Monday, March 30, 2009
Sunday, March 29, 2009
Saturday, March 28, 2009
Progress with spring paper: I can read by 21 rows now and have all the population data into 18 datasets
I have now read all my data into 18 age band data sets and just need to now join these 18 data sets and then split these every 21*23 rows into 228 data sets and then again split this into 23 yearly tables. So the end result should be 227 countries and one world data sets with population for both genders and population for each of male and female by 17 age bands and total all ages, and 23 yearly tables for each country.
This is giving me an idea for the analysis. This idea is to use the 18 data sets and call these the age band data sets(18). Then I want to have 227 country and one world data sets and call these the country data sets(228). Then also have 23 data sets for years and call these the years data sets(23). Then I will work comparing these data sets for my populations analysis.In fact, rather than join these data sets I have now, the age band data sets to get the other data sets, I should instead simply read in the data for each data set. So my next reading in will be for the 227 countries and one world data sets. Then the third step will be to read in the data into years data sets.
I also need to start looking at a data source for finding the top three industries in each of the 228 countries. May be one google search like I did to find the population data will work.
I was offered a chance to test out Inference for R. This software it seems interfaces R with Microsoft Office. I have so far used it in both Word and Excel. Because it is an interface it is faster than copying and pasting code into the R command line window and also is faster because when you change code and run that changed code as in debugging you do not lose the code typed on the command line. I had in the past changed my code on the command line and would have to copy my successful code back to my text editor to keep a record of what I was doing.
One big problem for Inference for R is that it is only available on one platform. I am having to move back to the Windows world of computers because of statistics software like Inference for R, SAS and Excel and also because of the game Eve Online which is mostly a windows based game.
My eMac is functioning fine and I even found the DVD drive working again. The Macbook although it is filthy is working fine as my usual daily machine now in the living room, The new home built PC with Windows XP 64 bit and Debian Linux is also working fine now. The new Dell Mini 9 Netbook is fine and I carry it around all the time but it is not getting much use but it is very useful to have it if needed. I used it at a community group last week to keep minutes of a meeting.
I am thinking of buying a Windows laptop that I would build from an OCZ laptop skeleton. This PC laptop would also be running Windows XP 64 bit and Debian 4.2. I am also thinking of buying an iMac 20 inch desktop. I am looking at used MacBooks and Powerbooks and iBooks for a friend and perhaps one for myself for music recording. I am looking in the 200-400 dollar range for these. These would be running Tiger not Leopard for an OS.
Friday, March 20, 2009
After meeting my supervisor I felt confident that my thesis is progressing well. I am focused on the literature search and my paper for a conference in social economy in two months. These are the two basic tasks at present. I will need to find the twenty classic references for my topic. Then I will spend the summer term reading these references. I have four items in mind so far for this twenty item list.
In health reading I am reading about stress management. In entertainment reading I am reading two books about guitar. In programming I am studying SAS, Excel and have numerous computer "language" books. I will need to learn to program in Java or C++ this summer.
Thursday, March 19, 2009
Sunday, March 15, 2009
I have created columns next to the total population and male and female population counts columns that subtract a given age band from the previous age band. Thus if these new columns are negative it means a given age band is larger than the younger age band before it. For instance, if there are 300,000 persons aged 5 to 9 years old and there are only 200,000 babies aged 0-4 years old in a country in say 1996 then the next columns shows - 100,000. I then used conditional formatting to highlight the new column if a value was negative. This has given me a way of visually finding any suspected baby booms. But I still desire R code that will scan these new columns and produce a data set of only the population tables where these negative values occur. At this point, as well I have been offered the use of a documentation software to document in Office 2007 format, my use of R code.
I have some idea of the required indexes in the data set to use in R loops or array indexes. These are the re-occurrence rows of data in the large data set. In other words starting with the World population data tables then going from country names starting at A then down to Z we have repeated tables starting in 1996 to 2010 for each year and then 2015 and basically every five years projected until 2050. So there are tables in two dimensions, country names and then years. I will limit my searches to 1996 to 2005 at first but then if I am choosing a country as having booms in this decade of years 1996 to 2005, I will come back to the projections for the future.
I also need to get the path's right to open data files using R on windows. So once this path problem is solved I can move ahead with trying index's to parse the large data set into a smaller data set with only population tables that indicate a suspected baby boom.
- Whitty, Monica T., Joinson, Adam N. Truth, Lies and Trust on the Internet (New York, N.Y.: Routledge, 2009).
- This is interesting and I agree with these authors that not all self disclosure is anonymous. I am seeing this working into my findings that sites that allow social networking with those we know offline are different than the "virtual friends who are strangers" topics of past Internet studies.
Friday, March 06, 2009
here are some titles that are only pennies used from Amazon
If a few bucks is still too much you read his online tutorial
Old but still useful
The Html Sourcebook: A Complete Guide to Html 3.0 (Paperback)
by Ian S. Graham (Author), Ian S Graham (Author)
Html 4.0 Sourcebook (Paperback)
by Ian S. Graham (Author) "What is a text markup language?..." (more)
I am trying to start reading this one
HTML Stylesheet Sourcebook (Sourcebooks) (Paperback)
by Ian S. Graham (Author)
BTW I do not know Ian or have any monetary connection to him nor really academic beyond learning from his great books and always available online tutorial.
Monday, March 02, 2009
Here are the RSS feeds from ProQuest for the searches I did to complete the searching for the seven sub-component types of knowledge theorized to make up operational knowledge.
- Procedural and Process Knowledge
- Heuristic Knowledge
- Cultural Knowledge