Thursday, April 09, 2009

I can read in the country data sets in two lines of code.

I figured out how to read in the country data sets in two lines of code. The problem is I need 228 of these pairs of lines. To do this I turn to the automated features of "fill down" in Excel. I am learning that in SAS one can do this with macros. In Excel I have created the first line of code where I set a specific counter based on the position of the countries data in the large file. Once this so called counter or index of numbers is set the second line is a as.data.frame line that reads the large file but only reads the rows( as indicated by the counter/index set of numbers) needed for that country. So today I am going to use Excel to write the second line of code 228 times.

I am still using Inference for R and was mistaken in a sexist way about the representative changing her last name. Of course, many women have two last names and that was all it was. I have only completed the first tutorial for Inference for R and have not yet tried the studio application. This promises to allow breaks while running code.

At this point in testing this Inference for R software, after I have the second lines of code completed I will run this and try printing the Inference for R word document and adding a write up. I may use some of this blog for the write up. I will do this in word. I will after that try to make a Power Point for the conference and start to write up this part of the research. Ideally I will have only a few slides devoted to this part of the project but that show some of the code.

Then it is off to Google or back to the US census site to find data about major industries by country.

1 comment:

Jason said...

I have a really hard time thinking of any data that could be read into Excel and not into R. (Worst case, one could read it into Excel then import into R from there, but I've never had it happen.)

Could you post more details? You might get some suggestions on how to do it using a less painful approach in return.