TARTU 2013
Table of Contents
1. 2. 3. Abstract ............................................................................................................................... 3 US Census ........................................................................................................................... 4 Implementation ................................................................................................................... 5 3.1. 3.2. 3.3. Constraints ................................................................................................................... 5 Software ....................................................................................................................... 5 Implementation ............................................................................................................ 5
Conclusion .................................................................................................................................. 8 References .................................................................................................................................. 9
2
1. Abstract
The purpose of this project is to analyze the changes in United States education, income and demographics over ten years (from 2000 to 2010). For this US census data is used which should be available at the website of US Census Bureau (www.census.gov). The goals of this project are comparing the 2000 and 2010 US census data and find out how attributes like education, income and demographics have changed over ten years by making an OLAP solution for this. To achieve that, two datasets are needed – 2000 and 2010 US census full datasets. Another goal is to make snapshots of 2010 US census data so that they could fit into Excel and they could be used in Data Mining course exercises.
3
2. US Census
The United States Census is a decennial census that is organized by The Bureau of the
References: 1. Wikipedia.org, United States Census, 16.12.2012. http://en.wikipedia.org/wiki/United_States_Census - last viewed 04.01.2013. 2. Wikipedia.org, 2000 United States Census, 27.11.2012. http://en.wikipedia.org/wiki/2000_United_States_Census - last viewed 04.01.2013. 3. Wikipedia.org, 2010 United States Census, 01.01.2013. http://en.wikipedia.org/wiki/2010_United_States_Census - last viewed 04.01.2013. 4. 2000 US Census Data. http://biit.cs.ut.ee/~vilo/edu/Data/census2000/extract_full.csv.gz - last viewed 04.01.2013 5. MonteDB 5. http://www.monetdb.org/ - last viewed 04.01.2013. 6. SquirreL SQL Client. http://squirrel-sql.sourceforge.net/ - last viewed 04.01.2013. 7. Pentaho 4.8 http://www.pentaho.com/explore/pentaho-business-analytics/ - last viewed 04.01.2013 9