Homework 1
Reminders:
1. Due date: Jan-14-2012 (Saturday) in class.
2. Please submit only the hardcopy.
3. Please show the names and ID numbers of all your group members on the cover page. Please also indicate your session (DSME5110W).
1.
Problem 2.1 (p. 33)
The file P02_01.xlsx indicates the gender and nationality of the MBA incoming class in two successive years at the Kelley School of Business at Indiana University.
a. For each year, create tables of counts of gender and of nationality. Then create column charts of these counts. Do they indicate any noticeable change in the composition of the two classes?
b. Repeat part a for nationality, but recode this variable so that all nationalities that have counts of 1 or 2 are classified as Other.
2.
Problem 2.5 (p. 33)
The file DJIA Monthly Close.xlsx contains monthly values of the Dow Jones Industrial Average from 1950 through 2009. It also contains the percentage changes from month to month. (This file will be used for an example later in this chapter.) Create a new column for recoding the percentage changes into six categories: Large negative (< -3%), Medium negative (< -1%, ≥ -3%), Small negative (< 0%, ≥ -1%), Small positive (< 1%, ≥ 0%), Medium positive (< 3%, ≥ 1%), and Large positive (≥ 3%). Then create a column chart of the counts of this categorical variable. Comment on its shape. 3.
Problem 2.6 (p. 55)
The file P02_06.xlsx lists the average time (in minutes) it takes citizens of 379 metropolitan areas to travel to work and back home each day.
a. Create a histogram of the daily commute times.
b. Find the most representative average daily commute time across this distribution.
c. Find a useful measure of the variability of these average commute times around the mean.
d. The empirical rule for standard deviation indicates that approximately 95% of these average travel times will fall between which two values? For this particular data set, is