Statistics
Student Name: Louis Chen
Teacher Name:Ian
Due Date: June 3 2015
Word Count:499
GAC010 AE2
Louis
ID
1. Create a dot plot to see the distribution of the data.
10
8
5
3
0
150
155
160
165
170
175
180
185
190
2. List data in order
157,160,162,165,166,167,167,167,168,169,170,171,172,173,174,174,176,178,180,182
3. What’s the Range of data:
Range=Highest-Lowest=182-157=25(cm)
4. Plot the data in a Histogram and describe the distribution
7
Frequency
5
4
2
0
157.5
162.5
167.5
172.5
177.5
182.5
Heights
The distribution is symmetrical. It has a normal shape. It has a slight skew to the right.
2015/6/2
!1
GAC010 AE2
Louis
ID
Cumulative Frequency Percentage
5. Create a cumulative frequency curve and describe its shape.
100%
75%
50%
25%
0%
157.5
162.5
167.5
172.5
177.5
182.5
Heights
S-shaped curve. typical of a normal distribution
6. Use the ogive to estimate the median and quartiles.
1st quartile:
Q1 about 167cm
Median:
about 170 cm
3rd quartile:
Q3 About 174cm
Estimated from the graph.
7. What is the inter-quartile range?
Inter-quartile range:
IQR=Q3-Q1=174-166=7cm
2015/6/2
!2
GAC010 AE2
Louis
ID
8. Box plot
160
165
170
175
180
9. Find the mean and standard deviation (sd)
!
(X − µ )2
= 6.5
!σ =
20
2015/6/2
!3
GAC010 AE2
Louis
ID
10. Calculate the Pearson measure of Skewness (modal version), and the Quartile measure of skewness.
Are they good indicators of skew in the data? Give reasons for your answer.
Mode=167cm
Pearson Skewness=!
µ − Mo 169.9 − 167
=
≈ 0.446 σ 6.5
Quartile Skewness=!
(Q1 − X) + (Q3 − X) (167 − 170) + (174 − 170)
=
≈ 0.143
IQR
7
They are good indicators. Both measures indicate a positive skew, though the histogram seems to be little negatively skewed.
11. Which of the three measures of central tendency (mean, median, mode) is the best choice for this data? Give a reason for your answer.
I think mean can be the best choice. Since the data has no extreme values, nor a large skew. And mean accounts for all datas.
12. What are (mean – 1 sd) and (mean + 1 sd)? What percentage of the data is between them?
?−? = 169.9−6.5 = 163.4cm
?+? = 169.9+6.5 = 176.4cm
There are 14 data between 163.4 and 166.4
The percentage of the data: 14/20=70%
13. What are (mean – 2 sd) and (mean +2 sd)? What percentage of the data is between them?
?−2?=169.9-13= 156.9cm
?+2?=169.9+13= 182.9cm
There are 20 data between 156.9 and 182.9
The percentage of the data: 20/20=100%
14. What are (mean – 3 sd) and (mean + 3 sd)? What percentage of the data is between them?
?−3?=169.9−19.5= 150.4cm
?+3?=169.9+19.5= 189.4cm
There are 20 data
The percentage of the data: 20/20=100%
2015/6/2
!4
GAC010 AE2
Louis
ID
15. Find the value in the data that is closest to z = 0.5.
X−µ σ !Z =
X close to 0.5*6.5+169.9=173.15
So X= 173
16. What percentage of the data are between z = –0.5 and 0.5?
!Z =
X−µ σ X1=Z?+? = -0.5*6.5+169.9= 166.65cm
X2=Z?+? = 0.5*6.5+169.9=173.15cm
The percentage of the data:
9/20=0.45=45%
17. The middle 50% of the data covers Q1 to Q3
So 167 ≤X≤ 174
! Z1 =
X − µ 167 − 169.9
=
≈ −0.45 σ 6.5
! Z2 =
X − µ 174 − 169.9
=
≈ 0.63 σ 6.5
Z range = 0.63-(-0.45)=1.08
18.If you chose a person from this sample randomly, what is the probability that they would be taller than
1 sd above the mean?
?=169.9
?=6.5
So, ?+?=169.9+6.5=176.4 x≥176.4 ??→ 178,180,182.
P (x ≥176.4) = 3 / 20 × 100% = 15%
2015/6/2
!5
You May Also Find These Documents Helpful
-
The Second will be the size chart. This will measure tendency, variation, mean, median and mode.…
- 481 Words
- 2 Pages
Satisfactory Essays -
For both data sets the distributions are not normal (p < 0.05). The data is skewed to the right.…
- 1250 Words
- 49 Pages
Good Essays -
The data for credit balance shows that it has a normal distribution with no skew. The Mean (3970), Median (4090), and Mode (3890) are all within very close proximity and the Mean is at the peak of the chart. All other data points fall nearly equal on either side of the Mean. The standard deviation is fairly high at 931.9.…
- 495 Words
- 2 Pages
Satisfactory Essays -
2 measurements were used in calculating the reported results. Median and Mean were both used in the reported results…
- 487 Words
- 2 Pages
Satisfactory Essays -
c) Construct a complete boxplot and determine if the data is skewed to the left, skewed to the right, or symmetric.…
- 759 Words
- 5 Pages
Satisfactory Essays -
In this activity you will collect data and then perform statistical analyses to determine measures of central tendency and variation of the data. You will also represent the data using a histogram.…
- 1129 Words
- 8 Pages
Good Essays -
The histogram above shows the Credit Balance variable of the 50 customers surveyed. The histogram is almost symmetrical with one outlier which is the credit balance of $2,000. While it being symmetrical you can almost fold the y-axis in half to have it look the same. While observing the histogram, its skewed to the left because of the outlier, and the skew is -.015043. Using the Anderson-Darling Normality Test, the P-value for Credit Balance is 0.400, and A^2 is 0.38. Throughout the mean, median, and Standard Deviation there is a 95% confidence interval as well.…
- 830 Words
- 4 Pages
Good Essays -
Which statement describes the data? [1] median = mode [3] mean < mode [2] median = mean [4] mean = mode…
- 1058 Words
- 5 Pages
Good Essays -
What is seen in the stem and leaf plot is that most of the data is shown on the lower end within the 0 stem. With this information the distribution is right-skewed.…
- 549 Words
- 3 Pages
Satisfactory Essays -
5. Under what circumstances is the median or mode a better measure of central tendency than the mean? Explain your response.…
- 569 Words
- 3 Pages
Satisfactory Essays -
Can mean, median, or mode be calculated from all statistical data? Explain why or why not. When is the mean the best measure of central tendency? When is the median the best measure of central tendency?…
- 413 Words
- 2 Pages
Satisfactory Essays -
Answer Key: BFeedback: When the mean is greater than the median, that can indicate that the distribution is skewed to the right. (See the discussion on page 125.) The presence of outliers will also indicate skewness. Using the procedure to identify outliers on page 160, the IQR is 4.4. There are two data values that are greater than Q3 + 1.5*IQR, indicating the distribution is positively skewed.…
- 2019 Words
- 9 Pages
Powerful Essays -
If you estimate the percentiles, then the quartiles and the percentiles correspond. If you calculate the percentiles, Arnold is at the 50th percentile, with a scored of 17.Ingrid is at the 25th percentile, with a score of 11 and Brooke is at the 75rd percentile with a score of 25. The median of the data is 16.5, and the values for the lower and upper quartiles are 10.5 and 23.5.…
- 278 Words
- 2 Pages
Satisfactory Essays -
Typically, there are two general types of statistic that are used to describe data. Measures of central tendency are ways of describing the central position of a frequency distribution for a group of data. Measures of spread are ways of summarizing a group of data by describing how spread of the scores. Week 4’s assignment was to…
- 415 Words
- 3 Pages
Satisfactory Essays -
Calculate and interpret the following for the data in Column K, “The degree of agreement among patrons that Remington’s has large portions,” on the Remington Data worksheet of the Remington’s Data Set workbook:…
- 248 Words
- 2 Pages
Satisfactory Essays