Submitted by: Garima Agrawal (Section D)
(Student name or group name) Group Member Name | PG ID | Garima Agrawal | 61410506 |
Question1: The data for home values has a considerable wide range (429578) as compared to the inter-quartile range (93522). This means the data has a huge spread and the same can be verified from coefficient of variation which is even more than 41%. Besides, as can be seen from graphical plot and the positive skewness (0.87) measure, the data is skewed towards right. Also, the outliers present towards the right end indicate the presence of few extremely high valued houses, due to which average price of houses is higher than the median price. The highest density of data is present in two lower quartiles, as can be seen from box plot. This shows that low valued houses are present in bulk, and thus must available in the market easily. | |
Question 2: Though normal distribution model is not an absolutely apt for the data set of prices, the data can still be analyzed by assuming normality owing to the fact that data points hover around the diagonal line of normal Quantile plot. Some data points also cross the permissible range, but the density of data (high in the middle, and low at the ends ) allows for the usage of normal distribution model.The same can be verified from the measure of Kurtosis (0.7) which is well in permissible range for usage of normal distribution model. | |
Question 3:
MEAN = 164K; STANDARD DEVIATION = 68K A.Z1 (@ x as 92.8K) = (92.8 – 164)/68 = -1.04Z2 (@ x as 255.5K) = (255.5 – 164)/68 = 1.34P(Z1 < Z < Z2) = 0.9099 – 0.1492 = 0.7607Percentage probability is 76.07, which seems to be more than the actual value, basis what can be seen via boxplot. | B.Z1 (@ x as 232K) = (232 – 164)/68 = 1P( Z < Z1) = 0.8413Percentage probability is 84.13, which is consistent with what can be seen via data distribution. | C.Prob (Z < Z1) = 0.75Z1 = 0.6745Price, X1 = (68)*(0.6745) + 164 = 209.86So, Theoretical value of house at 75th quartile is 209.9K as compared to the actual value of 205.3K. |
Question 4: On the analysis of data set, histogram clearly depicts the presence of white spaces in the data, which are values of living area unavailable in the market. Same is not directly evident from box plot.And the box plot on the other hand aptly shows the position of median and quartiles instantly. As can be seen from histogram as well as box plot, the data set is skewed towards right. The same can be verified from the measure of Skewness (0.807) which being positive indicates a right skewed data set. | |
Question 5:
On the analysis of data set, histograms clearly depict that original variable is a better fit for normal distribution variable as compared to the logarithmic one. The value of kurtosis for the new variable is -0.47 as compared to that of original at 0.392. Closer to zero is the value of Kurtosis, more is the normality in data.In my opinion, this change in normality is due to the fact that logarithm has scaled down the range and thus increased the number of bars relatively. This has caused data to deviate a little from normality. | Plot for living area | Plot for Log(Living Area) |
You May Also Find These Documents Helpful
-
P(-2.25 < z < 1.25) = F(1.25) - (1 - F(2.25)) = 0.89435 - (1 - 0.987776) = 0.882126…
- 761 Words
- 4 Pages
Satisfactory Essays -
The data for credit balance shows that it has a normal distribution with no skew. The Mean (3970), Median (4090), and Mode (3890) are all within very close proximity and the Mean is at the peak of the chart. All other data points fall nearly equal on either side of the Mean. The standard deviation is fairly high at 931.9.…
- 495 Words
- 2 Pages
Satisfactory Essays -
of jar fills is normally distributed, what percentage of jar fills will be (i) greater than 202.5…
- 333 Words
- 2 Pages
Satisfactory Essays -
What is the probability that a Type I error will be made for z > 2.575?…
- 2853 Words
- 9 Pages
Satisfactory Essays -
They were losing much less money on their net income but the main point is they were still losing a lot of money. With the cash flow they were also decreasing every year that they were operating. Also looking at the data they were not able to pay any of the debt that they owed until 2008.…
- 420 Words
- 2 Pages
Good Essays -
Find the z-score for having area 0.07 to its right under the standard normal curve, that is, find [pic].…
- 1192 Words
- 5 Pages
Good Essays -
In the world of real estate, location sometimes determines the future of a property, residential or non-residential. Revere Street, a residential property located in the heart of downtown Boston, Massachusetts, provides a wonderful location for residents who also like to enjoy the convenience and diversity the city could offer. Revere Street shares Boston's unique New England historical city view and modernized financial district landscape. The pin point with the letter A on the map below shows a geographic center of Revere Street inside the City of Boston.…
- 742 Words
- 4 Pages
Powerful Essays -
A histogram shows the distribution of data within the Income. In this Histogram graph of Income, it shows that the graph is not symmetrical. This histogram graph has a wider bell shape form. The graph shows that this graph is more like two graph because there is a clear difference between income generating from 20-40 and from 50-above. There are two separated cluster; therefore, the skewness of this graph is skewed right. Income has a lower value of kurtosis which indicates a lower, less distinct peak. The following table shows the numerical summary of Income:…
- 1166 Words
- 5 Pages
Good Essays -
* Finally, we will change the probability of a success to ¾. In column C4 enter the words ‘three fourths’ as the variable name. Again, use similar steps to that given above in order to calculate the probabilities for this column. The only difference is in Event probability: use 0.75.…
- 813 Words
- 4 Pages
Powerful Essays -
ex: probability of getting between 270 and 310 successes inclusive = 269.5 < x 157.5…
- 278 Words
- 3 Pages
Good Essays -
Y ~ N(98.2 , 0.62 / root(50) ) P(Y < 97.98) = P(Z < (97.98 - 98.2) / 0.0876...) = P(Z < -2.509...) = 1 - phi(2.509...) = 1 - 0.99395 = 0.0060 = 6% correct…
- 401 Words
- 2 Pages
Satisfactory Essays -
0.003 x 100 / 0.040 = 7.5... ≈ ±8% ---------------------> 1.5% + 8% = ±9.5%…
- 638 Words
- 3 Pages
Powerful Essays -
Assuming that the distribution is normal for weight relative to the ideal and 99% of the male participants scored between (-53.68,64.64), Where did 95% of the values for weight relative to the ideal lie? Round your answer to two decimal places.…
- 496 Words
- 3 Pages
Satisfactory Essays -
Percent error=> +/- 0.51% Percent error= | (theoretical-experimental) / (theoretical) |*100 Percent error= | (195-196) / (195) |*100 Percent error=> +/- 0.51% Percent error= | (theoretical-experimental) / (theoretical)…
- 470 Words
- 9 Pages
Powerful Essays -
Probability 0.1746 0.15 0.10 0.05 0.00 0 P(X=5)=0.175 5 X 10 What is the probability that Mary will get a score of no more than 35% on this exam? n=20, p=0.20 (0.35)(20)=7 P(X≤7) Distribution Plot Binomial, n=20, p=0.2 0.25 0.9679 Probability 0.20 0.15 0.10 0.05 0.00 X P(X≤7)=0.968 7 10 The following information is available on the number of calls received at the telephone switchboard of…
- 404 Words
- 2 Pages
Good Essays