Preview

Statistics

Good Essays
Open Document
Open Document
1252 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Statistics
| Statistics 101 Report | The Kentucky Milk Case Study | | |

|

Preliminary Analysis
2a)

Figure 1: X as a Data Object
X is a data frame as derived from the program R shown above in Figure 1. There are 274 observations of 11 variables. The number of observations is obtained from the number of rows while the number of variables is obtained from the number of columns.
2b)

Figure 2: Creating a sub-data frame from X

Figure 3:Sub-data frame from X
Figure 2 shows a screenshot of the commands entered into R to create a sub-data frame X containing observations of the 7 selected variables. Figure 3 shows the sub-data frame created from the commands in R.
2c)

Figure 4: Missing values of WWBID
The variable WWBID has 36 missing values and the fraction of the number of missing values out of the total number of cases is1291.
2d)
The percentage of cases in the dataset which contains one or more missing value as calculated using R is 13.19% which can also be seen from Figure 4.

2e)

Figure 5: Bid price variable for markets Tri-County & Surround in 1984

Figure 6: Bid price variable for markets Tri-County & Surround in 1985

Figure 7: Bid price variable for markets Tri-County & Surround in 1986

Figure 8: Bid price variable for markets Tri-County & Surround in 1987

Figure 9: Bid price variable for markets Tri-County & Surround in 1988

Figures 5-9 above show the box plots obtained using R for Tri-County and Surround for the years 1984 to 1988. After examining these plots from the combined data for Meyer and Trauth, there is presence of potential outliers in all the years.
There is a presence of potential outliers in all 5 years in Surround. On the other hand, they are only present in years 1985 and 1988 for Tri-County.
For the potential outliers in Surround, they are all the maximum values of the bid price variable. However, Tri-County has a potential outlier in 1988 which is the minimum value of the bid price variable.

You May Also Find These Documents Helpful

  • Good Essays

    The mean household size of the customers is found to be 3.42. The median of the data is 3 and the…

    • 1935 Words
    • 8 Pages
    Good Essays
  • Satisfactory Essays

    Mgmt 305 Question Paper

    • 479 Words
    • 2 Pages

    ____6. Refer to Exhibit 2. What is the probability that a randomly selected computer will have a…

    • 479 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    According to our computation for measure of dispersion (see attached excel file for mega stat), we found out that there are low and high outliers that existed on one of the variable. Under the size of the house column, the low outlier shows 1 deviation and 3 deviations on high outlier. It means that there is variability in the measurement or possibly, an error on the data gathered.…

    • 416 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    Sas Assignment

    • 300 Words
    • 2 Pages

    UNSW SCHOOL OF MATHEMATICS MATH2871 DATA MANAGEMENT FOR STATISTICAL ANALYSIS ASSIGNMENT 1 - ANSWERS TOTAL MARKS 20 (10% of final grade) Number of Questions: 2…

    • 300 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    The above chart represents the national average as proved by the Real Estate data set. The data has been sorted from lowest to highest and divided by six to form the various groups. The descriptive statistics is shown in the table below.…

    • 680 Words
    • 3 Pages
    Good Essays
  • Good Essays

    (total amount of data adds up to 88, then divided this total by the number of data sets, 30. 88 divided by 30, equals 2.93, which I rounded up to 3.)…

    • 1433 Words
    • 6 Pages
    Good Essays
  • Good Essays

    CJA314WK1HW

    • 1013 Words
    • 3 Pages

    This paper will discuss two metropolitan areas with different data. Which area had more reported incidents will also be discussed with the rates of the crime for each area. This paper will also discuss the rates change over time in both areas and what factors might explain the differences in the rates.…

    • 1013 Words
    • 3 Pages
    Good Essays
  • Better Essays

    Pro Forma

    • 1365 Words
    • 7 Pages

    Graph 2-1 shows the total interest income of Orrstown from year 2006-2013. As it takes a longer time for small financial institutions to react on the financial crisis, the negative interest income of orrstown showed in year 2012 and 2013. Because there would be little chance of any big financial crisis in the following five years, it is reasonable to take outliers. Take the average of the annual growth rate of total interest income from 2006 to 2011, and use it as the annual growth rate of total interest income for the nest five years.…

    • 1365 Words
    • 7 Pages
    Better Essays
  • Good Essays

    higher rates of robberies, and I found out that Chicago had the highest over the two years of data…

    • 831 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    Intimate Partner Violence

    • 3914 Words
    • 16 Pages

    Counties,” Proceedings of the Section on Government Statistics, American Statistical Association, Alexandria, VA, 1995, pp. 154– 159.…

    • 3914 Words
    • 16 Pages
    Powerful Essays
  • Satisfactory Essays

    Trouble group aims to be the best in all aspects our company’s mission is to succeed in all of the following:…

    • 5696 Words
    • 23 Pages
    Satisfactory Essays
  • Powerful Essays

    Georgia Atlantic Company

    • 4286 Words
    • 18 Pages

    This case can be used in several ways. In the introductory course, the case can be used as the basic structure for a lecture or as a written assignment in conjunction with lecture and text material. In our more advanced courses, which usually have smaller enrollments, we have found additional uses which produced very satisfactory results. To encourage active class discussion of various aspects of dividend policy, we divide the class into groups and assign each group one or several of the positions with respect to dividend policy. Students then present oral arguments in favor of the particular policy they have been assigned and against the others. Students exhibit a good deal of creativity in developing reasons for following a policy they may or may not believe in.…

    • 4286 Words
    • 18 Pages
    Powerful Essays
  • Powerful Essays

    Paragon

    • 1544 Words
    • 7 Pages

    1(a) Paragon Electronics, INC. LIFO Purchases By Paragon Sold to Ending Inventory Cost of goods sold Year Units Unit cost($) Aero Inc Units Unit Cost($) Total($) Unit Unit Cost($) Total($) 1986 100 700 80 20 700 14,000 80 700 56,000 1987 100 800…

    • 1544 Words
    • 7 Pages
    Powerful Essays
  • Good Essays

    Comparing Data

    • 268 Words
    • 2 Pages

    When outliers are present, the median and IQR are used to measure center and spread because they are unaffected by extreme values. When the data appears to be symmetric and there are no known outliers, the mean and standard deviation (another measure of spread) are used.…

    • 268 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    Course Work

    • 3677 Words
    • 15 Pages

    Question 1: We would have the work on the data set phones.sav. Therefore, we hoped to glance at the information of the data at initial. After the reading, the data set consisted of the following 12 variables that all relates to the year 2010. They were shown in the variable name and the description in the below Table 1.001.…

    • 3677 Words
    • 15 Pages
    Powerful Essays