Preview

t value and regression

Good Essays
Open Document
Open Document
1344 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
t value and regression
S CHOOL OF M ATHEMATICS , S TATISTICS AND O PERATIONS R ESEARCH
STAT 392

Tutorial – Ratio and Regression Estimation

1. Regression Estimation (from Lohr, Ex 3.6.4)
Foresters want to estimate the average age of tress in a stand. Determining age is cumbersome because one needs to count the tree rings on a core taken from the tree. In general, though, the older the tree, the larger the diameter, and diameter is easy to measure. The foresters measure the diameter of all
1132 tress and find that the population mean is 26.2 cm. They then randomly select 20 trees for age measurement. Tree, k
1
2
3
4
5
6
7
8
9
10

Diameter, xk
30.5
29.0
20.1
22.9
26.7
20.1
18.5
25.9
29.7
28.7

Age, yk
125
119
83
85
99
117
69
133
154
168

Tree, k
11
12
13
14
15
16
17
18
19
20

Diameter, xk
14.5
20.3
26.2
30.5
23.4
21.6
17.8
27.2
23.6
20.8

Age, yk
61
80
114
147
122
106
82
88
97
99

¯
(a) Treating the trees as a simple random sample, estimate the mean age of trees in the stand Y , with a variance estimate, 95% confidence interval, and RSE. Comment on the quality of the estimate.
(b) Draw a scatterplot of these data (make sure the x and y axes both start at zero). Fit a regression line y = α +βx+ε to the data, and draw it on to the plot.
(c) Determine whether ratio estimation using diameter as the auxiliary variable would be beneficial.
[You will need to compute the correlation coefficient of x and y, and their respective coefficients of variation.]
¯
(d) Make a ratio estimate of the mean age of trees in the stand Y , with a variance estimate, 95% confidence interval, and RSE. Comment on the quality of the estimate.
i. Fit the zero intercept regression line y = Rx + ε to the data ii. Add this line to your scatterplot.
¯
iii. Estimate Y with
¯
¯
Y R = RX
¯
where X is the population mean value of x. iv. Compute the residuals ek = yk − yk = yk − Rxk
v. Compute the variance of the residuals s2
e

You May Also Find These Documents Helpful

  • Good Essays

    Nt1310 Unit 4 Lab Report

    • 2595 Words
    • 11 Pages

    (c) Find the 95% two-sided confidence interval to estimate the mean. Comment on your result.…

    • 2595 Words
    • 11 Pages
    Good Essays
  • Good Essays

    Nt1330 Unit 5 Study Guide

    • 398 Words
    • 2 Pages

    [After plotting the scatterplot, position cursor on one data point and right click. Choose Add Trendline, then select linear. Experiment with Chart Layouts to find regression equation. ]…

    • 398 Words
    • 2 Pages
    Good Essays
  • Good Essays

    In Dunlap forest, a spot on the trail was chosen, and a stake was put down. Using a tape measure, we walked a number of meters into the forest that corresponded to the randomly generated X number, where another stake was put down. Using another tape measure, we walked a number of meters left that corresponded to the randomly generated Y number. A stake was put down, and that location was dubbed the sampling point. The area around the sampling point was divided into four quadrants, and the overstory tree closest to the sampling point in each quadrant was…

    • 207 Words
    • 1 Page
    Good Essays
  • Powerful Essays

    d) What is the value of the coefficient of determination? Give an interpretation of this value in context.…

    • 909 Words
    • 5 Pages
    Powerful Essays
  • Satisfactory Essays

    4. Question : A federal bank examiner is interested in estimating the mean outstanding defaulted loans balance of all defaulted loans over the last three years. A random sample of 20 defaulted loans yielded a mean of $67,918 with a standard deviation of $16,552.40. Calculate a 90% confidence interval for the mean balance of defaulted loans over the past three…

    • 904 Words
    • 4 Pages
    Satisfactory Essays
  • Satisfactory Essays

    C) We can find K by using the regression line method and time series data or cross sectional data.…

    • 381 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    A3 5 AppliedStatistics

    • 1129 Words
    • 8 Pages

    Statistics are commonly used in manufacturing processes to control and maintain quality. This activity will allow you to apply statistics in order to analyze and determine the quality of a set of wooded cubes.…

    • 1129 Words
    • 8 Pages
    Good Essays
  • Good Essays

    Nt1310 Unit 7-1

    • 1558 Words
    • 7 Pages

    The regression graph is shown above. b will depend on students' freehand line. Using a calculator, we find b =…

    • 1558 Words
    • 7 Pages
    Good Essays
  • Good Essays

    Stats Final guide

    • 3002 Words
    • 13 Pages

    (1) A study of the number of cars sold looked at the number of cars sold at 500…

    • 3002 Words
    • 13 Pages
    Good Essays
  • Good Essays

    Pdf Chapter 9

    • 601 Words
    • 3 Pages

    What is the 99% confidence interval for the population mean? A) [17.42, 20.78] B) [17.48, 20.72] C) [14.23, 23.98] D) [0.44, 3.80] Answer: A Use the following to answer questions 84-86: A survey of 25 grocery stores revealed that the average price of a gallon of milk was $2.98 with a standard error of $0.10. 84. What is the 95% confidence interval to estimate the true cost of a gallon of milk? A) $2.81 to $3.15 B) $2.94 to $3.02 C) $2.77 to $3.19 D) $2.95 to $3.01 Answer: C 85. What is the 98% confidence interval to estimate the true cost of a gallon of milk? A) $2.73 to $3.23 B) $2.85 to $3.11 C) $2.94 to $3.02 D) $2.95 to $ 3.01 Answer: A 90. A pharmaceutical company wanted to estimate the population mean of monthly sales for their 250 sales people. Forty sales people were randomly selected. Their mean monthly sales were $10,000 with a standard deviation of $1000. Construct a 95% confidence interval for the population mean. A) [9,690.1, 10,309.9] B) [9,715.5, 10,284.5] C) [8,040, 11,960] D) [8,000, 12,000] Answer: B Use the following to answer questions 91-92: A survey of an urban university (population of 25,450) showed that 750 of 1100 students sampled attended a home football game during the season. What inferences can be made about student attendance at football games? 91. Using the 99% level of confidence, what is the confidence interval? A) [0.767, 0.814] B) [0.0.6550, 0.7050] C) [0.6659, 0.6941] D) [0.0.6795, 0.6805] Answer: C 92. Using the 90% level of…

    • 601 Words
    • 3 Pages
    Good Essays
  • Satisfactory Essays

    PS 8

    • 422 Words
    • 4 Pages

    Following are the regression results for the data using Excel. In this problem, you will be interpreting the regression results. (For Practice, you may want to see if you can replicate these results using the data above in Excel.) (8 Points)…

    • 422 Words
    • 4 Pages
    Satisfactory Essays
  • Good Essays

    STATSMidtermReview

    • 3397 Words
    • 14 Pages

    5. A forester surveys a sample of trees in a certain state forest and records the following information about each tree: species, height, diameter of trunk 4 feet above the ground, and type of leaves (needle or…

    • 3397 Words
    • 14 Pages
    Good Essays
  • Good Essays

    Exercise Week 3

    • 550 Words
    • 2 Pages

    3. Explore the distribution of the Age variable via histogram and moments. Overlay a Normal curve on the…

    • 550 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Econometrics: Exercises

    • 1186 Words
    • 11 Pages

    Report your results in equation form along with the number of observations and R2. What…

    • 1186 Words
    • 11 Pages
    Good Essays
  • Good Essays

    Instructor: Frank Wood 1. (20 points) In the file ”problem1.txt”(accessible on professor’s website), there are 500 pairs of data, where the first column is X and the second column is Y. The regression model is Y = β0 + β1 X + a. Draw 20 pairs of data randomly from this population of size 500. Use MATLAB to run a regression model specified as above and keep record of the estimations of both β0 and β1 . Do this 200 times. Thus you will have 200 estimates of β0 and β1 . For each parameter, plot a histogram of the estimations. b. The above 500 data are actually generated by the model Y = 3 + 1.5X + , where ∼ N (0, 22 ). What is the exact distribution of the estimates of β0 and β1 ? c. Superimpose the curve of the estimates’ density functions from part b. onto the two histograms respectively. Is the histogram a close approximation of the curve? Answer: First, read the data into Matlab. pr1=textread(’problem1.txt’); V1=pr1(1:250,1); V2=pr1(1:250,2); T1=pr1(251:500,1); T2=pr1(251:500,2); X=[V1;V2]; Y=[T1;T2]; Randomly draw 20 pairs of (X,Y) from the original data set, calculate the coefficients b0 and b1 and repeat the process for 200 times b0=zeros(200,1); b1=zeros(200,1); i=0 for i=1:200 indx=randsample(500,20); x=X(indx); 1…

    • 1398 Words
    • 6 Pages
    Good Essays