Preview

What is Bootstrapping (In Statistics)?

Good Essays
Open Document
Open Document
406 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
What is Bootstrapping (In Statistics)?
What is Bootstrapping (In Statistics)?
Bootstrapping is an interesting process or technique of assigning measure of accuracy. Depended upon calculation, Bootstrapping can be used to any statistic to measure estimation.

Definition
According to the Cambridge dictionary of statistics – “A confidence interval is a range of values, calculated from the sample observations that are believed, with a particular probability, to contain the true parameter value. A 95% confidence interval, for example, implies that were the estimation process repeated again and again, then 95% of the calculated intervals would be expected to contain the true parameter value.”

Reason for the name
The term has been derived from the expression "lifting oneself up by one's own bootstraps". This refers to raising oneself up by one's own means. This is also called bootstrap funding.

Example
Let’s understand this with an interested example of a candy factory. Knowing that the candy bars have mean weight. Since it is not feasible to weigh each candy that is produced in the factory, we use sampling techniques and randomly choose 100 candies. We calculate the mean of these 100 candies, and say that the population mean falls under a margin of error.
Suppose that after a couple of months, we want to know the mean candy weight on the day that we sampled the production line (with greater accuracy - or less of a margin of error).
We simply cannot use candies being produced today, as too many elements are now being used like; sugar, different flavours of milk, cocoa beans, different atmospheric conditions, different employees etc. All that we have from the day that we are curious about are the 100 weights.

When to use bootstrapping?
We can simply explain bootstrapping taking simple random sample from the observed sample of same size but with a replacement. This would mean each observation has equal probability of selecting in the sample.
We can use bootstrapping for inference and

You May Also Find These Documents Helpful

  • Satisfactory Essays

    M N M Project 5

    • 612 Words
    • 3 Pages

    The sampling method is the scientific procedure of selecting those sampling units which would provide the required estimates with associated margins of uncertainty, arising from examining only a part and not the whole. There are several types of sampling, however for this particular project we used random sampling in which each item of the data) has the same probability of being selected in the sample. In order to achieve true raw data, three bags of 1.69oz M & M candies were purchased from three different stores. The data from these bags were collected and placed into an excel worksheet to create a single data set (see below). This information was then entered into an excel spreadsheet and separated by individual bags.…

    • 612 Words
    • 3 Pages
    Satisfactory Essays
  • Good Essays

    QNT 561 Final Exam

    • 697 Words
    • 5 Pages

    11. We wish to develop a confidence interval for the population mean. The population follows the normal distribution, the standard deviation of the population is 3, and we have a sample of 10 observations. We decide to use the 90 percent level of confidence. The appropriate value of to represent the level of confidence is…

    • 697 Words
    • 5 Pages
    Good Essays
  • Satisfactory Essays

    3. Question : In a manufacturing process a random sample of 36 bolts manufactured has a mean length of 3 inches with a standard deviation of .3 inches. What is the 99% confidence interval for the true mean length of the bolt?…

    • 904 Words
    • 4 Pages
    Satisfactory Essays
  • Satisfactory Essays

    · Discuss the confidence intervals and their usefulness based on the number of data points.…

    • 468 Words
    • 3 Pages
    Satisfactory Essays
  • Powerful Essays

    Nt1310 Unit 1 Assignment 2

    • 2615 Words
    • 11 Pages

    Eg- imagine that the mean IQ of UiTM students is 105.00. (If UiTM students are our population, then this mean is a parameter.) If we take a sample of 10 UiTM students and compute their mean IQ, it will probably not be exactly 105.00. Instead, let us say that it is 103.25. (This would be a statistic.) If we then take a second sample of 10 UiTM students and compute their mean IQ, again it will probably not be 105.00 and it probably will not be 103.25 (the mean of our first sample). Instead, it might be 106.87. If we keep doing this—say, 100 times—then we will probably end up with 100 different sample means, and it is very likely that none of them is exactly 105.00. This variability in the sample means is sampling error. (Note that the term “error” here does not mean that anyone has made a mistake. “Error” here just refers to…

    • 2615 Words
    • 11 Pages
    Powerful Essays
  • Better Essays

    Skittles Project

    • 1058 Words
    • 5 Pages

    Confidence Interval: A confidence interval is an indicator of a measurement's precision. It is also an indicator of how stable an estimate is, which is the measure of how close a measurement will be to the original estimate if an experiment is repeated.…

    • 1058 Words
    • 5 Pages
    Better Essays
  • Good Essays

    Law of Averages

    • 1479 Words
    • 6 Pages

    Margin of error as per the text is “it estimates the largest distance you would reasonably expect to see between sample average and population average”1. Based on the data provided above, with ME = $5 we can be confident that in the population from which…

    • 1479 Words
    • 6 Pages
    Good Essays
  • Good Essays

    Team Paper Scenario

    • 1139 Words
    • 5 Pages

    Based on the health plan survey with a sample of people who reported receiving a flu shot, the percentage of all people who received the flu vaccine is probably between 67% and 73%. However, as stated in this case scenario, only 61% of all eligible health plan enrollees actually received a flu shot. Therefore, a miscalculation has occurred in the survey because the actual percentage of people who received the flu vaccine did not fall between the ranges as predicted by the margin of error. Bennett, Briggs, and Triola (2009) stated, “The margin of error in a statistical study is used to describe the range of values or confidence interval, likely to contain the population parameter” (p. 1). From this definition, we can conclude that a value predicted by the margin of error is never absolute. Furthermore, it also indicates that the higher the percentage of the margin of error, the less credibility the survey supports.…

    • 1139 Words
    • 5 Pages
    Good Essays
  • Good Essays

    Siop Lesson Plan

    • 856 Words
    • 4 Pages

    CCSS.Math.Content.7.SP.A.2 Use data from a random sample to draw inferences about a population with an unknown characteristic of interest. Generate multiple samples (or simulated samples) of the same size to gauge the variation in estimates or predictions. For example, estimate the mean word length in a book by randomly sampling words from the book; predict the winner of a school election based on randomly sampled survey data. Gauge how far off the estimate or prediction might be.…

    • 856 Words
    • 4 Pages
    Good Essays
  • Good Essays

    If many random samples were taken, 95% of the confidence intervals produced would contain the actual…

    • 770 Words
    • 6 Pages
    Good Essays
  • Good Essays

    Random sampling eliminates bias in choosing a sample and allows control of variability. So once we see the magic words “randomly selected” and “margin of error,” do we know we have trustworthy information before us? It certainly beats voluntary response, but not always by as much as we might hope. Sampling in the real world is more complex and less reliable than choosing a Simple Random Sample (SRS) from a list of names in an exercise. Confidence statements do not reflect all of the sources of error that are present in practical sampling. Most sample surveys are afflicted by errors other than random sampling errors. These errors can introduce bias that makes a confidence statement meaningless. Good sampling technique includes the art of reducing all sources of error. Part of this art is the science of statistics, with its random samples and confidence statements. In practice, however, good statistics isn’t all there is to good sampling. Let’s look at sources of errors in sample surveys and at how samplers combat them.…

    • 528 Words
    • 3 Pages
    Good Essays
  • Good Essays

    The purpose of sampling is to study a part of a whole group been studied. According to Monette, Sullivan, & DeJong, 2011, some groups are just two big and sampling allows the study of a workable number of cases from the large group to derive findings that are relevant to all members of the group (Chapter 6, The Purpose of Sampling). One example of probability sample is Simple Random Sampling (SRS), such as trying to do a research project that calls for a national sample of 2,000 households. SRS is considered the basic sampling procedure on which statistical theory is based. (Monette, Sullivan, & DeJong, 2011). An example of nonprobability sample is Ronald Feldman and Timothy Caplinger (1977) that were interested in factors that bring about behavior changes…

    • 1100 Words
    • 4 Pages
    Good Essays
  • Good Essays

    The Bootstrapping Method

    • 346 Words
    • 2 Pages

    1) Put a random, undetermined amount of jellybeans in a symmetrical jar with no deformities.…

    • 346 Words
    • 2 Pages
    Good Essays
  • Good Essays

    (i) Selection Error: When we need to select a sample, we use a list of elements from which we draw a sample, and this list usually does not include many members of the target population. Usually it is not feasible to include every member of the target population in this list. This list of members of the population that is used to select a sample is called the sampling frame. Thus, the sampling frame that is used to select a sample may not be representative of the population. This may cause the sample results to be different from the population results. The error that occurs because the sampling frame is not representative of the population is called the selection error. If a sample is nonrandom (and, hence, non representative), the sample results may be quite different from the census results.…

    • 564 Words
    • 3 Pages
    Good Essays