DOC PREVIEW
UW-Madison STAT 371 - STAT 371 Lecture Notes

This preview shows page 1 out of 3 pages.

Save
View full document
Premium Document
Do you want full access? Go Premium and unlock all 3 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Example cont m mean thymus m 1 31 72 s sd thymus s 1 8 72909 n length thymus n 1 5 se s sqrt n se 1 3 903767 The sample standard deviation is an estimate of how far individual values differ from the population mean The standard error is an estimate of how far sample means from samples of size n differ from the population mean Statistics 371 Fall 2004 4 Confidence intervals Statistical Estimation 5 Derivation of a Confidence Interval From the sampling distribution of Y we have the following statement 0 9 Pr z Y z n n if we let z 1 645 because the area between 1 645 and 1 645 under a standard normal curve is 0 9 Different choices of z work for different confidence levels The first inequality is equivalent to Y z n and the second is equivalent to Y z n which are put together to give Pr Y z Y z n n Statistics 371 Fall 2004 Typical Problem Bret Larget Statistics 371 Fall 2004 Statistical inference is inference about unknown aspects of a population based on treating the observed data as the realization of a random process We focus in this course on inference in the setting of random samples from populations Statistical estimation is a form of statistical inference in which we use the data to estimate a feature of the population and to assess the precision the estimate Chapter 6 introduces these ideas in the setting of estimating a population mean Department of Statistics October 7 2004 We know that the sample mean y is likely to be close within a few multiples of n to the population mean Thus the unknown population mean is likely to be close to the observed sample mean y We can express a confidence interval by centering an interval around the observed sample mean y those are the possible values of that would be most likely to produce a sample mean y University of Wisconsin Madison The basic idea of a confidence interval for is as follows 6 Derivation of a Confidence Interval 1 The following data set are the weights mg of thymus glands from five chick embryos after 14 days of incubation The data was collected as part of a study on development of the thymus gland thymus 1 29 6 21 5 28 0 34 6 44 9 If we model this data as having been sampled at random from a population of chick embryos with similar conditions what can we say about the population mean weight 0 9 Statistics 371 Fall 2004 Confidence Intervals Statistics 371 Fall 2004 Here is some R code to compute the mean standard deviation and standard error for the example data Statistics 371 Fall 2004 2 Standard Error of the Mean This recipe for a confidence interval is then Y z n We know that SD of the sampling distribution of the sample mean y can be computed by this formula Y n This depends on knowing If we don t know as is usually the case we could use s as an alternative However the probability statement is then no longer true We need to use a different multiplier to account for the extra uncertainty This multiplier comes from the t distribution But if we only observe sample data y1 yn we do not know the value of the population SD so we cannot use the formula directly However we can compute the sample standard deviation s which is an estimate of the population standard deviation The expression s SEY n is called the standard error of the sample mean and is an estimate of the standard deviation of the sampling distribution of the sample mean You can understand why statisticians gave this concept a shorter name Statistics 371 Fall 2004 6 Statistics 371 Fall 2004 3 Mechanics of a confidence interval Sampling Distributions Notice that these multipliers 2 132 and 2 776 are each greater than the corresponding z multipliers 1 645 and 1 96 y Z n Had the sample size been 50 instead of 5 the t multipiers 1 677 and 2 01 would still be larger than the corresponding z but by a much smaller amount y T s n If the population is normal the statistic Z has a standard normal distribution If the population is not normal but n is sufficiently large the statistic Z has approximately a standard normal distribution by the Central Limit Theorem The distribution of the statistic T is more variable than that of Z because there is extra randomness in the denominator The extra randomness becomes small as the sample size n increases Statistics 371 Fall 2004 10 Interpretation of a confidence interval Statistics 371 Fall 2004 Student s t Distribution In our real data example we would interpret the 90 confidence interval as follows If Y1 Yn are a random sample from any normal distribution and if Y and S are the sample mean and standard deviation respectively then the statistic We are 90 confident that the mean thymus weight mg of all similar chick embyos that had been incubated under similar conditions would be between 23 4 and 40 04 T Notice that the interpretation of a confidence interval states the confidence level states the parameter being estimated is in the context of the problem including units and describes the population It is generally good practice to round the margin of error to two significant figures and then round the estimate to the same precision Statistics 371 Fall 2004 7 11 Another Example Y n S is said to have a t distribution with n 1 degrees of freedom All t distributions are symmetric bell shaped distributions centered at 0 but their shapes are not quite the same as normal curves and they are spread out a more than the standard normal curve The spread is largest for small sample sizes As the sample size and degrees of freedom increases the t distributions become closer to the standard normal distribution The Table in the back cover of your textbook provides a few key quantiles for several different t distributions Statistics 371 Fall 2004 8 The t Distributions in R The diameter of a wheat plant is an important trait because it is related to stem breakage which affects harvest The stem diameters mm of a sample of eight soft red winter wheat plants taken three weeks after flowering are below The functions pt and qt find areas and quantiles of t distributions in R The area to the right of 2 13 under a t distribution with 4 degrees of freedom is 1 pt 2 27 4 1 0 04286382 2 32 62 42 22 32 51 92 0 The mean and standard deviation are y 2 275 and s 0 238 To find the 95th percentile of the t distribution with four degrees of freedom you could do the following qt 0 95 df 4 1 2 131847 a Find a 95 confidence interval for the population mean This R code cecks the values of the 0 05 upper tail probability for the first several …


View Full Document

UW-Madison STAT 371 - STAT 371 Lecture Notes

Documents in this Course
HW 4

HW 4

4 pages

NOTES 7

NOTES 7

19 pages

Ch. 6

Ch. 6

24 pages

Ch. 4

Ch. 4

10 pages

Ch. 3

Ch. 3

20 pages

Ch. 2

Ch. 2

28 pages

Ch. 1

Ch. 1

24 pages

Ch. 20

Ch. 20

26 pages

Ch. 19

Ch. 19

18 pages

Ch. 18

Ch. 18

26 pages

Ch. 17

Ch. 17

44 pages

Ch. 16

Ch. 16

38 pages

Ch. 15

Ch. 15

34 pages

Ch. 14

Ch. 14

16 pages

Ch. 13

Ch. 13

16 pages

Ch. 12

Ch. 12

38 pages

Ch. 11

Ch. 11

28 pages

Ch. 10

Ch. 10

40 pages

Ch. 9

Ch. 9

20 pages

Ch. 8

Ch. 8

26 pages

Ch. 7

Ch. 7

26 pages

Load more
Download STAT 371 Lecture Notes
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view STAT 371 Lecture Notes and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view STAT 371 Lecture Notes and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?