Download presentation
Presentation is loading. Please wait.
Published byAugustine Dalton Modified over 8 years ago
1
Confidence Intervals
2
Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample mean.
3
Properties of a Good Estimator u 1.Unbiased u 2.Consistent u 3.Relatively Efficient
4
Since there’s no way of knowing how good a point estimate is, statisticians generally prefer...
5
… an Interval Estimate. u A range of values used to estimate the parameter. u The interval may or may not contain the parameter (but typically does). u The most common kind is the...
6
... Confidence Interval u Confidence level = the probability that the interval estimate will contain the parameter. u Common confidence levels: 90%95% 98% 99%
7
Confidence Intervals u Tradeoff: A greater confidence level comes from a wider interval. u Important z-values to remember: 90%:1.645 (some use 1.65) 95%: 1.96 98%: 2.33 99%:2.575 (some use 2.58)
8
Confidence Intervals u The maximum error of estimate (E) is the maximum difference between the point estimate of a parameter and the actual value of the parameter.
9
Confidence Intervals u Four steps: –Compute 1 – conf. level = –Divide by 2. –Look up the associated z-value(s) for /2 and 1 – /2. –Compute the confidence interval.
10
NOTATION FOR PROPORTIONS p =population proportion sample proportion of x successes in a sample of size n. sample proportion of failures in a sample of size n.
11
MARGIN OF ERROR OF THE ESTIMATE FOR p NOTE: n is the size of the sample.
12
The confidence interval is often expressed in the following equivalent formats: or CONFIDENCE INTERVAL FOR THE POPULATION PROPORTION p
13
SAMPLE SIZES FOR ESTIMATING A PROPORTION p When an estimate p is known: When no estimate p is known: ˆ ˆ
14
Confidence Intervals for the Mean u When the population standard deviation or variance is known, the standard normal distribution can be used depending on the sample size and the shape of the original distribution....
15
Confidence Intervals for the Mean u When n ≤ 30, the original variable must be normally distributed. u When n > 30, the distribution of sample means will be approximately normal even if the original distribution isn’t normal.
16
The margin of error for the mean is the maximum likely difference observed between sample mean and population mean µ, and is denoted by E. When the standard deviation, σ, for the population is known, the margin of error is given by MARGIN OF ERROR FOR THE MEAN where 1 − α is the desired confidence level.
17
CONFIDENCE INTERVAL ESTIMATE OF THE POPULATION MEAN μ (WITH σ KNOWN and n > 30)
18
SAMPLE SIZE FOR ESTIMATING µ wherez α/2 =critical z score based on desired confidence level E =desired margin of error σ =population standard deviation
19
Confidence Intervals for the Mean u To summarize, we use the standard normal distribution (z values from Table A-2) for these main reasons: – is known and the original variable is normally distributed OR is known and n > 30
20
Confidence Intervals for the Mean Now, if is unknown, s can be substituted for and we use a new distribution... the (Student) t distribution.
21
PROPERTIES OF THE STUDENT t DISTRIBUTION The Student t distribution is different for different sample sizes (see Figure below for the cases n = 3 and n = 12).
22
Features of the t-distribution u Bell-shaped u Symmetrical about the mean u The mean, median and mode are equal to 0 and located at the center. u The curve never touches the x-axis.
23
Features of the t-distribution u Variance is greater than 1. u Actually a family of curves based on degrees of freedom (related to sample size) u As d.f. increases, t approaches the standard normal distribution.
24
ASSUMPTIONS: σ NOT KNOWN 1.The sample is a simple random sample. 2.Either the sample is from a normally distributed population OR n > 30. When σ is not known we will use the Student t Distribution.
25
THE STUDENT t DISTRIBUTION If the distribution of a population is essentially normal, then the distribution of is essentially a Student t distribution for all samples of size n, and is used to find critical values denoted by t α/2. The Student t distribution is often referred to as the t distribution.
26
Confidence Intervals u The degrees of freedom are the number of values that are free to vary after a sample statistic has been computed. d.f. = n – 1
27
Confidence Intervals u Two steps: –Use the appropriate confidence level and the appropriate degree of freedom [d.f. = n – 1] to look up the associated t-values in the table (A-3). –Compute the interval.
28
MARGIN OF ERROR ESTIMATE OF µ (WITH σ NOT KNOWN) where (1 − α) is the confidence level and t α/2 has n − 1 degrees of freedom. NOTE: The values for t α/2 are found in Table A-3 which is found on page 606, inside the back cover, and on the Formulas and Tables card.
29
CONFIDENCE INTERVAL ESTIMATE OF THE POPULATION MEAN μ (WITH σ NOT KNOWN) where
30
CHOOSING THE APPROPRIATE DISTRIBUTION
31
CI for a Standard Deviation 1. Given sample values, estimate the population standard deviation σ or the population variance σ 2. 2. Determine the sample size required to estimate a population standard deviation or variance. COMMENT: Estimating standard deviations is very useful in areas such a quality control in a manufacturing process. This is because manufacturers want the products to be consistent.
32
ASSUMPTIONS 1. The sample is a simple random sample. 2. The population must have normally distributed values (even if the sample is large).
33
CHI-SQUARE DISTRIBUTION To estimate a population variance we use the chi-square distribution. where n = sample size s 2 = sample variance σ 2 = population variance
34
PROPERTIES OF THE CHI- SQUARE DISTRIBUTION 1.The chi-square distribution is not symmetric, unlike the normal and Student t distributions. Figure 7-8 Chi-Square Distribution As the number of degrees of freedom increases, the distribution becomes more symmetric. Figure 7-9 Chi-Square Distribution for df = 10 and df = 20
35
PROPERTIES (CONTINUED) 2. The values of chi-square can be zero or positive, but they cannot be negative. 3. The chi-square distribution is different for each number of degrees of freedom, which is df = n – 1 for this CI. As the number increases, the chi-square distribution approaches the shape of a normal distribution.
36
CRITICAL VALUES In Table A-4, each critical value of χ 2 corresponds to an area given in the top row of the table, and that area represents the cumulative area located to the right of the critical value in the body of the table. NOTE: Since the chi-square distribution is not symmetric the left critical value and the right critical value are not just opposites of each other.
37
ESTIMATORS OF σ 2 and σ The sample variance s 2 is the best point estimate of the population variance σ 2. The sample standard deviation s is the best point estimate of the population standard deviation σ.
38
CONFIDENCE INTERVAL FOR POPULATION VARIANCE σ 2 right-tail critical value left-tail critical value CONFIDENCE INTERVAL FOR POPULATION STANDARD DEVIATION σ
39
DETERMINING SAMPLE SIZE To determine sample size, use Table 7-2 on page 364.
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.