Lesson Introduction to the Practice of Statistics
Objectives Define statistics and statistical thinking Understand the process of statistics Distinguish between qualitative and quantitative variables Distinguish between discrete and continuous variables
Vocabulary Central Limit Theorem – the larger the sample size, the closer the sampling distribution for the sample mean from any underlying distribution approaches a Normal distribution Standard error of the mean – standard deviation of the sampling distribution of x-bar
Sample Mean, x̄ The behavior of x̄ in repeated sampling is much like that of the sample proportion, p-hat. Sample mean x̄ is an unbiased estimator of the population mean μ Spread is less than that of X. Standard deviation of x̄ is smaller than that of X by a factor of 1/√n
Sample Spread of x̄ If the random variable X has a normal distribution with a mean of 20 and a standard deviation of 12 –If we choose samples of size n = 4, then the sample mean will have a normal distribution with a mean of 20 and a standard deviation of 6 –If we choose samples of size n = 9, then the sample mean will have a normal distribution with a mean of 20 and a standard deviation of 4
Example 1 The height of all 3-year-old females is approximately normally distributed with μ = inches and σ = 3.17 inches. Compute the probability that a simple random sample of size n = 10 results in a sample mean greater than 40 inches. μ = σ = 3.17 n = 10 σ x = 3.17 / 10 = x - μ Z = σ x 40 – = = = normalcdf(1.277,E99) = normalcdf(40,E99,38.72,1.002) = a P(x-bar > 40)
Example 2 We’ve been told that the average weight of giraffes is 2400 pounds with a standard deviation of 300 pounds. We’ve measured 50 giraffes and found that the sample mean was 2600 pounds. Is our data consistent with what we’ve been told? μ = 2400 σ = 300 n = 50 σ x = 300 / 50 = x - μ Z = σ x 2600 – 2400 = = = normalcdf(4.714,E99) = normalcdf(2600,E99,2400, ) = a P(x-bar > 2600)
Example 3 Young women’s height is distributed as a N(64.5, 2.5), What is the probability that a randomly selected young woman is taller than 66.5 inches? μ = 64.5 σ = 2.5 n = 1 σ x = 2.5 / 1 = 2.5 !! x - μ Z = σ x 66.5 – 64.5 = = = 0.80 normalcdf(0.80,E99) = 1 – = normalcdf(66.5,E99,64.5,2.5) = a P(x > 66.5)
Example 4 Young women’s height is distributed as a N(64.5, 2.5), What is the probability that an SRS of 10 young women is greater than 66.5 inches? μ = 64.5 σ = 2.5 n = 1 σ x = 2.5 / 10 = 0.79 x - μ Z = σ x 66.5 – 64.5 = = = 2.53 normalcdf(2.53,E99) = 1 – = normalcdf(66.5,E99,64.5,2.5/√10) = a P(x > 66.5)
Central Limit Theorem Regardless of the shape of the population, the sampling distribution of x-bar becomes approximately normal as the sample size n increases. Caution: only applies to shape and not to the mean or standard deviation X or x-bar Distribution Population Distribution Random Samples Drawn from Population xxxxxxxxxxxxxxxx
Central Limit Theorem in Action n =1 n = 2 n = 10 n = 25
Example 5 The time a technician requires to perform preventive maintenance on an air conditioning unit is governed by the exponential distribution (similar to curve a from “in Action” slide). The mean time is μ = 1 hour and σ = 1 hour. Your company has a contract to maintain 70 of these units in an apartment building. In budgeting your technician’s time should you allow an average of 1.1 hours or 1.25 hours for each unit? μ = 1 σ = 1 n = 70 σ x = 1 / 70 = x - μ Z = σ x 1.1 – 1 = = = 0.83 normalcdf(0.83,E99) = 1 – = a P(x > 1.1) vs P(x > 1.25)
Example 5 cont The time a technician requires to perform preventive maintenance on an air conditioning unit is governed by the exponential distribution (similar to curve a from “in Action” slide). The mean time is μ = 1 hour and σ = 1 hour. Your company has a contract to maintain 70 of these units in an apartment building. In budgeting your technician’s time should you allow an average of 1.1 hours or 1.25 hours for each unit? μ = 1 σ = 1 n = 70 σ x = 1 / 70 = x - μ Z = σ x 1.25 – 1 = = = normalcdf(2.083,E99) = 1 – = a P(x > 1.25)
Shape, Center and Spread of Population Distribution of the Sample Means ShapeCenterSpread Normal with mean, μ and standard deviation, σ Regardless of sample size, n, distribution of x-bar is normal μ x-bar = μ σ σ x-bar = n Population is not normal with mean, μ and standard deviation, σ As sample size, n, increases, the distribution of x-bar becomes approximately normal μ x-bar = μ σ σ x-bar = n Summary of Distribution of x
Summary and Homework Summary –Take an SRS and use the sample proportion x̄ to estimate the unknown parameter μ –x̄ is an unbiased estimator of μ –Increase in sample size decreases the standard deviation of x̄ (by a factor of 1/√n) –If the population is normal, then so is x̄ –Central Limit Theorem: for large n, the sampling distribution of x̄ is approximately normal for any population (with a finite σ) Homework –Day 1: pg 595-6; –Day 2: pg 601-4; 9.35, 36, 38, 42-44