Sampling Distributions Welcome to inference!!!! Chapter 9.

Slides:



Advertisements
Similar presentations
Chapter 6 – Normal Probability Distributions
Advertisements

Chapter 8: Estimating with Confidence
Sampling Distributions and Sample Proportions
Reminders: Parameter – number that describes the population Statistic – number that is computed from the sample data Mean of population = µ Mean of sample.
Chapter 7 Introduction to Sampling Distributions
3.3 Toward Statistical Inference. What is statistical inference? Statistical inference is using a fact about a sample to estimate the truth about the.
Chapter 6 Introduction to Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Sampling Distributions
9.1 Sampling Distributions A parameter is a number that describes the population. A parameter is a fixed number, but in practice we do not know its value.
Chapter 10: Estimating with Confidence
CHAPTER 11: Sampling Distributions
UNIT FOUR/CHAPTER NINE “SAMPLING DISTRIBUTIONS”. (1) “Sampling Distribution of Sample Means” > When we take repeated samples and calculate from each one,
Section 9.3 Sample Means.
A P STATISTICS LESSON 9 – 1 ( DAY 1 ) SAMPLING DISTRIBUTIONS.
Chapter 7 Sampling Distributions
CHAPTER 11: Sampling Distributions
Chapter 5 Sampling Distributions
The Distribution of Sample Proportions Section
AP Statistics Chapter 9 Notes.
PARAMETRIC STATISTICAL INFERENCE
AP Statistics 9.3 Sample Means.
Chapter 9: Sampling Distributions “It has been proved beyond a shadow of a doubt that smoking is one of the leading causes of statistics.” Fletcher Knebel.
Chapter 18: Sampling Distribution Models
9.3: Sample Means.
CHAPTER 11: Sampling Distributions ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
CHAPTER 15: Sampling Distributions
Stat 1510: Sampling Distributions
AP Statistics Chapter 10 Notes. Confidence Interval Statistical Inference: Methods for drawing conclusions about a population based on sample data. Statistical.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
Section 6.3: How to See the Future Goal: To understand how sample means vary in repeated samples.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 7 Sampling Distributions 7.1 What Is A Sampling.
Sampling Distribution CHAPTER 18. Modeling the Distribution of Sample Proportions  Rather than showing real repeated samples, imagine what would happen.
Sampling Distributions & Sample Means Movie Clip.
Sampling Distributions: Suppose I randomly select 100 seniors in Anne Arundel County and record each one’s GPA
Sampling Distributions Chapter 18. Sampling Distributions If we could take every possible sample of the same size (n) from a population, we would create.
Population Distributions vs. Sampling Distributions There are actually three distinct distributions involved when we sample repeatedly andmeasure a variable.
Chapter 8: Estimating with Confidence
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Chapter 9 Day 2. Warm-up  If students picked numbers completely at random from the numbers 1 to 20, the proportion of times that the number 7 would be.
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
 A national opinion poll recently estimated that 44% (p-hat =.44) of all adults agree that parents of school-age children should be given vouchers good.
Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 5 Sampling Distributions
Chapter 5 Sampling Distributions
Chapter 5 Sampling Distributions
Click the mouse button or press the Space Bar to display the answers.
Sampling Distributions
What Is a Sampling Distribution?
Chapter 7: Sampling Distributions
Chapter 7 Sampling Distributions
Chapter 7: Sampling Distributions
CHAPTER 7 Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 9: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 5 Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 5: Sampling Distributions
Presentation transcript:

Sampling Distributions Welcome to inference!!!! Chapter 9

Parameter  A Parameter is a number that describes the population.  A parameter always exists but in practice we rarely know it’s value b/c of the difficulty in creating a census.  We use Greek letters to describe them (likeμor σ). If we are talking about a percentage parameter, we use rho (ρ)  Ex: If we wanted to compare the IQ’s of all American and Asian males it would be impossible, but it’s important to realize that μ Americans and μ males exist.  Ex: If we were interested in whether there is a greater percentage of women who eat broccoli than men, we want to know whether ρ women > ρ men

Statistic  A statistic is a number that describes a sample. The value of a statistic can always be found when we take a sample. It’s important to realize that a statistic can change from sample to sample.  Statistics use variables like x bar, s, and phat (non greek).  We often use statistics to estimate an unknown parameter.  Ex: I take a random sample of 500 American males and find their IQ’s. We find that x bar =  I take a random sample of 200 women and find that 40 like broccoli. Then phat w =.2  IMPORTANT! A POPULATION NEEDS TO BE AT LEAST 10 TIMES AS BIG AS A SAMPLE TAKEN FROM IT. IF NOT, YOU NEED A SMALLER SAMPLE

Bias  We say something is biased if it is a poor predictor

Variability *Variability of population doesn’t change- (scoop example) size of scoop matters

How can we use samples to find parameters if they give us different results?  Imagine an archer shooting many arrows at a target: 4 situations can occur  a) consistent but off target. b) all over the place. tends to average a bulls eye but each result is far from center. c) worse than a as the archer is consistently missing high and to the right but nearly as consistently as situation a. d) is ideal- low bias and low variability.

Here’s an example:  Suppose our goal was to estimate μ american male.  We can’t take a census, so we take many samples. We find the average IQ of american males in each sample (x bar).  a) if our many samples of IQ’s are consistent but higher than the true average IQ of AM’s, then we have a situation with high bias and low variability  b) If our many samples are inconsistent- some high, some low than the true mean of AM’s Iqs, we have low bias and high variability.

 If our sample means are not close to each other but all higher than μ AM then we have high bias and high variability (c)  Finally, if our samples all just slightly higher or slightly lower than μ AM, we have our desired situation: low bias and low variability.

But…  You aren’t taking a bunch of samples…you’re only going to take 1! and we want it to predict μ am  If we used the data from situation d) then any of the samples would provide a good predictor for μ am  We already know some ways to get a good sample- using an SRS and being very sure to have no bias when choosing our sample.  Inference is using our sample statistic (assuming it’s a good sample) to predict our parameter with a certain degree of confidence.

The Sampling Distribution  The sampling distribution of a statistic is the distribution of means of all possible samples of the same size from the population.  When we sample, we sample with replacement.  A sampling distribution is a sample space- it describes everything that can happen when we sample. Cool demo

Central Limit Theorem (CLT)  As you take more and more SRS’s of the same size, the distribution of their means will get closer and closer to a normal curve centered around the true population mean…NO MATTER WHAT THE SHAPE OF THE PARENT POPULATION!! Why?!  The Sampling Distribution of means has a mean of μand a standard deviation of σ/√n** N(μ, σ/√n)

CLT summary  1. The mean of the population (what we want to find) will be the same as the mean of all your many samples.  2. The Standard Deviation of all your many samples will be the population standard deviation divided by √n (your sample size)  3. The histogram of the samples will appear normal (bell shaped).  4. The larger the sample size (n), the smaller the standard deviation will be and the more constricted the graph will be.

What if we’re talking about proportions?  Same thing except we use our proportion formulas for mean and standard deviation

Don’t forget our rule of 10!  Only use σ/√n or if the 10 times N (the number of people in our population) is ≥ n (the number of people in our sample)

Example  The true average study time for a final exam in history is found to be 6 hours and 25 minutes with a standard deviation of 1 hour and 45 minutes. Assume the distribution is normal. N(6.417, 1.75)  What is the probability that a student chosen at random spends more than 7 hours studying? Normalcdf(7,100,6.417,1.75) = 37%  What is the probability that an SRS of 4 students will average more than 7 hours in studying? Normalcdf(7,100,6.417,1.75/√4) = 25.3%.  Why did the probability go down?  A student to study more than 7 hours is not probable…a group of 4 to average more than 7 is less probable.

Example 2  The length of pregnancy from conception to birth varies normally with a mean of 266 days and a standard deviation of 16 days  N(266,16)  What is the probability that a woman chosen at random has a pregnancy lasting more than 270 days?  40.1%  What is the mean and standard deviation of my sampling distribution?  What is the probability that an SRS of 16 women have pregnancies averaging more than 270 days?