Introduction to Inference

Slides:



Advertisements
Similar presentations
Introduction to Inference
Advertisements

AP Statistics: Section 10.1 A Confidence interval Basics.
Chapter 10: Estimating with Confidence
CONFIDENCE INTERVALS HONORS ADVANCED ALGEBRA PRESENTATION 1-9.
Chapter 10: Estimating with Confidence
10.1: Estimating with Confidence Statistical inference uses language of probability to express the strength of our conclusions 2 types of formal statistical.
ESTIMATING with confidence. Confidence INterval A confidence interval gives an estimated range of values which is likely to include an unknown population.
Chapter 8 Introduction to Inference Target Goal: I can calculate the confidence interval for a population Estimating with Confidence 8.1a h.w: pg 481:
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Statistics 101 Chapter 10. Section 10-1 We want to infer from the sample data some conclusion about a wider population that the sample represents. Inferential.
Statistical Inference: Making conclusions about the population from sample data.
10.1: Confidence Intervals – The Basics. Introduction Is caffeine dependence real? What proportion of college students engage in binge drinking? How do.
Confidence Intervals: The Basics BPS chapter 14 © 2006 W.H. Freeman and Company.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 10.1 Confidence Intervals: The Basics.
BPS - 3rd Ed. Chapter 131 Confidence Intervals: The Basics.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
+ “Statisticians use a confidence interval to describe the amount of uncertainty associated with a sample estimate of a population parameter.”confidence.
1 Section 10.1 Estimating with Confidence AP Statistics January 2013.
Chapter 10: Introduction to Statistical Inference.
QUICK: Review of confidence intervals Inference: provides methods for drawing conclusions about a population from sample data. Confidence Intervals estimate.
Chapter 10: Confidence Intervals
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
CONFIDENCE INTERVALS: THE BASICS Unit 8 Lesson 1.
Ch 8 Estimating with Confidence 8.1: Confidence Intervals.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Warm Up- Class work Activity 10 Pg.534 -each person tosses the pin 50 times
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Christopher, Anna, and Casey
Inference: Conclusion with Confidence
CHAPTER 8 Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
CHAPTER 9 Sampling Distributions
Confidence Intervals with Means
Inference: Conclusion with Confidence
Week 10 Chapter 16. Confidence Intervals for Proportions
Chapter 8: Estimating with Confidence
Review of Chapter 10 Confidence Intervals
Ch. 8 Estimating with Confidence
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
Introduction to Inference
CHAPTER 14: Confidence Intervals The Basics
Daniela Stan Raicu School of CTI, DePaul University
CHAPTER 22: Inference about a Population Proportion
CHAPTER 8 Estimating with Confidence
Section 7.7 Introduction to Inference
EXAMPLE: The weight of a can of Coca Cola is supposed to have mean = 12 oz with std. dev.= 0.82 oz. The product is declared underweight if it weighs.
Chapter 10: Estimating with Confidence
Click the mouse button or press the Space Bar to display the answers.
Confidence Intervals Accelerated Math 3.
Chapter 8: Estimating with Confidence
Confidence Intervals with Proportions
Basic Practice of Statistics - 3rd Edition Introduction to Inference
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating With Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Daniela Stan Raicu School of CTI, DePaul University
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Confidence Intervals
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Pg. 637 # ) A.(18.9,25.1) We are 95% confident that the mean gain for all SAT math second-try scores to be between 18.9 and 25.1 points. 90%
2.) A. Incorrect, the prob. Is either 0 or 1, but we don’t know which.
Chapter 8: Estimating with Confidence
How Confident Are You?.
Presentation transcript:

Introduction to Inference

What is the goal in statistics? to infer from the sample data a conclusion about the population.

Statistical Inference Provides methods for drawing conclusions about a population from sample data. So what’s new about this? We can use probability to express the strength of our conclusions.

Confidence Intervals Have the form estimate ± margin of error Our guess for the value of the unknown parameter Shows how accurate we believe our guess is (also denoted as m)

Confidence Interval A level C confidence interval for a parameter has two parts: An interval calculated from the data, usually of the form estimate ± margin of error A confidence level C, which gives the probability that the interval will capture the true parameter value in repeated samples.

10.1 Estimating with Confidence Example 10.2, p. 537 Suppose you want to estimate the mean SAT Math score for the more than 350,000 high school seniors in California. Only about 49% of California students take the SAT. These self-selected seniors are planning to attend college and so are not representative of all California seniors. You know better than to make inferences about the population based on any sample data. At considerable effort and expense, you give the test to a simple random sample (SRS) of 500 California high school seniors. The mean for your sample is 𝑥 =461. What can you say about the mean score µ in the population of all 350,000 seniors?

Some things to notice: 𝑥 =461 Suppose you want to estimate the mean SAT Math score for the more than 350,000 high school seniors in California. Only about 49% of California students take the SAT. These self-selected seniors are planning to attend college and so are not representative of all California seniors. You know better than to make inferences about the population based on any sample data. At considerable effort and expense, you give the test to a simple random sample (SRS) of 500 California high school seniors. The mean for your sample is 𝑥 =461. What can you say about the mean score µ in the population of all 350,000 seniors? Some things to notice: We are not provided the population mean or standard deviation 𝑛=500 𝑥 =461  𝑛≥30 Follows CLT; approximately normal

≈4.5 𝑛=500 𝑥 =461 Let us suppose that 𝜎=100. Then, 𝜎 𝑥 = 100 500 Suppose you want to estimate the mean SAT Math score for the more than 350,000 high school seniors in California. Only about 49% of California students take the SAT. These self-selected seniors are planning to attend college and so are not representative of all California seniors. You know better than to make inferences about the population based on any sample data. At considerable effort and expense, you give the test to a simple random sample (SRS) of 500 California high school seniors. The mean for your sample is 𝑥 =461. What can you say about the mean score µ in the population of all 350,000 seniors? 𝑛=500 𝑥 =461 Let us suppose that 𝜎=100. Then, 𝜎 𝑥 = 100 500 We’ll learn how to deal with not being given the standard deviation later. ≈4.5

Suppose you want to estimate the mean SAT Math score for the more than 350,000 high school seniors in California. Only about 49% of California students take the SAT. These self-selected seniors are planning to attend college and so are not representative of all California seniors. You know better than to make inferences about the population based on any sample data. At considerable effort and expense, you give the test to a simple random sample (SRS) of 500 California high school seniors. The mean for your sample is 𝑥 =461. What can you say about the mean score µ in the population of all 350,000 seniors? 𝑛=500 𝑥 =461 𝜎=100 𝜎 𝑥 ≈4.5 What if we were to collect data on more SRSs of 500?

Suppose you want to estimate the mean SAT Math score for the more than 350,000 high school seniors in California. Only about 49% of California students take the SAT. These self-selected seniors are planning to attend college and so are not representative of all California seniors. You know better than to make inferences about the population based on any sample data. At considerable effort and expense, you give the test to a simple random sample (SRS) of 500 California high school seniors. The mean for your sample is 𝑥 =461. What can you say about the mean score µ in the population of all 350,000 seniors? 𝑛=500 𝑥 =461 𝜎=100 𝜎 𝑥 ≈4.5 What if we were to collect data on more SRSs of 500?

Recall the Empirical Rule: 68% is within 1 standard deviation of the mean 95% is within 2 standard deviations of the mean 99.7% is within 3 standard deviations of the mean

Statistical Confidence 𝑛=500 𝑥 =461 𝜎=100 𝜎 𝑥 ≈4.5 So in 95% of the samples, the unknown µ lies between 𝑥 +2 𝜎 𝑥 and 𝑥 −2 𝜎 𝑥 . 𝑥 +2 𝜎 𝑥 = 𝑥 −2 𝜎 𝑥 = 𝑥 +2 4.5 = 𝑥 +9 𝑥 −2 4.5 = 𝑥 −9

𝑛=500 𝑥 =461 𝜎=100 𝜎 𝑥 ≈4.5 What does this mean? 𝑛=500 𝑥 =461 𝜎=100 𝜎 𝑥 ≈4.5 What does this mean? In 95% of all samples, 𝑥 lies within ±9 of the unknown population mean µ. So µ also lies within ±9 of 𝑥 in those samples.

95% Confidence 𝑥 −9= 461−9= 452 𝑥 +9= 461+9= 470 Our sample of 500 California seniors gave 𝑥 =461. We say that we are 95% confident that the unknown mean SAT Math score for all California high school seniors lies between 𝑥 −9= 461−9= 452 𝑥 +9= 461+9= 470

Understanding the Grounds for Confidence 𝑥 −9= 461−9= 452 𝑥 +9= 461+9= 470 Two possibilities: The interval between452 and 470 contains the true µ. Our SRS was one of the few samples for which 𝑥 is not within 9 points of the true µ Only 5% of all samples give such inaccurate results.

Writing in context (Because you should!) We are 95% confident that the unknown µ lies between 452 and 470. This is called a 95% Confidence Interval. “We got these numbers by a method that gives correct results 95% of the time”

Homework P. 542: 10.1, 10.3, 10.4 a – d Due: Tuesday