STAT 110 - Section 5 Lecture 7 Professor Hao Wang University of South Carolina Spring 2012 TexPoint fonts used in EMF. Read the TexPoint manual before.

Slides:



Advertisements
Similar presentations
+ Sampling and Surveys Inference for Sampling The purpose of a sample is to give us information about alarger population. The process of drawing conclusions.
Advertisements

STAT Section 5 Lecture 3 Professor Hao Wang University of South Carolina Spring 2012.
Sampling Methods.
Chapter 7: Data for Decisions Lesson Plan
* Students will be able to identify populations and samples. * Students will be able to analyze surveys to see if there is bias. * Students will be able.
CHAPTER 8: Producing Data: Sampling
Lesson Designing Samples. Knowledge Objectives Define population and sample. Explain how sampling differs from a census. Explain what is meant by.
Causes of Error in Sampling. Sampling Error Sampling error is error caused by the way you chose your sample – Volunteer Sampling & Convenience Sampling.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
QBM117 Business Statistics Statistical Inference Sampling 1.
AP Statistics Chapter 5 Notes.
Why sample? Diversity in populations Practicality and cost.
Chapter 12 Sample Surveys
CHAPTER 4 Designing Studies
Section 5.1. Observational Study vs. Experiment  In an observational study, we observe individuals and measure variables of interest but do not attempt.
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. How to Get a Good Sample Chapter 4.
AP Statistics!!! Test Review Sampling Error ◦ Occurs in the act of choosing the sample ◦Undercoverage – certain members of population are ‘left.
Chapter 5 Section 3 Part 1.  Often when we hear of a sample we do not know the truth behind the sampling process.  The whole truth about opinion polls.
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
BPS - 5th Ed. Chapter 81 Producing Data: Sampling.
Producing Data: Sampling BPS - 5th Ed.Chapter 81.
CHAPTER 8 Producing Data: Sampling BPS - 5TH ED.CHAPTER 8 1.
Sample Surveys.  The first idea is to draw a sample. ◦ We’d like to know about an entire population of individuals, but examining all of them is usually.
Population We are almost always interested in knowledge about a population. We would have little interest in samples if we could always ask everyone.
Section 2 Part 2.   Population - entire group of people or items for which we are collecting data  Sample – selections of the population that is used.
Sampling is the other method of getting data, along with experimentation. It involves looking at a sample from a population with the hope of making inferences.
Chapter 12 Notes Surveys, Sampling, & Bias Examine a Part of the Whole: We’d like to know about an entire population of individuals, but examining all.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.1 Samples and Surveys.
Part III Gathering Data.
Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Thinking About.
Section 5.1 Designing Samples Malboeuf AP Statistics, Section 5.1, Part 1 3 Observational vs. Experiment An observational study observes individuals.
Chapter 41 Sample Surveys in the Real World. Chapter 42 Thought Question 1 (from Seeing Through Statistics, 2nd Edition, by Jessica M. Utts, p. 14) Nicotine.
Other Probability Sampling Methods
CHAPTER 8: Producing Data Sampling ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Designing Samples Chapter 5 – Producing Data YMS – 5.1.
Lecture # 6:Designing samples or sample survey Important vocabulary Experimental Unit: An individual person,animal object on which the variables of interest.
BPS - 5th Ed. Chapter 81 Producing Data: Sampling.
C HAPTER 5: P RODUCING D ATA Section 5.1 – Designing Samples.
Section 5.1 Designing Samples AP Statistics
AP STATISTICS LESSON AP STATISTICS LESSON DESIGNING DATA.
AP STATISTICS Section 5.1 Designing Samples. Objective: To be able to identify and use different sampling techniques. Observational Study: individuals.
CONFIDENCE STATEMENT MARGIN OF ERROR CONFIDENCE INTERVAL 1.
Part III – Gathering Data
1 Chapter 2: Sampling and Surveys. 2 Random Sampling Exercise Choose a sample of n=5 from our class, noting the proportion of females in your sample.
Chapter 10 Sampling: Theories, Designs and Plans.
Chapter 5 Sampling: good and bad methods AP Standards Producing Data: IIB4.
I can identify the difference between the population and a sample I can name and describe sampling designs I can name and describe types of bias I can.
 An observational study observes individuals and measures variable of interest but does not attempt to influence the responses.  Often fails due to.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.1 Samples and Surveys.
Chapter 7 Data for Decisions. Population vs Sample A Population in a statistical study is the entire group of individuals about which we want information.
Chapter 12 Vocabulary. Matching: any attempt to force a sample to resemble specified attributed of the population Population Parameter: a numerically.
Chapter 5 Sampling and Surveys. Section 5.3 Sample Surveys in the Real World.
Status for AP Congrats! We are done with Part I of the Topic Outline for AP Statistics! (20%-30%) of the AP Test can be expected to cover topics from chapter.
Designing Studies In order to produce data that will truly answer the questions about a large group, the way a study is designed is important. 1)Decide.
1 Chapter 11 Understanding Randomness. 2 Why Random? What is it about chance outcomes being random that makes random selection seem fair? Two things:
Sample Surveys.  Population- all exp. units that you want to make a conclusion about  Sampling frame – list of individuals from whom the sample is drawn.
Chapter 4: Designing Studies... Sampling. Convenience Sample Voluntary Response Sample Simple Random Sample Stratified Random Sample Cluster Sample Convenience.
Plan for Today: Chapter 1: Where Do Data Come From? Chapter 2: Samples, Good and Bad Chapter 3: What Do Samples Tell US? Chapter 4: Sample Surveys in the.
Errors in Sampling Sampling Errors- Errors caused by the act of taking a sample. Makes sample results inaccurate. Random Sampling Error Errors caused by.
Introduction/ Section 5.1 Designing Samples.  We know how to describe data in various ways ◦ Visually, Numerically, etc  Now, we’ll focus on producing.
MATH Section 6.1. Sampling: Terms: Population – each element (or person) from the set of observations that can be made Sample – a subset of the.
DATA COLLECTION AND EXPERIMENTAL DESIGN SECTION 1.3 NOTES.
Chapter 3 What Do Samples Tell Us?.
Unit 5 Notebook Pages 31 -.
Chapter 5: Producing Data
MATH 2311 Section 6.1.
Chapter 5: Producing Data
Basic Practice of Statistics - 5th Edition Producing Data: Sampling
10/18/ B Samples and Surveys.
EQ: What is a “random sample”?
Presentation transcript:

STAT Section 5 Lecture 7 Professor Hao Wang University of South Carolina Spring 2012 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA

Last time: Picturing Bias and Variability

Last time: Margin of Error The CNN Poll interviewed 1000 people. The approval rating was 57%. What is the margin of error for 95% confidence (using the quick formula)? Answer: Recall 95% confidence

Margin of Error (continued)

Use MOE to calculate an interval that we think includes the parameter Form for most confidence intervals: Approximate (because we’re using the quick MOE) 95% confidence interval for p Confidence Interval

Confidence Statements A confidence statement interprets a confidence interval and has two parts: a margin of error and a level of confidence. Margin of error says how close the statistic lies to the parameter. Level of confidence says what percentage of all possible samples result in a confidence interval which contains the true parameter

Example: President Bush Pre 9/11: 57% with MOE 3% Post 9/11: 90% with MOE 3% Interpretations –We are 95% confident that the percent of all Americans who approve of the job president Bush was doing was between 54% and 60% before 9/11. –We are 95% confident that the percent of all Americans who approve of the job president Bush was doing was between 87% and 93% after 9/11.

Example: College Education This May 2011 survey finds that 57% of the 2142 adult Americans polled think that “the higher education system in the United States fails to provide students good value for the money they and their families spend”. Using the quick formula for MOE, compute a 95% confidence interval for p.

Example: Coke or Pepsi Suppose you take a sample of 1231 people and ask them if they prefer Coke over Pepsi. You find that 696 say they do. What is, the observed percent from the population? A.725 = 72.5% B.565 = 56.5% C.029 = 2.9% D.038 = 3.8%

Example Coke Or Pepsi continued Suppose you take a sample of 1231 people and ask them if they prefer Coke over Pepsi. You find that 696 say they do. What is the margin of error for 95% confidence? A square root of 1231 = = 35.06% Bsquare root of 696 = = 26.38% C1/square root of 1231 = = 2.85% D1/square root of 696 = = 3.79%

Hints for Interpretation The conclusion of a confidence statement always applies to the population, not to the sample. Our conclusion about the population is never completely certain. If you want a smaller margin of error with the same confidence, take a larger sample.

Hints for Interpretation It is very common to report the margin of error for 95% confidence. –If the level of confidence is not mentioned, assume 95% confidence. Can choose to use a confidence level other than 95%. –Other popular levels: 80%, 90%, 99% –For a fixed sample size, if you increase the level of confidence, your interval will become wider. –For a fixed confidence level, if you increase sample size, your interval will become narrower

Population Size Doesn’t Matter The variability of a statistic from a SRS does not depend on the size of the population as long as the population is at least 100 times larger than the sample.

Example: Population Size Doesn’t Matter Suppose we take a sample of size 1000 from a population of 4,000,000 (e.g., South Carolina). Then we take a sample of 1000 from a population of 300,000,000 (e.g., the whole US). Which sample statistic would have more variability (i.e., MOE) ? A. The one from 4,000,000 B. The one from 300,000,000 C. They are the same.

Chapter 4 – Sample Surveys in the Real World

Type of errors: 1. Sampling Errors a. Random Sampling Error b. Bad Sampling Methods 2. Non-sampling Errors a. Processing errors b. Poorly worded questions c. Response error d. Non-Response

Chapter 4 – Sample Surveys in the Real World sampling errors – errors caused by the act of taking a sample  They cause sample results to be different from the results of a census. sampling frame – a list of individuals from which we will draw our sample  should list every individual in the population

Errors in Sampling random sampling error – results from chance selection in the simple random sample MOE lets us calculate how serious the error is. The error is due to chance – always present. A large sample helps control this. MOE includes only random sampling error. Most sample surveys are afflicted with errors other than random sampling errors.

Errors in Sampling Bad sampling method – a convenience sample or a voluntary response sample is also a form of sampling error. Voluntary sample Convenience sample undercoverage – occurs when some groups in the population are left out of the process of choosing the sample

nonsampling errors – errors not related to the act of selecting a sample from the population  can even be present in a census nonrespone (missing data) response errors processing errors effects of data collection procedure

Example The subject lies about past drug use. A. Sampling Error: Bad Sampling Method B. Non Sampling Error: Response Error C. Non Sampling Error: Non Response Error D. Non Sampling Error: Processing Error

The subject cannot be contacted after five calls. A. Sampling Error: Bad Sampling Method B. Non Sampling Error: Response Error C. Non Sampling Error: Non Response Error D. Non Sampling Error: Processing Error Example

Interviewers choose people on the street to interview. A. Sampling Error: Bad Sampling Method B. Non Sampling Error: Response Error C. Non Sampling Error: Non Response Error D. Non Sampling Error: Processing Error Example

Consider Wording Be aware that the wording of a question influences the answers. Examples: Is our government providing too much money for welfare programs? –44% said “yes” Is our government providing too much money for assistance to the poor? –13% said yes

More Complex Sample Designs Sometimes a strict simple random sample is difficult to obtain. - Multistage Sampling Design - Cluster Sampling - Systematic Sampling - Stratified Random Sampling

Stratified Random Sample Step 1: Divide the sampling frame into distinct groups of individuals, called strata. –Choose strata because you have an interest in the groups or because the individuals within each group are similar –Example: graduate/undergraduate students Step 2: Take a separate SRS in each stratum and combine these to make up the complete sample.

Stratified Random Sample. A club has 25 student members and 10 faculty members. The club can send 4 students and 2 faculty members to a convention. Students 01 Barrett06 Frazier11 Hu16 Liu21 Ren 02 Brady07 Gibellato12 Jimenez17 Marin22 Santos 03 Chen08 Gulati13 Katsaounis18 Nemeth23 Sroka 04 Draper09 Han14 Kim19 O ’ Rourke24 Tordoff 05 Duncan10 Hostetler15 Kohlschmidt20 Paul25 Wang Faculty 0 Berliner2 Dean4 Goel6 Moore8 Stasney 1 Craigmile3 Fligner5 Lee7 Pearl9 Wolfe Line 116: Choose a Stratified RS of 4 Students, then of 2 Faculty

Cluster Sampling In order to reduce costs in sampling, researchers focus on efficiency by sampling from clusters Clusters are often formed by geographic location, resulting in decreased travel costs for the research company. Randomly sample clusters then survey everyone in each cluster.

Cluster Sample - Divide population into clusters. Select one or more clusters and include everyone in those clusters in the sample. Example: SC has 46 counties. Select 5 counties at random, use all household in each selected county as sample. Example: USC has 5000 dorms. Select 100 dorms at random, use all students in each selected dorm as sample.

Want to find the opinions of US adults, but want to save on time and money by randomly selecting residences. All adults residing in a sampled residence will be interviewed. A. Stratified B. Cluster C. Both

Want to find the opinions of US adults and need to make sure that 3 specific religious groups are represented. You sample 100 Christians, 100 Jewish, and 100 Muslims. A. Stratified B. Cluster C. Both

Want to find the opinions of city dwelling US adults and need to make sure that the east and west coasts are represented. You send 5 interviewers to the east coast and 5 to the west coast. 5 City blocks are chosen at random. Everyone living in a chosen city block is interviewed. (similarly for the east coast) A. Stratified B. Cluster C. Both

Questions to Ask Before You Believe a Poll Who carried out the survey? What was the population? How was the sample selected? How large was the sample? What was the margin of error? What was the response rate? How were the subjects contacted? When was the survey conducted? What questions were asked?

A – a cluster sample B – a systematic sample C – a stratified random sample D - undercoverage USC has 20,065 undergraduates and 7,423 graduate students. In an effort to gauge the opinions of all students on campus parking issues, a simple random sample consisting of 201 undergraduates and a simple random sample of 74 graduate students are taken. This is an example of:

A – a cluster sample B – a systematic sample C – a stratified random sample D - undercoverage USC has 20,065 undergraduates and 7,423 graduate students. In an effort to gauge the opinions of all students on campus parking issues, a simple random sample consisting of 201 undergraduates is taken. This is an example of: