213Sampling.pdf When one is attempting to study the variable of a population, whether the variable is qualitative or quantitative, there are two methods.

Slides:



Advertisements
Similar presentations
Introduction Simple Random Sampling Stratified Random Sampling
Advertisements

Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 1 An Introduction to Business Statistics.
4 Uses and Abuses of Statistics 4.1 Introduction to Statistics
Lesson Designing Samples. Knowledge Objectives Define population and sample. Explain how sampling differs from a census. Explain what is meant by.
QBM117 Business Statistics Statistical Inference Sampling 1.
Why sample? Diversity in populations Practicality and cost.
Sampling Prepared by Dr. Manal Moussa. Sampling Prepared by Dr. Manal Moussa.
11 Populations and Samples.
Chapter 12 Sample Surveys
The Excel NORMDIST Function Computes the cumulative probability to the value X Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc
Sampling Methods.
Information from Samples Alliance Class January 17, 2012 Math Alliance Project.
Chapter 4 Selecting a Sample Gay, Mills, and Airasian
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. How to Get a Good Sample Chapter 4.
POPULATION- the entire group of individuals that we want information about SAMPLE- the part of the population that we actually examine in order to gather.
SESSION 5 & 6 Last Update 23 rd February 2011 Introduction to Statistics.
Chapter 1 Getting Started
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
Homework Check Homework check now... Please take out your homework so we can check it.
Chapter 1: Introduction to Statistics
Chapter 2: Statistics of One Variable
Intro Stats Lesson 1.3 B Objectives: SSBAT classify different ways to collect data. SSBAT distinguish between different sampling techniques. Standards:
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
Do Now: 1.Be sure to have picked up three papers upon entry. 2.Work with a partner to complete “The White House is not a Metronome Questions”
Sampling.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Chapter 5 Selecting a Sample Gay, Mills, and Airasian 10th Edition
Sampling Methods. Definition  Sample: A sample is a group of people who have been selected from a larger population to provide data to researcher. 
1-3 Data Collection and Sampling Techniques Surveys are the most common method of collecting data. Three methods of surveying are: 1) Telephone surveys.
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
Variables, sampling, and sample size. Overview  Variables  Types of variables  Sampling  Types of samples  Why specific sampling methods are used.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 13 Collecting Statistical Data 13.1The Population 13.2Sampling.
1 Chapter 7 Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling Distribution of.
Data Gathering Techniques. Essential Question: What are the different methods for gathering data about a population?
CHAPTER 12 DETERMINING THE SAMPLE PLAN. Important Topics of This Chapter Differences between population and sample. Sampling frame and frame error. Developing.
Chapter 10– Estimating Voter Preferences Statistics is the science of making decisions in the face of uncertainty. We use information gathered from a sample.
Chapter 1 Getting Started 1.1 What is Statistics?.
STANDARD ERROR Standard error is the standard deviation of the means of different samples of population. Standard error of the mean S.E. is a measure.
STA Lecture 51 STA 291 Lecture 5 Chap 4 Graphical and Tabular Techniques for categorical data Graphical Techniques for numerical data.
1 Chapter Two: Sampling Methods §know the reasons of sampling §use the table of random numbers §perform Simple Random, Systematic, Stratified, Cluster,
Section 1.2 Random Samples 1 Larson/Farber 4th ed.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
Unit 1 – Intro to Statistics Terminology Sampling and Bias Experimental versus Observational Studies Experimental Design.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling and Sampling Distributions.
Notes 1.3 (Part 1) An Overview of Statistics. What you will learn 1. How to design a statistical study 2. How to collect data by taking a census, using.
Part III – Gathering Data
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 In an observational study, the researcher observes values of the response variable and explanatory.
Chapter Eleven Sampling: Design and Procedures Copyright © 2010 Pearson Education, Inc
1 Chapter 2: Sampling and Surveys. 2 Random Sampling Exercise Choose a sample of n=5 from our class, noting the proportion of females in your sample.
SAMPLING TECHNIQUES LECTURE - 2 GE 608 Experimental Methods and Analysis Oct 28, 2015 Muharrum 14, 1437.
Review HW: E1 A) Too high. Polltakers will never get in touch with people who are away from home between 9am and 5pm, eventually they will eventually be.
1 Introduction to Statistics. 2 What is Statistics? The gathering, organization, analysis, and presentation of numerical information.
STATISTICAL DATA GATHERING: Sampling a Population.
Ch1 Larson/Farber 1 1 Elementary Statistics Larson Farber Introduction to Statistics As you view these slides be sure to have paper, pencil, a calculator.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
Selecting a Sample. outline Difference between sampling in quantitative & qualitative research.
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
Chapter 1 Getting Started What is Statistics?. Individuals vs. Variables Individuals People or objects included in the study Variables Characteristic.
Elementary Statistics (Math 145) June 19, Statistics is the science of collecting, analyzing, interpreting, and presenting data. is the science.
Sect. 1-3 Experimental Design Objective: SWBAT learn how to design a statistical Study, How to collect data by taking a census using a sampling, using.
Sampling Dr Hidayathulla Shaikh. Contents At the end of lecture student should know  Why sampling is done  Terminologies involved  Different Sampling.
Math 145 June 19, Outline 1. Recap 2. Sampling Designs 3. Graphical methods.
Sampling Why use sampling? Terms and definitions
Graduate School of Business Leadership
Understandable Statistics
Definitions Covered Statistics Individual Variable
Daniela Stan Raicu School of CTI, DePaul University
Chapter 5: Producing Data
6A Types of Data, 6E Measuring the Centre of Data
Data Collection and Sampling Techniques
Presentation transcript:

213Sampling.pdf When one is attempting to study the variable of a population, whether the variable is qualitative or quantitative, there are two methods of data collection which can be employed: 1. Conduct a Census. 2. Take a Sample.

Some Population characteristics:   = population average (mean) 2. p = population proportion Some Sample characteristics: 1. = sample average 2. = sample proportion

A census is taken when every element (or individual if the population consists of people) in the population of interest is inspected with regards to the population variable of interest. Taking a Census:

Problems with a Census 1.Can each member of the population be accessed? --- Cause for bias. 2. Control of information gathering is very difficult for large populations. --- Cause for bias. 3. Very expensive and time consuming.

Sample Take only part of the population: We hope that the characteristics of the sample reflect those of the population.

Suppose the mean (or average) number of courses taken by the 100 students in the sample was 4.2, or = 4.2. What does this statistic represent? Does this indicate that the average number of courses taken in the Fall 2005 semester for ALL undergraduate students is 4.2, or µ = 4.2? No, it does not.

Taking a Random Sample There are four ways to select a random sample: 1. Simple random sampling (SRS). 2. Stratified sampling. 3. Cluster sampling. 4. Systematic sampling (or 1-in-k sampling)

1. Simple Random Sampling - SRS A simple random sample is taken when every conceivable subgroup of a size n has the same “chance” of being selected as the sample.

In order to take a simple random sample, you require: 1.A sampling frame - a complete list of all the elements in the population of interest. 2. A method of numerically differentiating between each element in the sampling frame. This is done by assigning each element in the sampling frame a unique numerical number - a number that “belongs” to only them. 3. A random number generator - such as the table on pages 847 and 848 of your text.

Consider a regular sized lecture section of Statistics 213 (having 115 students, or N = 115) as the population of interest. Do you have at least one part time job? (Yes = 1, No = 0) How many part-time jobs do you have? (Answer = 0, 1, 2, …) Example

Below is a condensed version of the class list, or the sampling frame. 1. Student A 2. Student B 3. Student C Student XYZ 1 and 2:

3. Using the random number generator A simple random sample of 5 will be taken. The first three digit number (row 1, columns 1 - 3) is “104”. If the next three digit number is not between 1 and 115, continue until you find a three digit number between 1 and 115. It should be different from the previously selected number. The next four choices are: 094, 103, 071, 023. The SRS will be the 23rd, 94th, 71st, 103rd and 104th student.

Suppose of the five students selected, two have at least one part-time job. The sample proportion,, is then

Advantages of SRS A simple random sample is the purest method of random selection. The simple random sample criteria allows the selection of the sample to be done in a completely objective manner. There are no issues with selection bias

A stratified sample is taken when the population of interest is subdivided into k-different groups, or k-strata. Once this is done a simple random sample (SRS) is taken from each stratum. The simple random samples taken from each stratum are put together and constitute the random sample, or n. How is a population stratified? 2. Stratified Sampling

The population is stratified according to some other population variable: 1. geographic - stratify according to some ‘location’ variable of the underlying population: province, region (West, Central, East, Maritimes), quadrant (NW, NE, SW, SE), rural vs. urban, etc. 2. non-geographic - stratify according to some ‘non- location’ variable of the population: gender (male, female), income level/tax bracket (lower, middle, upper), age level (18 < 30, 30 < 40, 40 < 55, 55 and up), education level, etc.

Consider the population of Canadian voters, and the variable of interest is: “whether or not a politician can be trusted”. To measure such, suppose a random sample of 1000 Canadian voters is to be selected using stratified sampling, and the population will be stratified into 4 strata. (k = 4) Example

Stratifying according to region of the country, we have Stratum #1Stratum #2Stratum #3Stratum #4 The West Ontario Quebec Atlantic n W = 250 n O = 250 n Q = 250 n M = 250 = 58% = 52% = 61% = 68% Since the samples are “equally weighted”, the sample proportion is simply the average of the individual sample proportions: = =

Stratum #1 Stratum #2 Stratum #3 Stratum #4 West Ontario Quebec Atlantic 30.3 % 37.9 % 24.0 % 7.8 % n W = 303 n O = 379 n Q = 240 n M = 78 = 58% = 52% = 61% = 68% Proportionally Stratified Sampling1

Stratum #1 Stratum #2 Stratum #3 Stratum #4 West Ontario Quebec Atlantic 30.3 % 37.9 % 24.0 % 7.8 % n W = 303 n O = 379 n Q = 240 n M = 78 = 58% = 52% = 61% = 68% Because the stratified sample has been conducted proportionally, we then “weight” the individual percentages and calculate the weighted-average: = 303 (0.58) (0.52) +240 (0.61) + 78 (0.68) =

1.Approximately 58% of voters in the West and Territories believe politicians cannot be trusted. 2.Approximately 52% of voters in Ontario believe politicians cannot be trusted. 3.Approximately 61% of voters in Quebec believe politicians cannot be trusted. 4.Approximately 68% of voters in Atlantic Canada believe politicians cannot be trusted.

Often one does not have the luxury of a large budget or time frame to complete a study on a large population. In such cases one can attempt to sample from the population using a method that seems to closely follow a stratified sample, but is much easier. If one wishes to study the annual income of households in Calgary, clearly it would be difficult to have a complete list of all the households of Calgary. Identify “clusters” of a population - those non-overlapping groups that elements in a population naturally fall within.

Once this is done, the researcher can either: 1. randomly select, using SRS, one cluster (or many clusters) and then inspect every element falling within the randomly selected cluster(s) 2. randomly select, using SRS, one cluster (or many clusters) and then take a simple random sample of elements from the randomly selected cluster(s). Option #1 is deemed a single stage cluster sample. Option #2 is called a double (or multi) stage cluster sample.

The distinction is the following: In a single stage cluster sample, random selection is only occurring once: the cluster (or clusters) are RANDOMLY selected. In a double (or multi) stage cluster sample, random selection is occurring twice (or more than twice).

The sampling error is simply the difference between the sample and the population. There will be difference, albeit slight, between what is happening in the sample and what is really happening in the population. Both the sample mean ( ) and the sample proportion ( ) have a sampling error. The sampling error of is approximated by the following: Error = Clearly the larger the sample size, the smaller the sampling error.

A random sample of 500 Calgarians indicated that 45% of Calgarians think that Highway #2 between Calgary and Edmonton should be a toll highway. In this example the population of interest consists of Calgarians. The variable is an opinion about whether #2 between Calgary and Edmonton should be a toll highway -a qualitative variable that can be measured using a nominal scale (why?). The sample size, or n, is 500. The sample proportion, or = What is the error of this sample?

Error = We get an interval estimate: That is, from:

1.Error of Non-Inclusion - errors due to a member of the population having no “chance” of appearing in the sample. (Nonrandom sampling) 2.Errors of Non-Observation - errors that arise due to problems with sampling. (Non-response) 3.Errors of Observation - errors related to the collecting of the data. Such errors can occur even when the sample is selected using random methods. (Incorrect answer, or measurement bias) There are other errors in sampling that can occur, which are often called “biases”. Preventative measures can be taken to reduce, and in some cases eliminate, such sampling bias. These biases, or errors, can be classified into:

Summarizing the Data Four types of graphical methods will be discussed. These four methods are used for displaying data on a population variable that is quantitative. These four graphical routines are: Dotplot. Stem-and-Leaf plot (or stemplot). Histogram. Boxplot.