QMS 6351 Statistics and Research Methods Chapter 7 Sampling and Sampling Distributions Prof. Vera Adamchik.

Slides:



Advertisements
Similar presentations
University of Minnesota-Duluth, Econ-2030 (Dr. Tadesse) Inferential Statistics.
Advertisements

Statistics for Managers Using Microsoft® Excel 5th Edition
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Sampling and Sampling Distribution
1 1 Slide 2009 University of Minnesota-Duluth, Econ-2030(Dr. Tadesse) Chapter 7, Part B Sampling and Sampling Distributions Other Sampling Methods Other.
Chapter 7 Sampling Distributions
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
CHAPTER 7: SAMPLING DISTRIBUTIONS. 2 POPULATION AND SAMPLING DISTRIBUTIONS Population Distribution Sampling Distribution.
Sampling and Sampling Distributions: Part 2 Sample size and the sampling distribution of Sampling distribution of Sampling methods.
Sampling and Sampling Distributions Simple Random Sampling Point Estimation Sampling Distribution.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
The Excel NORMDIST Function Computes the cumulative probability to the value X Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc
7/2/2015 (c) 2001, Ron S. Kenett, Ph.D.1 Sampling for Estimation Instructor: Ron S. Kenett Course Website:
7-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft.
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
SAMPLING AND SAMPLING DISTRIBUTIONS
DATA Exploration: Statistics (One Variable) 1.Basic EXCELL/MATLAB functions for data exploration 2.Measures of central tendency, Distributions 1.Mean 2.Median.
Chapter 7 Sampling and Sampling Distributions n Simple Random Sampling n Point Estimation n Introduction to Sampling Distributions n Sampling Distribution.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Sampling: Theory and Methods
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
1 1 Slide © 2003 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2001 South-Western/Thomson Learning  Anderson  Sweeney  Williams Anderson  Sweeney  Williams  Slides Prepared by JOHN LOUCKS  CONTEMPORARYBUSINESSSTATISTICS.
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
Sampling The sampling errors are: for sample mean
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2016 Cengage Learning. All Rights Reserved. A population is the set of all the elements of interest. A population is the set of all the elements.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 7, Part A Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide Chapter 7 (b) – Point Estimation and Sampling Distributions Point estimation is a form of statistical inference. Point estimation is a form of.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #5 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution.
Copyright ©2011 Pearson Education 7-1 Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft Excel 6 th Global Edition.
1 1 Slide Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
1 Chapter 7 Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling Distribution of.
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling and Sampling Distributions.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Sampling and Sampling Distributions Basic Business Statistics 11 th Edition.
Basic Business Statistics
Sampling and Confidence Interval How can it be that mathematics, being after all a product of human thought independent of experience, is so admirably.
CHAPTER 7 SAMPLING DISTRIBUTIONS Prem Mann, Introductory Statistics, 7/E Copyright © 2010 John Wiley & Sons. All right reserved.
Learning Objectives Determine when to use sampling. Determine the pros and cons of various sampling techniques. Be aware of the different types of errors.
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
Sampling Dr Hidayathulla Shaikh. Contents At the end of lecture student should know  Why sampling is done  Terminologies involved  Different Sampling.
Fundamentals of Business Statistics chapter7 Sampling and Sampling Distributions 上海金融学院统计系 Statistics Dept., Shanghai Finance University.
Sampling Distributions
Chapter 7 (b) – Point Estimation and Sampling Distributions
Sampling Distributions and Estimation
St. Edward’s University
Sampling and Sampling Distribution
Chapter 5, Part A [SBE 7/e, Ch. 7] Sampling and Sampling Distributions
Slides by JOHN LOUCKS St. Edward’s University.
Econ 3790: Business and Economics Statistics
Chapter 7 Sampling and Sampling Distributions
Presentation transcript:

QMS 6351 Statistics and Research Methods Chapter 7 Sampling and Sampling Distributions Prof. Vera Adamchik

Chapter 7 Outline Simple random sampling Point estimation Introduction to sampling distributions Sampling distribution of Other sampling methods

Statistical inference The purpose of statistical inference is to obtain information about a population from information contained in a sample. A population is the set of all the elements of interest in a study. A sample is a subset of the population.

A parameter is a numerical characteristic of a population. A sample statistic is a numerical characteristic of a sample. We will use a sample statistic in order to judge tentatively or approximately the value of the population parameter.

The sample results provide only estimates (that is, rough and approximate values) of the values of the population characteristics. The reason is simply that the sample contains only a portion of the population. With proper sampling methods, the sample results will provide “good” estimates of the population characteristics.

Simple random sampling procedure

Selecting a sample Sampling from a finite population. Finite populations are often defined by lists such as organization membership roster, class roster, inventory product numbers, etc. Sampling from an infinite population (a process). The population is usually considered infinite if it involves an ongoing process that makes listing or counting every element impossible. For example, parts being manufactured on a production line, customers entering a store, etc.

Sampling from a finite population A simple random sample from a finite population of size N is a sample selected such that each possible sample of size n has the same probability of being selected. Replacing each sampled element before selecting subsequent elements is called sampling with replacement. Sampling without replacement is the procedure used most often. In large sampling projects, computer- generated random numbers are often used to automate the sample selection process.

St. Andrew’s College received 900 applications for admission in the upcoming year from prospective students. The applicants were numbered, from 1 to 900, as their applications arrived. The Director of Admissions would like to select a simple random sample of 30 applicants. Example: St. Andrew’s College

Sampling from a finite population using Excel =RAND() Excel generates a random number between 0 and 1 =RAND()*N Excel generates a random number greater than or equal to 0 but less than or equal N =INT(RAND()*900)

In the case of infinite populations, it is impossible to obtain a list of all elements in the population. The random number selection procedure cannot be used for infinite populations. Sampling from an infinite population

A simple random sample from an infinite population is a sample selected such that the following conditions are satisfied: Each element selected comes from the same population. Each element is selected independently. Sampling from an infinite population

Point estimation

In point estimation we use the data from the sample to compute a value of a sample statistic that serves as an estimate of a population parameter. A point estimate is a statistic computed from a sample that gives a single value for the population parameter. An estimator is a rule or strategy for using the data to estimate the parameter.

Terminology of point estimation We refer to as the point estimator of the population mean  We refer to as the point estimator of the population standard deviation 

Terminology of point estimation We refer to as the point estimator of the population proportion p. The actual numerical value obtained for in a particular sample is called the point estimate of the parameter.

Example: St. Andrew’s College Recall that St. Andrew’s College received 900 applications from prospective students. The application form contains a variety of information including the individual’s scholastic aptitude test (SAT) score and whether or not the individual desires on- campus housing. At a meeting in a few hours, the Director of Admissions would like to announce the average SAT score and the proportion of applicants that want to live on campus, for the population of 900 applicants.

However, the necessary data on the applicants have not yet been entered in the college’s computerized database. So, the Director decides to estimate the values of the population parameters of interest based on sample statistics. The sample of 30 applicants selected earlier with Excel’s RAND() function will be used. Example: St. Andrew’s College

Excel Value Worksheet Note: Rows are not shown. Point estimation using Excel A 1 Applicant Number CD SAT Score On-Campus Housing 1107No 1043Yes 991Yes 1008No 1127Yes 982Yes 1163Yes 1008No

Point estimates Note: Different random numbers would have identified a different sample which would have resulted in different point estimates.

Once all the data for the 900 applicants were entered in the college’s database, the values of the population parameters of interest were calculated. Population parameters

PopulationParameterPointEstimatorPointEstimateParameterValue  = Population mean SAT score SAT score  = Population std. deviation for deviation for SAT score SAT score 80 s = Sample std. s = Sample std. deviation for deviation for SAT score SAT score75.2 p = Population pro- portion wanting portion wanting campus housing campus housing Summary of point estimates obtained from a simple random sample = Sample mean = Sample mean SAT score SAT score = Sample pro- = Sample pro- portion wanting portion wanting campus housing campus housing

Making inferences about a population mean

Making inferences about a population mean The value of is used to make inferences about the value of . The sample data provide a value for the sample mean. A simple random sample of n elements is selected from the population. Population with mean  = ?

Population vs sampling distribution The population distribution is the probability distribution derived from the information on all elements of a population. The probability distribution of a sample statistic ( ) is called its sampling distribution.

Sampling distribution of The sampling distribution of the sample mean ( ) is the probability distribution of all possible values of. We need to know: Expected value of Standard deviation of Form of the sampling distribution of

Mean of the sampling distribution of The mean of the sampling distribution of is equal to the mean of the population. Thus, x x

Standard deviation of the sampling distribution of Finite population and n/N  0.05 (1) Infinite population (N is unknown); (2) Finite population and n/N 0.05 is referred to as the standard error of the mean. x is the finite population is the finite population correction factor correction factor

Two important observations 1. The spread of the sampling distribution of is smaller than the spread of the corresponding population distribution. In other words,. 2. The standard deviation of the sampling distribution of decreases as the sample size increases. x x

Form of the sampling distribution of 1. The population has a normal distribution. If the population from which the samples are drawn is normally distributed, then the sampling distribution of the sample mean will also be normally distributed for any sample size. x

Form of the sampling distribution of 2. The population is not normally distributed but the sample size is large (n 30). According to the Central Limit Theorem, for a large sample size (n 30), the sampling distribution of the sample mean is approximately normal, irrespective of the shape of the population distribution. In cases where the population is highly skewed or outliers are present, samples of size 50 or more may be needed. x

Form of the sampling distribution of 3. The sample size is small (n < 30) and the population is not normally distributed. Use special statistical procedures. x

Sampling Distribution of for SAT Scores Sampling distribution of x Example: St. Andrew’s College

What is the probability that the sample mean will be between 980 and 1000? In other words, what is the probability that a simple random sample of 30 applicants will provide an estimate of the population mean SAT score that is within +/-10 of the actual population mean ? P (980 < < 1000) = ?

Area =.5034 SamplingDistributionof for SAT Scores Example: St. Andrew’s College

The probability of means that, for a large number of samples of size 30 selected from the population, we can expect that in 50.34% of all cases the sample mean will be within +/-10 of the actual population mean (that is, ) and in 49.66% of all cases the sample mean will be further than +/-10 of the actual population mean (that is, below 980 or above 1000). Example: St. Andrew’s College

Relationship between the sample size and the sampling distribution of Example: St. Andrew’s College Suppose we select a simple random sample of 100 applicants instead of the 30 originally considered.

regardless of the sample size. In our example, E( ) remains at 990. Whenever the sample size is increased, the standard error of the mean is decreased. With the increase in the sample size to n = 100, the standard error of the mean is decreased from 14.6 to:

With n = 30, With n = 100, Example: St. Andrew’s College Relationship between the sample size and the sampling distribution of

Recall that when n = 30, P(980 < < 1000) = Now, with n = 100, P(980 < < 1000) = Because the sampling distribution with n = 100 has a smaller standard error, the values of have less variability and tend to be closer to the population mean than the values of with n = 30. Example: St. Andrew’s College

Area =.7888 Sampling Distribution of for SAT Scores Example: St. Andrew’s College

The probability of means that, for a large number of samples of size100 selected from the population, we can expect that in 78.88% of all cases the sample mean will be within +/-10 of the actual population mean (that is, ) and in 21.12% of all cases the sample mean will be further than +/-10 of the actual population mean (that is, below 980 or above 1000). Example: St. Andrew’s College

Making inferences about a population proportion

of is used The value of is used to make inferences p. about the value of p. The sample data provide a value for the proportion. sample proportion. A simple random sample of n elements is selected from the population. Population with proportion p = ? Making inferences about a population proportion

Sampling distribution of The sampling distribution of the sample proportion ( ) is the probability distribution of all possible values of. We need to know: Expected value of Standard deviation of Form of the sampling distribution of

Mean of the sampling distribution of The mean of the sampling distribution of is equal to the population proportion. Thus,

Standard deviation of the sampling distribution of Finite population and n/N  0.05 (1) Infinite population (N is unknown); (2) Finite population and n/N 0.05 is referred to as the standard error of the proportion.

Form of the sampling distribution of The sampling distribution of can be approximated by a normal probability distribution whenever the sample size is large. The sample size is considered large whenever the following two conditions are satisfied:

For values of p near.50, sample sizes as small as 10 permit a normal approximation. With very small (approaching 0) or very large (approaching 1) values of p, much larger samples are needed. Form of the sampling distribution of

For our example, with n = 30 and p =.72, the normal distribution is an acceptable approximation because: n (1 - p ) = 30(.28) = 8.4 > 5 and np = 30(.72) = 21.6 > 5 Example: St. Andrew’s College

Sampling Distributionof Example: St. Andrew’s College Sampling distribution of

Recall that 72% of the prospective students applying to St. Andrew’s College desire on-campus housing. What is the probability that a simple random sample of 30 applicants will provide an estimate of the population proportion of applicant desiring on-campus housing that is within plus or minus.05 of the actual population proportion? Example: St. Andrew’s College P (.67 < <.77) = ?

Area =.4582 Sampling Distributionof Example: St. Andrew’s College

Other sampling methods

Stratified random sampling Cluster sampling Systematic sampling Convenience sampling Judgment sampling

Stratified random sampling The population is first divided into groups of elements called strata. Each element in the population belongs to one and only one stratum. Best results are obtained when the elements within each stratum are as much alike as possible (i.e. a homogeneous group). A simple random sample is taken from each stratum.

Cluster sampling The population is first divided into separate groups of elements called clusters. Ideally, each cluster is a representative small-scale version of the population (i.e. heterogeneous group). A simple random sample of the clusters is then taken. All elements within each sampled (chosen) cluster form the sample.

Systematic sampling If a sample size of n is desired from a population containing N elements, we might sample one element for every n/N elements in the population. We randomly select one of the first n/N elements from the population list. We then select every n/Nth element that follows in the population list.

Convenience sampling It is a nonprobability sampling technique. Items are included in the sample without known probabilities of being selected. The sample is identified primarily by convenience. Example: A professor conducting research might use student volunteers to constitute a sample.

Judgment sampling The person most knowledgeable on the subject of the study selects elements of the population that he or she feels are most representative of the population. It is a nonprobability sampling technique. Example: A reporter might sample three or four senators, judging them as reflecting the general opinion of the senate.