Sampling Design and Analysis MTH 494 Lecture-30 Ossam Chohan Assistant Professor CIIT Abbottabad.

Slides:



Advertisements
Similar presentations
Introduction Simple Random Sampling Stratified Random Sampling
Advertisements

Mean, Proportion, CLT Bootstrap
Ex Post Facto Experiment Design Ahmad Alnafoosi CSC 426 Week 6.
Estimation in Sampling
Sampling Distributions
Sampling with unequal probabilities STAT262. Introduction In the sampling schemes we studied – SRS: take an SRS from all the units in a population – Stratified.
Selection of Research Participants: Sampling Procedures
QBM117 Business Statistics Statistical Inference Sampling 1.
AP Statistics C5 D2 HW: p.287 #25 – 30 Obj: to understand types of samples and possible errors Do Now: How do you think you collect data?
MISUNDERSTOOD AND MISUSED
Dr. Chris L. S. Coryn Spring 2012
Why sample? Diversity in populations Practicality and cost.
Sampling and Sampling Distributions: Part 2 Sample size and the sampling distribution of Sampling distribution of Sampling methods.
PROBABILITY SAMPLING: CONCEPTS AND TERMINOLOGY
STAT262: Lecture 5 (Ratio estimation)
QMS 6351 Statistics and Research Methods Chapter 7 Sampling and Sampling Distributions Prof. Vera Adamchik.
Ratio estimation with stratified samples Consider the agriculture stratified sample. In addition to the data of 1992, we also have data of Suppose.
Stratified Simple Random Sampling (Chapter 5, Textbook, Barnett, V
PROBABILITY SAMPLING: CONCEPTS AND TERMINOLOGY
Chapter Outline  Populations and Sampling Frames  Types of Sampling Designs  Multistage Cluster Sampling  Probability Sampling in Review.
Sample Design.
Chapter 7 Sampling and Sampling Distributions n Simple Random Sampling n Point Estimation n Introduction to Sampling Distributions n Sampling Distribution.
Chapter 12: AP Statistics
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section A 1.
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Sampling January 9, Cardinal Rule of Sampling Never sample on the dependent variable! –Example: if you are interested in studying factors that lead.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Sampling: Theory and Methods
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
1 1 Slide © 2003 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Virtual COMSATS Inferential Statistics Lecture-6
1 1 Slide Chapter 7 (b) – Point Estimation and Sampling Distributions Point estimation is a form of statistical inference. Point estimation is a form of.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Chapter 4 Statistics. 4.1 – What is Statistics? Definition Data are observed values of random variables. The field of statistics is a collection.
Agricultural and Biological Statistics. Sampling and Sampling Distributions Chapter 5.
Chapter 11 – 1 Chapter 7: Sampling and Sampling Distributions Aims of Sampling Basic Principles of Probability Types of Random Samples Sampling Distributions.
1 Chapter 7 Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling Distribution of.
Population and Sampling
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7 The Logic Of Sampling. Observation and Sampling Polls and other forms of social research rest on observations. The task of researchers is.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
Lecture 4. Sampling is the process of selecting a small number of elements from a larger defined target group of elements such that the information gathered.
Tahir Mahmood Lecturer Department of Statistics. Outlines: E xplain the role of sampling in the research process D istinguish between probability and.
Sampling Techniques 19 th and 20 th. Learning Outcomes Students should be able to design the source, the type and the technique of collecting data.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
11 Norton university Research Method Topic: Determination of sample size Facilitated by: Dr. pheak sothea DEPARTMENT: MDM.
Sampling Design and Analysis MTH 494 Lecture-22 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
Sampling Design and Analysis MTH 494 Lecture-21 Ossam Chohan Assistant Professor CIIT Abbottabad.
Probability Sampling. Simple Random Sample (SRS) Stratified Random Sampling Cluster Sampling The only way to ensure a representative sample is to obtain.
Population vs. Sample. Population: a set which includes all measurements of interest to the researcher (The collection of all responses, measurements,
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling Design and Procedure
Sampling.
Sampling Design and Analysis MTH 494
Sampling Why use sampling? Terms and definitions
Graduate School of Business Leadership
Meeting-6 SAMPLING DESIGN
Sampling with unequal probabilities
Slides by JOHN LOUCKS St. Edward’s University.
Chapter 7 Sampling and Sampling Distributions
Presentation transcript:

Sampling Design and Analysis MTH 494 Lecture-30 Ossam Chohan Assistant Professor CIIT Abbottabad

Sampling with unequal probabilities Up to now, we have only discussed sampling schemes in which the probabilities of choosing sampling units are equal. Equal probabilities give schemes that are often easy to design and explain. Such schemes are not, however, always possible or, if practicable, as efficient as schemes using unequal probabilities. Cluster sample with equal probabilities may result in large variance for the design-unbiased estimator of the population mean and total 2

Primary Sampling Units(PSUS) In sample surveys, primary sampling unit (commonly abbreviated as PSU) arises in samples in which population elements are grouped into aggregates and the aggregates become units in sample selection. The aggregates are, due to their intended usage, called "sampling units." Primary sampling unit refers to sampling units that are selected in the first (primary) stage of a multi-stage sample ultimately aimed at selecting individual elements. In selecting a sample, one may choose elements directly; in such a design, the elements are the only sampling units. One may also choose to group the elements into aggregates and choose the aggregates in a first stage of selection and then elements at a later stage of selection. The aggregates and the elements are both sampling units in such a design. For example, if a survey is selecting households as elements, then counties may serve as the primary sampling unit, with blocks and households... 3

Sampling one primary unit As a special case, suppose we select just one (n=1) of N psus to be in the sample. The total for psu I is denoted by t i, and we want to estimate the population total, t. Sampling one psu will demonstrate the ideas of unequal-probability sampling without introducing the complications. 4

Understanding example Let us start out by looking at what happens for a situation in which we know the whole population. A town has four supermarkets, ranging in size from 100 square meters (m 2 ) to 1000 m 2. We want to estimate the total amount of sales in the four stores for last month by sampling just one of the stores. Of course, this is just an illustration-if we really had only four supermarkets we would probably take a census) 5

You might expect that a larger store would have more sales than a smaller store, and that the variability in total sales among several 1000 m 2 stores will be greater than the variability in total sales among several 100 m 2 stores. Since we sample only one store, the probability that a store is selected on the first draw is (ϒ i ) is the same as the probability that the store is included in the sample (π i ). For this example, take π i = ϒ i = Pr(store i selected) 6

This is proportional to the size of the store. Since store A accounts for 1/16 of the total floor area of the four stores, it is sampled with probability 1/16. for illustrative purposes, we know the values of t i for the whole population. Values are given on next slide. 7

StoreSize (m 2 )ϒiϒiπ i (in thousand) A1001/1611 B2002/1620 C3003/1624 D100010/16245 Total

We could select a probability sample of size 1 with probabilities given above by shuffling cards numbered 1 through 16 and choosing one card. If the card’s number is 1, choose store A; if 2 or 3, choose B; if 4, 5 or 6, choose C; and if 7 through 16, choose D. or we could spin once on a spinner like this: 9

10

We compensate for the unequal probabilities of selection by also using ϒ i in the estimator. We have already seen such compensation fro unequal probabilities in stratified sampling: if we select 10% of the units in stratum 1 and 20% of the units in stratum 2, the sampling weight is 10 for each units in stratum 1 and 5 for each unit in stratum 2. Here, we select store A with probability 1/16, so store A’s sampling weight is

If the size of store is roughly proportional to the total sales for that store, we would expect that store A also has about 1/16 of the total sales and that multiplying store A’s sales by 16 would estimate the total sales for all four stores. As always, the sampling weight of unit i is the reciprocal of the probability of selection: 12

Unequal-probability sampling without replacement Generally, sampling with replacement is less efficient than sampling without replacement; with replacement is introduced first because of the case in selecting and analyzing samples. Nevertheless, in large surveys with many small strata, the inefficiencies may wipe out the gains in convenience. Much research has been done on unequal-probability sampling without replacement; the theory is more complicated because the probability that a unit is selected is different for the first unit chosen than for the second, third and subsequent units. When you understand the probabilistic arguments involved, however, you can find the properties of any sampling scheme. 13

Example The supermarket example discussed above can be used to illustrate some of the features of unequal-probability sampling with replacement. Here is the population again. 14

StoreSize (m 2 )ϒiϒiπ i (in thousand) A1001/1611 B2002/1620 C3003/1624 D100010/16245 Total

Let’s select two psus without replacement and with unequal probabilities. As we discussed above ϒ i = P(Select unit i on first draw) Since we are sampling without replacement, though, the probability that unit j is selected on the second draw depends on which unit was selected on the first draw. 16

One way to select the units with unequal probabilities is to use ϒ i as the probability of selecting unit i on the first draw, and then adjust the probabilities of selecting the other stores on the second draw. If store A was chosen on the first draw, then for selecting the second store would spin the wheel while clocking out the selection for store A, or shuffle the deck and redeal without Card-1. Thus 17

Rest of the solution 18

19

Summary Summary of the Designs and Methods 20

You will recall that the objective of statistics is to make inferences about a population from information contained in a sample. This unit discusses the design of sample surveys and associated methods of inference for populations containing a finite number of elements. Practical examples have been selected primarily from the fields of business and the social sciences where finite populations of human responses are frequently the target of surveys. Natural resource management examples are also included. 21

Summary The method of inference employed for most sample surveys is estimation. Thus we consider appropriate estimators for population parameters and the associated two-standard deviation bound on the error of estimation. In repeated sampling the error of estimation will be less than its bound, with probability approximately equal to Equivalently, we construct confidence intervals that, in repeated sampling, enclose the true population parameter approximately 95 times out of

The quantity of information pertinent to a given parameter is measured by the bound on the error of estimation. 23

The first segment, presented in initial discussions, reviews the objective of statistics and points to the peculiarities of problems arising in the social sciences, business and natural resource management that make them different from traditional type of experiment conducted in the laboratory. 24

The basic sample survey design, simple random sampling, is presented first. For this design the sample is selected so that every sample of size n in the population has an equal chance of being chosen. The design does not make a specific attempt to reduce the cost of the desired quantity of information. It is the most basic type of sample survey design, and all other designs are compared with it. 25

The second type of design, stratified random sampling divides the population into homogeneous groups called strata, this procedure usually produces an estimator that possesses a smaller variance than can be acquired by simple random sampling. Thus the cost of survey can be reduced by selecting fewer elements to achieve an equivalent bound on the error of estimation. 26

The third type of experimental design is systematic sampling, which is usually applied to population elements that are available in a list or line, such as names on file cards in a drawer or people coming out a factory. A random starting point is selected and then every k th element thereafter is sampled. Systematic sampling is frequently conducted when collecting a simple random or a stratified random sample is extremely costly or impossible. Once again, the reduction in survey cost in primarily associated with the cost of collecting the sample. 27

The fourth type of sample survey design is cluster sampling. Cluster sampling may reduce cost because each sampling unit is a collection of elements usually selected so as to be physically close together. Cluster sampling is most often used when a frame that lists all population elements is not available or when travel costs from element to element are considerable. Cluster sampling reduces the cost of the survey primarily by reducing the cost of collecting the data. 28

A discussion of ratio, regression, and difference estimators, which utilize information on an auxiliary variable is covered in third segment of the material. The ratio estimator illustrates how additional information, frequently acquired at little cost, can be used to reduce the variance of estimator and, consequently, reduce the overall cost of a survey. It also suggests the possibility of acquiring more sophisticated estimators by using information on more than one auxiliary variable. 29

This unit on ratio estimation follows naturally the discussion on simple random sampling in previous unit of SRS. That is, you can take a measurement of y, the response of interest, for each element of the SRS and utilize the traditional estimators. 30

To summarize, we have presented various elementary sample survey designs along with their associated methods of inference. Treatment of the topics has been directed towards practical applications so that you can see how sample survey design can be employed to make inferences at minimum cost when sampling from finite social, business or natural resource populations. 31