Sampling Design and Analysis MTH 494 Lecture-22 Ossam Chohan Assistant Professor CIIT Abbottabad.

Slides:



Advertisements
Similar presentations
Stratified Sampling Module 3 Session 6.
Advertisements

1 Module 3 Session 7 Systematic Sampling. 2 Session Objectives To introduce basic sampling concepts in systematic sampling Demonstrate how to select a.
Introduction Simple Random Sampling Stratified Random Sampling
Interpolation A standard idea in interpolation now is to find a polynomial pn(x) of degree n (or less) that assumes the given values; thus (1) We call.
Chapter 7 Statistical Data Treatment and Evaluation
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Math 144 Confidence Interval.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 10 th Edition.
Introduction to Probability and Statistics Linear Regression and Correlation.
STAT262: Lecture 5 (Ratio estimation)
8-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Statistics for Managers using Microsoft.
Copyright ©2011 Pearson Education 8-1 Chapter 8 Confidence Interval Estimation Statistics for Managers using Microsoft Excel 6 th Global Edition.
Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:
7-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft.
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
5-3 Inference on the Means of Two Populations, Variances Unknown
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
CORRELATION & REGRESSION
Correlation.
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
1 1 Slide Chapter 7 (b) – Point Estimation and Sampling Distributions Point estimation is a form of statistical inference. Point estimation is a form of.
Confidence Interval Estimation
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Probabilistic and Statistical Techniques 1 Lecture 24 Eng. Ismail Zakaria El Daour 2010.
Chapter 7 Estimates and Sample Sizes
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Copyright ©2011 Pearson Education 7-1 Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft Excel 6 th Global Edition.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 Normal Probability Distributions 6-1 Review and Preview 6-2 The Standard Normal.
1 Chapter 7 Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling Distribution of.
Sampling Design and Analysis MTH 494 Lecture-30 Ossam Chohan Assistant Professor CIIT Abbottabad.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Chap 8-1 Chapter 8 Confidence Interval Estimation Statistics for Managers Using Microsoft Excel 7 th Edition, Global Edition Copyright ©2014 Pearson Education.
College Algebra & Trigonometry
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Sampling and Sampling Distributions Basic Business Statistics 11 th Edition.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Basic Business Statistics
Sampling Design and Analysis MTH 494 Lecture-21 Ossam Chohan Assistant Professor CIIT Abbottabad.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
MBF1413 | Quantitative Methods Prepared by Dr Khairul Anuar 8: Time Series Analysis & Forecasting – Part 1
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
1 Ka-fu Wong University of Hong Kong A Brief Review of Probability, Statistics, and Regression for Forecasting.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
Virtual University of Pakistan
Chapter 7 Confidence Interval Estimation
Sampling Design and Analysis MTH 494
Chapter 9 Audit Sampling: An Application to Substantive Tests of Account Balances McGraw-Hill/Irwin ©2008 The McGraw-Hill Companies, All Rights Reserved.
Chapter 7 (b) – Point Estimation and Sampling Distributions
Representativeness The aim of any sample is to represent the characteristics of the sample frame. There are a number of different methods used to generate.
Virtual COMSATS Inferential Statistics Lecture-11
Metode Sampling (Ekologi Kuantitatif)
Sampling Design and Analysis MTH 494 Lecture-9
Chapter 8 Confidence Interval Estimation.
Elementary Statistics
Elementary Statistics
Chapter 7 Sampling and Sampling Distributions
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Sampling Design and Analysis MTH 494 Lecture-22 Ossam Chohan Assistant Professor CIIT Abbottabad

Review 2

Regression Estimation We observed that the ratio estimator is most appropriate when the relationship between y and x is linear through the origin. If there is evidence of a linear relationship between the observed y’s and x’s, but not necessarily one that would pass through the origin, then this extra information provided by the auxiliary variable x may be taken into account through a regression estimator of the mean µ y. 3

One must still have knowledge of µ x before the estimator can be employed, as it was in the case of ratio estimation of µ y. The underlying line that shows the basic relationship between y’s and x’s is sometimes referred to as the regression line of y upon x. Thus the subscript L in the ensuing formulas is used to denote linear regression. 4

The estimator given in next section assumes the x’s to be fixed in advance and the y’s to be random variable. We can think of the x values as something that has already been observed, like last year’s first quarter earnings, and the y response as a random variable yet to be observed, such as the current quarterly earnings of a company for which x is already known. The probabilistic properties of the estimator then depend only on y for a given set of x’s. 5

If stratum sample sizes are very small, or if the within-stratum ratios are all approximately equal, then the combined ratio estimator may perform better. Of course, an estimator of the population total can be found by multiplying either of the estimators above by the population size N, and the variances can be adjusted accordingly. Thus we might use the notation 6

Estimators Regression estimator of the population mean µ y. (3.28) Estimated Variance of (3.29) 7

Estimator Bound of the error of estimation: (3.30) When calculating b from observed pairs (y 1,x 1 ),…,(y n, x n ), we may use the fact that 8

Example 3.9 A mathematical achievement test was given to 486 students prior to their entering a certain college. From these students a simple random sample of n=10 students was selected and their progress in calculus observed. Final calculus grades were then reported, as given in the accompanying table. It is known that µ x =52 for all 486 students taking the achievement test. Estimate µ y for this population, and place a bound on the error of estimation. 9

Data for problem StudentAchievement test score, xFinal Calculus grade, y

Solution 11

Solution 12

A close examination of the data on sugar content and weight of oranges given in example 3.2 might suggest that a regression estimator is more appropriate than ratio estimator. A plot of the points will show that the regression line does not appear to go through the origin. However, the regression estimator of a total is of the form, specifically requiring knowledge of N. Since the ratio estimator also works well in this case, determining the number of oranges in the truckload may not be worth the extra cost and time 13

In other cases N may be known or easily found. Thus one should carefully consider the choice between ratio and regression estimators when estimating population means or totals. 14

Difference Estimation The difference method of estimating a population mean or total is similar to the regression method in that it adjusts the value up or down by an amount depending on the difference ( ). However, the regression coefficient b is not computed. In effect, b is set equal to unity. The difference method is, then, easier to employ than the regression method and frequently works just as well. 15

It is commonly employed in auditing procedures, and we will consider such an example in this section. The following formulas hold provided that simple random sampling was employed. 16

Estimators Difference estimator of a population µ y : (3.31) Estimated variance of : (3.32) 17

Estimators Bound on the error of estimation (3.33) 18

Example 3.10 Auditors are often interested in comparing the audited value of item with the book value. Generally, book values are known for every item in the population, and audit values are obtained for a sample of these items. The book values can be used to obtain a good estimate of the total or average audit value for the population. Suppose a population contains 180 inventory items with a stated book value of $13,320. Let xi denote the book value and y i the audit value of the i th item. A simple random sample of n=10 items yields the results shown in the accompanying table. Estimate the mean audit value of µ y by the difference method and estimate the variance of. 19

Data for Problem SampleAudit Value, y i Book Value, x i didi

Solution 21

22 Systematic Sampling

23 Session Objectives To introduce basic sampling concepts in systematic sampling Demonstrate how to select a random sample using systematic sampling design Estimation of different parameters in systematic random sampling

24 Sample Selection Procedure List all the units in the population from 1,2,…,N – Sampling frame Select a random number g in the interval 1 g K, using a random mechanism e.g. random number tables, where K = K is called the Sampling Interval N is the population size; n is the sample size The random number g is called the random start and constitutes the first unit of the sample

25 Sample Selection Procedure Take every k th unit after the random start The selected units will be g, g+k, g+2k, g+3k, g+4k, …,g+(n-1)k Until we have n units Example N =10000, n=100 k = =100 Suppose g=87

26 Sample Selection Procedure We select the following units 87, 187, 287, 387,…, 9987 NB: This procedure is however only valid if k is an integer (whole number) If k is not an integer (whole number) there are a number of methods we can use. We will consider just two of them

27 Sample Selection Procedure Method 1: Use Circular Sampling Treat the list as circular so that the last unit is followed by the first Select a random start g between 1 and N, using a random mechanism Add the intervals k until n units are selected Any convenient interval k will result into a random sample

28 Sample Selection Procedure One suitable suggestion is to choose the integer k closest to the ratio Method 2: Use Fractional Intervals Suppose we want to select a sample of 100 units from a population of 21,156. Calculate k = = Select a random start g between 1 and using a random mechanism

29 Sample Selection Procedure Suppose g = 582 Add the interval successively obtaining exactly 100 numbers The numbers will be 582, 21738, 42894, … Divide each number by 100 and round to the nearest whole number to get the selected sample, i.e. 6, 217, 429, etc

30 Advantages and Disadvantages of Systematic sampling Advantages: – The major advantage is that it is easy, almost foolproof and flexible to implement – It is especially easy to give instructions to fieldworkers – If we order our list prior to taking the sample, the sample will reflect the ordering and as such can easily give a proportionate sample

31 Advantages and Disadvantages of Systematic sampling Disadvantages: – The main disadvantage is that if there is an ordering (monotonic trend or periodicity) in the list which is unknown to the researcher, this may bias the resulting estimates – There is a problem of estimating variance from systematic sampling- variance is biased