1 A Resampling Study of NASS Survey MPPS Sampling Strategy By Stanley Weng National Agricultural Statistics Service U.S. Department of Agriculture.

Slides:



Advertisements
Similar presentations
Chapter 4 Sampling Distributions and Data Descriptions.
Advertisements

1
1 Vorlesung Informatik 2 Algorithmen und Datenstrukturen (Parallel Algorithms) Robin Pomplun.
© 2008 Pearson Addison Wesley. All rights reserved Chapter Seven Costs.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Part 3 Probabilistic Decision Models
Chapter 1 The Study of Body Function Image PowerPoint
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
STATISTICS Joint and Conditional Distributions
STATISTICS POINT ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
Variance Estimation in Complex Surveys Third International Conference on Establishment Surveys Montreal, Quebec June 18-21, 2007 Presented by: Kirk Wolter,
Linearization Variance Estimators for Survey Data: Some Recent Work
UNITED NATIONS Shipment Details Report – January 2006.
RXQ Customer Enrollment Using a Registration Agent (RA) Process Flow Diagram (Move-In) Customer Supplier Customer authorizes Enrollment ( )
What causes the commodity price boom? AGRI Green Team Seminar on the Health Check May 15, 2008 AGRI-G1 Agricultural Policy Analysis and Perspectives DG.
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
Properties of Real Numbers CommutativeAssociativeDistributive Identity + × Inverse + ×
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
1 Discreteness and the Welfare Cost of Labour Supply Tax Distortions Keshab Bhattarai University of Hull and John Whalley Universities of Warwick and Western.
Multistage Sampling Module 3 Session 9.
Chapter 7 Sampling and Sampling Distributions
REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.
Biostatistics Unit 5 Samples Needs to be completed. 12/24/13.
Table 12.1: Cash Flows to a Cash and Carry Trading Strategy.
PP Test Review Sections 6-1 to 6-6
EU market situation for eggs and poultry Management Committee 20 October 2011.
EU Market Situation for Eggs and Poultry Management Committee 21 June 2012.
Bright Futures Guidelines Priorities and Screening Tables
Chi-Square and Analysis of Variance (ANOVA)
5-1 Chapter 5 Theory & Problems of Probability & Statistics Murray R. Spiegel Sampling Theory.
2 |SharePoint Saturday New York City
Exarte Bezoek aan de Mediacampus Bachelor in de grafische en digitale media April 2014.
VOORBLAD.
Hypothesis Tests: Two Independent Samples
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
© 2012 National Heart Foundation of Australia. Slide 2.
Adding Up In Chunks.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
Module 17: Two-Sample t-tests, with equal variances for the two populations This module describes one of the most utilized statistical tests, the.
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
25 seconds left…...
Statistical Inferences Based on Two Samples
© The McGraw-Hill Companies, Inc., Chapter 10 Testing the Difference between Means and Variances.
Analyzing Genes and Genomes
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Essential Cell Biology
Chapter Thirteen The One-Way Analysis of Variance.
Chapter 8 Estimation Understandable Statistics Ninth Edition
Intracellular Compartments and Transport
PSSA Preparation.
Essential Cell Biology
Module 20: Correlation This module focuses on the calculating, interpreting and testing hypotheses about the Pearson Product Moment Correlation.
Immunobiology: The Immune System in Health & Disease Sixth Edition
Simple Linear Regression Analysis
Physics for Scientists & Engineers, 3rd Edition
Energy Generation in Mitochondria and Chlorplasts
Murach’s OS/390 and z/OS JCLChapter 16, Slide 1 © 2002, Mike Murach & Associates, Inc.
Commonly Used Distributions
Presentation transcript:

1 A Resampling Study of NASS Survey MPPS Sampling Strategy By Stanley Weng National Agricultural Statistics Service U.S. Department of Agriculture

2 INTRODUCTION MPPS Multivariate Probability Proportional to Size Address multiple, and often competing, purposes (multi targets) of a survey Used for NASS Crops Survey (CS) etc., since 1999

3 MPPS Technically Sample was selected using a Poisson method. Each farm i had a unique probability of selection, formed by

4 MPPS where is the item m selection probability, determined by auxiliary data with the assumption of the variance proportional to (a power of) the auxiliary variable value optimal allocation a desired item-level sample size

5 MPPS Development and application of the MPPS strategy at NASS: Amrhein, Hicks and Kott (1996) Amrhein and Bailey (1998) Bailey and Kott (1997) Hicks, Amrhein and Kott (1996) Kott, Amrhein and Hicks (1998).

6 A COMPARISON STUDY This study was designed to compare MPPS with the previously used SRS ((Stratified) Simple Random Sampling) strategy

7 THIS STUDY Explored the resampling approach to reveal the statistical characteristics/ behavior of NASS Ag survey data Raised issues for further investigation to improve our understanding and practice of NASS Ag survey sampling /estimation

8 RESAMPLING Population bootstrap Base sample June Crop Survey MPPS samples Pseudo population Composed of replicates of base sample elements, according to the (integerized) weight of the element

9 RESAMPLING Resamples Independent samples, drawn from by Poisson and SRS sampling strategies respectively

10 RESAMPLING Resample totals, and

11 RESAMPLING Resampling variance estimate for the sample total estimate Bootstrap statistic

12 DATA The crop component of the 2004 and 2005 June QAS, for all 42 participating states Certainty elements were eliminated from sample, to avoid unnecessary complication,

13 RESAMPLING VAR ESTIMATES Based on 1000 resamples Naive Comparison Log-Log Plot Resampling variance est vs sample total across crops – for each state Overlay: Poisson (*) vs SRS (^)

14 Naive Comparison General linear trend (Assumption: the variance proportional to a power of the total) For majority of crops, SRS variance appeared greater than Poisson variance (but often not appreciably)

15 Log-Log Plot of Resampling Variance Est vs Total Across Crops: CA Overlay: Poisson (*) vs SRS (^) pot srg saf sun dwh bar ctp oat ohy ctu wwh ric crn alf

16 Validness of the Comparison Need additional information to justify The quality of the resampling variance estimate depends on the statistical quality of the resample totals, which also provides evidence for the appropriateness of the sampling strategy Among various aspects, the most important: NORMALITY

17 Normality Q – Q plot of resample totals Demonstration: CA Most crops: Good shape of Q-Q Plot (Corn, Potatoes) Exception: Other Hay Evidence that Poisson was better than SRS

18

19

20

21

22

23

24 Outliers on the log-log plot Located far apart from the general trend The two sampling strategies gave appreciably different estimates Demonstration: CA: Other Hay MT: Potatoes Evidence that SRS was better

25 Log-Log Plot of Resampling Variance Est vs Total Across Crops: MT Overlay: Poisson (*) vs SRS (^) mus sun can pot saf fla crn oat ohy dwh bar alf wwh swh

26

27

28 FINITE SAMPLE RESAMPLING Complexities - Due to the special features of survey sampling Nonindependence arising in sampling without replacement Other complexities of finite population structure by designs and estimators

29 FINITE SAMPLE RESAMPLING Effects of discreteness (Davison & Hinkley, 1997, 2.3.2) Discrete empirical distribution and in particular, In finite population sampling, the pseudo population formed by replicates of sample elements

30 FINITE SAMPLE RESAMPLING Issues with this study Comparable sample size - Addressed by size adjustment Impact of the base sample - Not clear

31 Impact of Base Sample For finite population resampling, the general guideline The resampling population mimics the original population, and The resamples, mimic the base sample, drawn from by a design identical to the one by which the base sample was originally drawn (Sarndal, et al., 1992, Ch. 11)

32 AT ISSUE How the resampling technique should be correctly modified to accommodate the finite sampling situation?

33 AT ISSUE In literature, most reported finite sample resampling studies used (stratified) SRS, which bears the most similarity to the infinite population independent random sampling - the standard setting that the resampling technique is based on

34 SUMMARY An Approach Resampling & analysis of resamples, using statistical graphical and diagnostic techniques, to reveal statistical characteristics / behavior of NASS Ag survey data

35 SUMMARY Sampling strategy comparison Poisson seemed to be preferable to stratified simple random sampling A national comparison table of the two strategies across crops and states is to be produced for a comprehensive picture with likely causal factors identified

36 FURTHER INVESTIGATION To develop statistical understanding, the resampling setting of this study and other statistical information techniques will be further explored

37 FURTHER INVESTIGATION Behavior of Studentized bootstrap statistics Statistical function (Booth, Butler, and Hall, 1994; Davison & Hinkley, 1997) Examine different survey data

38 THANK YOU

39 ALFAlfalfa All Harvested Acres BAR Barley All Planted Acres CAN Canola All Planted Acres CRN Corn Planted Acres CTP Pima Cotton Planted Acres CTU Upland Cotton Planted Acres DEB Dry Beans Planted Acres DWH Durum Wheat Planted Acres FLA Flaxseed Planted Acres MUS Mustard All Planted Acres OAT Oats All Planted Acres OHY Other Hay Harvested Acres PNT Peanuts All Planted Acres POT Potatoes All Planted Acres RIC Rice All Grain Planted Acres RYE Rye All Planted Acres SAF Safflower All Planted Acres SGBSugarcane All Planted Acres* SOY Soybeans All Planted Acres SPT Sweet potatoes Planted Acres SRG Sorghum All Planted Acres SUG Sugarcane For Sugar Harvested Acres SUN Sunflowers All Planted Acres SWH Spring Wheat Irr Planted Acres WWHWinter Wheat All Planted Acres