INFO 7470/ECON 7400/ILRLE 7400 Understanding Social and Economic Data John M. Abowd and Lars Vilhuber January 21, 2013.

Slides:



Advertisements
Similar presentations
1 Longitudinal Employer- Household Dynamics (LEHD) Program Jeremy S. Wu U.S. Census Bureau May 11, 2005 Jeremy S. Wu U.S. Census Bureau May 11, 2005.
Advertisements

1 The Synthetic Longitudinal Business Database Based on presentations by Kinney/Reiter/Jarmin/Miranda/Reznek 2 /Abowd on July 31, 2009 at the Census-NSF-IRS.
Business Register Outputs in Support of Regional Policy John Perry UK Office for National Statistics.
What are Wage Records? Wage records are an administrative database used to calculate Unemployment Insurance benefits for employees who have been laid-off.
© 2007 John M. Abowd, Lars Vilhuber, all rights reserved Introduction to Record Linking John M. Abowd and Lars Vilhuber April 2007.
Semi-Permeable Boundaries Among Institutions: Non-Public Data and the Census RDC at Berkeley IASSIST 2009 – Tampere, Finland Jon StilesMay 27, 2009.
Nordisk Statistikermøde i København august 2010 The archive statistical method years - A Summary by Svein Nordbotten 8/11/20101Svein.
© John M. Abowd 2005, all rights reserved Household Samples John M. Abowd March 2005.
© John M. Abowd 2005, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2005.
INFO 4470/ILRLE 4470 Social and Economic Data Introduction John M. Abowd and Lars Vilhuber January 24, 2011.
INFO 4470/ILRLE 4470 Social and Economic Data Federal Data Providers (extras) John M. Abowd and Lars Vilhuber January 31, 2011.
Synthetic Data – A New Future for Public Use Micro-Data? John M. Abowd December 7, 2004.
INFO 4470/ILRLE 4470 Social and Economic Data Populations and Frames John M. Abowd and Lars Vilhuber February 7, 2011.
INFO 4470/ILRLE 4470 Social and Economic Data The 2010 Census of Population John M. Abowd and Lars Vilhuber January 26, 2011.
Are Public Use (Micro) Data a Thing of the Past? John M. Abowd Cornell University US Census Bureau Prepared for IASSIST 2002.
© John M. Abowd 2005, all rights reserved Sampling Frame Maintenance John M. Abowd February 2005.
© John M. Abowd 2005, all rights reserved Statistical Programs of the Federal Government John M. Abowd February 2005.
© John M. Abowd and Lars Vilhuber 2005, all rights reserved Introduction to Probabilistic Record Linking John M. Abowd and Lars Vilhuber March 2005.
Recent Advances In Confidentiality Protection – Synthetic Data John M. Abowd April 2007.
(In)Consistency of Economic Data across Federal Statistical Agencies: What Information Professionals Can Do September 29, 2014 Katherine R. Smith, Executive.
© John M. Abowd 2005, all rights reserved Economic Surveys John M. Abowd March 2005.
© John M. Abowd 2005, all rights reserved Introduction John M. Abowd January 2005.
Profile of US Data Sources on Entrepreneurship Richard Clayton and Jim Spletzer US Bureau of Labor Statistics OECD Entrepreneurship Indicators Steering.
INFO 4470/ILRLE 4470 Register-based statistics by example: County Business Patterns John M. Abowd and Lars Vilhuber February 14, 2011.
INFO 7470/ILRLE 7400 Universes, Populations, Frames, and Sampling John M. Abowd and Lars Vilhuber February 1, 2011.
Treasure Trove of Data: Conducting Research Using Federal Statistical Surveys.
Use of administrative data in statistics - challenges and opportunities ICES III End Panel Discussion Montreal, June 2007 Heli Jeskanen-Sundström Statistics.
INFO 7470/ILRLE 7400 Survey of Income and Program Participation (SIPP) Synthetic Beta File John M. Abowd and Lars Vilhuber April 26, 2011.
Improvements in the BLS Business Register Richard Clayton David Talan 12th Meeting of the Group of Experts on Business Registers Paris, France September.
Data Sharing to Reduce Respondent Burden for the U.S. Census Bureau’s Business Register Presented to 12 th Meeting of the Group of Experts on Business.
Household Surveys ACS – CPS - AHS INFO 7470 / ECON 8500 Warren A. Brown University of Georgia February 22,
INFO 7470/ILRLE 7400 Statistical Tools: Missing Data Methods John M. Abowd and Lars Vilhuber March 15, 2011.
Labor Market Information in the Americas: the United States Workshop On Labor Migration and Labor Market Information Systems Inter-American Network for.
Introduction to Record Linking John M. Abowd and Lars Vilhuber April 2011 © 2011 John M. Abowd, Lars Vilhuber, all rights reserved.
Liesl Eathington Iowa Community Indicators Program Iowa State University October 2014.
1 Supplementing ACS: The LEHD Program Jeremy S. Wu Marc Roemer U.S. Census Bureau May 12, 2005 Jeremy S. Wu Marc Roemer U.S. Census Bureau May 12, 2005.
INFO 7470/ILRLE 7400 Statistical Tools: Edit and Imputation John M. Abowd and Lars Vilhuber March 25, 2013.
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
Emerging methodologies for the census in the UNECE region Paolo Valente United Nations Economic Commission for Europe Statistical Division International.
Economics 2327 Economic Data Sources. DataFerrett Federal Reserve System United State Census Bureau of Economic Analysis U.S. Labor Department.
© John M. Abowd 2007, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2007.
1 Longitudinal Employer- Household Dynamics (LEHD) Program Jeremy S. Wu U.S. Census Bureau May 11, 2005 Jeremy S. Wu U.S. Census Bureau May 11, 2005.
© Copyright ONS Joint ECE/EUROSTAT work session on Population Censuses Geneva November 2004 Ian White.
Longitudinal Data Recent Experience and Future Direction August 2012.
Assessing Disclosure for a Longitudinal Linked File Sam Hawala – US Census Bureau November 9 th, 2005.
MCRDC Michigan Census Research Data Center The MCRDC is a joint project of the U.S. Bureau of the Census and the University of Michigan to enable qualified.
INFO 4470/ILRLE 4470 Ethical Aspects of Data Collection and Privacy Protection John M. Abowd and Lars Vilhuber March 30, 2011.
2008 NCHS Data Users’ Conference Omni Shoreham Hotel Washington, DC Wednesday, August 13, 2008.
Editing of linked micro files for statistics and research.
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
Business model Transformation Strategy (BmTS) John Pearson and Tracey Savage Statistics NZ’s.
© John M. Abowd 2005, all rights reserved Assessing Data Quality John M. Abowd April 2005.
U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES Centers for Disease Control and Prevention National Center for Health Statistics Improving Estimates of the.
Understanding Social and Economic Data Technical notes John M. Abowd and Lars Vilhuber February 1, 2016.
INFO 4470/ILRLE 4470 Visualization Tools and Data Quality John M. Abowd and Lars Vilhuber March 16, 2011.
© John M. Abowd 2005, all rights reserved Using the Decennial Census of Population and Housing John M. Abowd February 2005.
Using administrative data to produce official social statistics New Zealand’s experience.
The LEHD Program and Employment Dynamics Estimates Ronald Prevost Director, LEHD Program US Bureau of the Census
Marc Hamel and Julie Trépanier May 21, 2014 Canadian Statistical Demographic Database: A research project.
INFO 7470/ECON 7400/ILRLE 7400 Register-based statistics John M. Abowd and Lars Vilhuber March 4, 2013 and April 4, 2016.
INFO 7470 Statistical Tools: Edit and Imputation Examples of Multiple Imputation John M. Abowd and Lars Vilhuber April 18, 2016.
Measuring Data Quality in the BLS Business Register Richard Clayton Sherry Konigsberg David Talan WiesbadenGroup on Business Registers Tallin, Estonia.
INFO 7470/ECON 7400/ILRLE 7400 Alternate Data Sources of the 21 st Century John M. Abowd and Lars Vilhuber March 4, 2013 and April 4, 2016.
Using Census Data at the Federal Statistical Research Data Centers Barbara A. Downs Director, FSRDC Center for Economic Studies U.S. Census Bureau.
INFO 7470 Session 12 Updates John M. Abowd and Lars Vilhuber April 25, 2016.
Understanding Social and Economic Data
John M. Abowd and Lars Vilhuber February 16, 2011
Sub-regional workshop on integration of administrative data, big data
Presentation transcript:

INFO 7470/ECON 7400/ILRLE 7400 Understanding Social and Economic Data John M. Abowd and Lars Vilhuber January 21, 2013

Session 1: History and Current State of the Federal Statistical System ● Overview of the (U.S.) federal statistical infrastructure, and how it came to be ● Guest lecture by Margo Anderson Margo Anderson 1/21/20132 © John M. Abowd and Lars Vilhuber 2013, all rights reserved

Session 2: Technical Statistical Terminology and Tools Censuses, surveys, administrative records, contextual data, genomes, spatial records, web sources Populations Frames Sampling Coverage Bias Other errors 1/21/20133 © John M. Abowd and Lars Vilhuber 2013, all rights reserved

Session 3: Measuring People and Households Censuses of population Goals and methods Guest lecture by Warren Brown U.S. Decennial Census of Population and Housing American Community Survey Current Population Survey American Housing Survey 1/21/20134 © John M. Abowd and Lars Vilhuber 2013, all rights reserved

Session 4: Measuring Business and Economic Activity The fundamental concepts of the national income and product accounts Business entities Frame management – Census – BLS Births and deaths The Employer Business Register Economic Censuses Establishment Surveys 1/21/20135 © John M. Abowd and Lars Vilhuber 2013, all rights reserved

Session 5: Enhancing Traditional Methods ● Health surveys, Program participation surveys – SIPP – NHIS – Benefit recipient surveys ● Survey-driven linkages – Validation studies (CPS, PSID) – Two-stage sampling schemes (Canada's Workplace and Employee Survey, WES) WES 1/21/20136 © John M. Abowd and Lars Vilhuber 2013, all rights reserved

Session 5: Introduction to Integrated Data Systems ● Validation, augmentation using administrative data – SIPP linked to SSA – Retirement History Survey (RHS) linked to SSA – Health and Retirement Survey (HRS) linked to SSA ● Integrating data from multiple sources – What is it? – Key tools – Where is it applied 1/21/20137 © John M. Abowd and Lars Vilhuber 2013, all rights reserved

Session 6: 21 st Century Statistical Systems ● Integrated administrative data systems – Longitudinal Employer-Household Dynamics (LEHD) data – IRS linked data ● Register-based Censuses – Variants in Europe – Canada: augmenting the Census with administrative data – US: planning 2020? 1/21/20138 © John M. Abowd and Lars Vilhuber 2013, all rights reserved

Session 6: 21 st Century Statistical Systems ● Non-traditional data collection methods – Administrative – Electronic (web) – Non-traditional sources (Google, Twitter, etc.) ● Confidentiality and access methods – RDCs – Enclaves – Methods of gaining access – Justifications for gaining access – Learning about confidential data 1/21/20139 © John M. Abowd and Lars Vilhuber 2013, all rights reserved

Session 7: Replicable science ● How do you find literature? – (assumed to be known) – Review of how to cite literature ● Assessing quality of replicability ● Some tools ● Assessing quality of replicability ● Some tools ● How do you find data? – Less well developed, if at all – Referencing data used in articles – Developing standards on how to cite data – The conundrum of how to cite confidential data 1/21/ © John M. Abowd and Lars Vilhuber 2013, all rights reserved

Sessions 8-10: Statistical tools ● Edit and imputation – Why and when? – Methods – Do it yourself! ● Record linkage – Why, when, and what? – Methods and tools – Do it yourself! 1/21/ © John M. Abowd and Lars Vilhuber 2013, all rights reserved

Sessions 8-10: Statistical tools ● Disclosure limitation methods – Why and when? – Methods – Do it yourself! ● Session 13 – Releasing synthetic data combines many of these tools: ● Extreme case of imputation ● Use as a disclosure limitation method ● Record linkage as a way to prove that protections are valid 1/21/ © John M. Abowd and Lars Vilhuber 2013, all rights reserved

Sessions 11-13: More Tools Geographic Information Systems Guest lecture by Nicholas Nagle Basic Geocoding Spatial data analysis methods 1/21/ © John M. Abowd and Lars Vilhuber 2013, all rights reserved

Sessions Modeling integrated data The relational database model Alternative representations: graphs, networks Bayesian methods for edit, imputation, estimation Synthetic data used to represent the simulation outputs 1/21/ © John M. Abowd and Lars Vilhuber 2013, all rights reserved

TECHNICAL SETUP 1/21/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 15

Technical setup Primary website: ell.edu/info7470/

Submitting labs and quizzes

Course Management System

Entering the CMS

First-time CMS password reset 1/21/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 20

In the CMS

In the CMS course page

Video setup ● Two-screen setup – Screen 1 will have people – Screen 2 will have slides/live demos/etc – Is this what you are seeing now? ● Recording will merge both streams

Video etiquette ● Please mute your mike! (Source: BY-NC 2.0) BY-NC 2.0

Video etiquette ● If asking questions, try and get close to the mike ● If speaking, try and get the camera focussed on the person asking the question

Recording ● We will be recording all sessions ● Recording will focus primarily on the main camera, but all pictures and sounds are liable to be recorded and made available in the recorded classes ● We will edit the recordings after the class ends, with the goal of making a MOOC-like experience possible in ● All presenters will be asked for permission; other participants will not appear in those recordings