1 Confidentiality and Data Access: Perspectives on Demographic Data Pat Doyle U.S. Census Bureau Prepared for the IASSIST Annual Conference, University.

Slides:



Advertisements
Similar presentations
Statistics NZs experience in using Administrative Data in an Integrated Programme of Economic Vince Galvin General Manager Strategy & Communications.
Advertisements

Alternative Approaches to Data Dissemination and Data Sharing Jerome Reiter Duke University
Simulating Publicly Subsidized Reinsurance Strategies In Three States Lisa Clemans-Cope, Ph.D. (presenter) Randall R. Bovbjerg, J.D. (PI for Reinsurance.
Public Use Microdata File (PUMF) 1. Change factors 2. Scenarios : characteristics 3. Analytic Content: additions and losses Outline DLI Ontario.
Statistical Disclosure Control (SDC) for 2011 Census Progress Update Keith Spicer – ONS SDC Methodology 23 April 2009.
Balancing Access and Confidentiality Jenny Telford Australian Bureau of Statistics September 2008.
Output Consultation Plans and Statistical Disclosure Control Strategy developments Angele Storey and Jane Longhurst ONS.
1 The Synthetic Longitudinal Business Database Based on presentations by Kinney/Reiter/Jarmin/Miranda/Reznek 2 /Abowd on July 31, 2009 at the Census-NSF-IRS.
Data Collection in a Decentralized Statistical System – The U.S. Perspective Friends of the Chair Group on Integrated Economic Statistics, Work Group Meeting.
The Smith Consulting Group1 Ethics and Accountability Bob Smith The Smith Consulting Group Spring 2004 Conference Oklahoma Association for Instructional.
A Brief Introduction to Epidemiology - VII (Epidemiologic Research Designs: Demographic, Mortality & Morbidity Studies) Betty C. Jung, RN, MPH, CHES.
Access routes to 2001 UK Census Microdata: Issues and Solutions Jo Wathan SARs support Unit, CCSR University of Manchester, UK
Information Sources for Urban History Linda Zellmer Government Information & Data Services Librarian Western Illinois University
Semi-Permeable Boundaries Among Institutions: Non-Public Data and the Census RDC at Berkeley IASSIST 2009 – Tampere, Finland Jon StilesMay 27, 2009.
SWRK 292 Thesis/Project Seminar. Expectations for Course Apply research concepts from SWRK 291. Write first three chapters of your project or thesis.
Optimizing the Use of Microdata: Julia Lane Adapted from ASA presentation in honor of Pat Doyle.
Are Public Use (Micro) Data a Thing of the Past? John M. Abowd Cornell University US Census Bureau Prepared for IASSIST 2002.
© John M. Abowd 2005, all rights reserved Recent Advances In Confidentiality Protection John M. Abowd April 2005.
United Nations Workshop on Revision 3 of Principles and recommendations for Population and Housing Censuses and Census Evaluation Amman, Jordan, 19 – 23.
Lesson 2. Developing a Marketing Plan Next Generation Science / Common Core Standards Addressed! RST.11 ‐ 12.7 Integrate and evaluate multiple sources.
John M. Abowd Cornell University IASSIST 2010 June 4, 2010.
Becoming Canadian Citizens: Intent, process and outcome Kelly Tran, Tina Chui: Statistics Canada Stan Kustec, Martha Justus: Citizenship and Immigration.
Sub-session 1B: General Overview of CRVS systems.
Using and Interpreting Data Community Health Assessment Unit Office of Epidemiology.
Microdata Simulation for Confidentiality of Tax Returns Using Quantile Regression and Hot Deck Jennifer Huckett Iowa State University June 20, 2007.
1 Overview of Statistical Disclosure Methodology for Microdata Laura Zayatz Census Bureau BTS Confidentiality Seminar Series, April.
The Application of the Concept of Uniqueness for Creating Public Use Microdata Files Jay J. Kim, U.S. National Center for Health Statistics Dong M. Jeong,
Introduction to OBIS-USA Biological Data, Applications, & Relationships March 14, 2011.
12th Meeting of the Group of Experts on Business Registers
EVALUATION AND THE RESIDENCY PROGRAM Caroline C. Nielsen, Ph.D.
Statistics Canada’s Real Time Remote Access Solution 2011 MSIS Meeting – Karen Doherty May 2011.
Wisconsin County Health Rankings UW Population Health Institute CATCH Project June 2, 2008.
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
LOGO CIVIL REGISTRATION AND VITAL STATISTICS SYSTEM OF VIET NAM.
Evaluating a Research Report
Transition from traditional census to sample survey? (Experience from Population and Housing Census 2011) Group of Experts on Population and Housing Censuses,
Plans for the Research and Testing Phase of the 2020 Census Presentation to the State Data Centers October 15, 2010 Daniel H. Weinberg (Assistant Director.
User-focused Threat Identification For Anonymised Microdata Hans-Peter Hafner HTW Saar – Saarland University of Applied Sciences
1 The 2001 Census PUMFS Odyssey Sponsored by HAL and PALS Presented by Chuck Humphrey.
Developing and improving data resources for social science research A strategic approach to data development and data sharing in the social sciences Peter.
Longitudinal Data Recent Experience and Future Direction August 2012.
Vital Event Data Release Scoring Criteria 5 June, 2005 NAPHSIS Meeting – Cincinnati, OH Mark Flotow Illinois Center for Health Statistics, IDPH.
1 Assessing the Impact of SDC Methods on Census Frequency Tables Natalie Shlomo Southampton Statistical Sciences Research Institute University of Southampton.
Assessing Disclosure for a Longitudinal Linked File Sam Hawala – US Census Bureau November 9 th, 2005.
Innovations and Pilots under ISSNIP Why, What, How?
WP 19 Assessment of Statistical Disclosure Control Methods for the 2001 UK Census Natalie Shlomo University of Southampton Office for National Statistics.
1 IPAM 2010 Privacy Protection from Sampling and Perturbation in Surveys Natalie Shlomo and Chris Skinner Southampton Statistical Sciences Research Institute.
Disclosure Limitation in Microdata with Multiple Imputation Jerry Reiter Institute of Statistics and Decision Sciences Duke University.
1 For a Population Statistical Register Characteristics and Potentials for the Official Statistics Central department for administrative data and archives.
European Conference on Quality in Official Statistics, Rome, July 2008 Community Innovation Survey: a Flexible Approach to the Dissemination of Microdata.
Chapter 7: Indexes, Registers, and Health Data Collection
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
Eve Powell-Griner National Center for Health Statistics Centers for Disease Control and Prevention National Center for Health Statistics Microdata Release.
STATE of EQUILIBRIUM: DATA USE and PRIVACY Jan A. Markowitz, PhD NAPHSIS NAPHSIS-VSCP JOINT MEETING Portland, Oregon June 8, 2004.
Anonymity and Privacy Issues --- re-identification
United Nations Workshop on Principles and Recommendations for a Vital Statistics System, Revision 3, for African English-speaking countries Addis Ababa,
Third Regional Workshop on Production and Use of Vital Statistics May 2014, in Daejeon, Republic of Korea Presented by: Ashok Kumar Bhattarai, Director.
Comments for Hungarian and South Africa’s PRESENTATION Wu Jie Department of Population and Employment National Bureau of Statistics of China 27 – 30 June.
Census 2011 – A Question of Confidentiality Statistical Disclosure control for the 2011 Census Carole Abrahams ONS Methodology BSPS – York, September 2011.
Workshop A Increasing the risk of success Gogarty Consultancy Providing Social Work, Training and Consultancy Services National.
INFO 7470/ECON 7400/ILRLE 7400 Register-based statistics John M. Abowd and Lars Vilhuber March 4, 2013 and April 4, 2016.
Dissemination Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May 2008,
Sub-regional workshop on integration of administrative data, big data
Sources of vital statistics
Classification Trees for Privacy in Sample Surveys
Lucia Foster Chief Economist U.S. Census Bureau December 5-6, 2013
Federal Statistical Office Germany Research Data Centre
Functioning of the vital statistics system
Jerome Reiter Department of Statistical Science Duke University
Presentation transcript:

1 Confidentiality and Data Access: Perspectives on Demographic Data Pat Doyle U.S. Census Bureau Prepared for the IASSIST Annual Conference, University of Connecticut, Storrs CT, June 2002

2 The Problem Seeking Balance

3 Survey Administrative Records

4 Survey Administrative Records

5 Survey Administrative Records

6 Administrative Data is Necessary but Not Sufficient Full re-identification of sample cases not verifiable because –Noise/error in administrative data –Noise/error in survey data –Insufficient number of matching keys Risk of re-identification slightly higher now than in the past due to reduced cost of accessing and using administrative lists But what about selected re-identification?

7 Administrative Records Local Announcements

8 Administrative and Local Records

9 Survey Administrative and Local Records

10 Local Information Increases Likelihood of Selected Re-Identification Local information increases known attributes of people with rare characteristics More verifiable information for re- identification purposes In the absence of change in disclosure methods--increased risk of re-identification of selected situations with more and more accessible local information Families at higher risk than individuals

11 Growth in Publicly Available Administrative Data (from Sweeney, 2001) Data SourceNumber of items by Year Birth Certificates Hospital Discharges0663 Grocery Purchases321272

12 Change in Disclosure Methods at the Census Bureau Eliminate sub-state geography, or Suppress information on rare characteristics or events, or Perturb information on rare characteristics or events

13 Highlights of the Book Confidentiality, Disclosure, and Data Access: Theory and Practical Applications for Statistical Agencies By Pat Doyle, Julia Lane, Laura Zayatz and Jules Theeuwes

14 Current Methods and Data Felsö et al.—Survey of disclosure methods in use –Most statistical agencies do not release demographic microdata to the public –Many agencies providing demographic microdata have limited access Sweeney—Re-identification methods –Points out the accessibility and richness of private data –Illustrates the increased risk associated with data integrated across administrative data sources

15 Risk and Data Quality Assessments for Public Demographic Data Elliot—Review of state-of-the-art in quantifying risk of disclosure –Absolute risk measure not feasible, relative risk may be –Risk is a function of sampling rate, level of detail, number of match keys, compatibility of keys Domingo-Ferrer and Torra—Disclosure methods for microdata, information loss and risk –Summarize disclosure methods and information loss measures –Evaluates disclosure methods based on re-identification risk and information loss

16 New Disclosure Methods for Demographic Microdata Abowd and Woodcock adopt Rubin’s multiple imputation strategy for demographic surveys to mask the underlying data

17 Access to Non-Public Data Seastrom—Licensing and its use at NCES Dunne—Secure research sites Blakemore—Remote access

18 Perceptions Singer—Individuals’ perceptions and attitudes toward confidentiality Gerber—An ethnographic study of the relationship between privacy attitudes and response