Federal Statistical Office Germany Research Data Centre

Slides:



Advertisements
Similar presentations
Alternative Approaches to Data Dissemination and Data Sharing Jerome Reiter Duke University
Advertisements

Eurostat T HE E UROPEAN PROCESS OF ENHANCING ACCESS TO E UROSTAT DATA A LEKSANDRA B UJNOWSKA E UROSTAT.
Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
Confidentiality risks of releasing measures of data quality Jerry Reiter Department of Statistical Science Duke University
Business microdata dissemination at Istat Daniela Ichim Luisa Franconi
Access routes to 2001 UK Census Microdata: Issues and Solutions Jo Wathan SARs support Unit, CCSR University of Manchester, UK
Optimizing the Use of Microdata: Julia Lane Adapted from ASA presentation in honor of Pat Doyle.
Developing a Statistical Disclosure Standard for Europe Tanvi Desai LSE Research Laboratory Data Manager Research Laboratory IASSIST 2010: Cornell.
Eurostat M ODES OF ACCESS TO EU MICRODATA IN THE NEW LEGAL FRAMEWORK A LEKSANDRA BUJNOWSKA E UROSTAT S TATISTICAL OFFICE OF THE E UROPEAN U NION.
Joint UNECE/Eurostat work session on statistical data confidentiality October 2013 Ottawa, Canada Improvement of access to European microdata Outcome.
Decentralised and Remote Access to Confidential Data in the ESS (ESSnet DARA) Overview and State of the Art Maurice Brandt Destatis FIRST EUROPEAN DATA.
UNECE Workshop on Confidentiality Manchester, December 2007 Comparing Fully and Partially Synthetic Data Sets for Statistical Disclosure Control.
Session 4. Panel session: How useful is the notion of “circle of trust” concept ? A vision for the future. Maurice Brandt Destatis Germany 2ND EUROPEAN.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Basque Statistics Office Confidentiality Project: Final stages Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality Tarragona, Spain,
© Federal Statistical Office, Research Data Centre, Maurice Brandt Folie 1 Analytical validity and confidentiality protection of anonymised longitudinal.
Statistics Canada’s Real Time Remote Access Solution 2011 MSIS Meeting – Karen Doherty May 2011.
Accreditation practices at the Hungarian Central Statistical Office Zoltán Vereczkei Methodology Department Hungarian Central Statistical Office
Mara Cammarrota Italian National Institute of Statistics Development of Information System and Corporate Products, Information Management and Quality Assessment.
Luisa Franconi Integration, Quality, Research and Production Networks Development Department Unit on microdata access ISTAT Essnet on Common Tools and.
User-focused Threat Identification For Anonymised Microdata Hans-Peter Hafner HTW Saar – Saarland University of Applied Sciences
Daniel Beckler United States Department of Agriculture National Agricultural Statistics Service Timothy Mulcahy NORC at the University of Chicago Topic.
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
Safe Centre Network Need for Safe Centre to enrich European Research Maurice Brandt (Destatis) and David Schiller (IAB) Work session on statistical data.
1 New Implementations of Noise for Tabular Magnitude Data, Synthetic Tabular Frequency and Microdata, and a Remote Microdata Analysis System Laura Zayatz.
Final Meeting Development of Harmonised Indicators and Estimation Procedures for Forests with Protective Functions against Natural Hazards in the Alpine.
Some aspects concerning analytical validity and disclosure risk of CART generated synthetic data Hans-Peter Hafner and Rainer Lenz Research Data Centre.
2008 NCHS Data Users’ Conference Omni Shoreham Hotel Washington, DC Wednesday, August 13, 2008.
IAB homepage: Institut für Arbeitsmarkt- und Berufsforschung/Institute for Employment Research A New Approach for Disclosure Control in the.
26 August 2011 Future of access to EU confidential data for scientific purposes Jean-Marc Museux Eurostat – 58th ISI conference,
Statistical data confidentiality and micro data in Albania
© Federal Statistical Office, Research Data Centre, Maurice Brandt Folie 1 ESSnet Projects “Decentralised Access to EU microdata” Maurice Brandt Research.
The experience of a National Statistical Institute after a law change: Estonia First Regional Workshop Microdata Access in European Countries ― Cooperation.
Access to environmental microdata in Germany IAOS Conference, Chile, 2010 Markus Zwick Federal Statistical Office Germany.
Disclosure Limitation in Microdata with Multiple Imputation Jerry Reiter Institute of Statistics and Decision Sciences Duke University.
State Statistical Institute Berlin-Brandenburg Jörg Höhne / Julia HöningerResearch Data Centre Morpheus – Remote Data Access with a Quality Measure Joint.
European Conference on Quality in Official Statistics, Rome, July 2008 Community Innovation Survey: a Flexible Approach to the Dissemination of Microdata.
Access to microdata in the Netherlands: from a cold war to co-operation projects Eric Schulte Nordholt Senior researcher and project leader of the Census.
Development of UK Virtual Microdata Laboratory Felix Ritchie Shanghai, March 2010.
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
The Review of the Dissemination of Health Statistics Carole Abrahams Office for National Statistics.
19-20 October 2010IT Directors’ Group Meeting 1 Item 3.3.g of the agenda Vision Infrastructure Project on Secure Infrastructure for CONfidential data access.
Joint UNECE/Eurostat work session on statistical data confidentiality October 2015 Helsinki, Finland Circle of trust Maurice Brandt DESTATIS.
Why should official statistics care about data integration? First experiences in linking microdata in Germany Christopher Gürke RDC of the Federal Statistical.
Level 2 Diploma in Bench Joinery © 2013 City and Guilds of London Institute. All rights reserved. PowerPoint presentation Introduction to risk assessment.
Expanding the Role of Synthetic Data at the U.S. Census Bureau 59 th ISI World Statistics Congress August 28 th, 2013 By Ron S. Jarmin U.S. Census Bureau.
Development of UK Virtual Microdata Laboratory
Data Confidentiality and the Common Good.
Amandine Jambert - IT Experts Department
Dissemination Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May 2008,
Legal, political and methodological issues in confidentiality in the ESS Maria João Santos, Jean-Marc Museux Eurostat.
Access to European microdata for scientific purposes
The new metadata structure & Country Specific Notes
Workshop on Decentralised Access to European Microdata
Data from statistical modeling (e. g
DDI-RDF Discovery Vocabulary _ Use Cases and Vocabularies
Data without Boundaries The DwB project (FP7-13)
Item 5.6 of the Agenda Remote access to confidential data for scientific purpose Jean-Marc Museux/ Aleksandra Bujnowska - Unit B2 Methodology and research.
Remote access to confidential data
Protecting Confidential Data
ITDG meeting of of October 2011
Item 7.1 Implementation of the 2016 Adult Education Survey
Item 2.2 of the Agenda Remote access to confidential data for researchers: possible actions under the 7th Framework Programme Pascal JACQUES Unit B 5 15.
Treatment of statistical confidentiality Part 3: Generalised Output SDC Introductory course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER A FRAMEWORK.
SAFE – a method for anonymising the German Census
Item 5 Wim Kloek, Eurostat
Open Data Sharing and its Statistical Limitations
Item 2.2 Scientific Use Files for the Time Use Survey
Item 5 Modernisation of the EU-SILC Production
Imputation as a Practical Alternative to Data Swapping
Presentation transcript:

Federal Statistical Office Germany Research Data Centre New Techniques and Technologies for Statistics - 2009 Brussels, 18 - 20 February 2009 Special Session on Access to Microdata "An informational infrastructure for the E-Science Age - On the way to remote data access for business data " Maurice Brandt Federal Statistical Office Germany Research Data Centre

Overview Introduction 2. Current situation at the research data centres 3. Content of the project “InfinitE” 4. Production of data structure files 5. Result-based confidentiality 6. Summary

1. Introduction Development in (business-) microdata request goes to microdata without data perturbing methods Ideally original microdata  more and more researcher ask for remote data execution or safe centre This leads to a huge amount of tables, which have to be checked for confidentiality The development on a national level will propably also happen on EU level The researchers require more data preferably non anonymised microdata

2. Current situation in the RDC‘s output checking: right now the output of the researcher is checked by two persons (4 eyes principle) only publication of absolute anonymous tables allowed construction of combined and integrated datasets for business microdata  difficult to anonymise One person in the RDC and one person in the statistical devision Combined dataset: first the data are enriched with other data  for anonymisation reasons this information have to be deleted or strongly anonymised

2. Current situation in the RDC‘s Why this project: - still reservations from science concerning the data perturbing methods for economic microdata - amount of work of manual output checking - increasing request for original microdata

3. Content of the Project „InfinitE“ “An informational infrastructure for the E-Science Age - On the way to remote data access for business data” deals with the improvement of remote access in the Federal Statistical Office Germany project aims to find solutions for a better remote access in Germany through so called data structure files and (automatic) output checking procedures data structure files: - goal: semantic and syntactic correct data structure files - application to original data without any adaptations

4. Production of data structure files Methods to produce data structure files: - stochastic noise - multidimensional microaggregation - sythetic data  multiple imputation Test of confidentiality and measurement of reidentification risk - Development of new procedures to measure reidentification risk of syntetic data  Joerg Drechsler: „Disclosure Control in Business Data” on this conference Judgement about utility and applicability of data structure files

5. Result-based confidentiality output checking procedures Classification of outputs in „safe“ and „unsafe“ output Identification of output where anonymisiation procedures are necessary Evaluation and development of practicable anonymisation methods for „unsafe output“ The project evaluates also the analytical validity of the anonymised output

5. Result-based confidentiality Confidentiality methods for tables and (regression) output - (rounding, controlled tabular adjustment, stochastic noise) - evaluation of automatic output checking procedures feasibility study to change the legal frame for researcher to publish tables - More responsibility to the researcher - This leads to less anonymisation and suppression in the output

6. Summary change is observable in user needs and requirements on microdata access with this national project the data infrastructure in Germany is going to improve to consider these developments time for change in remote data execution procedure - otherwise the amount of output is not manageable anymore National and ESSnet projects can benefit from each other

Thank you for your attention Maurice Brandt Research Data Centre Federal Statistical Office Germany Tel. +49 611/75 4349 maurice.brandt@destatis.de http://www.forschungsdatenzentrum.de http://www.destatis.de