February 1, 2011 Workshop: Persistent Identifiers for the Social Sciences 1 SOEP and DOI Requirements and Challenges Jan Goebel.

Slides:



Advertisements
Similar presentations
FDZ-RV Research Data Centre of the German Pension Insurance Integration of Metadata in FDZ-RV 1 Integration of Metadata in FDZ-RV (Research Data Centre.
Advertisements

Archiving Trevor Croft MICS3 Data Archiving, Dissemination and Further Analysis Workshop Geneva - November 6th, 2006.
Overview of Preliminary and Final Reports MICS3 Data Analysis and Report Writing Workshop.
The English Longitudinal Study of Ageing (ELSA) Data & Documentation 2008 Jibby Medina NatCen.
ESDS Resources Anthony Rafferty ESDS Government Centre for Census and Survey Research University of Manchester.
Documentation and Additional Resources Alexander Mack.
Dr. Markus Quandt GESIS – Leibniz-Institute for the Social Sciences Workshop: Persistent Identifiers for the Social Sciences University Club, Bonn, February.
Resources for Social Sciences
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Using the IEA IDB Analyzer to merge and analyze data.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid Using the IEA IDB Analyzer to merge and analyze data.
Data Citation for the Social Sciences Mary Vardigan ICPSR CODATA Conference on Data Attribution and Citation August 22-23, 2011.
Access to and specifics of detailed national LFS data – the case of Slovenia Sebastian Kočar Social Science Data Archives University of Ljubljana 4th DwB.
Panel Micro-Databases for Socio- Economic Research in Europe: ECHP, CHER, CNEF & EPUNet.
Institute of Transport Research - Berlin-Adlershof 1 Clearing House for Transport Data & Transport Models IASSIST Strength in Numbers, Ottawa (Canada)
Hannele Keckman-Koivuniemi and Mari Kleemola : Data Processing in FSD : CHALLENGES IN A NEW ARCHIVE IASSIST2003 Ottawa,
EPUNet Training Course 2005 Day 1 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005.
A database-driven tool to create items, variables and questionnaires NEPS Metadata Editor.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
IASSIST Conference 2006 – Ann Arbor, May Metadata as report and support A case for distinguishing expected from fielded metadata Reto Hadorn S I.
M. Fall, JP. Lorgnet et alii 26/02/2010 Individual Dynamics of Poverty, a study tackling changes in poverty in France via the SILC survey.
The education variables in the European Social Survey: Advantages in using the DDI for documentation Hilde Orten and Hege Midtsæter Norwegian Social Science.
Multiple Indicator Cluster Surveys Data Interpretation, Further Analysis and Dissemination Workshop Data Archiving.
 Access & Dissemination of Data from the SHS Lisa Taylor SHS Research Officer.
Implementing Digital Object Identifiers at the GESIS Data Archive for the Social Sciences Workshop “Persistent Identifiers for the Social Sciences” Bonn,
1 The planned use of DDI 3.0 within a German Research Data Center IASSIST, Session “Tools and Implementations of DDI 3.0”, May 27, 2009 Dana Müller.
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
Introduction to EU-SILC
DOI Registration for Social and Economic Data da|ra Brigitte Hausstein GESIS Leibniz-Institute for the Social Sciences, Berlin.
FORMS OF COOPERATION BETWEEN NATIONAL STATISTICAL INSTITUTES AND DATA ARCHIVES Sebastian Kočar (ADP, UL) First Regional Workshop – Microdata Access in.
Research Data Centre network for transnational access - four years of experiences by seven European RDCs Karen Dennison (UK Data Archive) and David Schiller.
Supporting transnational access to government microdata from four European countries Karen Dennison, SDS and David Schiller, IAB P resented by Karen Dennison,
Die ZBW ist Mitglied der Leibniz-Gemeinschaft Statistical Research Data on the Semantic Web SWIB 2012 Cologne, Germany Daniel Bahls Leibniz Information.
Workshop on International Standards, Contemporary Technologies and Regional Cooperation, Noumea, New Caledonia, 04–08 February 2008 Results Generated from.
Mara Cammarrota Italian National Institute of Statistics Development of Information System and Corporate Products, Information Management and Quality Assessment.
Assessing Quality for Integration Based Data M. Denk, W. Grossmann Institute for Scientific Computing.
Creating a collection of standardized datasets on household consumption Olivier Dupriez World Bank, Development Data Group 6 June.
1 Archiving Michael J. Levin Harvard Center for Population and Development Studies
Finding Microlevel Data for Economists at Princeton University: Education and Labor.
ELSA ELSA datasets and documentation available from the archive or by special arrangement Kate Cox National Centre for Social.
Statistics Canada Citizenship and Immigration Canada Longitudinal Survey of Immigrants to Canada Ryerson University April 16, 2004.
OECD’s approach to Manage, Publish, and Cite data 16 May-2011.
SOC 503 Techniques & Methods of Social Science Data Resources at Princeton University.
Working with EU-SILC: data files, variables and data management Practical computing session I – Part 1 Heike Wirth GESIS – Leibniz Institut für Sozialwissenschaften.
Colectica: A Platform for DDI 3 based Metadata Management Design. Collect. Share.
Gateway to Global Aging Data September 17 th, 2014 APRU Data Workshop Drystan Phillips.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Data Collection with Surveybe
MICS Data Processing Workshop Multiple Indicator Cluster Surveys Data Processing Workshop Creating Analysis Files: Description of Preparation Steps.
13-Jul-07 Item 1 – Introduction. 13-Jul-07WG Core variables in social surveys Name of the presentation 16 Core Variables… 1.Geographic data I (linked.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
“Mobility into and out of poverty in 14 European countries” Author-Presenter: Eirini Andriopoulou ATHENS UNIVERSITY OF ECONOMICS AND BUSINESS DEPARTMENT.
Multiple Indicator Cluster Surveys Data Processing Workshop Overview of SPSS structural check programs and frequencies MICS Data Processing Workshop.
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
Estonian experience in implementation and maintenance of statistical database using PX-Web Eda Fros Statistics Estonia UNECE Workshop on Developing Data.
Introduction to Training Data. SILC data as delivered from Eurostat Features of training dataset 2.
The Reproducible Research Advantage Why + how to make your research more reproducible Presentation for the Center for Open Science June 17, 2015 April.
© Federal Statistical Officewww.forschungsdatenzentrum.de Workshop „Integrating European Census Microdata“ Barcelona, July 2005 Country Report for.
Open Science=Open Methodology Oshrat Hochman & Christof Wolf
Country report Germany
An Overview of Data-PASS Shared Catalog
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
Questasy: Documenting and Disseminating Longitudinal Data Online with DDI 3 Edwin de Vet 11/14/2018.
Third Annual Financial Capability and Asset Building (FCAB) Convening Event: Afternoon Panel Wednesday, January 10, 2018, 1:30 pm – 5:30 pm     FCAB Over.

Third Annual Financial Capability and Asset Building (FCAB) Convening Event: Afternoon Panel Wednesday, January 10, 2018, 1:30 pm – 5:30 pm     FCAB Over.
in the data production process
DATA ACCESS IASSIST workshop on Access Policies and Licensing for Archives and Repositories Eric Balster (CentERdata) Cologne, May 28, 2013.
“Mobility into and out of poverty in 14 European countries”
Questasy: Documenting and Disseminating Longitudinal Data Online with DDI 3 Edwin de Vet 5/21/2019.
SOEP and DOI Requirements and Challenges
Presentation transcript:

February 1, 2011 Workshop: Persistent Identifiers for the Social Sciences 1 SOEP and DOI Requirements and Challenges Jan Goebel

February 1, 2011 Workshop: Persistent Identifiers for the Social Sciences 2 Content 1.SOEP Overview 2.Problems 3.Conclusions

February 1, 2011 Workshop: Persistent Identifiers for the Social Sciences 3 SOEP Overview Socio-Economic Panel Study (SOEP) is a representative longitudinal study of private households in Germany Annual survey since 1984 of about 10,000 households (around 20,000 persons) Some of the many topics include household composition, occupational biographies, employment, earnings, health and indicators of subjective well-being

February 1, 2011 Workshop: Persistent Identifiers for the Social Sciences 4 SOEP is an ongoing Survey Common with all panel surveys Each year we distribute an enhanced version with new and changed data Question are changing, new topics,... → We do a lot but not just replication! Even changes for „archived data“, like a change in the coding scheme of ISCO

February 1, 2011 Workshop: Persistent Identifiers for the Social Sciences 5 The SOEP currently (User DVD) consists of: –More than 320 data files –About Variables Granulation to choose for citation? –Complete SOEP distribution of one year? –„Connected“ SOEP parts, e.g. Individual questionnaires, HH-questionnaires, generated datasets –Each data file –Each Variable (for each year or only once, longitudinal concept?) SOEP is not one dataset but a complex data structure

February 1, 2011 Workshop: Persistent Identifiers for the Social Sciences 6 European user: 100% Version (English, German, different formats for SAS/SPSS/Stata/ASCII) Non-EU user: 95% Version (of cases) International comparative research: Part of the CNEF (Cross National Equivalent File) SOEP Geocodes (supplementary CD): Regional Planning Regions, Community types, etc. Country codes, Community codes, zip codes, microm: only by remote execution or at the Research Data Center (RDC SOEP) SOEP Pretests SOEP Related Studies „The SOEP” is available in different versions

February 1, 2011 Workshop: Persistent Identifiers for the Social Sciences 7 SOEP can change during the period, because of updates Updates of weighting schemes or even bug fixes (also possible for older waves) Sometimes more than one update between distributions (cumulative updates?) How can a user know what version she is using? Message-Digest Algorithm (MD5) Secure Hash Algorithm (SHA-2) Universal Numeric Fingerprint (UNF) Does rounding matter? German/English Labels, different formats (SPSS, STATA, …) Only update of a label bug?

February 1, 2011 Workshop: Persistent Identifiers for the Social Sciences 8 Conclusions Nesting of DOI should be possible: PrintDOISOEP exampleSOEP DOI Edited bookSurveySOEP DVD /soep.26 Article in bookData fileSOEP dataset $PGEN /soep.26.hgen Table in article in book VariableSOEP dataset $pgen variable ihinc$$ /soep.26.hgen.ihinc It should be possible for a user to identify the data, including version  The metadata of a DOI should include a SHA for each data file and format, which must also be persistent, like SHA-2 Commitment about the persistence of the data provider It is not enough to identify the data source to make an scientific empirical analysis reproducible, you normally need the syntax also

February 1, 2011 Workshop: Persistent Identifiers for the Social Sciences 9 Thank you for your attention!