Meta Dater Metadata Management and Production System for surveys in Empirical Socio-economic Research A Project funded by EU under the 5 th Framework Programme.

Slides:



Advertisements
Similar presentations
MICS4 Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop Data Archiving.
Advertisements

Metadata Management at GESIS-ZA Reiner Mauer GESIS – Data Archive and Data Analysis CESSDA-Expert Seminar Odense, September 11th 2008.
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
Federal Department of Home Affairs FDHA Federal Statistical Office FSO Meeting of the OECD Expert Group on SDMX September, OECD, Paris Centralized.
A Toolbox for Blackboard Tim Roberts
JTX Overview Overview of Job Tracking for ArcGIS (JTX)
Why, what were the idea ? 1.Create a data infrastructure, 2.Data + the knowledge products that are produced on the basis of data a) Efficiant access to.
IASSIST / IFOD: Mobile Data and the Life Cycle – Tampere, Finland May 26-29, 2009 Lifecycle & Comparative Studies Metadata Needs of the Future CESSDA RI.
CHARMCATS: Harmonisation demands for source metadata and output management CESSDA Expert Seminar: Towards the CESSDA- ERIC common Metadata Model and DDI3.
Metadata to Support the Survey Life Cycle Alice Born, Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (METIS) Geneva,
ESSnet on SDMX phase II Laura Vignola ISTAT Rome, 3-4 December 2012.
MP IP Strategy Stateye-GUI Provided by Edotronik Munich, May 05, 2006.
& Slovak Archive of Social Data “SASD” and DDI 3 IASSIST/IFDO Conference, Tampere, Juraj.
Group & Resource Package - Potentials to re-use metadata with DDI 3 - Uwe Jensen, GESIS – cessda Expert Seminar Nov Ljubljana, Slovenia Group &
STARDAT DATA ARCHIVING SUITE European Survey Research Association (ESRA), July 18 – 22, 2011, Lausanne, Switzerland Monika Linne, Evelyn Brislinger, Wolfgang.
Meta Dater Metadata Management and Production System for Surveys in Empirical Socio-economic Research A Project funded by EU under the 5 th Framework Programme.
DDI 3.0 Conceptual Model Chris Nelson. Why Have a Model Non syntactic representation of the business domain Useful for identifying common constructs –Identification,
IASSIST Conference 2006 – Ann Arbor, May Metadata as report and support A case for distinguishing expected from fielded metadata Reto Hadorn S I.
Codebook Centric to Life-Cycle Centric In the beginning….
Evelyn Brislinger, Wolfgang Zenk-Möltgen
Managing the Metadata Lifecycle The Future of DDI at GESIS and ICPSR Peter Granda, ICPSR Meinhard Moschner, GESIS Mary Vardigan, ICPSR Joachim Wackerow,
Data Management: Documentation & Metadata Types of Documentation.
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
2 nd Training Workshop 4 – 5 June 2007 Common Data Index - CDI By Dick M.A Schaap Technical Coordinator SeaDataNet.
Implementing Digital Object Identifiers at the GESIS Data Archive for the Social Sciences Workshop “Persistent Identifiers for the Social Sciences” Bonn,
United Nations Statistics Division
WP.5 - DDI-SDMX Integration
MEDIN Data Guidelines. Data Guidelines Documents with tables and Excel versions of tables which are organised on a thematic basis which consider the actual.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
Implementing ESS standards for reference metadata and quality reporting at Istat Work Session on Statistical Metadata Topic (i): Metadata standards and.
DOI Registration for Social and Economic Data da|ra Brigitte Hausstein GESIS Leibniz-Institute for the Social Sciences, Berlin.
Metadata management and statistical business process at Statistics Estonia Work Session on Statistical Metadata (Geneva, Switzerland 8-10 May 2013) Kaja.
SDMX and DDI Working Together Technical Workshop 5-7 June 2013
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
European Interoperability Architecture e-SENS Workshop : Collecting data for the Cartography Tool 7-8 January 2015.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
« 8-11 July 2008 « Metadata Life Cycle « STATISTICS PORTUGAL.
February 17, 1999Open Forum on Metadata Registries 1 Census Corporate Statistical Metadata Registry By Martin V. Appel Daniel W. Gillman Samuel N. Highsmith,
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Documentation and Cataloguing in Data.
The european ITM Task Force data structure F. Imbeaux.
IBISAdmin Utah’s Web-based Public Health Indicator Content Management System.
Experts Workshop on the IPT, v. 2, Copenhagen, Denmark The Pathway to the Integrated Publishing Toolkit version 2 Tim Robertson Systems Architect Global.
United Nations Economic Commission for Europe Statistical Division Mapping Data Production Processes to the GSBPM Steven Vale UNECE
Developing Statistical Information Systems and XML Information Technologies - Possibilities and Practicable Solutions Geneva,
DDI AND EXPERIENCES AT ICPSR Prepared for Expert Seminar Finnish Social Science Data Archive Tampere, Finland September 1-2, 2000.
ICP Tool Pack ICP Regional Coordinators’ Meeting Washington, DC May 19, 2004 Vilas Mandlekar The World Bank.
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
Metadata Working Group Jean HELLER EUROSTAT Directorate A: Statistical Information System Unit A-3: Reference data bases.
MetaPlus Klas Blomqvist Statistics Sweden Research and Development – Central Methods
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
5.8 Finalise data files 5.6 Calculate weights Price index for legal services Quality Management / Metadata Management Specify Needs Design Build CollectProcessAnalyse.
FORSbase SEEDS meeting May 5 th, 2015, Lausanne Bojana Tasic.
Prepared by: Galya STATEVA, Chief expert
What’s New in Colectica 5.3 Part 1
SDMX Information Model
Questasy: Documenting and Disseminating Longitudinal Data Online with DDI 3 Edwin de Vet 11/14/2018.
The Re3gistry software and the INSPIRE Registry
Workshop on the Validation of Waste Statistics
Generic Statistical Business Process Model (GSBPM)

Enhancing ICPSR metadata with DDI-Lifecycle
Social Research Methodology and Supplementary Documentation John Kallas University of the Aegean, Department of Sociology.
ESS.VIP VALIDATION An ESS.VIP project for mutual benefits
Mapping Data Production Processes to the GSBPM
The Next Generation of the Microdata Information System MISSY: An Integrated Solution for the Documentation of European Microdata European DDI User Conference,
Questasy: Documenting and Disseminating Longitudinal Data Online with DDI 3 Edwin de Vet 5/21/2019.
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Presentation transcript:

Meta Dater Metadata Management and Production System for surveys in Empirical Socio-economic Research A Project funded by EU under the 5 th Framework Programme Key Action: Improving the Socio-economic Knowledge Base - Data infrastructure – HPSE-CT Project period – MetaDater: Towards Standards and Tools to document comparative surveys Internet: Ekkehard Mochmann (Project Co-ordinator) Uwe Jensen (Project Manager) GESIS / ZA University of Cologne Edinburgh, May 27, 2005

Objectives & Products Standards for Metadata Data model for metadata covering the life-cycle of complex comparative surveys on space & time axis; no panel Efficient User system for Data Collector and Provider to produce, manage & exchange metadata according to new standards

Project Status 2003 – User analysis Data archive / Researcher & field work institute 2. Data modelling –Design of a unique data model based on analysis of events & processes of the life-cycle of a comparative studies –Conceptual Data models > Relational Data model (2 nd edition) –Provision & Evaluation of the Data model by DDI / CSDI 3. System Architecture & Functional Core –Design of 3 tier system (GUI, Application Server, RDBMS) –Programming in Java 4. Application development & programming –Modular system for Data provider & collector 5. Usability –Preparation of a test database & tests 6. Dissemination activities

1. User Analysis - Results Archive – Analysis on work sequences & systems Common shared aspects Acquisition > Entry control & Study documentation > Archiving > Dissemination Differences in detail (Archives; Studies) Complexity: Cross-section studies > comparative Studies Technical & functional differences in used systems Researcher & Research institutes Capture of Metadata at first occurrence General Interest vs. missing concepts or tools Specific Interests given Capture of Metadata on Study design & Field work Support on Questionnaire development / Question DB Easy exchange of metadata between researcher & archives

1. Conceptual Data Model Major Life-cycle phases of Studies 1.Study design and documentation 2.Questionnaire development & testing 3.Sample selection & field work 4.Data & document. control – Data Processing of Cross-section 5.Study extension, harmonization & standardization 6.Indexing & Classification 7.Data and Metadata publication & distribution 8.Analysis, correction and update of Metadata Documentation Recent discussions & evaluation in the DDI & CSDI network Study Life-cycle is adopted in DDI data modelling; DDI version 3 shall be compatible After evaluation provision to interested users 2. Relational Data Model Relational schema of Metadata & metadata management by RDBMS Compatible with DDI v2.0 > extension for comparative Studies 2. Data modelling of metadata

3. System Architecture 3-tier system User interface for the modules Application server Relational database management system

Overview: M1 Study description M2 Q/V editor – define standard as a reference M3 Process single data set (cross-section; comparative study) M4 Process extension: harmonize & standardise comparative data M5 Import & Export of XML files or SPSS metadata 4. User Modules & functional components

Study description with metadata sections on Project > Study > Data set > Design - Questionnaire – Data - Context – Classification Module M1 Study Description (cross-section dataset)

Basic task – define the standard for a study or a study collection –Capture or edit questions from the (basic) questionnaire –Define the depending variables add variables to the data file structure (standard setup) –Organise, route & filter, index & classify questions - variables Principle options to work on questions with the Q/V editor: –Import (basic) questionnaire or questions to DB Xml compatible & provided by a project; from XML codebook –Copy from database complete basic questionnaire (e.g. replicate ISSP) single questions from previous wave (e.g. EB trend questions) –Capture Questions from scratch with the Q/V editor Q/V screens to capture of different question types : –Question type 1: Single response on simple Question –Question type 2: Single response on Question with Items –Question type 3: Multiple response by Answer Instances (1st. item; 2nd. Item ) –Question type 4: Multiple response by Answer categories (Mentioned / not mentioned ) –Question type 5: Multiple response on single Items of a question (Grid) Module M2 Q/V editor

Module M2 Q/V editor / example Question type 4: Multiple response on Question with Items Step1: > select Question type > enter 1.st / last Item No. … > CREATE activates Question / Variable pane > offers respective no. of table lines for - Items 1 to 17 - depending variables Step2: enter Question text … & Variable definitions Step 3 : next screen

Module M2 Q/V editor / example Question type 4: Multiple response on Question with Items Step 3 at Tab Value/ Missing > select valid & missing values

Application design & programming M3 – Process single data set (cross-section; comparative study) with reference to the Q/V standard  Document deviations in question wording or demographic items  Process metadata of the single datasets load SPSS data and adjust them to the standard  Technical standardisation of space or time bound variables  Adjust the variables to the standard (recode, rename, filtering etc) M4 - Process Extension: harmonize & standardise comparative data –integrated dataset (space compound datasets) > e.g. ISSP –cumulated dataset (time compound datasets) > e.g. ALLBUS –cumulated-integrated dataset > e.g. EB Trend file –Generalized dataset harmonization Index & classify on Study and variable level M5 - Import & Export of XML or SPSS based metadata –Metadata exchange & publications (XML) > SD, CB, Quest. – SPSS metadata: portable / syntax / setup –Access via WWW front-ends (MADIERA; local systems;) Preparation on Usability testing & user workshops Other modules to come 2005

The end of the presentation

1. Conceptual Data Model Life-cycle of Studies from the Concept > Dissemination Relational schema of Metadata Compatible with DDI v2.0 > extension for comparative Studies After evaluation provision to interested users 2. Relational Data Model Model implemented at the RDBMS; DD & ERD Recent evaluation in the DDI network Study Life-cycle is adopted in DDI data modelling DDI version 3 shall be compatible Detailed analysis is under work of DDI expert groups 2. Data modelling of metadata

2.1 Conceptual Model II. Behavioural model Main process groups along the study life-cycle 1. Study design and documentation Concept, Questionnaire, Field work, Data / Study description 2. Contact and Data acquisition 3. Data & document. control – Process Cross-section 4. Study extension, harmonization & standardization 5. Indexing & Classification 6. Data and Metadata publication & distribution 7. Correction and Update of Metadata Documentation

Conceptual model - Concepts I Model for simple & replicated survey Simple cross-section: Project > Study with > one design > one set of data > a set of Variables > Values Repeated over space and time: Basic object: dataset requires proper definition to be managed by the system single dataset = one sample at one time compound dataset = collection of datasets > many sets of data collected in different time instances (time compound) collected in different spaces (space compound) processed dataset = integration of single data sets to a new one cumulative (covering different time instances) integrated (covering different space instances) cumulated-integrated (covering time and space instances)

Conceptual model Concepts II Types of Metadata (app 7) Definitional –Data definition, methodology, measurement Operational –Researchers, Use of data Contextual –Social background; party information … Documents Classification –Study, variable, question level by variety of vocabularies Administrative –Embargo, access categories, responsibility to change data …

Conceptual model Concepts III Integration & Cumulation processes (app 8) Cumulation (z. B. ISSP) Time series (z. B. Politbarometer; ALLBUS) Trends (Eurobarometer) Harmonisation of different Data sets involved datasets already in the systems Harmonisation over space or time Procedures: ex post overall – or adding new dataset Integration of collections of studies into the system Import already processed metadata to the system

Conceptual model Concepts IV Questionnaire documentation & Question DB (app9) The Questionnaire Questionnaire ‘editor’ Import & export questionnaires Indexing of questions & questionnaire The questions – to be classified Question types Questions forms Relation between dataset and questionnaire & questions

Documentation object (DOID = Document object id (~ ISBN) to identity uniquely every object in system Assign administrational attributes to DO (not to each basic entity) Attributes like: who, when, release, permission level or subtypes of document objects Study Data set Publication Classification Object of observation Attribute Conceptual model Concepts V

2.2 Relational Model E-R-D Schema & DD

3-tier system User interface for the modules Application server Relational database management system 3. System Architecture

Question type 1: Single response on simple Question 4.1 Modules – M3 Q/V editor / example 1 Step1: select Question type and define … Open empty fields > step 2 Step2: enter Question and Variable information Step 3: search and select Answer scale / missing values

Question type 2: Single response on Question with items 4.1 Modules – M3 Q/V editor / example 2 Step2: enter Question and Variable information Step 3: Select Tab for the Answer text / missing value search and select Answer scale / missing values Result > next screen Step1: select Question type and define … Open empty fields > step 2

Question type 2: Single response on Question with items 4.1 Modules – M3 Q/V editor / example 2 Step 3 Tab with select Answer scale / missing values