3 rd Annual European DDI Users Group Meeting, 5-6 December 2011 The Ongoing Work for a Technical Vocabulary of DDI and SDMX Terms Marco Pellegrino Eurostat.

Slides:



Advertisements
Similar presentations
The SDMX Registry Model April 2, 2009 Arofan Gregory Open Data Foundation.
Advertisements

Status on the Mapping of Metadata Standards
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
1 Metadata Registry Standards: A Key to Information Integration Jim Carpenter Bureau of Labor Statistics MIT Seminar June 3, 1999 Previously presented.
SDMX and DDI: How Do They Fit Together in Practical Terms? Arofan Gregory The Open Data Foundation European DDI User’s Group 2011 Gothenburg, Sweden.
Codebook Centric to Life-Cycle Centric In the beginning….
Modernizing the Data Documentation Initiative (DDI-4) Dan Gillman, Bureau of Labor Statistics Arofan Gregory, Open Data Foundation WICS, 5-7 May 2015.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
The use and convergence of quality assurance frameworks for international and supranational organisations compiling statistics The European Conference.
Neuchâtel Terminology Model: Classification database object types and their attributes Revision 2013 and its relation to GSIM Prepared by Debra Mair, Tim.
Background Data validation, a critical issue for the E.S.S.
ESCWA SDMX Workshop Session: Role in the Statistical Lifecycle and Relationship with DDI (Data Documentation Initiative)
GSIM Stakeholder Interview Feedback HLG-BAS Secretariat January 2012.
WP.5 - DDI-SDMX Integration
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
SDMX and DDI Working Together Technical Workshop 5-7 June 2013
Generic Statistical Information Model (GSIM) Thérèse Lalor and Steven Vale United Nations Economic Commission for Europe (UNECE)
Profiling Metadata Specifications David Massart, EUN Budapest, Hungary – Nov. 2, 2009.
DDI-RDF Discovery Vocabulary A Metadata Vocabulary for Documenting Research and Survey Data Linked Data on the Web (LDOW 2013) Thomas Bosch.
Background to the Generic Statistical Information Model (GSIM) Briefing Pack December
Terminology and Standards Dan Gillman US Bureau of Labor Statistics.
CHRIS NELSON METADATA TECHNOLOGY WORK SESSION ON STATISTICAL METADATA GENEVA 6-8 MAY 2013 Designing a Metadata Repository Metadata Technology Ltd.
UNECE METIS work session on statistical metadata Luxembourg, 9 to 11 April SDMX as a source of standardised terminology: MCV and cross-domain concepts.
CountryData Technologies for Data Exchange SDMX Information Model: An Introduction.
SDMX Standards Relationships to ISO/IEC 11179/CMR Arofan Gregory Chris Nelson Joint UNECE/Eurostat/OECD workshop on statistical metadata (METIS): Geneva.
Technical Overview of SDMX and DDI : Describing Microdata Arofan Gregory Metadata Technology.
SDMX and DDI working together Technical workshop, Luxembourg, June 2013 Use cases for DDI and SDMX.
Describing Statistical registers in SDMX and DDI: A Comparison Arofan Gregory Metadata Technology Eurostat, June 4-6, 2013 Luxembourg.
United Nations Economic Commission for Europe Statistical Division Part B of CMF: Metadata, Standards Concepts and Models Jana Meliskova UNECE Work Session.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
Statistics Portugal/ Metadata Unit Monica Isfan « Joint UNECE/ EUROSTAT/ OECD Work Session on Statistical Metadata.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
Subcommittee 3D DATA SETS FOR LIBRARIES. SC 3D Experience report for implementing IEC – Conventions and guidelines Cape Town, (Cape.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
Lois Fritts SAIC January 17, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2022.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
DDI Discovery: An Overview of Current RDF Vocabularies Arofan Gregory Metadata Technologies NA Joachim Wackerow GESIS.
Survey Data Management and the Combined Use of DDI and SDMX Arofan Gregory Chris Nelson Metadata Technology Eurostat, June
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Metadata Registries Workshop Metadata Registries Workshop U.S. Bureau of Labor Statistics Conference Center April 15-17, 1998.
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
SDMX IT Tools Introduction
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
Strategic Priorities for DDI Spring 2013 Mary Vardigan Director, DDI Alliance METIS -- Geneva, Switzerland May 6, 2013.
GSIM, DDI & Standards- based Modernisation of Official Statistics Workshop – DDI Lifecycle: Looking Forward October 2012.
OECD Expert Group on Statistical Data and Metadata Exchange (Geneva, May 2007) Update on technical standards, guidelines and tools Metadata Common.
Eurostat 1 3.An overview of the SDMX implementation process Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course,
1 Item 2.1.b of the agenda IT Governance in the ESS and related issues Renewal of mandates STNE Adam WROŃSKI Eurostat, Unit B5.
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
1 Joint UNECE/EUROSTAT/OECD METIS Work Session (Geneva, March 2010) The On-Going Review of the SDMX Technical Specifications Marco Pellegrino, Håkan.
Eurostat Sharing data validation services Item 5.1 of the agenda.
Session 2: Developing a Comprehensive M&E Work Plan.
SDMX Basics course, March 2016 Eurostat SDMX Basics course, March Introducing the Roadmap Marco Pellegrino Eurostat Unit B5: “Data and.
>> Metadata What is it, and what could it be? EU Twinning Project Activity E.2 26 May 2013.
Publishing DDI-Related Topics Advantages and Challenges of Creating Publications Joachim Wackerow EDDI16 - 8th Annual European DDI User Conference Cologne,
DDI and GSIM – Impacts, Context, and Future Possibilities
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
11. The future of SDMX Introducing the SDMX Roadmap 2020
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
SDMX Information Model: An Introduction
Statistical Information Technology
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
Presentation to SISAI Luxembourg, 12 June 2012
The role of metadata in census data dissemination
Item 7.11 SDMX Progress report
… Two-step approach Conceptual Framework Annex I Annex II Annex III
DDI and GSIM – Impacts, Context, and Future Possibilities
Presentation transcript:

3 rd Annual European DDI Users Group Meeting, 5-6 December 2011 The Ongoing Work for a Technical Vocabulary of DDI and SDMX Terms Marco Pellegrino Eurostat 1 Background Work Products Inputs to the Joint Vocabulary The Challenge Current Status Looking Forward

3 rd Annual European DDI Users Group Meeting, 5-6 December Background At the EDDI 2010 conference, an informal dialogue between SDMX, the DDI Alliance and interested members of the community was held 4 other meetings since then, and some telephone conferences No formal membership: secretariat provided by UN/ECE (more than 40 people on the mailing list) Goal of this work: to help the standards bodies coordinate to better serve their users

3 rd Annual European DDI Users Group Meeting, 5-6 December Background (Continued) Several areas of work: the different terminology between the SDMX and DDI communities was identified as one of the problems in the dialogue A joint SDMX-DDI Vocabulary is being created to help address this issue All relevant documents and information for the SDMX-DDI Dialogue can be found at DDI+Dialogue+-+Overview+Page DDI+Dialogue+-+Overview+Page

3 rd Annual European DDI Users Group Meeting, 5-6 December Work Products So far, a small number of work products have been identified: – Joint SDMX-DDI Vocabulary – Business Case for using SDMX and DDI – A proposed coordinated approach for using the standards in an interoperable way (register data use case) Other documents are envisaged: – DDI, SDMX and the GSBPM to support statistical quality improvements – Detailed examples Each of the work products is being created by a small team of volunteers from the SDMX and DDI communities

3 rd Annual European DDI Users Group Meeting, 5-6 December Work Products (Continued) The team working on the initial drafting of the Joint SDMX-DDI Vocabulary includes: – Marco Pellegrino (Eurostat) – Arofan Gregory (Open Data Foundation) – Chris Nelson (Metadata Technology) – Mary Vardigan (DDI Alliance) – Joachim Wackerow (GESIS/DDI Alliance) We anticipate many more participants as we get further along in the process, especially in a review capacity

3 rd Annual European DDI Users Group Meeting, 5-6 December The terminology challenge  Definitions and descriptions are often insufficient to support a correct use of a standard  Names are often not definitive for concepts  Standardization must focus on definitions rather than names

3 rd Annual European DDI Users Group Meeting, 5-6 December ISO/IEC Part 4: Rules and Guidelines for the Formulation of Data Definitions The purpose of a data element definition is to define a data element with words or phrases that describe, explain, or make definite and clear its meaning Good definitions promote the standardization and reuse of data elements, leading to data sharing and integration of information systems

3 rd Annual European DDI Users Group Meeting, 5-6 December Data Definition Rules A data definition shall be: – Unique – Singular – A statement of concept, not its negative – A descriptive phrase or sentence – Commonly understood abbreviations – Without embedded definitions

3 rd Annual European DDI Users Group Meeting, 5-6 December Data Definition Guidelines  State the essential meaning of the concept  Be precise and unambiguous  Be concise  Be able to stand alone  Be expressed without embedding rationale, functional usage, domain information or procedural information  Avoid circular reasoning  Use consistent terminology and structure for related definitions

3 rd Annual European DDI Users Group Meeting, 5-6 December Inputs to the Joint Vocabulary The SDMX Secretariat has been working to develop a comprehensive SDMX Vocabulary for use within that community – SDMX Metadata Common Vocabulary developed as part of the “Content-Oriented Guidelines” (2009) – SDMX Technical Vocabulary based largely on the SDMX Information Model, with other inputs Early draft of a DDI Vocabulary was developed by the DDI alliance for input into this process

3 rd Annual European DDI Users Group Meeting, 5-6 December The Challenge Question: What is a Category Scheme? Answer: That really depends (on which standard you are using…) This is a simple example of how the same term is used to refer to two completely different types of metadata! There are other, similar differences of terminology which could produce confusion.

Data or Metadata Structure Definition SDMX: is everything well described? Category Scheme Category Data or Metadata Flow Data Provider Provision Agreement Data Set or Metadata Set Content Constraint Structure and Item Scheme Maps Registered Data Source or Metadata Source Attachment Constraint Categorisation

3 rd Annual European DDI Users Group Meeting, 5-6 December Study Concepts Concepts measures Survey Instruments using Questions made up of Universes about

3 rd Annual European DDI Users Group Meeting, 5-6 December DDI: everything clear? Category Scheme Code Scheme Concept Scheme Control Construct Scheme GeographicStructureScheme GeographicLocationScheme InterviewerInstructionScheme Question Scheme NCubeScheme Organization Scheme Physical Structure Scheme Record Layout Scheme Universe Scheme Variable Scheme Dataset Dcelements DDI profile Conceptual component Study unit Group Resource package Instance Coverage …

3 rd Annual European DDI Users Group Meeting, 5-6 December T echnical V ocabulary: expected benefits  Support a common understanding of the agreed technical standards by providing a single authoritative list of the technical terms used in the standards, together with a description of each term and, if needed, some context explanations  Facilitate a comparison with other standards and a mapping of concepts with minimum need to determine “semantic equivalence”  Improve visibility for existing definitions (building on existing sources and avoiding a proliferation of “standard” terminologies)  Improve accessibility to a set of standard definitions through a single address

3 rd Annual European DDI Users Group Meeting, 5-6 December Vocabulary STRUCTURE  Term (mandatory)  Definition (mandatory)  Definition source (mandatory)  Context (in SDMX and DDI)  Links to related terms within the glossary (optional)  URL to more detailed information (optional)  Several outputs (doc, html, xml)

3 rd Annual European DDI Users Group Meeting, 5-6 December Current Status The terms in the SDMX Vocabulary are now being evaluated (TWG) so that an appropriate subset can be mapped to DDISDMX Vocabulary The first draft will not be comprehensive – It will only address the main objects in each standard, and those which have very strong similarities between the two standards The initial set of DDI terms, plus their relationship to SDMX objects, has been draftedDDI terms

3 rd Annual European DDI Users Group Meeting, 5-6 December The Initial Draft DDI-SDMX Vocabulary (example)

3 rd Annual European DDI Users Group Meeting, 5-6 December The Initial Draft DDI-SDMX Vocabulary (example)

3 rd Annual European DDI Users Group Meeting, 5-6 December Looking Forward We expect to have the initial draft ready for consideration by the larger group by march 2012 Hopefully, this document can be finalized and then expanded: – We expect it to be a living document as the SDMX- DDI dialogue proceeds – It will be published as a contribution to the integrated use of DDI and SDMX

Generic Process Example Survey/Register Raw Data Set Anonymization, cleaning, recoding, etc. Micro-Data Set/ Public Use Files Tabulation, processing, case selection, etc. Aggregation,harmonization Aggregation,harmonization Aggregate Data Set (Lower level) Aggregate Data Set (Higher Level) DDI SDMX Indicators

3 rd Annual European DDI Users Group Meeting, 5-6 December Business case: a key issue in the DDI-SDMX dialogue Thank you!