1 5a. SDMX and reference metadata exchanges Bogdan ZDRENTU Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, 27-29 October 2015.

Slides:



Advertisements
Similar presentations
Better data quality through global data and metadata sharing
Advertisements

Slide 1 Eurostat Directorate B – Statistical methods and tools; dissemination Towards implementation of SDMX – 9/11 January 2007 SDMX Open Data Interchange.
WP.5 - DDI-SDMX Integration
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Implementing ESS standards for reference metadata and quality reporting at Istat Work Session on Statistical Metadata Topic (i): Metadata standards and.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
Metadata management and statistical business process at Statistics Estonia Work Session on Statistical Metadata (Geneva, Switzerland 8-10 May 2013) Kaja.
REFERENCE METADATA FOR DATA TEMPLATE Ales Capek EUROSTAT.
CountryData Technologies for Data Exchange SDMX Information Model: An Introduction.
Implementation of SDMX for data and metadata exchange SDMX Basics Course 12 April 2013 Daniel Suranyi Eurostat B5 Management of statistical data and metadata.
13-Jul-07 Implementation of SDMX for data and metadata exchange Balance of Payments Working Group 2-3 April 2012 Daniel Suranyi Eurostat B5 Management.
Slide 1 Eurostat Unit B3 – Statistical Information Technologies CoRD Meeting – 4 June 2007 Agenda Item 8 Preliminary ideas for a 2011 census hub Giuseppe.
Eurostat 1 7a. Practical use case 1: Pesticides Use Project Blanaru Cristina Eurostat Unit B5: “Central data and metadata services” SDMX Basics course,
Eurostat achievements and challenges Emanuele Baldacci, Director European Commission - Eurostat Director Methodology; Corporate statistical.
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
1 Integration of the Eurostat and ESS Metadata Systems A. Götzfried Head of Unit B6 Eurostat.
SDMX IT Tools Introduction
SDMX and Metadata SDMX Basics Course 12 April 2013 Daniel Suranyi Eurostat B5 Management of statistical data and metadata.
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
Work Session on Statistical Metadata 2013 Session III: Metadata in the Statistical Business Process Better documenting statistical business processes:
SDMX IT Tools SDMX use in practice in NA
7b. SDMX practical use case: Census Hub
1 Enhancing data quality by using harmonised structural metadata within the European Statistical System A. Götzfried Head of Unit B6 Eurostat.
1 Quality reporting within the Eurostat and the ESS metadata systems August Götzfried and Håkan Linden Eurostat Unit B6: Reference databases and metadata.
13 November, 2014 Seminar on Quality Reports QUALITY REPORTS EXPERIENCE OF STATISTICS LITHUANIA Nadiežda Alejeva Head, Price Statistics.
Implementation of SDMX for data and metadata exchange SDMX Basics Course October 2012 Daniel Suranyi Eurostat B5 Management of statistical data and.
SDMX Basics course, March 2016 Eurostat SDMX Basics course, March Introducing the Roadmap Marco Pellegrino Eurostat Unit B5: “Data and.
Quality declarations Study visit from Ukraine 19. March 2015
The Eurostat Metadata Handler Götzfried Eurostat (Head of Unit B6)
Prepared by: Galya STATEVA, Chief expert
5b. SDMX and reference metadata: guideline examples
Item 6 - Introduction to ESS Metadata Handler
Exchanging Reference Metadata using SDMX
The CVD Metadata Handler
SDMX Information Model
Usage of National Reference Metadata Editor (NRME)
ESTP Training Course 8 & 9 April 2014 Fabien JACQUET Eurostat B5
2. An overview of SDMX (What is SDMX? Part I)
Time Use Survey data processing and dissemination 17 July 2014
2. An overview of SDMX (What is SDMX? Part I)
ESS technical standards and tools for quality reporting
Working Group "LAbour MArket Statistics" October 2013
5. Detail: Main SDMX objects for. metadata exchange. (What is SDMX
Data Transmission Tools & Services EDAMIS, SDMX, Validation
Implementation of SDMX in the ESS
What is next? H. Linden Eurostat, Unit B5
Statistical Information Technology
SDMX as basis for water data reporting
ESS VIP ICT Project Task Force Meeting 5-6 March 2013.
National reference metadata and the National Reference Metadata Editor
SDMX : General introduction H. Linden, Eurostat, Unit B5
Item of the Agenda Towards an integrated Eurostat metadata handler – Eurostat SDMX Registry services for Member States Francesco Rizzo Unit B3 13.
Working Group "Education and Training Statistics" April 2013
SDMX Progress and implementation A. Götzfried, Unit B6
Legislative strategy for cross-cutting ESS legislation
9. Practical use case 3: Pesticides Use Project
Standards and guidelines for reference metadata
M. Henrard, B5 N. Buysse and H. Linden, B6 Eurostat
Annegrete Wulff Statistics Denmark
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
ESTP course on Statistical Metadata – Introductory course –
European Statistical System Metadata Handler ESS MH (Super) Providers
PRODCOM Working Group JMO M November 2012
Petr Elias Czech Statistical Office
ESS technical standards and tools for quality reporting
Introduction to reference metadata and quality reporting
7. Introduction to the main SDMX objects for metadata exchange
ESS conceptual standards for quality reporting
Presentation transcript:

1 5a. SDMX and reference metadata exchanges Bogdan ZDRENTU Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015

Eurostat Structural metadata acting as identifiers and descriptors of the data, such as: dimensions of statistical cubes variables titles of tables Nomenclatures (code lists) always be associated with the data to allow their identification, retrieval and browsing. Types of metadata 2

Eurostat Example for structural metadata 3

Eurostat Reference metadata acting only as descriptors of the data, they don’t help to actually identify the data. They can be of different kinds: conceptual metadata methodological metadata quality metadata (process and output) can be exchanged independently from the data they are related to, but are however often linked to them. Types of metadata 4

Eurostat Example for reference metadata 5

ESS standardisation of reference metadata based on SDMX 6  The SDMX Content Oriented Guidelines (2009), i.e. the Cross- Domain Concepts and the MCV;  The SDMX Technical Standards:  The information model for creation of the Metadata Structure Definitions (MSDs);  The SDMX-ML for documenting the XML format;  The Euro SDMX Registry for storing the MSDs etc.

Eurostat Code lists describe dimensions in data tables, giving a meaning to the data. Code lists are based on: official statistical classifications such as NACE, NUTS, ISCO… the SDMX Content Oriented Guidelines Domain specific codifications A standard code list is a code list already harmonised Standard code lists should be used all along the statistical business process: data design, collection, aggregation, dissemination, archiving… Standardisation of structural metadata 7

Eurostat Example of a harmonised code list (NACE Rev. 1.1) Old version (before harmonisation) New version (after harmonisation) DomainsOld codesOld label_enNew codesNew label_en hrst, htecMA_TOTALManufacturing sector DManufacturing fatsMANManufacturing industries theme3RDManufacturing industry theme4B0200Manufacturing industry theme8SE0_4Manufacturing industry theme9TOT_MANUFManufacturing industry ds, hrst, htecMA_LOW_TECLow technology manufacturing sector D_LTC Low-technology manufacturing fats / innLOT Low Technology (incl. following NACE codes: 15-22; 36, 37) innI_LOW_TEC Low tech industries: NACE Rev.1 codes 15 to 22, 36 and 37 hrst, htecSE_TOTAL Services: NACE Rev. 1.1 sections G to Q = 50 to 99 G-QServices fatsSERServices sector 8

Eurostat Better comparability: same codes for the same concepts Increase efficiency: less transcoding; less code lists; clean lists Improve accuracy: facilitate data management and exchange and reduce the number of errors Re-usability and integration of the data: data warehouse are only possible if codes corresponding to the same concept are the same SDMX implementation: it is essential for the implementation of a SDMX data/metadata exchange process. The ESS standard code lists will also be made available in the Euro SDMX Registry (currently RAMON). Impact on the statistical business processes 9

Eurostat RAMON 10

Eurostat Standard Code Lists in RAMON 11

Eurostat The ESS Reference Metadata Standards ESMS Euro SDMX Metadata Structure ESQRS ESS Standard for Quality Reports Structure EPMS Eurostat Process Metadata Structure 12

Eurostat The Euro SDMX Metadata Structure (ESMS) 13

Eurostat The ESS Standard for Quality Reports Structure (ESQRS) 14

Eurostat Concept name 1Contact 1.1Contact organisation 1.2Contact organisation unit 1.3Contact name 1.4Contact person function 1.5Contact mail address 1.6Contact address 1.7Contact phone number 1.8Contact fax number 2Summary process description 3Workflow 4Statistical processing 4.1Data collection 4.2Source data 4.2.1Source data - integration 4.2.2Source data - coding 4.3Data validation 4.3.1Data validation in Member States 4.3.2Validation rules agreed with Member States 4.3.3Data validation - detection (Eurostat) 4.3.4Data validation - correction (Eurostat) Concept name 4.4Data compilation 4.4.1Data compilation - variables 4.4.2Data compilation - weights 4.4.3Data compilation - aggregates 4.4.4Data compilation - finalisation 4.4.5Data compilation - draftoutput 4.5Data validation - final 4.5.1Data validation final - output 4.5.2Data validation final - explanation 5Confidentiality 5.1Confidentiality - data treatment 6Release policy 6.1User access 7Dissemination format 7.1Publications 7.2On-line database 7.3Micro-data access 7.4Other 8IT applications 8.1IT applications for data reception/collection 8.2IT applications for data processing 8.3IT applications for data validation 8.4IT applications for data confidentiality 8.5IT applications for metadata 8.6Other IT applications The Euro Process Metadata Structure (EPMS) 15

Eurostat Dissemination of reference metadata 16

Eurostat Dissemination of national reference metadata 17

Eurostat The ESS Metadata Handler Common user Interface Output produced for the Eurostat Web Other output for Eurostat or external users ESS-MH IT application RAMON CODED ESS – Metadata Handler Euro SDMX Registry Input from national metadata Metadata from the Eurostat Domain manager Eurostat as main administrator The business process 18

What is the ESS Metadata Handler? 19  The ESS Metadata Handler is a web based application for reference metadata production, exchange and dissemination in the ESS;  It implements the ESS metadata standards (ESMS, ESQRS and EPMS, etc.)  It replaces EMIS (used in Eurostat) and NRME (for countries);  It contains many improvements based on users' feedback (in terms of business process and functionalities);  It is in production since 31 January 2014.

EDAMIS National Statistical Institute EUROSTAT ESS Metadata Handler (ESS MH) ESS MH Database Eurostat Website PRODUCTION TREATMENT & ANALYSIS DISSEMINATION National Metadata File 20

The business process for using the ESS MH for national metadata  Mapping of the existing national reference metadata files to the ESMS and/or ESQRS formats;  Conversion of existing national reference metadata files into standard structure;  Insertion of these files into the ESS MH application;  The NSI’s are asked to complete, enhance their converted files, directly in the ESS MH;  The responsible Domain Managers in Eurostat are asked to validate these ESMS / ESQRS files;  The national metadata are finally disseminated on Eurostat Web site (if decided so).  Time line is approximately 6 months. 21

ESS standards for metadata - Implementation Waste (end of life vehicles, packaging, electronic waste) WINE FARM STRUCTURE MIP STATISTICS HICP/ Compliance monitoring EHIS (Education, health and social protection) R&D (CIS 2012) Annual crops PRAG ESAW AES (Education, Science and Culture) LCI (Labour Cost Index) INFOSOC (Information Society) BUSINESS REGISTER HICP LFS-Q, LFS-A EU-SILC FATS STS (Short Term Statistics ) WASTE AEI (Pesticides) EDUCAT JVC (Job Vacancy Stats) PRODCOM EXTERNAL TRADE (3rd countries) COSAEA URBANREG R&D TOURISM PERMANENT CROPS CENSUS HOUSING PRICES HPS 22

Practical Example 23

Example for reference metadata That source contains metadata about the Tourism datasets 24

Let's select a specific topic from which we'll create a metadata report 25

Content of the selected topic 26

Content of the selected topic 27

Defining a Metadata Structure Definition The Tasks 1.Analysis of the entire set of metadata in order to identify and document the “Concepts” for which metadata are to be reported or disseminated. 2.Determine the structure of the “Metadata Report” in terms of the concepts used, the hierarchy of the concepts when used in the report, and their “representation” (e.g. is a code list used, is the format free text?). 3.Specify the “object type” to which the metadata are to be attached, and how this object type is identified: knowledge of the SDMX Information model is useful here (as the metadata can only be attached to object types that can be identified in terms of the object types that exist in the information model). 28

Metadata Report Structure – Content Metadata BASIC_METH_ISSUES POS_ACC_TOUR STAT_UNIT 29

SCOPE_OBS TOUR_ACC_ESTAB NACE_55_1 30

Metadata Report Structure – Concept Scheme The following concepts are derived from this example: BASIC_METH_ISSUES POS_ACC_TOUR STAT_UNIT SCOPE_OBS TOUR_ACC_ESTAB CONTACT_ORG CONTACT_ORG_UNIT CONTACT_MAIL_ADDRESS CONTACT NACE_55_1 31

Metadata Report Structure – Bringing it Together Report Structure - Contact Report CONTACT_ORG CONTACT_ORG_UNIT CONTACT_MAIL_ADDRESS ESTAT_MSD ESTAT_METADATA_CS CATEGORY_REPORT CONTACT 32

METADATA REPORT STRUCTURE – BRINGING IT TOGETHER Report Structure - Quality Report ESTAT_MSD ESTAT_METADATA_CS CATEGORY_REPORT BASIC_METH_ISSUES POS_ACC_TOUR STAT_UNIT SCOPE_OBS TOUR_ACC_ESTAB NACE_55_1 33

Metadata Set: Structure References to : a Metadata Structure Definition (MSD) a Report Structure a Target Identifier Defines: The actual values of the target objects Comprises: The Reported Attributes and their corresponding Values These Attributes may be: coded text date/time number etc. 34

Metadata Set – General Schematic CONTACT_ORG CONTACT_ORG_UNIT CONTACT_MAIL_ADDRESS CONTACT Unit G3 Short-term statistics; tourism Eurostat, Statistical Office of the European Communities portal/page/portal/help/user_sup port CATEGORY_REPORT BASIC_METH_ISSUES POS_ACC_TOUR 35

METADATA SET – GENERAL SCHEMATIC CATEGORY_REPORT STAT_UNIT SCOPE_OBS 36

METADATA SET – GENERAL SCHEMATIC CATEGORY_REPORT TOUR_ACC_ESTAB NACE_55_1 37

METADATA SET – METADATA FILE 38

METADATA SET – ESMS EXAMPLE 39

METADATA SET – ESMS EXAMPLE 40

Questions?