Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 5a. SDMX and reference metadata exchanges Bogdan ZDRENTU Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, 27-29 October 2015.

Similar presentations


Presentation on theme: "1 5a. SDMX and reference metadata exchanges Bogdan ZDRENTU Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, 27-29 October 2015."— Presentation transcript:

1 1 5a. SDMX and reference metadata exchanges Bogdan ZDRENTU Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, 27-29 October 2015

2 Eurostat Structural metadata acting as identifiers and descriptors of the data, such as: dimensions of statistical cubes variables titles of tables Nomenclatures (code lists) always be associated with the data to allow their identification, retrieval and browsing. Types of metadata 2

3 Eurostat Example for structural metadata 3

4 Eurostat Reference metadata acting only as descriptors of the data, they don’t help to actually identify the data. They can be of different kinds: conceptual metadata methodological metadata quality metadata (process and output) can be exchanged independently from the data they are related to, but are however often linked to them. Types of metadata 4

5 Eurostat Example for reference metadata 5

6 ESS standardisation of reference metadata based on SDMX 6  The SDMX Content Oriented Guidelines (2009), i.e. the Cross- Domain Concepts and the MCV;  The SDMX Technical Standards:  The information model for creation of the Metadata Structure Definitions (MSDs);  The SDMX-ML for documenting the XML format;  The Euro SDMX Registry for storing the MSDs etc.

7 Eurostat Code lists describe dimensions in data tables, giving a meaning to the data. Code lists are based on: official statistical classifications such as NACE, NUTS, ISCO… the SDMX Content Oriented Guidelines Domain specific codifications A standard code list is a code list already harmonised Standard code lists should be used all along the statistical business process: data design, collection, aggregation, dissemination, archiving… Standardisation of structural metadata 7

8 Eurostat Example of a harmonised code list (NACE Rev. 1.1) Old version (before harmonisation) New version (after harmonisation) DomainsOld codesOld label_enNew codesNew label_en hrst, htecMA_TOTALManufacturing sector DManufacturing fatsMANManufacturing industries theme3RDManufacturing industry theme4B0200Manufacturing industry theme8SE0_4Manufacturing industry theme9TOT_MANUFManufacturing industry ds, hrst, htecMA_LOW_TECLow technology manufacturing sector D_LTC Low-technology manufacturing fats / innLOT Low Technology (incl. following NACE codes: 15-22; 36, 37) innI_LOW_TEC Low tech industries: NACE Rev.1 codes 15 to 22, 36 and 37 hrst, htecSE_TOTAL Services: NACE Rev. 1.1 sections G to Q = 50 to 99 G-QServices fatsSERServices sector 8

9 Eurostat Better comparability: same codes for the same concepts Increase efficiency: less transcoding; less code lists; clean lists Improve accuracy: facilitate data management and exchange and reduce the number of errors Re-usability and integration of the data: data warehouse are only possible if codes corresponding to the same concept are the same SDMX implementation: it is essential for the implementation of a SDMX data/metadata exchange process. The ESS standard code lists will also be made available in the Euro SDMX Registry (currently RAMON). Impact on the statistical business processes 9

10 Eurostat RAMON http://ec.europa.eu/eurostat/ramon 10

11 Eurostat Standard Code Lists in RAMON 11

12 Eurostat The ESS Reference Metadata Standards ESMS Euro SDMX Metadata Structure ESQRS ESS Standard for Quality Reports Structure EPMS Eurostat Process Metadata Structure 12

13 Eurostat The Euro SDMX Metadata Structure (ESMS) 13

14 Eurostat The ESS Standard for Quality Reports Structure (ESQRS) 14

15 Eurostat Concept name 1Contact 1.1Contact organisation 1.2Contact organisation unit 1.3Contact name 1.4Contact person function 1.5Contact mail address 1.6Contact email address 1.7Contact phone number 1.8Contact fax number 2Summary process description 3Workflow 4Statistical processing 4.1Data collection 4.2Source data 4.2.1Source data - integration 4.2.2Source data - coding 4.3Data validation 4.3.1Data validation in Member States 4.3.2Validation rules agreed with Member States 4.3.3Data validation - detection (Eurostat) 4.3.4Data validation - correction (Eurostat) Concept name 4.4Data compilation 4.4.1Data compilation - variables 4.4.2Data compilation - weights 4.4.3Data compilation - aggregates 4.4.4Data compilation - finalisation 4.4.5Data compilation - draftoutput 4.5Data validation - final 4.5.1Data validation final - output 4.5.2Data validation final - explanation 5Confidentiality 5.1Confidentiality - data treatment 6Release policy 6.1User access 7Dissemination format 7.1Publications 7.2On-line database 7.3Micro-data access 7.4Other 8IT applications 8.1IT applications for data reception/collection 8.2IT applications for data processing 8.3IT applications for data validation 8.4IT applications for data confidentiality 8.5IT applications for metadata 8.6Other IT applications The Euro Process Metadata Structure (EPMS) 15

16 Eurostat Dissemination of reference metadata 16

17 Eurostat Dissemination of national reference metadata 17

18 Eurostat The ESS Metadata Handler Common user Interface Output produced for the Eurostat Web Other output for Eurostat or external users ESS-MH IT application RAMON CODED ESS – Metadata Handler Euro SDMX Registry Input from national metadata Metadata from the Eurostat Domain manager Eurostat as main administrator The business process 18

19 What is the ESS Metadata Handler? 19  The ESS Metadata Handler is a web based application for reference metadata production, exchange and dissemination in the ESS;  It implements the ESS metadata standards (ESMS, ESQRS and EPMS, etc.)  It replaces EMIS (used in Eurostat) and NRME (for countries);  It contains many improvements based on users' feedback (in terms of business process and functionalities);  It is in production since 31 January 2014.

20 EDAMIS National Statistical Institute EUROSTAT ESS Metadata Handler (ESS MH) ESS MH Database Eurostat Website PRODUCTION TREATMENT & ANALYSIS DISSEMINATION National Metadata File 20

21 The business process for using the ESS MH for national metadata  Mapping of the existing national reference metadata files to the ESMS and/or ESQRS formats;  Conversion of existing national reference metadata files into standard structure;  Insertion of these files into the ESS MH application;  The NSI’s are asked to complete, enhance their converted files, directly in the ESS MH;  The responsible Domain Managers in Eurostat are asked to validate these ESMS / ESQRS files;  The national metadata are finally disseminated on Eurostat Web site (if decided so).  Time line is approximately 6 months. 21

22 ESS standards for metadata - Implementation Waste (end of life vehicles, packaging, electronic waste) WINE FARM STRUCTURE MIP STATISTICS HICP/ Compliance monitoring EHIS (Education, health and social protection) R&D (CIS 2012) Annual crops PRAG ESAW AES (Education, Science and Culture) LCI (Labour Cost Index) INFOSOC (Information Society) BUSINESS REGISTER HICP LFS-Q, LFS-A EU-SILC FATS STS (Short Term Statistics ) WASTE AEI (Pesticides) EDUCAT JVC (Job Vacancy Stats) PRODCOM EXTERNAL TRADE (3rd countries) COSAEA URBANREG R&D TOURISM PERMANENT CROPS CENSUS HOUSING PRICES HPS 22

23 Practical Example 23

24 Example for reference metadata That source contains metadata about the Tourism datasets 24

25 Let's select a specific topic from which we'll create a metadata report 25

26 Content of the selected topic 26

27 Content of the selected topic 27

28 Defining a Metadata Structure Definition The Tasks 1.Analysis of the entire set of metadata in order to identify and document the “Concepts” for which metadata are to be reported or disseminated. 2.Determine the structure of the “Metadata Report” in terms of the concepts used, the hierarchy of the concepts when used in the report, and their “representation” (e.g. is a code list used, is the format free text?). 3.Specify the “object type” to which the metadata are to be attached, and how this object type is identified: knowledge of the SDMX Information model is useful here (as the metadata can only be attached to object types that can be identified in terms of the object types that exist in the information model). 28

29 Metadata Report Structure – Content Metadata BASIC_METH_ISSUES POS_ACC_TOUR STAT_UNIT 29

30 SCOPE_OBS TOUR_ACC_ESTAB NACE_55_1 30

31 Metadata Report Structure – Concept Scheme The following concepts are derived from this example: BASIC_METH_ISSUES POS_ACC_TOUR STAT_UNIT SCOPE_OBS TOUR_ACC_ESTAB CONTACT_ORG CONTACT_ORG_UNIT CONTACT_MAIL_ADDRESS CONTACT NACE_55_1 31

32 Metadata Report Structure – Bringing it Together Report Structure - Contact Report CONTACT_ORG CONTACT_ORG_UNIT CONTACT_MAIL_ADDRESS ESTAT_MSD ESTAT_METADATA_CS CATEGORY_REPORT CONTACT 32

33 METADATA REPORT STRUCTURE – BRINGING IT TOGETHER Report Structure - Quality Report ESTAT_MSD ESTAT_METADATA_CS CATEGORY_REPORT BASIC_METH_ISSUES POS_ACC_TOUR STAT_UNIT SCOPE_OBS TOUR_ACC_ESTAB NACE_55_1 33

34 Metadata Set: Structure References to : a Metadata Structure Definition (MSD) a Report Structure a Target Identifier Defines: The actual values of the target objects Comprises: The Reported Attributes and their corresponding Values These Attributes may be: coded text date/time number etc. 34

35 Metadata Set – General Schematic CONTACT_ORG CONTACT_ORG_UNIT CONTACT_MAIL_ADDRESS CONTACT Unit G3 Short-term statistics; tourism Eurostat, Statistical Office of the European Communities http://epp.eurostat.ec.europa.eu/ portal/page/portal/help/user_sup port CATEGORY_REPORT BASIC_METH_ISSUES POS_ACC_TOUR 35

36 METADATA SET – GENERAL SCHEMATIC CATEGORY_REPORT STAT_UNIT SCOPE_OBS 36

37 METADATA SET – GENERAL SCHEMATIC CATEGORY_REPORT TOUR_ACC_ESTAB NACE_55_1 37

38 METADATA SET – METADATA FILE 38

39 METADATA SET – ESMS EXAMPLE 39

40 METADATA SET – ESMS EXAMPLE 40

41 Questions?


Download ppt "1 5a. SDMX and reference metadata exchanges Bogdan ZDRENTU Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, 27-29 October 2015."

Similar presentations


Ads by Google