1 Enhancing data quality by using harmonised structural metadata within the European Statistical System A. Götzfried Head of Unit B6 Eurostat.

Slides:



Advertisements
Similar presentations
Implementation of GSBPM, DDI and SDMX reference metadata at Statistics Denmark UNECE workshop 5-7 May 2015 Mogens Grosen Nielsen
Advertisements

Slide 1 Eurostat Directorate B – Statistical methods and tools; dissemination Towards implementation of SDMX – 9/11 January 2007 SDMX Open Data Interchange.
Eurostat J OINT UNECE/OECD/E UROSTAT MEETING OF THE GROUP OF EXPERTS ON BUSINESS REGISTERS 3-4 September 2013, Geneva Session 1: Economic globalisation.
The use and convergence of quality assurance frameworks for international and supranational organisations compiling statistics The European Conference.
Quality assurance activities at EUROSTAT CCSA Conference Helsinki, 6-7 May 2010 Martina Hahn, Eurostat.
REFERENCE METADATA FOR DATA TEMPLATE Ales Capek EUROSTAT.
Overview of quality work in Statistics Denmark Kirsten Wismer.
UNECE METIS work session on statistical metadata Luxembourg, 9 to 11 April SDMX as a source of standardised terminology: MCV and cross-domain concepts.
1 Annual National Accounts  1. Situation of OECD annual national accounts database  2. New features of the joint OECD-Eurostat questionnaire  3. COFOG2.
Eurostat Unit B3 – IT and standards for data and metadata exchange SDMX Basics Training – 2012 IT architectures for data exchange SDMX-RI and the Hub approach.
provide information ESSnet on consistency of concepts and applied methods of business and trade related statistics Session 2 : Business.
United Nations Economic Commission for Europe Statistical Division Part B of CMF: Metadata, Standards Concepts and Models Jana Meliskova UNECE Work Session.
BAIGORRI Antonio – Eurostat, Unit B1: Quality; Classifications Q2010 EUROPEAN CONFERENCE ON QUALITY IN STATISTICS Terminology relating to the Implementation.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
Monitoring public satisfaction through user satisfaction surveys Committee for the Coordination of Statistical Activities Helsinki 6-7 May 2010 Steve.
Developments in European Statistics Challenges in Official Statistics Visit to NSO Romania July 2009 Walter Radermacher, Chief Statistician of the.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
1 Integration of the Eurostat and ESS Metadata Systems A. Götzfried Head of Unit B6 Eurostat.
SDMX and Metadata SDMX Basics Course 12 April 2013 Daniel Suranyi Eurostat B5 Management of statistical data and metadata.
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
1 5a. SDMX and reference metadata exchanges Bogdan ZDRENTU Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
Work Session on Statistical Metadata 2013 Session III: Metadata in the Statistical Business Process Better documenting statistical business processes:
Aim: “to support the enhancement and implementation of the standards needed for the modernisation of statistical production and services”
OECD Expert Group on Statistical Data and Metadata Exchange (Geneva, May 2007) Update on technical standards, guidelines and tools Metadata Common.
1 The GSBPM and ESS statistical business process metadata Session 4 H. Linden, Unit B6 Eurostat Workshop on Statistical Metadata (METIS) (Geneva, 5-7 October.
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
Page 1 Development of Metadata System at Croatian Bureau of Statistics Development of Metadata System at Croatian Bureau of Statistics Presented by Maja.
1 Quality reporting within the Eurostat and the ESS metadata systems August Götzfried and Håkan Linden Eurostat Unit B6: Reference databases and metadata.
ESTP Course on the EGR November FATS user interface and metadata of final frame.
13 November, 2014 Seminar on Quality Reports QUALITY REPORTS EXPERIENCE OF STATISTICS LITHUANIA Nadiežda Alejeva Head, Price Statistics.
Quality declarations Study visit from Ukraine 19. March 2015
The Eurostat Metadata Handler Götzfried Eurostat (Head of Unit B6)
Towards more flexibility in responding to users’ needs
5b. SDMX and reference metadata: guideline examples
Structural and reference metadata in the European Statistical System
Seminar on ESA 2010 Metadata
Interoperable data formats: SDMX
ESTP TRAINING ON EGR Luxembourg – December 2014
The Vision Infrastructure Project “Enhanced Dissemination Chain”
MSDs and combined metadata reporting
Cross-domain concepts
Goals and objectives of Work package 2 of the ESSnet on Consistency of concepts and applied methods of business and trade-related statistics Norbert Rainer,
The European Statistical System
Statistics Denmark’s presentation of metadata
2. An overview of SDMX (What is SDMX? Part I)
Working Party on Regional Statistics 1-2 October 2012
2. An overview of SDMX (What is SDMX? Part I)
PRESENTATION OF SHORT-TERM ECONOMIC STATISTICS
Draft EP/Council Regulation for processes, standards and
Implementation of SDMX in the ESS
What is next? H. Linden Eurostat, Unit B5
Statistical Information Technology
August Götzfried Eurostat unit B 4
ESS VIP programme: Cross-cutting project on ESS data warehouses for production and dissemination John Allen Agenda point 5 Dissemination Working Group.
Draft EP/Council Regulation on processes, standards and metadata for the exchange and dissemination of European statistics A. Götzfried Head of.
August Götzfried Eurostat unit B 4
SDMX : General introduction H. Linden, Eurostat, Unit B5
A review of the 2011 census round in the EU, including the successful implementation of a detailed European legal base First meeting of the Technical Coordination.
“Enhanced Dissemination Chain”
SDMX Progress and implementation A. Götzfried, Unit B6
Legislative strategy for cross-cutting ESS legislation
Standards and guidelines for reference metadata
SDMX Implementation The National Accounts use case
M. Henrard, B5 N. Buysse and H. Linden, B6 Eurostat
Annegrete Wulff Statistics Denmark
ESTP course on Statistical Metadata – Introductory course –
Petr Elias Czech Statistical Office
Introduction to reference metadata and quality reporting
7. Introduction to the main SDMX objects for metadata exchange
ESS conceptual standards for quality reporting
Presentation transcript:

1 Enhancing data quality by using harmonised structural metadata within the European Statistical System A. Götzfried Head of Unit B6 Eurostat

2 2 The starting point The Commission Communication 404/2009 asks that production methods of European statistics need to be improved in order to enhance the efficiency within the European Statistical System (ESS); that the ‘stove-pipe’ statistical production systems should be replaced by more integrated production processes across statistical domains or statistical organisations (i.e. horizontal and vertical integration). Metadata are accompanying the statistical production process from end-to-end; we therefore see harmonised metadata as one of the main enablers for progressing towards the aim of the abovementioned Communication.

3 3 Harmonisation of structural metadata In the centre of interest: The harmonisation of structural metadata “Structural metadata are needed to identify, use and process data matrices and data cubes, e.g. names of columns or dimensions of statistical cubes. Structural metadata must be associated with the statistical data, otherwise it becomes impossible to identify, retrieve and navigate the data. “ We will present our work on the harmonisation of different types of structural metadata such as code lists, statistical variables and structural metadata linked to data tables. This work should lead to a considerable improvement of the data and metadata produced and disseminated by Eurostat and the ESS, in line with the principles of the European Statistics Code of Practice.

4 4 Harmonising code lists Code lists are used as dimensions and attributes in data structures (data messages) all along the statistical business process; an example: Population statistics with the dimensions age, geographical area, sex and time (including the respective codes used)

5 5 Harmonising code lists Harmonising code lists means more in detail: to produce lists of codes including the underlying statistical concepts which can be broadly used across statistical domains; to harmonise these lists in applying a number of basic principles, such as using as far as possible official classifications, inserting aggregates, etc. This harmonisation will improve the quality of the data produced as codes and concepts are defined uniquely and as data and metadata exchange will be facilitated.

6 6 Harmonising code lists Harmonising code lists means more in detail: an example for such as code list (related to age classes) is shown in the paper submitted; based on existing codes Eurostat produces and releases more and more of those lists with around 400 lists to be disseminated at the end; these lists also need to be maintained over time; the lists will be used in newly created data structure definitions (based on the SDMX technical and statistical standards) and be included in IT production systems; Some of those lists should also be upgraded into the SDMX statistical standards;

7 7 Harmonising statistical variables Harmonising statistical variables means more in detail: to draw up an inventory of the statistical variables used within the ESS in dissemination (around 1300 of those variables were compiled); to add standard characteristics to these statistical variables such as definitions, statistical domains, units of measure, etc.; to use this inventory for the harmonisation of the statistical variables in cases where unnecessary and unjustified differences exist between them; to improve the statistical variables themselves in order to make them better accessible and understandable for producers and users.

8 8 Harmonising statistical variables This activity is one more activity providing the pre-condition for a better integration of statistical business processes as promoted by the abovementioned Commission Communication 404/2009. However: Alignments of the statistical variables requires a change of the respective data sets; this can be seen more as a medium term task.

9 9 Harmonising structural metadata linked to data tables Structural metadata linked to data tables are table headings, titles, subtitles and short descriptions and similar metadata; harmonising this structural metadata means more in detail: to write guidelines for this structural metadata linked to the data tables (= around 4000 multi-dimensional tables and around 1000 so-called pre-defined tables are in dissemination); to improve and harmonise the headings of the data tables disseminated in applying those guidelines; to improve and harmonise the short descriptions explaining the main contents of the pre-defined tables;

10 Harmonising structural metadata linked to data tables Example for an old and new table heading: Old title of a multi-dimensional table: 'Relative incidence rate of accidental injuries at work by severity, permanency of the job, length of service in the enterprise and economic activity of the employer (EU mean rate = 100 for each severity)’ Revised title of a multi-dimensional table: –Title: 'Incidence rate of accidental injuries at work by severity, job status and NACE' –Subtitle:'Index EU=100‘

11 Harmonising structural metadata linked to data tables An improvement of the structural metadata linked to the data tables should increase the accessibility and clarity of our disseminated data considerably. We could also envisage in a second step that these guidelines are improved further and that they get advanced towards general guidelines for this type of metadata for the ESS.

12 All over Many harmonisation and improvement efforts related to structural metadata are ongoing at Eurostat and within the ESS. This work should improve the quality of our data and metadata disseminated, mainly in terms of consistency, accessibility and clarity of the data sets produced. The harmonisation work also considerably contributes to the improvement and integration of the statistical business processes as defined in the Commission Communication 404/2009.