UNECE-CES Work session on Statistical Data Editing

Slides:



Advertisements
Similar presentations
Data Sharing Werner Bier Deputy Director-General Statistics European Central Bank Inter-Agency Group on Economic and Financial Statistics (IAG) G-20 Data.
Advertisements

ESS VIP project on Validation
Background Data validation, a critical issue for the E.S.S.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
© OECD/IEA th Meeting of the Oslo Group Energy Statistics Baku, September 2013 Looking ahead InterEnerStat and Oslo Group Jean-Yves Garnier.
SDMX Implementation in the European Statistical System P. Everaers, A. Götzfried Eurostat.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
1 Integration of the Eurostat and ESS Metadata Systems A. Götzfried Head of Unit B6 Eurostat.
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
Statistical data editing - UNECE work session – OSLO September 2012 Proposal of a revised approach for data validation within the European Statistical.
SDMX IT Tools SDMX use in practice in NA
1 The GSBPM and ESS statistical business process metadata Session 4 H. Linden, Unit B6 Eurostat Workshop on Statistical Metadata (METIS) (Geneva, 5-7 October.
Eurostat Sharing data validation services Item 5.1 of the agenda.
1 Recent developments in quality related matters in the ESS High level seminar for Eastern Europe, Caucasus and Central Asia countries Claudia Junker,
United Nations Economic Commission for Europe Statistical Division CSPA: The Future of Statistical Production Steven Vale UNECE
Validation Architecture in the ESS CSPA Workshop, Geneva June 2016 Geneva June 2016 Eurostat, Vincent TRONET, Unit B1.
Theme (iv): Standards and international collaboration
The Eurostat Metadata Handler Götzfried Eurostat (Head of Unit B6)
Business Case National Accounts Production System – Services (NAPS-S)
Achievements in 2016 Data Integration Linked Open Metadata
The ESS vision, ESSnets and SDMX
Cost analysis of key statistical products
Progress on ESS Validation Project
ESS Validation State of Play and next steps
ESS Vision 2020 Recent developments Addressing the skill gaps
ESS Vision 2020 Validation: Implementation of deliverables
11. The future of SDMX Introducing the SDMX Roadmap 2020
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
SIMSTAT IS progress report
Validation Break-out sessions
The European Statistical System
ESSnet SCFE – Wiesbaden workshop - Introduction
Towards a European validation architecture
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
2. An overview of SDMX (What is SDMX? Part I)
ESS Vision 2020: ESS.VIP Validation
2. An overview of SDMX (What is SDMX? Part I)
Data Validation in the ESS Context
Task Force on Annual Financial Accounts
ESS Validation Project State of Play and next steps
Business and IT Architecture for ESS validation
Data Validation in the ESS Context
Statistical Information Technology
Giuliano Amerini Unit E6 (Transport)
CORA ESSNet COmmon Reference Architecture starting ...
Draft Methodology for impact analysis of ESS.VIP Projects
3rd WGM Meeting 3 May 2018 Item 2.3 Possible standards for ESS Validation.
Validation services developed in the ESS
ESS VIP ICT Project Task Force Meeting 5-6 March 2013.
ESS Validation Project State of Play and next steps
3. An overview of the SDMX implementation process
Applying the ESS EARF in a VIP project: The ESS.VIP Validation example
SDMX : General introduction H. Linden, Eurostat, Unit B5
Dissemination Working Group John Allen
Data integration methods
SDMX Implementation: the process
2.4 Business Architecture For ESS Validation
SDMX Implementation The National Accounts use case
M. Henrard, B5 N. Buysse and H. Linden, B6 Eurostat
Statistical cooperation
CSPA in National Accounts Demonstration
Modernisation of Validation in the ESS Status report
3. An overview of the SDMX implementation process
Presentation of Project Joint meeting of the ESS.VIP.BUS ICT Project
Modernisation of Validation in the ESS Collaboration with countries
The migration to the new EDAMIS
ESS Vision and VALIDATION
Project objectives and benefits
SDMX in AFRICA SDMX Roadmap th SDMX Global Conference
Presentation transcript:

Modernisation of validation in the European Statistical System (ESS) A multidimensional approach UNECE-CES Work session on Statistical Data Editing The Hague, 24-26 April 2017 Topic (ii) – Theme (iv) vincent.tronet@ec.europa.eu Eurostat, Unit B1

ESS Objective: providing comparable statistics at EU level Focus on EU policies, harmonisation extended to nearly all statistical fields Partnership between Eurostat and NSI in EU and EFTA countries Cooperation with candidate countries, Commission services, ECB and International Organisations

ESS Vision 2020 Adopted in May 2014 European response to challenges of official statistics 5 key areas of work: Focus on users Strive for quality Harness new data sources Promote efficiency in production processes Improve dissemination and communication

Data validation in production chain Focus on data sent to Eurostat Member State Eurostat Validation can take place in several points of the ESS statistical production process

Current situation

Main shortcomings in current situation No clear picture of who in the ESS is doing what as regards validation: Risk of validation gaps Time-consuming validation "ping-pong" Potential subjectivity in assessment of data quality Lack of standards and of a common architecture for shareable and reusable validation services: Duplication of IT development costs within the ESS Manual work due to low integration between the different tools

Medium-term goals for ESS validation Business Outcomes Goal 1: Ensure the transparency of the validation procedures applied to the data sent to Eurostat by the ESS Member States. Increase in the quality and credibility of European statistics   Reduction of costs related to the time-consuming validation cycle in the ESS ("validation Ping-Pong") Goal 2: Enable sharing and re-use of validation services across the ESS on a voluntary basis. Reduction of costs related to IT development and maintenance

A multidimensional approach Methodology Processes/governance Information standards IT resources Human resources

A. Methodology Need for a common framework for validation in the ESS => methodological handbook: Definitions Classifications Metrics

B. Processes/Governance Business architecture for ESS Validation Eurostat and Member States agree at domain specific Working Groups on validation rules, severity and roles Validation principles Standard process for determining acceptability of data Main data flows needed for efficient validation process Possible scenarios for reusing validation services in countries to validate data prior to official transmission

Key principles for validation in business architecture The sooner, the better (Eurostat cannot correct country data) Trust, but verify Well-documented and appropriately communicated validation rules Well-documented and appropriately communicated validation errors Comply or explain Good enough is the new perfect

To-be situation

3 optional IT scenarios for countries Connect to central repositories Installation of service or tool in own environment Only access to repositories (SDMX DSDs, validation rules) Connect to CSPA compliant service Integration of service into own workflow Connect to the process Transmission to Eurostat required Eurostat runs service and delivers report

C. Information standards Need for standardization of the 3 types of objects exchanged with Member States: Data: SDMX information model (any structured data) Validation rules: VTL (Validation and Transformation Language) for statisticians and machines Standard validation reports for statisticians and machines

D. Information Technology in ESS => 3 CSPA services, 2 Registries

D. Information Technology in ESS => 3 CSPA services, 2 Registries Structural Validation service based on a Data Structure Registry (e.g. SDMX Registry) Content Validation service that checks consistency and plausibility of data based on rules stored in VTL in a Validation Rule Registry Validation Rule Manager to view, create, modify and manage VTL validation rules stored in the Validation Rule Registry.

E. Human resources Communication (e.g. Validation Portal) Training (ESS course on 21-22 Nov and 4 Regional conferences planned between end 2017 and Q1/2018) Work to describe validation rules, agree on them and keep them up to date

Concrete use case: National Accounts Objective: Re-design of the National Accounts statistical Production System using Eurostat corporate (CSPA compliant) services for managing and validating incoming data files.

Initial situation (3 parallel processes) FAME Oracle RDBMS Oracle OLAP

Phase 1: Common structural validation

Phase 2: Common structural and content validation

Future: More common statistical services

Conclusions Modernisation of validation in the ESS will contribute to lower the burden, increase quality and transparency Cooperation and International standards are essential Components are progressively being put in place Limited focus in a first stage: Data sent to Eurostat Deliverables potentially extended in future to cover the whole statistical process in countries and at Eurostat First results already visible

QUESTIONS ?