Metadata driven application for data processing – from local toward global solution Rudi Seljak Statistical Office of the Republic of Slovenia.

Slides:



Advertisements
Similar presentations
Building a Knowledge Management System as a Life Cycle
Advertisements

STANDARD ERRORS PRESENTATION AND DISEMINATION AT THE STATISTICAL OFFICE OF THE REPUBLIC OF SLOVENIA Rudi Seljak Statistical Office of the Republic of Slovenia.
Database Processing: Fundamentals, Design and Implementation, 9/e by David M. KroenkeChapter 1/1 Copyright © 2004 Please……. No Food Or Drink in the class.
CZECH STATISTICAL OFFICE | Na padesatem 81, Prague 10 | Jitka Prokop, Czech Statistical Office SMS-QUALITY The project and application.
Lecture 13 Revision IMS Systems Analysis and Design.
A Data Curation Application Using DDI: The DAMES Data Curation Tool for Organising Specialist Social Science Data Resources Simon Jones*, Guy Warner*,
Fundamentals, Design, and Implementation, 9/e Chapter 1 Introduction to Database Processing.
Chapter 1 Program Design
Modernisation of Statistical Processing at SURS Andreja Smukavec, SURS Rudi Seljak, SURS Workshop on Modernisation of Statistical Production Geneva, 15–17.
International Seminar on Modernizing Official Statistics:
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Metadata driven application for aggregation and tabular protection Andreja Smukavec SURS.
9 Feb 2004Mikko Mäkinen & Saija Ylönen Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Geneva, 9-11 February 2004, Topic (ii): Metadata.
The Edit Anders Norberg, Statistics Sweden (SCB) Work Session on Statistical Data Editing Ljubljana, Slovenia, 9-11 May 2011.
Implementing ESS standards for reference metadata and quality reporting at Istat Work Session on Statistical Metadata Topic (i): Metadata standards and.
Rudi Seljak, Metka Zaletel Statistical Office of the Republic of Slovenia TAX DATA AS A MEANS FOR THE ESSENTIAL REDUCTION OF THE SHORT-TERM SURVEYS RESPONSE.
M ETADATA OF NATIONAL STATISTICAL OFFICES B ELARUS, R USSIA AND K AZAKHSTAN Miroslava Brchanova, Moscow, October, 2014.
The value added of a national statistical institute Max Booleman Marleen Verbruggen.
Introduction and key issues identified in the papers UNECE Conference of European Statisticians June 2015 Second Seminar, Session I.
Chapter 2: Software Process Omar Meqdadi SE 2730 Lecture 2 Department of Computer Science and Software Engineering University of Wisconsin-Platteville.
1 The availability, timeliness and quality of rapid estimates UNCTAD experience Henri Laurencin INTERNATIONAL SEMINAR ON TIMELINESS, METHODOLOGY AND COMPARABILITY.
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
Register-Based Census 2011 in Slovenia – Some Quality Aspects Danilo Dolenc Statistical Office of the Republic of Slovenia UNECE-Eurostat Expert Group.
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
CZECH STATISTICAL OFFICE 1 The Quality Metadata System In the Czech Statistical Office Work Session on Statistical Metadata (METIS)
Quality Reporting at SORS – Experiences and Future Perspectives Rudi Seljak, Tina Ostrež Statistical Office of the Republic of Slovenia.
Current and Future Applications of the Generic Statistical Business Process Model at Statistics Canada Laurie Reedman and Claude Julien May 5, 2010.
BAIGORRI Antonio – Eurostat, Unit B1: Quality; Classifications Q2010 EUROPEAN CONFERENCE ON QUALITY IN STATISTICS Terminology relating to the Implementation.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
USING THE METADATA IN STATISTICAL PROCESSING CYCLE – THE PRODUCTION TOOLS PERSPECTIVE Matjaž Jug, Pavle Kozjek, Tomaž Špeh Statistical Office of the Republic.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Sampling Error Estimation – SORS practice Rudi Seljak, Petra Blažič Statistical Office of the Republic of Slovenia.
Developing Statistical Information Systems and XML Information Technologies - Possibilities and Practicable Solutions Geneva,
1 C. ARRIBAS, D. LORCA, A. SALINERO & A. COLMENERO Measuring statistical quality at the Spanish National Statistical Institute.
Cmpe 589 Spring 2006 Lecture 2. Software Engineering Definition –A strategy for producing high quality software.
Process Description and Quality Guidelines – Two Birds with One Stone European Conference on Quality in Official Statistics Q2014 Rudi Seljak, Tina Steenvoorden.
Business model Transformation Strategy (BmTS) John Pearson and Tracey Savage Statistics NZ’s.
Outlining a Process Model for Editing With Quality Indicators Pauli Ollila (part 1) Outi Ahti-Miettinen (part 2) Statistics Finland.
Developing and applying business process models in practice Statistics Norway Jenny Linnerud and Anne Gro Hustoft.
The Application for Statistical Processing at SURS Andreja Smukavec, SURS Rudi Seljak, SURS UNECE Statistical Data Confidentiality Work Session Helsinki,
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Workforce Scheduling Release 5.0 for Windows Implementation Overview OWS Development Team.
Open GSBPM compliant data processing system in Statistics Estonia (VAIS) 2011 MSIS Conference Maia Ennok Head of Data Warehouse Service Data Processing.
Developing the prototype Longitudinal Business Database: New Zealand’s Experience Julia Gretton IAOS Conference Shanghai, China, October 2008
Integrated metadata systems History Status Vision Roadmap
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
QUALITY ASSESSMENT OF THE REGISTER-BASED SLOVENIAN CENSUS 2011 Rudi Seljak, Apolonija Flander Oblak Statistical Office of the Republic of Slovenia.
United Nations Economic Commission for Europe Statistical Division Standards-based Modernization of Official Statistics Steven Vale UNECE
The views expressed herein are those of the author and should not be attributed to the IMF, its Executive Board, or its management Meeting the Future Demands.
1 Case Study Integrated Metadata Driven Statistical Data Management System (IMD SDMS) CSB of Latvia METIS 2010.
The business process models and quality issues at the Hungarian Central Statistical Office (HCSO) Mr. Csaba Ábry, HCSO, Methodological Department Geneva,
Program Design. Simple Program Design, Fourth Edition Chapter 1 2 Objectives In this chapter you will be able to: Describe the steps in the program development.
Using SAS Stored Processes and the SAS Portal for Delivering Statistics to Drug Discovery Volker Harm PhUSE/PSI One-day Event 2009, Marlow.
United Nations Economic Commission for Europe Statistical Division GSBPM in Documentation, Metadata and Quality Management Steven Vale UNECE
Statistics Estonia's new system for statistical data activity processing (VAIS) ITDG Luxembourg 2010 Allan Randlepp.
Analysis and Reporting Toolset (A&RT): Lessons on how to develop a system with an external partner David Smith AstraZeneca.
The Role of service Granularity in Successful CSPA Realization Zvone Klun, Tomaž Špeh Geneve, 22 June 2016.
Information Systems Development
CASE Tools and Joint and Rapid Application Development
Computer Aided Software Engineering (CASE)
UNITED NATIONS ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS Work Session on Statistical Data Editing April 2017 The Hague,
Rudi Seljak, Aleš Krajnc
Information Systems Development
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Integrated Statistical Information System (ISIS) in Croatia By Maja Ledić Blažević, Senior Advisor, Research & Development Dept. and Branka Cimermanović,
Data validation in Statistical Office of the Republic of Serbia
Chapter 1 Introduction to Database Processing
Business architecture
Presentation of Project Joint meeting of the ESS.VIP.BUS ICT Project
Technical Coordination Group, Zagreb, Croatia, 26 January 2018
Presentation transcript:

Metadata driven application for data processing – from local toward global solution Rudi Seljak Statistical Office of the Republic of Slovenia

Summary of presentation Introduction Current generic application – main characteristics Development of global solution Changes in the statistical process Conclusions

Introduction Statistical data processing: –Demanding, time consuming and very expensive task –Constant pressure for budget cuts Rationalisation of the statistical process: –Take advantage of the rapid IT development –Movement from domain oriented to process oriented production –Stove-pipe IT solutions replaced by general applications Statistical Office of the Republic of Slovenia (SURS) –SURS began systematic development of generic solutions 6 years ago –Prototype solutions for several parts of the process were developed –These solutions were already used for several large surveys (e.g Agriculture Census and the 2011 Population Census) –The prototype generic solutions are now upgraded to a more global solutions

Generalised solutions – main characteristics Small, generic solutions for small parts of the statistical process, called the building blocks: –Enable easy and flexible linking of inputs and outputs of the individual components to the whole statistical process –Can be plugged to different databases in different environments (e.g. ORACLE, SAS) if the input database follows few basic conditions –They are designed as fully metadata driven (MDD) systems: one program code → the parameters for the execution of the processing for the concrete survey are provided through the special metadata tables –The process metadata can be provided in different environments (SAS, MS Access, ORACLE) → the metadata organisation must follow the strict rules of its structure (tables and variables)

Building blocks - functioning … Different microdata databases General SAS program Ad-hoc program Ad-hoc program Building block Different databases of process metadata

Linking bulding blocks into the process Building block 1 Microdata Building block 2 Ad-hoc program Building block n Transformed data … Ad-hoc program Transformed data Ad-hoc program Transformed data

Process metadata The system is to a very large extent based on the process metadata: –Processing rules which enable adjustment of the general program for different surveys. The process metadata are at the moment inserted directly into MS Access database –High probability of syntax errors –Users must be thoroughly instructed in order to correctly fill the metadata TableVariableConditionCorr_ruleStep TABLE1XX/Y >1000Round(X/100)1 TABLE1ZZ NE XX2

Building blocks The basic tool of the whole system are the building blocks, which cover the particular processing phase. SAS macros which is able to operate on the basis of the process metadata. So far the building blocks for following phases are created: –Data validation (logical controls) –Deterministic corrections –Data imputations –Standard error estimation –Aggregation –Tabulation –Calculation of quality indicators –Disclosure control (testing phase)

Building a global solution The developed system is very open and flexible tool. However certain re-integration would be needed to increase its functionality: –To move the process metadata in ORACLE environment –To create single, unique database of process metadata where process metadata for all the surveys are stored and maintained –To develop the graphical interfaces for user friendly management of process metadata –To link the system with the metadata repository

The new system … Different microdata databases General SAS program Ad-hoc program Database of processing metadata Metadata repository Ad-hoc program Application for metadata management Data on tables and variables

Application for metadata management Deterministic corrections

Application for metadata management Execution of the particular process step

New application and statistical process Generic MDD application introduces changes in the implementation of data processing on general level: –Essentially different distribution of work between IT specialists, general methodologists and IT experts –Change in the role of subject-matter statisticians → changed expectations of their skills and capabilities –The work organisation of the IT Department and the General Methodology Department will have to be changed from domain oriented to process oriented. –Different approach of IT and methodology experts will be needed. Experts capable of thinking and operating at a much more general level Survey is just one of the realisations of the general statistical process.

Conclusions SURS developments in recent years: flexible, metadata driven generic solutions for different phases of data processing. Very open system will be replaced with more integrated and centralised system Main goal: Transition from the stove-pipe oriented production to the more integrated processing systems Two main challenges: –To build the generic IT solutions, which would „cover“ the wide diversity of statistical surveys –To change the very „domain oriented state of mind “ among the employees

Thank you for your attention