Implementing ModernStats Standards Linked Open Metadata

Slides:



Advertisements
Similar presentations
United Nations Economic Commission for Europe Statistical Division High-Level Group Achievements and Plans Steven Vale UNECE
Advertisements

GSBPM and GSIM as the basis for the Common Statistical Production Architecture Steven Vale UNECE
The European Statistical System Vision Infrastructure Programme Daniel Defays, Director Directorate B, Eurostat Eurostat Workshop on the Modernisation.
Luxembourg January CORE ESSnet (COmmon Reference Environment) final meeting Carlo Vaccari Istat - Italy.
Jenny Linnerud, 27/10/2011, Cologne1 ESSnet CORE Common Reference Environment ESSnet workshop in Cologne 27th and 28th of October 2011.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
United Nations Economic Commission for Europe Statistical Division High-Level Group Achievements and Plans Steven Vale UNECE
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
SDMX IT Tools Introduction
Modernization Committee on Products and Sources: Report th High -Level Group Workshop on the modernization of Production and Services, Den Haag.
United Nations Economic Commission for Europe Statistical Division International Collaboration to Modernise Official Statistics Steven Vale UNECE
Aim: “to support the enhancement and implementation of the standards needed for the modernisation of statistical production and services”
OECD Expert Group on Statistical Data and Metadata Exchange (Geneva, May 2007) Update on technical standards, guidelines and tools Metadata Common.
1of 20 MSIS 2014 – Dublin Panel on CSPA Monica Scannapieco – Carlo Vaccari (Istat – Italy)
Renovation of Eurostat dissemination chain
Modernization Committee on Products and Sources: Proposal for HLG project on Data Integration 5 th High -Level Group Workshop on the modernization of Production.
Eurostat Standardisation DIME-ITDG 2015 Item 6 DIME-ITDG February
Modernisation Committee on Standards Priorities and future plans for 2015 and 2016 October 23, 2015.
Advancing statistics for development Marko Javorsek ESCAP Statistics Division Modernization Working Group on Production, Methods, and Standards (MWG) First.
United Nations Economic Commission for Europe Statistical Division Standards-based Modernisation Steven Vale UNECE
Modernisation Story of Statistics Slovenia
Data Integration in Official Statistics 2017 Project Proposal
DDI and GSIM – Impacts, Context, and Future Possibilities
Achievements in 2016 Data Integration Linked Open Metadata
The ESS vision, ESSnets and SDMX
Structural and reference metadata in the European Statistical System
Modernization Maturity Model and Roadmap
Istituto Nazionale di Statistica – Istat
GSIM Implementation at Statistics Finland Session 1: ModernStats World - Where to begin with standards based modernisation? UNECE ModernStats World Workshop.
Metadata Standards for Statistical Classifications
Modernization Committee on Products and Sources
DIME ITDG, Luxembourg 28 June 2016
Eurostat activities update
ESSnet Linked Open Statistics Update
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
GSBPM, GSIM, and CSPA.
GSIM The Generic Statistical Information Model
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
How can DDI make the most of RDF?
2. An overview of SDMX (What is SDMX? Part I)
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
2. An overview of SDMX (What is SDMX? Part I)
Statistical organisations should use standardised and industrial processes for the production of statistics in order to be more efficient. The statistical.
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
ESS Standardisation State of play
WP7 – COMBINING BIG DATA - STATISTICAL DOMAINS
RAMON Re-engineering An Update
CORA ESSNet COmmon Reference Architecture starting ...
ITDG meeting of of October 2011
August Götzfried Eurostat unit B 4
ESS.VIP ADMIN EssNet on Quality in Multi-source Statistics, progress report 19TH WORKING GROUP ON QUALITY IN STATISTICS, 6 December 2016 Fabrice Gras,
Item 7.11 SDMX Progress report
SOA initiatives at Istat
Implementing the “Vision” within the ESS
WP 1 Management and Coordination
Standards and guidelines for reference metadata
Business architecture
Generic Statistical Information Model (GSIM)
The future of Statistical Production
ESTP Training Course “Enterprise Architecture and the different EA layers, application to the ESS context ” Rome, 16 – 19 October 2017.
COmmon REference Environment - CORE:
SCFE WP1 guidelines and procedures
Data Architecture project
DDI and GSIM – Impacts, Context, and Future Possibilities
ESS Enterprise Architecture
Implementing the “Vision” within ESS
Classifications and Linked Open Data Formalizing the structure and content of statistical classifications Item 9.1 Standards Working Group Luxembourg,
High-Level Group for the Modernisation of Official Statistics
Pilot use of Linked Open Data technologies for publishing official statistics: current status in the ESS and Eurostat April 17th, 2018 GISCO WG.
Presentation transcript:

Implementing ModernStats Standards Linked Open Metadata Direct Result of Sprint Franck Cotton, Insee Monica Scannapieco, Istat

Implementing ModernStats Standards Project proposed by the MC on Products and Sources at HLG Workshop on November 2015 Launched in 2016 with three workpackages WP1 Classifications and Vocabularies Linked Open Metadata WP2 Models and Services WP3 Maturity Model and Roadmap

Linked Open Metadata: Objectives Building proofs to show the value of Linked metadata Complete IT systems implementing significant use cases Learn by doing Design Guidelines Feasibility of application of Linked metadata to statistical domain Evaluation and sustainability plans

Linked Open Metadata: What OWL Semantic Web technologies OFFICIAL

Linked Open Metadata: Why Insee LOD Portal: http://rdf.insee.fr/sparql Comparable and interoperable statistics OWL Istat LOD Portal: http://datiopen.istat.it Japan e-Stats LOD Portal:http://data.e-stat.go.jp/lodw/

Project Workflow and Milestones Set up of sandbox technological environment Sharing of initial design guidelines RDF artefacts database: setup and integration Sprint in Rome to finalize results SemStats workshop to communicate results Jan-Feb March-April May-August September October

Rome Linked Open Metadata Sprint Duration: 3 days, 12-14 September 2016 Participants: 15 Organizations: Physical participation: Istat, Insee, CBS, Eurostat, UNECE Virtual participation: Mexico Location: SAPIENZA - University of Rome (neutral) Sprint type: Agile/SCRUM development (Sprint Master:Taeke Gjaltema) Hard programming… …but also blackboard thinking!

Project Outputs: RDF Artifacts Classifications UN classifications (with history): ISIC and CPC Eurostat classifications (with history): NACE and CPA National classifications: NAICS, Ateco, NAF and CPF Correspondence Tables SDMX measure_unit code list Ontologies: GSIM, CSPA, GSBPM Data on CSPA services RDF Triple Store (Stardog) on the Sandbox

Project Outputs: Web clients Awarded by Oracle at SemStats 2016 Best Challenge Classification Explorer Browsing Cross-Navigation btw classifications Searching/Exporting RDF Triple Store (Stardog) on the Sandbox Model Explorer Browsing Cross-Navigation btw models Edit CSPA services

Project Outputs: reporting for design, communication, sustainability Design Guidelines How to model and implement Linked data classifications Two papers at SemStats 2016 (reviewed by Academic and Official statistics representatives) An OWL Ontology for the Common Statistical Production Architecture by A.Dreyer, F.Cotton, G.Duffès An OWL Ontology for the Generic Statistical Information Model (GSIM): Design and Implementation by A.Dreyer, G.Duffès, D.Gillman, M. Scannapieco, L. Tosco Recommendations for projects results’ sustainability

DETAILS ON WORKPACKAGES Franck Cotton

Project Follow-Up (1) Issue 1: Maintainance of project artefacts RDF Classifications and correspondence tables GSIM, GSBPM, CSPA ontologies Data on CSPA services How to solve Issue 1: suggestions Sandbox platform could remain accessible until new facilities for sharing RDF artefacts are available HLG involvement, e.g. through the Supporting Standards group and the Sharing tools group Coordination with Eurostat DIGICOM project

Project Follow-Up (2) Issue 2: Extending the work on design guidelines How to solve Issue 2: suggestions HLG involvement, e.g. through the Supporting Standards group and the Sharing tools group Coordination with Eurostat DIGICOM project European ISA2 program (SEMIC - Semantics Interoperability Community) The ISA² programme supports the development of digital solutions that enable public administrations, businesses and citizens in Europe to benefit from interoperable cross-border and cross-sector public services. ISA² will run from 1 January 2016 until 31 December 2020.

Project Follow-Up (3) Details on project resources Main contribution for manpower by Insee and Istat Both technical management and implementation Contributions by CBS, Eurostat and UNECE at the Rome sprint Organizational aspects (Webex organization and reporting) by UNECE Sandbox fee

Wrapping-up: Advantages Statistical artefacts represented as Linked data have several advantages Defined once for all and shared for actual semantic interoperability Structure and content harmonization «Is the definition of the Business Process concept consistent and complete among GSIM, GSBPM and CSPA?» Correctness (consistency and completeness) checks: Formal representation of statistical metadata and models World wide technological standards Tools available off-the-shelf, technology independence, etc.

Final Remarks (1) However, besides the shown and discussed advantages, there are some risks involved by the adoption of the linked data paradigm: Dedicated technological stack and specific skills needed Technological standards still keep on (slowly) changing: changes in versions, performance issue, etc. A question could be: We are talking about metadata harmonization and sharing since many years, even decades, so… what’s new now?

Technology as an enabler Consolidate metadata management experience Final Remarks (2) Technology as an enabler + Consolidate metadata management experience Insee LOD Portal: http://rdf.insee.fr/sparql Japan e-Stats LOD Portal:http://data.e-stat.go.jp/lodw/ Ready to reap the benefits Istat LOD Portal: http://datiopen.istat.it

Thank you for your attention!