Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.

Slides:



Advertisements
Similar presentations
Guidelines on Integrated Economic Statistics United Nations Statistics Division Regional Seminar on Developing a Programme for the Implementation Programme.
Advertisements

Input Data Warehousing Canada’s Experience with Establishment Level Information Presentation to the Third International Conference on Establishment Statistics.
Making the Case for Metadata at SRS-NSF National Science Foundation Division of Science Resources Statistics Jeri Mulrow, Geetha Srinivasarao, and John.
Metadata to Support the Survey Life Cycle Alice Born, Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (METIS) Geneva,
ESSnet on SDMX phase II Laura Vignola ISTAT Rome, 3-4 December 2012.
Is Your Data Facility ISO Compliant? Progress Towards Harmonizing the DDI and ISO/IEC Dan Gillman Information Scientist US Bureau of Labor Statistics.
Implementation of GSBPM, DDI and SDMX reference metadata at Statistics Denmark UNECE workshop 5-7 May 2015 Mogens Grosen Nielsen
Reducing Metadata Objects Dan Gillman November 14, 2014.
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
ESCWA SDMX Workshop Session: Role in the Statistical Lifecycle and Relationship with DDI (Data Documentation Initiative)
Metadata: Integral Part of Statistics Canada Quality Framework International Conference on Agriculture Statistics October 22-24, 2007 Marcelle Dion Director.
WP.5 - DDI-SDMX Integration
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Implementing ESS standards for reference metadata and quality reporting at Istat Work Session on Statistical Metadata Topic (i): Metadata standards and.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
Metadata management and statistical business process at Statistics Estonia Work Session on Statistical Metadata (Geneva, Switzerland 8-10 May 2013) Kaja.
M ETADATA OF NATIONAL STATISTICAL OFFICES B ELARUS, R USSIA AND K AZAKHSTAN Miroslava Brchanova, Moscow, October, 2014.
SDMX and DDI Working Together Technical Workshop 5-7 June 2013
3 rd Annual European DDI Users Group Meeting, 5-6 December 2011 The Ongoing Work for a Technical Vocabulary of DDI and SDMX Terms Marco Pellegrino Eurostat.
Judy Lee Enterprise Statistics Division Statistics Canada I 1 Developing Metadata Standards in an Integration Project at Statistics Canada United Nations.
4 April 2007METIS Work Session1 Metadata Standards and Their Support of Data Management Needs Daniel W. Gillman Bureau of Labor Statistics Paul Johanis.
CASE STUDY: STATISTICS NORWAY (SSB) Jenny Linnerud and Anne Gro Hustoft Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg.
IMDB Registration of Survey Variables Dec 19, 2005.
Metadata Registries Workshop April 15, 1998 Slide 1 of 20 ANSI X Douglas D. Mann Stewardship Naming & Identification Classification.
« 8-11 July 2008 « Metadata Life Cycle « STATISTICS PORTUGAL.
Statistics Portugal/ Metadata Unit Monica Isfan « Joint UNECE/ EUROSTAT/ OECD Work Session on Statistical Metadata.
February 17, 1999Open Forum on Metadata Registries 1 Census Corporate Statistical Metadata Registry By Martin V. Appel Daniel W. Gillman Samuel N. Highsmith,
Metadata Architecture at StatCan MSIS 2008 Luxembourg, April 7-9, 2008 Karen Doherty Director General Informatics Branch Statistics Canada.
Statistical Metadata System in the State Statistical Committee Baku, Azerbaijan, 2013 State Statistical Committee of the Republic of Azerbaijan 1.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Presentation Title: Day:
South Africa Case Study Update Matile Malimabe Executive Manager: Standards Acting Executive Manager: Data Management & Technology.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
Statistical Metadata Strategy and GSIM Implementation in Canada Statistics Canada.
SDC JE What is a Data Registry? v A place to keep facts about characteristics of data that are necessary to clearly describe, inventory,
1 1 Developing a framework for standardisation High-Level Seminar on Streamlining Statistical production Zlatibor, Serbia 6-7 July 2011 Rune Gløersen IT.
Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND.
Overview of SC 32/WG 2 Standards Projects Supporting Semantics Management Open Forum 2005 on Metadata Registries 14:45 to 15:30 13 April 2005 Larry Fitzwater.
Regional Seminar on Promotion and Utilization of Census Results and on the Revision on the United Nations Principles and Recommendations for Population.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Tutorial on XML Tag and Schema Registration in an ISO/IEC Metadata Registry Open Forum 2003 on Metadata Registries Tuesday, January 21, 2003; 4:45-5:30.
SDMX IT Tools Introduction
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
May 2007 Registration Status Small Group Meeting 1: August 24, 2009.
Metadata Framework for a Statistical Data Warehouse
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
Role of the IMDB in the CBA and IM Strategy Presented to Information Management Committee Standards Division June
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
Use of Standardized Metadata to Find, Select and Access Statistical Data - Experience of Statistics Canada - Joint UNECE/Eurostat/OECD Work Session on.
METIS 2011 Workshop Session III – National Implementation of the GSBPM Alice Born and Tim Dunstan Thursday October 6, 2011 Implementation of the GSBPM.
Presented By Margaret Hellen Atiro Uganda Bureau of Statistics at the United Nations Regional Seminar on Census Data Archiving 20 – 23 Sep 2011, Addis.
Relationship between Short-term Economic Statistics Expert Group Meeting on Short-Term Statistics February 2016 Amman, Jordan.
Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator
United Nations Statistics Division Developing a short-term statistics implementation programme Expert Group Meeting on Short-Term Economic Statistics in.
METADATA MANAGEMENT AT ISTAT: CONCEPTUAL FOUNDATIONS AND TOOLS Istituto Nazionale di Statistica ITALY.
Metadata requirements for archiving structured data Alice Born Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (9-11 April.
Metadata models to support the statistical cycle: IMDB
Prepared by: Galya STATEVA, Chief expert
Guidelines on Integrated Economic Statistics
Generic Statistical Business Process Model (GSBPM)
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
Guidelines on Integrated Economic Statistics
2. An overview of SDMX (What is SDMX? Part I)
SDMX in the S-DWH Layered Architecture
Guidelines on Integrated Economic Statistics
The role of metadata in census data dissemination
M. Henrard, B5 N. Buysse and H. Linden, B6 Eurostat
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Joint UNECE/Eurostat/OECD
Presentation transcript:

Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata July 4 to 6, 2007

Outline 1.Overview 2.Statistical metadata systems and the statistical cycle 3.Statistical metadata in each phase of the statistical cycle 4.Systems and design issues 5.Organizational and cultural issues

Overview of Integrated Metadatabase (IMDB) To support interpretation of the data – dissemination phase Responsibility of Standards Division (metadata, classifications and standard definitions) Adherence to Policy on Informing Users on Data Quality and Methodology, Policy on Standards and Quality Assurance Framework In general, metadata goes back November 2001

Overview of Integrated Metadatabase (IMDB) Contains metadata on 350 active and 250 inactive surveys and statistical programs –Purpose –Methodology used to produce the data –Measures of data accuracy –Variables, classifications for the data –Location of clean master datafile –Contacts Survey managers cannot release data without the prescribed metadata – mandatory

Overview of Integrated Metadatabase (IMDB) Next priorities: Complete documentation of variables Complete questionnaire model determine metadata for archived datafiles – may require additional metadata Lessons learned: Opportunities in collecting metadata in the first phase of the statistical cycle – not at the time of dissemination

Statistical metadata systems and the statistical cycle Relationship with survey planning and design phase IMDB expanded its role as part of the Household Survey Content Harmonization Standardize concepts, questions, question blocks across household surveys Variables follow the ISO-IEC Questions and question blocks, associated response choices linked to variables and classifications are stored in the IMDB at the beginning Survey Specification Manager pulls metadata from the IMDB but contains specifications and code

Statistical metadata systems and the statistical cycle Relationship to dissemination systems Metadata for information modules on the STC website – mandatory Information for survey respondents – requires metadata prior to release of data Data Liberation Initiative – public-use microdata files documented in DDI Metadata to support data exchange – SDMX, DDI, XBRL, Wiki, HTML, etc….

Statistical metadata systems and the statistical cycle Relationship to aggregation - analysis phase Analytical datawarehouses use IMDB to organize their tables (variables and classifications) Relationship to archive phase IMDB contains location of master datafile, record layout, contact information Currently developing business rules for archived datafiles

Statistical metadata systems and the statistical cycle Relationship with management systems Software Register – registry of Agency’s software and applications organized by survey and statistical program – IMDB is the inventory Quality management assessment and questionnaire – based on inventory of surveys in the IMDB; reuse of existing metadata

Operations Management Quality Assurance AnalysisDissemination CollectEditEstimateTabulatePublish Operational Data Registers Survey Data Administrative Data Data Warehouses Operational Data Stores IMDB in the survey life cycle Design Metadata IMDB Archive IMDB

Statistical metadata for phases in the statistical cycle Metadata describing statistical business processes –Data dissemination for interpretation of data –IMDB serves as the corporate inventory of all surveys and statistical programs, questionnaires, master datafiles –metadata or paradata resides in other metainformation systems – SSM, IQMS

Statistical metadata for phases in the statistical cycle Metadata for data elements –Supports: Survey planning and design; Analysis; Dissemination; Archiving –Metadata objects tracked over time for changes (versioning) and validity (registration) –Output to online data tables and STC products –For discovery – inventory of DE on STC website and STCWiki (internal review before going public) –Links to questions, question blocks, datafiles

STCWiki – Type of marital status of person

Statistical metadata for phases in the statistical cycle Metadata for survey planning and design –Questions, standard questions blocks and standard response choices in IMDB –Mapped to value domains, data elements and surveys in the IMDB –These metadata assembled into collection instruments in other metainformation systems outside the IMDB

Systems and design issues IMDB started in 1998 –Phase 1 Consolidation of existing metadata stores –Phase 2 Metadata describing statistical business processes –Phase 3 Metadata for data elements, etc. MetaStat system – Statistical activity, survey, instance, frame, universe, instrument, datafiles, survey methodology, documentation, data accuracy MetaWeb system – object class, property, data element, value domain, question, response choices, question block, value meaning manager

Phase 2 Input Screens Text strings related to data components Directives Resource Bundle Key Value SurveySDDS Statistical Data Doc… …... Labels Resource Bundle KeyValue SurveySDDS SDDS …... IMDB database

Phase 2 Input Screen Administered Item

Phase 2 - Identification Tab

Systems and design issues Dissemination and information discovery systems Web publication from IMDB is through HTML, dynamically generated with Perl scripts Conforms to government standards – CLF Survey-centric view and developing DE-centric view Discovery from Wiki solution – non-linear view of Phase 2 and 3 metadata Allows users to view links among administered items in the IMDB

Organizational and cultural issues Information management Assist in harmonization / usage of standards Knowledge sharing Corporate memory Reuse of our metainformation assets

Knowledge Sharing/Corporate Memory Survey Life Cycle IMDB CollectEditEstimateTabulatePublishDesign Survey Universe Frame Instance Collection Instrument Methodology Data Files Enterprise Architecture Concepts (Object Class, Property, Data Element Concept) Data Elements Questions Questions Blocks Classifications (Conceptual Domain Value Domain)

Corporate Memory Data Files IMDB Operational Data Registers Survey Data Administrative Data Operational Data Stores Clean Master File Public Use Master File Archival information Archived Data

IMDB Reuse of Information Assets Information Discovery/Dissemination Wiki HTML SDMX DDI ? One meta data source many uses for the information many output formats

Reuse of Information Assets Applications Development IMDB Classification coding Collection instrument development Publishing Other applications

Reuse of Information Assets Integration with Data IMDB Data Warehouses CANSIM

Organizational and cultural issues STC is one of the most integrated statistical systems in the world As part of its Enterprise Architecture strategy – moving towards centralized and generalized systems, including the IMDB IMDB was built initially to support interpretation of disseminated data Pressure is to provide metadata up (and down) the statistical value chain and into management systems Opportunities at the Survey planning and design phase – reuse of existing metadata (variables, classifications, questions, etc) registered in the IMDB – coherence