Role of the IMDB in the CBA and IM Strategy Presented to Information Management Committee Standards Division June 7 2010.

Slides:



Advertisements
Similar presentations
Guidelines on Integrated Economic Statistics United Nations Statistics Division Regional Seminar on Developing a Programme for the Implementation Programme.
Advertisements

Input Data Warehousing Canada’s Experience with Establishment Level Information Presentation to the Third International Conference on Establishment Statistics.
Making the Case for Metadata at SRS-NSF National Science Foundation Division of Science Resources Statistics Jeri Mulrow, Geetha Srinivasarao, and John.
Metadata to Support the Survey Life Cycle Alice Born, Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (METIS) Geneva,
ESSnet on SDMX phase II Laura Vignola ISTAT Rome, 3-4 December 2012.
United Nations Oslo City Group on Energy Statistics 8 th Oslo Group Meeting, Baku, Azerbaijan September 2013 ESCM Chapter 8: Data Quality and Metadata.
CZECH STATISTICAL OFFICE | Na padesatem 81, Prague 10 | Jitka Prokop, Czech Statistical Office SMS-QUALITY The project and application.
Fitting a survey life cycle in the DDI Irene Wong Chuck Humphrey IASSIST Edinburgh May 2005.
Statistics and Data for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 27, 2008.
EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.
An Integrated Approach to Economic Statistics “ The Canadian Experience” UNSD – IBGE Workshop on Manufacturing Statistics Kevin Roberts Rio de Janeiro,
André Loranger New York, June 2014 The Integrated Business Statistics Program at Statistics Canada Presentation to the UNCEEA Assistant Chief Statistician.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
Metadata: Integral Part of Statistics Canada Quality Framework International Conference on Agriculture Statistics October 22-24, 2007 Marcelle Dion Director.
WP.5 - DDI-SDMX Integration
FCM Quality of Life Reporting System Metadata By: Acacia Consulting and Research June 2002.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
The Canadian Integrated Approach to Economic Surveys Marie Brodeur, Peter Koumanakos, Jean Leduc, Éric Rancourt, Karen Wilson Statistics Canada International.
Data Warehousing at STC MSIS 2007 Geneva, May 8-10, 2007 Karen Doherty Director General Informatics Branch Statistics Canada.
Using ISO/IEC to Help with Metadata Management Problems Graeme Oakley Australian Bureau of Statistics.
M ETADATA OF NATIONAL STATISTICAL OFFICES B ELARUS, R USSIA AND K AZAKHSTAN Miroslava Brchanova, Moscow, October, 2014.
SDMX and DDI Working Together Technical Workshop 5-7 June 2013
Judy Lee Enterprise Statistics Division Statistics Canada I 1 Developing Metadata Standards in an Integration Project at Statistics Canada United Nations.
4 April 2007METIS Work Session1 Metadata Standards and Their Support of Data Management Needs Daniel W. Gillman Bureau of Labor Statistics Paul Johanis.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
The Adoption of METIS GSBPM in Statistics Denmark.
Data and Social Research Chuck Humphrey Data Library Rutherford North Library.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Statistics Portugal/ Metadata Unit Monica Isfan « Joint UNECE/ EUROSTAT/ OECD Work Session on Statistical Metadata.
Current and Future Applications of the Generic Statistical Business Process Model at Statistics Canada Laurie Reedman and Claude Julien May 5, 2010.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Metadata Architecture at StatCan MSIS 2008 Luxembourg, April 7-9, 2008 Karen Doherty Director General Informatics Branch Statistics Canada.
United Nations Economic Commission for Europe Statistical Division Mapping Data Production Processes to the GSBPM Steven Vale UNECE
Editing of linked micro files for statistics and research.
Statistical Metadata Strategy and GSIM Implementation in Canada Statistics Canada.
SNA seminar in the Caribbean Integrated questionnaires Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February,
Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND.
Developing and applying business process models in practice Statistics Norway Jenny Linnerud and Anne Gro Hustoft.
Regional Seminar on Promotion and Utilization of Census Results and on the Revision on the United Nations Principles and Recommendations for Population.
SDMX IT Tools Introduction
Integrated metadata systems History Status Vision Roadmap
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
Unified Enterprise Survey New Horizons International Conference on Establishment Surveys Daniela Ravindra and Marie Brodeur Montreal, June 2007 Statistics.
5.8 Finalise data files 5.6 Calculate weights Price index for legal services Quality Management / Metadata Management Specify Needs Design Build CollectProcessAnalyse.
METIS 2011 Workshop Session III – National Implementation of the GSBPM Alice Born and Tim Dunstan Thursday October 6, 2011 Implementation of the GSBPM.
The business process models and quality issues at the Hungarian Central Statistical Office (HCSO) Mr. Csaba Ábry, HCSO, Methodological Department Geneva,
Relationship between Short-term Economic Statistics Expert Group Meeting on Short-Term Statistics February 2016 Amman, Jordan.
Metadata requirements for archiving structured data Alice Born Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (9-11 April.
Quality declarations Study visit from Ukraine 19. March 2015
Metadata models to support the statistical cycle: IMDB
Navigating Your Way Through the EFT, Nesstar and Beyond 20/20 (WDS)
MANAGEMENT OF STATISTICAL PRODUCTION PROCESS METADATA IN ISIS
Guidelines on Integrated Economic Statistics
Generic Statistical Business Process Model (GSBPM)
YTY − an integrated production system for business statistics
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
Guidelines on Integrated Economic Statistics
2. An overview of SDMX (What is SDMX? Part I)
SDMX in the S-DWH Layered Architecture
Guidelines on Integrated Economic Statistics
Mapping Data Production Processes to the GSBPM
The role of metadata in census data dissemination
Data Liberation Initiative (DLI)
The Role of Metadata in Census Data Dissemination
Metadata on quality of statistical information
METIS 2011 Workshop Session III – National Implementation of the GSBPM
Étienne Saint-Pierre, Statistics Canada
Presentation transcript:

Role of the IMDB in the CBA and IM Strategy Presented to Information Management Committee Standards Division June

Outline  Current content  IMDB and the GSBPM  Questions and questionnaires  Variables  Quality indicators  Quality review of the IMDB  Links to datawarehouses  Role of the IMDB in the CBA  Mapping to other metadata standards

Content in IMDB  Inventory of statistical metadata for all surveys and statistical programs  Inventory of all questionnaires (XHTML, PDF) – Information for Survey Participants, DLI/PUMF  Inventory of all variables, statistical units and classifications used for collection and dissemination  Potentially inventory of all questions, response choices, interviewer notes  Inventory of many documents for surveys and statistical programs – to support DDI and SDMX documentation and other international reporting IMF’s DQAF

Types of statistical metadata in and not in the IMDB Definitional metadata (or structural metadata – SDMX term) description of statistical data variables (statistical units, property and representation), their definitions and related classifications questions, record layouts Reference metadata Describes statistical datasets and processes Data sources, data collection, survey methodology, imputation, estimation Not part of “work flow” Operational metadata Measures of data accuracy – response rates, CVs sample size, limited statistical release information Metadata not included in IMDB Other operational metadata (edit failures, quality metrics, sign-offs) Systems metadata (edit rules, derivation rules, coding rules, imputation and estimation rules) Dataset metadata (structure, footnotes, titles) but IMDB does contain links to some record layouts, disseminated products and master data files. Paradata (not included in IMDB) Information related to statistical data and production process linked to a person, business or organization (i.e., unit in sample, unit has responded, number of attempts to reach unit) “PASSIVE METADATA” “ACTIVE METADATA”

Input dataMicro-data Confidential aggregate data Public output data 3 Build 4 Collect 5 Process 6 Analyze 7 Disseminate Operational data Registers Survey Data Administrative Data Datastores Operational Data Stores IMDB in the survey life cycle 1 Specify needs Metadata/paradata IMDB 8 Archive IMDB 2 Design Quality management and metadata management

Questions and questionnaires Many uses:  Harmonized content – approved questions and variables are stored in the IMDB (STCwiki and STCwebsite)STCwikiSTCwebsite IMDB-generated questionnaires  Of the 467 questionnaires on the Internet - 45% CLF2 compliant  DDI 3 (DLI) – pull questions, interviewer notes and other metadata from IMDB in RDC through Oracle forms)  Question inventory could be used by QDRC for testing and quality – reuse of concepts and questions (CBA)

Variables  comprehensive inventory of variables and related classifications  systematically evaluating 1,400 active CANSIM tables to build variables and classifications  to date, variables and classifications have been developed for prices, 37 out 42 tables produced by the UES survey programs and 33 out of 230 tables from the SNA tables  Annual Survey of Services Annual Survey of Services  Variables for harmonized content – part of HSS Variables for harmonized content  Resource intensive – need to validate variables with SMOs for economic and social statistics – what model should StatCan follow?

Variables  Pilot project – Statistics by variableStatistics by variable Prototype developed with 5 variables from harmonized content and links to CANSIM Working with Client Services and Dissemination Usability testing completed – March 2010 Expand to 30 variables on Analysts and researchers portal (June 2010) Present to Dissemination and Communication Committee for approval  ‘under construction’ approach – incrementally populate variable portal

Data quality indicators  Need to improve coherence across surveys and statistical programs in data accuracy section of IMDB  Integrate DPR indicators (accuracy, relevance, organizational efficiency) and indicators from 2009 Quality Guidelines – see 2010 METIS paper2010 METIS  Approach – ABS has Quality declarations for each survey and ISTAT has indicators for GSBPM processesQuality declarations  Crosscutting – Corporate Planning, Quality Secretariat and Standards Division work together  Are there additional DQ indicators required from an IM perspective?

Data quality indicators Quality declaration

Links to datawarehouses (“data centres”)  T2 TAX DW uses IMDB classifications (value domains) and its value domain loader tool: Enterprise Complexity Categories TDD Processing Environment TDD Name of Geographic Location and Geography – Canada, Region, Province TDD Imputation Categories TDD NAICS Groups Reference Year TDD Survey Universe Categories TDD  All warehouses have metadata menu option and links to IMDB content via STCwiki but not used all the time

Links to datawarehouses (“data centres”)  Involved with the prototype of information sharing between the ASM and SNA datawarehouses (CBA) SNA is the lead on this project Connectivity between data centres and processes both upstream and downstream Pull passive metadata from the IMDB (June 2010) May require other types of metadata from other systems or expand the IMDB  Gain practical understanding on how to organize data centres and make recommendations to the IMC and CBA Management Committee

Role of the IMDB in CBA  Report circulated to ARB and CBA Task Force recommends that the IMDB: Authoritative source of definitional and reference metadata for Statistics Canada metadata be “reused” to support GSBPM phases Other metadata/metainformation systems ‘pull’ from existing metadata from the IMDB Use international metadata standards for ‘meta-driven systems’  Review of IMDB architecture and other meta-information systems commissioned by ARB/Classification Systems Branch Preliminary results presented to IM Committee  Social survey processing environment will be linked to the IMDB

14 Statistics Canada Statistique Canada Metadata Environment: TO-BE

Mapping IMDB to other metadata standards Thesauri/search resources XBRL ISO11179 CMR DDI SDMX CWM Data Collection Data Dissemination Data Transfer between Organizations and Organizational Units Database Interoperability Data Collection Data Dissemination IMDB metamodel Tax, Health and SNA datawarehouses National Accounts BOP Trade in Services Social statistics: Data liberation initiative (DLI); STC microdata files Financial data from businesses

Mapping IMDB to other metadata standards  DDI (Data Documentation Initiative) already used in STC PUMF and “analytic” microdata files are DDI-XML tagged by DLI – STC and universities, and CRDCN (only until 2012) CRDCN has asked STC to continue DDI tagging after 2012 WG on Social Survey Metadata Environment is ensuring the ability to generate DDI output from generalized processing systems  Should the IMC recommend that DDI be a standard output of microdata files?

IMDB systems development  3-year work plan in place  one interface – METAWEB – for entering and updating metadata  Metadata entry done by Standards and only some divisions  Can the responsibility for entering and updating metadata (and variables) be pushed out to SM divisions or the IM Secretariat?