Metadata Architecture at StatCan MSIS 2008 Luxembourg, April 7-9, 2008 Karen Doherty Director General Informatics Branch Statistics Canada.

Slides:



Advertisements
Similar presentations
National Institute of Statistics, Geography and Informatics (INEGI) Implementation of SDMX in Mexico.
Advertisements

April, 2004 Lars Thygesen International Trade Expert meeting Whats going on at OECD: statistical information management.
Metadata to Support the Survey Life Cycle Alice Born, Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (METIS) Geneva,
Information Infrastructure: Foundations for ABS Transformation Stuart Girvan, Australian Bureau of Statistics MSIS Paris, April 2013.
Reducing Metadata Objects Dan Gillman November 14, 2014.
Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010.
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
10 April 2014 The Redesigned WSDOT Data Catalog Andy Everett, Metadata Repository Librarian, Washington State DOT.
WP.5 - DDI-SDMX Integration
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
Data Warehousing at STC MSIS 2007 Geneva, May 8-10, 2007 Karen Doherty Director General Informatics Branch Statistics Canada.
Using ISO/IEC to Help with Metadata Management Problems Graeme Oakley Australian Bureau of Statistics.
Meeting on the Management of Statistical Information Systems (MSIS 2012) Washington, DC, May 2012 Development of Inter-Ministry Information System.
Using the SAS® Information Delivery Portal
Overview of SDMX: Statistical Data and Metadata eXchange Technical and Content Standards for Statistical Data Ann McPhail, Division Chief Statistics Department,
Judy Lee Enterprise Statistics Division Statistics Canada I 1 Developing Metadata Standards in an Integration Project at Statistics Canada United Nations.
4 April 2007METIS Work Session1 Metadata Standards and Their Support of Data Management Needs Daniel W. Gillman Bureau of Labor Statistics Paul Johanis.
Architecture for a Database System
CASE STUDY: STATISTICS NORWAY (SSB) Jenny Linnerud and Anne Gro Hustoft Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg.
Chapter 6 SAS ® OLAP Cube Studio. Section 6.1 SAS OLAP Cube Studio Architecture.
GSIM implementation in the Istat Metadata System: focus on structural metadata and on the joint use of GSIM and SDMX Mauro Scanu
United Nations Economic Commission for Europe Statistical Division Part B of CMF: Metadata, Standards Concepts and Models Jana Meliskova UNECE Work Session.
Statistics Portugal/ Metadata Unit Monica Isfan « Joint UNECE/ EUROSTAT/ OECD Work Session on Statistical Metadata.
February 17, 1999Open Forum on Metadata Registries 1 Census Corporate Statistical Metadata Registry By Martin V. Appel Daniel W. Gillman Samuel N. Highsmith,
ISO/IEC : Framework for a Metadata Registry By Daniel W. Gillman Bureau of Labor Statistics USA.
Interoperability & Knowledge Sharing Advisor: Dr. Sudha Ram Dr. Jinsoo Park Kangsuk Kim (former MS Student) Yousub Hwang (Ph.D. Student)
Enabling Reuse-Based Software Development of Large-Scale Systems IEEE Transactions on Software Engineering, Volume 31, Issue 6, June 2005 Richard W. Selby,
Statistical Metadata Strategy and GSIM Implementation in Canada Statistics Canada.
SDC JE What is a Data Registry? v A place to keep facts about characteristics of data that are necessary to clearly describe, inventory,
1 1 Developing a framework for standardisation High-Level Seminar on Streamlining Statistical production Zlatibor, Serbia 6-7 July 2011 Rune Gløersen IT.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Tutorial on XML Tag and Schema Registration in an ISO/IEC Metadata Registry Open Forum 2003 on Metadata Registries Tuesday, January 21, 2003; 4:45-5:30.
SDMX IT Tools Introduction
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
SDMX and Metadata SDMX Basics Course 12 April 2013 Daniel Suranyi Eurostat B5 Management of statistical data and metadata.
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
User-Driven Integrated Statistical Solutions: Government for the People by the People Open Forum on Metadata Registries Santa Fe, New Mexico January 20,
Metadata Framework for a Statistical Data Warehouse
7b. SDMX practical use case: Census Hub
CENSUS OUTPUTS Dissemination Plans Chris Ashford 2011 Census Outputs : Technical Delivery.
Data in context Chapter 1 of Data Basics. Frameworks Today, we will be presenting two frameworks for thinking about the content of data services. A.Statistics.
Statistical Metadata Extensions to the X3.285 Metamodel By Daniel W. Gillman Chairman, NCITS/L8 U.S. Bureau of the Census.
Role of the IMDB in the CBA and IM Strategy Presented to Information Management Committee Standards Division June
Use of Standardized Metadata to Find, Select and Access Statistical Data - Experience of Statistics Canada - Joint UNECE/Eurostat/OECD Work Session on.
Metadata Driven Integrated INFORMATION SYSTEM of CSB of LATVIA Version Central Statistical Bureau of Latvia April 5 – 9, 2008 / Luxembourg Presentation.
Eurostat Report on SDMX Reference Infrastructure User Group 1 st meeting in Luxembourg Sept 2012 Item 5.2 of the agenda November 2012IT Director's.
Eurostat 6. SDMX: A non-technical overview of the SDMX architecture and IT tools 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services”
>> Metadata What is it, and what could it be? EU Twinning Project Activity E.2 26 May 2013.
Metadata requirements for archiving structured data Alice Born Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (9-11 April.
Metadata models to support the statistical cycle: IMDB
The evolution of the SDMX infrastructure and services
Census Hub in practice Working Group "European Statistical Data Support" Luxembourg, 29 April 2015.
Cross-domain concepts
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
SDMX in the S-DWH Layered Architecture
SDMX Tools Overview and architecture
Metadata The metadata contains
ESS VIP ICT Project Task Force Meeting 5-6 March 2013.
Prepared by Peter Boško, Luxembourg June 2012
The role of metadata in census data dissemination
The Role of Metadata in Census Data Dissemination
Integrated Statistical Production System WITH GSBPM
Palestinian Central Bureau of Statistics
Presentation transcript:

Metadata Architecture at StatCan MSIS 2008 Luxembourg, April 7-9, 2008 Karen Doherty Director General Informatics Branch Statistics Canada

2 Table of Contents Building Blocks Viewing Metadata Via STCWiki Metadata Architecture Conclusion

3 Background Large collection of data offerings led to the need for metadata on the surveys and administrative data sources A lot of effort was spent initially on defining the surveys to support the dissemination function Focus has shifted to the descriptions of the variables and code sets used

4 Business Drivers Many programs now load their data into warehouses to support analysis and data confrontation Users were asking for the supporting metadata to be available from within the warehouses Users wanted to be able to submit corrections to the metadata as they were working on the data

5 Building Blocks Metadata collection –Integrated Metadata Base (IMDB) Data manipulation and data display tools –Data Warehouse Framework –EzWeb –STCwiki

6 Integrated Metadata Base Contains information on our 590 active surveys Benefits: –Improves the interpretability of our surveys –Assists in assuring the coherence of our data –Promotes knowledge sharing within StatCan and with external users –Preserves corporate memory –Promotes the reuse and standardization of metadata assets including definitions and code sets

7 Original Vision for the IMDB

8 Integrated Metadata Base Architecture: –Oracle Database –Java Model based on ISO/IEC Metadata Registries and the Corporate Metadata Repository (CMR) from the US Census Bureau

9 Integrated Metadata Base Data dimension model : –Describes the data –Based on ISO/IEC –Data elements (variables) specified by an object class with properties and values (code sets, classifications) –Can be used to describe any set of objects not just metadata

10 Integrated Metadata Base ISO data element definition : –Object class: type of object being described (person, establishment, household, etc.) –Property: attribute that describes the object (sex, age, etc.) –Conceptual Domain: list of possible settings (value meanings) for a property (property Sex has two possible meanings Male and Female) –Value Domain: code set used to represent the value meanings (1 = Male, 2 = Female, etc.)

11 Data Warehouse Framework Standard development framework for data warehouses at Statcan Based on Microsoft.net / SQLServer Users can access metadata from within the warehouse

12 EZWeb & STCwiki EzWeb –used to develop Intranet sites in a very standardized way –incorporates MS Excel pivot tables to allow users to view data in a warehouse via OLAP –can navigate easily from one OLAP report to another STCwiki –used to display metadata from the IMDB –allows users to collaborate and to submit proposed changes to the IMDB –based on MediaWiki (used by Wikipedia)

13 Access from a Data Warehouse A user viewing data in a warehouse by age range and year can see the associate metadata by clicking on the Metadata icon (top menu).

14 Metadata Browsing in STCwiki The user can view any of the metadata associated with the table they were looking at – in this case the definitions of the age range variable.

15 Metadata Browsing in STCwiki The user can view any metadata stored in IMDB – all is accessible from STCwiki.

16 Put it All Together So now we have the building blocks, how do we put it together into a coherent architecture?

17 Objectives Leverage investment in IMDB to describe not only statistical metadata but also metadata on IT systems and the enterprise architecture Develop user-friendly interfaces to metadata from within a warehouse Improve support for coding activities and classifications systems Allow users to submit proposed changes to the metadata in the IMDB

18 The IMDB is Expanding Expansion plans for this year are highlighted in blue.

19 Multiple Formats Warehouse data and IMDB metadata can be combined and rendered in any format via converter modules.

20 Metadata Architecture Target architecture Take data and metadata and render/publish it in a variety of modes via software based format converters. Most of this architecture exists today at StatCan.

21 Conclusion IMDB has the design required to support a full range of metadata functionality at StatCan A significant amount of variable-related information already exists in the IMDB Work for 2008/09 –Completing the process for reviewing and approving metadata changes from wiki users –Completing the Classification Management System –Mapping out a proposal for funding to develop a format converter (DDI or SDMX)