The Role of Metadata in Census Data Dissemination

Slides:



Advertisements
Similar presentations
Development of Strategies for Census Data Dissemination Introduction United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis.
Advertisements

Input Data Warehousing Canada’s Experience with Establishment Level Information Presentation to the Third International Conference on Establishment Statistics.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Harnessing the Power of Microdata Standards, tools and best practices for microdata dissemination and management International Household Survey Network.
POLICIES AND PROCEDURES FOR ARCHIVING DATA IN BURUNDI.
Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Overview of Archiving of Microdata Session 4 United Nations.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
GSIM implementation in the Istat Metadata System: focus on structural metadata and on the joint use of GSIM and SDMX Mauro Scanu
United Nations Economic Commission for Europe Statistical Division Part B of CMF: Metadata, Standards Concepts and Models Jana Meliskova UNECE Work Session.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
Assessing The Development Needs of the Statistical System NSDS Workshop, Trinidad and Tobago, July 27-29, 2009 Presented by Barbados.
1 Interoperability of Spatial Data Sets and Services Data quality and Metadata: what is needed, what is feasible, next steps Interoperability of Spatial.
Documenting and disseminating census and survey data sets Ilpo Survo, United Nations ESCAP, Bangkok, for UNECE.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Documentation and Cataloguing in Data.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis for Arabic Speaking Countries, Amman, Jordan May 2011 Identification.
ADP SUPPORT IN UGANDA BUILDING A NATIONAL DATA ARCHIVE Presented by Kizito Kasozi Director Information Technology Uganda Bureau of Statistics PARIS21.
Secure Epidemiology Research Platform (SERPent) Kick Start Meeting - April 15 th, 2010 Pascal Heus
Archiving microdata Standards and good practices United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development.
The Role of International Standards for National Statistical Offices Andrew Hancock Statistics New Zealand Prepared for 2013 Meeting of the UN Expert Group.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis for Arabic Speaking Countries, Amman, Jordan May 2011 Identification.
General Recommendations on STS Carsten Boldsen Hansen Economic Statistics Section, UNECE UNECE Workshop on Short-Term Statistics (STS) and Seasonal Adjustment.
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
Copyright 2010, The World Bank Group. All Rights Reserved. Principles, criteria and methods Part 1 Quality management Produced in Collaboration between.
The Role of service Granularity in Successful CSPA Realization Zvone Klun, Tomaž Špeh Geneve, 22 June 2016.
Role of Metadata in dissemination of census data Regional Seminar on dissemination and spatial analysis of census data, Nairobi, September, 2010.
Census Planning and Management for next Nigerian Census
Setting Up a National Data Archive: The Ugandan Experience
Data sharing at Deutsche Bundesbank: The House of Microdata
Namibia Experience in the use of Administrative Data
Development of Strategies for Census Data Dissemination
Experiences Informal Sector in National Accounts
Olivier Dupriez World Bank / IHSN
MANAGEMENT OF STATISTICAL PRODUCTION PROCESS METADATA IN ISIS
Pietro Gennari FAO Chief Statistician co-Chair CCSA
Quality assurance in official statistics
Streamlining the Statistical Production in TurkStat Metadata Studies in TURKSTAT High Level Seminar for Eastern Europe, Caucasus and Central Asia Countries.
International Statistical Classifications for SDG Indicators
Dissemination Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May 2008,
"Development of Strategies for Census Data Dissemination".
WORKSHOP GROUP ON QUALITY IN STATISTICS
Data Management: Documentation & Metadata
Documentation of statistics
Sub-regional workshop on integration of administrative data, big data
Measuring Data Quality and Compilation of Metadata
Francesca Perucci United Nations Statistics Division
Institutional Framework, Resources and Management
Statistics Governance and Quality Assurance: the Experience of FAO
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Idendification of and Consultation with Census Data Users
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Role of Metadata in Census Data Dissemination
Modernization of Statistical data processes
BETTER AND PROPER ACCESS TO PACIFIC MICRODATA
High level seminar on the implementation of the
Sub-Regional Workshop on International Merchandise Trade Statistics Compilation and Export and Import Unit Value Indices 21 – 25 November Guam.
Energy Statistics Compilers Manual
Jan Byfuglien Statistics Norway
Statistical System in India
Presentation to SISAI Luxembourg, 12 June 2012
Part B of CMF: Metadata, Standards Concepts and Models Jana Meliskova
The role of metadata in census data dissemination
The Role of Metadata in Census Data Dissemination
1. SDMX: Background and purpose
Generic Statistical Information Model (GSIM)
Introduction to reference metadata and quality reporting
7. Introduction to the main SDMX objects for metadata exchange
Presentation transcript:

The Role of Metadata in Census Data Dissemination United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Amman, Jordan 16-19 May 2011 Background When talking about strategies for data dissemination, we cannot avoide the subject of MD. It is critical for dissemination and corrct use of data. -Also, as the exchange of statistical information grows, there are stronger calls for comprehensive and accessible metadata systems. -

Outline What is metadata? What does metadata do? Types of metadata Metadata in census data dissemination Metadata standards: current situation Conclusion

What is metadata? It is structured information or documentation about data which: informs users about the content, quality and condition of data; describes the structure of datasets, explains, locates, or makes it easier to retrieve, use, or manage data; provides information on the processes of data production; guides on proper usage or interpretation of data. “Data that define and describe other data” (ISO definition) “Information about information” (Dion, 2006) What is metadata? There are many statements about MD, but no single agreed conceptual definition. - MD is structured information (or documentation) about statistical data which: -informs users about the content, quality and condition of the data. -describes the structure of dataset, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource. -provides information on the processes of production -guides on proper usage or interpretation of data, and is instrumental in transforming data into meaningful information. Putting it simply.... Organization for Standardization (ISO) which defines metadata as “data that defines and describes other data” 3

What does metadata do? Assist in retrieving and processing data Support correct use of data Provide transparency in data Enhance interoperability Improve archiving, preservation, institutional memory over time Statistical metadata systems have several basic roles, some of which support end-users and others support statistical production. 1) In a computerized environment, metadata assists in retrieving and processing data in various statistical applications. 2) Support correct use of data , because Good documentation reduces the likelihood of misuse of data. 3) Comments on data quality, coverage, difficulties in the process of data collection, any deviations from recommended definitions/classifications enhance transparency in data. 4) Enhance interoperability Describing a data resource with structured metadata allows to exchange data with minimal loss of content and functionality. 5) Archiving, preservation, institutional memory Metadata ensures that data resources will survive and continue to be accessible into the future. THINK that dDigitally available information may become unusable in the future as hardware and software technologies change. A well-structured metadata system however can help in migrating data properly across various technologies, thus preserving data. 4

Types of metadata Structural metadata provide information about the structure of the dataset act as identifiers and descriptors of the data, making it possible to properly identify, retrieve and browse the data Reference metadata allow a thorough understanding and interpretation of the corresponding statistical data Describe the concepts, definitions, methodology and quality of data; production and dissemination process, data access conditions, release policy, confidentiality, etc Broadly speaking..... There are two types of metadata: Structural metadata – Structural metadata are those metadata which provide information about the structure of the dataset. They act as identifiers and descriptors of the data structure, making it possible to properly identify, retrieve and browse the data. Reference metadata -allow a thorough understanding and interpretation of the corresponding statistical data - Reference metadata (also known as explanatory metadata) are generally in a textual format and describe the content of the data, including the concepts, methodology and quality, and so on.. 5

Metadata in Census Data Dissemination All tabulations of census data should include: Census questions Reason why the questions are asked Conceptual definitions Geographic hierarchies used Changes since the previous census, regarding content, operational methods or geographic boundaries Quality indicators (ex. coverage rates and item non-response) Methodological note on the rules and methods applied If a long-form sample is used... Sampling design, size, sampling variability of the results (P&R, 2008) Metadata is a key element of census dissemination to ensure that the underlying concepts on the issues are well understood and that the results are properly interpreted. According to the P&R.....

Metadata standards: current situation  Currently, data dissemination and exchange take place in ad-hoc manner, using all kinds of non-standard format. Common standards and guidelines are needed to enable more efficient exchange of statistical data. Standard metadata system will: ensures consistency and comparability of content avoids duplication and diversity of definitions ensures reduction in cost of data development - Currently, data dissemination and exchange take place in ad-hoc manner, using all kinds of non-standard format. Common standards and guidelines are needed to enable more efficient exchange of statistical data. - We should realize that considerable gains from adopting common approaches to metadata management. Standard metadata system will: ensures consistency and comparability of content avoids duplication and diversity of definitions ensures reduction in cost of data development 7

Metadata standards (cont’d) Two international metadata standards are becoming well established: - SDMX (Statistical Data and Metadata Exchange) a number of international agencies have endorsed SDMX; supported by the UN Statistical Commission -DDI (Data Dissemination Initiative) Microdata Management Toolkit of WB uses the DDI metadata standard At the international level, many different metadata schems are being developed. Among these, Two international metadata standared for data exchanges. 1.SDMX (Statistical Data and Metadata Exchange 2.DDI (Data Dissemination Initiative ) - a number of international agencies, including IMF, OECD, FAO, have moved to use SDMX for standerdized data exchange. 8DSMX ensures MD always come along with the data, making the information immediately understandable, - aggregate level?) - IHSN (International Household Survey Network) of WB offers developing countries a *Macrodata Management toolkit”, in which DDI metadata standared is used. (microdata documentation – unit base??) 8

Conclusion Metadata are at the heart of the information management. They should be an integral part of statistical dissemination strategies. Standards for metadata management are important to develop. Regional and international collaboration between NSOs is an important consideration 9

Thank you !

Users of metadata Metadata support the knowledge of potential user of statistical information. The major users include: Users of statistical data Producers of statistical data Researchers on the development of statistical systems Who needs metadata? -Users and producers of statistical data are obvious users of statistical metadata. -Users of statistical data: The user of statistical data will need some metadata to analyze and interpret the statistical data. The level of detail of the metadata needed will vary depending on the type of the user. Different categories of users have different needs and requirements for the level of detail. As a result statistical services will have to offer a wide range of metadata to suit different users. - Producers of statistical data: A broad range of people involved in the production of statistical data and information products including -- designers of data collections programs, subject matter statisticians, statistical methodologists, and information syst em specialists as well as providers of input data and respondents of surveys – all need metadata in their work. -Researchers on the development and methodology of statistical systems also need metadata 11

Points for discussion - What are the success factors of metadata? Strategies? Infrastrucure? Resources? - How to ensure the security of metadata? - How “open” or “transparent” the metadata can be? - How to improve standerdization of metadata?