Update and Thoughts on Directions for Metadata Work Carol Hert March 17, 2003.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Status on the Mapping of Metadata Standards
Nesstar, ESDS International and ESDS Qualidata online demonstrations ASLIB visit to the UK Data Archive Wednesday 24 November 2004 Louise Corti, Associate.
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
Metadata at ICPSR Sanda Ionescu, ICPSR.
Web Service Ahmed Gamal Ahmed Nile University Bioinformatics Group
Introduction to ZPORTAL Prepared by Houeida K. Charara Electronic Resources Librarian LAU Libraries ©2010.
Data Catalogue Service Work Package 4. Main Objective: Deployment, Operation and Evaluation of a cataloguing service for scientific data. Why: Potential.
Is Your Data Facility ISO Compliant? Progress Towards Harmonizing the DDI and ISO/IEC Dan Gillman Information Scientist US Bureau of Labor Statistics.
NESSTAR Limitedw w w. n e s s t a r. c o m DDI-Publishing Made Easy- the Nesstar Way Jostein Ryssevik Nesstar Ltd.
Information Retrieval in Practice
Issues in the Transfer of Help Tools to Government Agencies: The Example of the Statistical Interactive Glossary (SIG) Stephanie W. Haas School of Information.
Environmental Terminology System and Services (ETSS) June 2007.
The Statistical Knowledge Network: Glossary and Metadata at the EIA Stephanie W. Haas & Sheila O. Denn The GovStat Project NSF.
“Reverse Engineering” Statistical Metadata through User Studies Carol A. Hert Syracuse University January 23, 2003.
The GovStat Project ils.unc.edu/govstat Integration of Data and Interfaces to Enhance Human Understanding of Government Statistics: Toward the National.
BUSINESS DRIVEN TECHNOLOGY
Metadata for the SKN: Philosophy, Progress, and Future Directions Sheila Denn, Dan Gillman, Carol Hert, Jung Sun Oh, and Cristina Pattuelli.
Overview of Search Engines
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
ISO as the metadata standard for Statistics South Africa
WP.5 - DDI-SDMX Integration
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
The 21 st Century Library Collaborative Services, Standards, and Interoperability William E. Moen School of Library and Information Sciences Texas Center.
4 April 2007METIS Work Session1 Metadata Standards and Their Support of Data Management Needs Daniel W. Gillman Bureau of Labor Statistics Paul Johanis.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
10/18/2015 NORTEL NETWORKS CONFIDENTIAL – FOR TRAINING PURPOSES ONLY Global Documentation Evolution System Overview and End-to-End Process Training.
February 17, 1999Open Forum on Metadata Registries 1 Census Corporate Statistical Metadata Registry By Martin V. Appel Daniel W. Gillman Samuel N. Highsmith,
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Towards Web Semantics Spreadsheets and the US Government Lee Feigenbaum, Cambridge Semantics Brand Niemann, U.S. EPA SICoP Special Conference February.
Metadata Architecture at StatCan MSIS 2008 Luxembourg, April 7-9, 2008 Karen Doherty Director General Informatics Branch Statistics Canada.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
EPA’s Environmental Terminology System and Services (ETSS) Michael Pendleton Data Standards Branch, EPA/OEI Ecoiformatics Technical Collaborative Indicators.
Linking Tasks, Data, and Architecture Doug Nebert AR-09-01A May 2010.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
Implementation Experiences METIS – April 2006 Russell Penlington & Lars Thygesen - OECD v 1.0.
InSPIRe Australian initiatives for standardising statistical processes and metadata Simon Wall Australian Bureau of Statistics December
XML Engr. Faisal ur Rehman CE-105T Spring Definition XML-EXTENSIBLE MARKUP LANGUAGE: provides a format for describing data. Facilitates the Precise.
N NESSTAR: A Semantic Web Application for Statistical Data and Metadata Pasqualino “Titto” Assini Nesstar Ltd - UK.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Shawn Jones INDUS Corporation January 18, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2029.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Eurostat November 2015 Eurostat Unit B3 – IT and standards for data and metadata exchange Jean-Francois LEBLANC Christian SEBASTIAN SDMX IT Tools SDMX.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
1 Database Environment. 2 Objectives of Three-Level Architecture u All users should be able to access same data. u A user’s view is immune to changes.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Distributed Archives Interoperability Cynthia Y. Cheung NASA Goddard Space Flight Center IAU 2000 Commission 5 Manchester, UK August 12, 2000.
Semantics and the EPA System of Registries Gail Hodge IIa/ Consultant to the U.S. Environmental Protection Agency 18 April 2007.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
International Planetary Data Alliance Registry Development and Coordination Project Report 7 th IPDA Steering Committee Meeting July 13, 2012.
JAFER Toolkit Project Oxford University 1 JAFER Java-based high level Z39.50 toolkit Matthew Dovey; Colin Tatham; Antony Corfield; Richard Mawby Oxford.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
International Planetary Data Alliance Registry Project Update September 16, 2011.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
An Overview of Data-PASS Shared Catalog
Census Technology: Processing architecture and data analysis
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
2. An overview of SDMX (What is SDMX? Part I)
Metadata The metadata contains
SDMX IT Tools SDMX Registry
Palestinian Central Bureau of Statistics
Presentation transcript:

Update and Thoughts on Directions for Metadata Work Carol Hert March 17, 2003

Our Metadata Activities User study to understand metadata necessary for integration tasks (we’re finding needs for metadata not available in agencies) Ongoing efforts to understand DDI and ISO11179 for deploying in end-user tools Identification of host of other relevant standards (open archives, business XML, Z39.50, …) Marked-up tables using DDI Attempting to acquire particular metadata

Metadata Aspects for GovStat Conceptual Tasks Determining elements and attributes to be used in wrapping data and contextual info (an XML DTD presumably) User study et al. to determine appropriate content “thought” experiments with implementations related to elements, attributes, and their values Developing conceptual metadata model for SKN Practical Tasks Finding the actual metadata content to be “wrapped” via the elements finding data with metadata to port into tools

Today’s Presentation Focus on the Conceptual Tasks Status report on potentially relevant standards and projects Considering the user tools and the public intermediary Start strategizing on directions to pursue further

Concept. Task 1: Identifying Elements, Attributes, and Values Current Contenders for Elements, Attributes (and some values) DDI (and its implementations) ISO11179 (and its implementations) Hybrids Corporate Metadata Repository (CMR) from Oracle Data cubes for Tables from NESSTAR, DDI

DDI Data set is the basic element Data archives perspective-designed primarily for people who archive data sets and those who will retrieve and reuse those datasets Does capture information on variables, values, etc. Still actively working on specifications for tables (see Ryssevik memo 3/6/2003)

DDI Issues Doesn’t have good mechanism for relating surveys and instances of those surveys- each data set is considered as stand-alone Hard to compare across variables and time-series Elements for tables still in development and other data presentations (such as news releases, graphics) not well developed Currently working backwards to a conceptual model for the metadata

DDI Implementations of Note Counting California Virtual Data Center (Harvard/MIT) NESSTAR/FASTER Developed CRISTAL datacubes and FasterCubes Minnesota Population Center Developed WendyCubes for data cubes WendyCubes and FasterCubes being merged Data Ferrett (Census)

ISO11179 from the data producers’ perspective (Dan argues that it doesn’t take any perspective) Able to relate survey instances, etc. Isn’t capable of handling the full range of metadata we might need, nor can it handle data representations such as news releases, webpages, etc. (same problem with DDI)

ISO11179 Implementations StatCanada Dan G. has reservations about this implementation and feels it doesn’t meet the standard (more as I understand the problem better)

Is CMR the answer? CMR as a registry to describe data, data processes, data quality and which links to datasets and data CMR incorporates all of ISO11179, and DDI, in addition can support a variety of metadata types (those news releases) CMR not open source, cost unknown (software cost and Oracle consultants) Two good contacts for us Dan has gotten for BLS Sarah Nusser acquiring for Iowa State

Seque to Conceptual Task 2 My original goal was to determine what metadata elements would be necessary for a given end- user tool (e.g. the SIG) and determine which standard(s) could provide necessary functionality (enabling metadata to get from agencies to the user tools) I started by looking at the SIG and also at DDI implementations to see what functionalities we could acquire

The Plot Thickens Two new questions emerged from these activities What functions/information (data & metadata) would be necessary in SKN What other standards efforts should be considered in creating the SKN?

The SKN Architecture

INTERNAL TO AGENCIESPUBLIC INTERMEDIARY POSSIBLE SKN USER TOOLS/FUNCTIONS TRANSFERS Agency data production Data archives standards, projects and their functions CMR; Proprietary metadata repositories; Presentation formats (html, xml, pdf, etc.); Database formats (ACCESS, ALMIS ); DDI Datacubes NESSTAR/Faster CRISTAL; XML for Analysis; Common Warehouse Metadata Model; Statistical disclosure (SDC in Nesstar); StatCan ISO imp. DDI (and DDI for datacubes) NESSTAR /Faster CRISTAL Middleware (whatever that includes) NEOOM from Nesstar/Faster From Virtual Data Center (VDC): federated metadata harvesting, repository exchange and caching, federated authentication and authorization, naming Searching: Z39.50 Data analysis, Bookmarking, Downloading datasets (nesstar); Cataloging, archiving functions (VDC); Online search, data conversion, exploration, data analysis (VDC); Glossary (The Neuchatel Group) Statistical Interactive Glossary (SIG—our project) Ontologies (ISI/Columbia for gas); Relation Browsers; Online Help Z39.50 (used by VDC) Open Archives (VDC) DC, MARC, DDI metadata import and export (VDC) SOAP HTTP RDF (Nesstar) ASN.1

New Strategic Direction for Us? Specification of metadata necessary throughout SKN? Will require specification of interactions among components of SKN And perhaps the specification of specific standards

An example of a possible interaction User via interface “I want data on gasoline price indices in the state of MD” Query transferred to intermediary. Intermediary query agent has business rule requiring check of terms so forwards the term “indices” to the SIG

Example continued SIG responds with 3 definitions of index (specificity of definition) and multiple display options Intermediary business rule indicates to take most general and to use the term “index” in queries sent to agency data sources Etc.

New Strategic Direction for Us? Specification of functions (and related information) necessary throughout SKN? Will require specification of interactions among components of SKN (possible queries, acceptable responses, bindings among agents, etc.) And perhaps the specification of specific standards