Chuck Humphrey Data Library Co-ordinator University of Alberta May 16, 2007 1 Capitalising on Metadata Tool development plans IASSIST 2007.

Slides:



Advertisements
Similar presentations
1 of 15 Information Access Internal Information © FAO 2005 IMARK Investing in Information for Development Information Access Internal Information.
Advertisements

Chuck Humphrey, Alberta Research Data Centre Canadian RDC Report Where are we now?
DLI & Research Data Centres Creating a better understanding of these two programs Chuck Humphrey Data Library University of Alberta April 2004.
Directions The DLI External Advisory Committee has been promoting DDI as a standard for DLI data. This appears in the DLI Strategic Plan following feedback.
DDI for the Uninitiated ACCOLEDS /DLI Training: December 2003 Ernie Boyko Statistics Canada Chuck Humphrey University of Alberta.
Chuck Humphrey University of Alberta RDC CFI Projects Building the next generation of metadata tools.
National Report - Canada - Sustaining Arctic Observing Networks Board Meeting – Potsdam, Germany October 2012.
Administrative Data Research Centre for England 1.
Data Access and Data Use: the Missing Link? Elizabeth Hamilton University of New Brunswick Chuck Humphrey University of Alberta Data and Knowledge Transfer.
Holyoke Public Schools Professional Development By, Judy Taylor
Meeting of CAUL/CONZUL and CREPUQ Sub-Committee of Libraries Montréal, Québec, October 10, 2001 October 10, 2001 A Research Digital Library : a Proposal.
Fitting a survey life cycle in the DDI Irene Wong Chuck Humphrey IASSIST Edinburgh May 2005.
Open Library Environment Designing technology for the way libraries really work November 19, 2008 ~ ASERL, Atlanta Lynne O’Brien Director, Academic Technology.
1 Canada’s National Data Archive Consultations Chuck Humphrey University of Alberta IASSIST 2005.
Chuck Humphrey University of Alberta The Canadian Research Data Centre Network’s DDI Tools Development Project IASSIST 2009.
Lecture 13 Revision IMS Systems Analysis and Design.
Chapter 6 Systems Development.
1 CES IASSIST 2002, June 2002 University of Connecticut MetaNet: Standardising Statistical Metadata Methodology Karen Brannen University of Edinburgh,
Statistics Canada Statistique Canada mai 2005 / 1.
Research and IR Cohabitating Chuck Humphrey University of Alberta IASSIST 2006.
1 Preserving Research Data The Canadian Experience Charles Humphrey University of Alberta February 2005.
NARA – Roper Center Collaboration: USIA Office of Research Surveys Michael Carlson National Archives and Records Administration Marc Maynard.
Welcome to CMPE003 Personal Computer Concepts: Hardware and Software Winter 2003 UC Santa Cruz Instructor: Guy Cox.
Taking SMEs from Word-based narrative to topic-based structure.
1 Open Library Environment Designing technology for the way libraries really work December 8, 2008 ~ CNI, Washington DC Lynne O’Brien Director, Academic.
NSI 1 Collect Process AnalyseDisseminate Survey A Survey B Historically statistical organisations have produced specialised business processes and IT.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
Implementing ESS standards for reference metadata and quality reporting at Istat Work Session on Statistical Metadata Topic (i): Metadata standards and.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
Chapter 13: Developing and Implementing Effective Accounting Information Systems
DDI-RDF Discovery Vocabulary A Metadata Vocabulary for Documenting Research and Survey Data Linked Data on the Web (LDOW 2013) Thomas Bosch.
DDI-RDF Leveraging the DDI Model for the Linked Data Web.
February 17, 1999Open Forum on Metadata Registries 1 Census Corporate Statistical Metadata Registry By Martin V. Appel Daniel W. Gillman Samuel N. Highsmith,
U.S. Department of Agriculture eGovernment Program eGovernment Working Group Meeting February 11, 2004.
1 Strategic Plan for Digital Archives Programme DAP PROJECT SCOPE OVERVIEW STATUS.
Background Cornell Institute for Social and Economic Research (CISER): Data and Computing Support for Social and Economic Researchers at Cornell University.
The Changing Data Environment Train the Trainers Montreal February 2010 Chuck Humphrey University of Alberta.
Research Group – ResRA RRA Network Review Workshop - 9 May 2013.
Secure Epidemiology Research Platform (SERPent) Kick Start Meeting - April 15 th, 2010 Pascal Heus
Any data..! Any where..! Any time..! Linking Process and Content in a Distributed Spatial Production System Pierre Lafond HydraSpace Solutions Inc
12/6/2005DINO – Dec 2005 CANDDI DINO OCUL Data Group Ryerson – Dec
Regional Seminar on Promotion and Utilization of Census Results and on the Revision on the United Nations Principles and Recommendations for Population.
The Data Documentation Initiative: more discussion Chuck Humphrey University of Alberta Atlantic DLI Workshop 2005, Acadia University.
SDMX IT Tools Introduction
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
Generic Statistical Information Model (GSIM) Jenny Linnerud
CANDDI Bo Wandschneider CAPDU/DLI April 13, 2005 Queen’s University.
1 Dataset Builder Tool Canadian Research Data Centre Network Statistics Canada NADDI 2014.
United Nations Economic Commission for Europe Statistical Division GSBPM and Other Standards Steven Vale UNECE
Communities of Practice & L ESSONS L EARNED Budget, Finance, and Award Management Large Facilities Office May 2016 Large Facilities Workshop 2016 S. Dillon.
Navigating Your Way Through the EFT, Nesstar and Beyond 20/20 (WDS)
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Summit 2017 Breakout Group 2: Data Management (DM)
CFI John R Evans Leaders Fund Digital Data Management
Data stewardship life cycle
DDI for the Uninitiated
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
The Open Data Environment
SYSTEMS ANALYSIS & DESIGN
HingX Project Overview
Systems Construction and Implementation
System Construction and Implementation
Systems Construction and Implementation
Item 2.2 of the Agenda Remote access to confidential data for researchers: possible actions under the 7th Framework Programme Pascal JACQUES Unit B 5 15.
Exchanging Data Management Plans with DDI
Capitalising on Metadata
Data Liberation Initiative (DLI)
(Project) SIGN OFF PROCESS MONTH DAY, YEAR
Introducing the Data Documentation Initiative
Presentation transcript:

Chuck Humphrey Data Library Co-ordinator University of Alberta May 16, Capitalising on Metadata Tool development plans IASSIST 2007

2 Outline Three problems needing metadata solutions Three metadata projects Two meetings focused on metadata Two phases to implement solutions The Research Data Centre (RDC) Life Cycle

3 Three Metadata Problems The Academic Directors of the RDC Network sought to solve three problems: 1. How to solve the unusable state of the data documentation in the RDC’s, which can be best characterised as huge PDF files repeated for every cycle of a survey. The scale of the data documentation makes a print-based format impractical. 2. How to capture and make available user knowledge about the data acquired from working with the data. How to build a user knowledge database about the data. 3. How to discover related topics across cycles within or across surveys. How to discover comparable variables across surveys more easily. Proposed to the Canadian Foundation for Innovation (CFI) in December 2005

4 Project 1: Creating Metadata Create DDI-compliant metadata for the existing 50 survey titles in the RDCs now and another 50 titles anticipated over the next four years. This project involves separate metadata production and software development tasks. The metadata will be generated through the production task; tools supporting the migration of DDI 1/2 to DDI 3 will be a contract software development task.

5 Project 2: Capturing User Feedback The concept of user feedback fits within the life cycle model of research and needs to be described in the metadata. This project will involve two tasks. The first will be to specify the elements of user feedback and building them into DDI 3. The second will be to develop simple tools that are not intrusive for capturing and tagging user feedback.

6 Project 3: Discovering Comparability Enriched metadata will facilitate the discovery of related variables across cycles within or across surveys. This project will develop metadata analysis tools that allow researchers to maintain their focus on the conceptual nature of their research questions while letting the tools explore the operational details of working with the data. Funded by CFI and announced on November 27, 2006

7 Nineteen participants representing organizations interested or involved in metadata tools development met in Montreal on January 25-26,  Eight participants from outside Canada, including Germany, Norway, Switzerland, UK and US.  Five from the RDC Network.  Six from elsewhere in Canada, including CISTI, HRSDC, Institut de la statistique Québec, Social Sciences Network Data Services (Western University) and Statistics Canada. Montreal Metadata Meeting

8 Maximizing Our Returns from Social Science Data: the case for Data Documentation Initiative 3.0  A report prepared by Raymond Currie and Chuck Humphrey following the meeting. Circulated to participants in February  Provides responses to the questions: Why should we invest in metadata? How might we collaborate in developing metadata tools?  Gives a summary of discussions about the principles underlying such collaborations. Montreal Metadata Meeting

9 Three projects organized in two phases:  After reviewing the details of the three metadata projects, they were organized into two general development phases. Phase one will build foundational architecture for metadata tool development and tools to facilitate the migration from DDI 1.0/2.1 to DDI 3.0. Phase Two:  The specifications for phase two will be developed in a subsequent consultation occurring approximately six months after the March 2007 Toronto meeting. The tools in this phase will support mining the metadata for concepts and exploring the metadata for comparable data and variables. Metadata RFP Consultation

10 Project Application Project Approval Project Creation Access to Data Generate Analysis Files Output Disclosure Analysis Research Commun- icatons Stages in the life cycle The RDC Research Life Cycle

11 Application processes Managing RDC Projects Contract process Project account process Master files Managing Data Work/analysis files File system backups Disclosure process Managing Research Outputs Publications Knowledge management Three RDC Processes Requiring Information Management

12 Each process generates information:  Managing RDC projects: receiving project applications, identifying researchers, conducting peer-reviews, conducting security approval, signing contracts, conducting orientations, assigning project numbers, enabling security access and granting LAN accounts and file space.  Managing data: handling master files and their documentation, creating working or analysis files from master files (including syntax files) and backing up the file workspace for projects.  Managing research outputs: conducting disclosure analysis, identifying research communications based on RDC research, capturing research outputs and organizing and communicating newly produced knowledge. Information Needing Management

13 Project Application Project Approval Project Creation Access to Data Generate Analysis Files Output Disclosure Analysis Research Commun- icatons Managing Data Stages The RDC Research Life Cycle

14 Generate DDI metadata to 1.0/2.0 standards in a retrospective conversion project Statistics Canada Master Files Develop tools to convert DDI 1.0/2.0 to DDI 3.0 and incorporate the Questionnaire module Access to master files Analysis through Multiple Working Files Repurpose master files Subset Recode Compute Merge Multiple Versions Metadata in the Data Stages

15 Analysis through Multiple Working Files Access to master files Repurpose master files Multiple Versions Generate DDI metadata for working files using tools that read statistical system and syntax / log files. These metadata files can be used to document the working files, to produce products from the metadata (e.g., a codebook listing or Powerpoint slides), to be linked to research communications and to recreate the working file from its master data file. A validation tool will compare a working file against a virtually generated working file. Metadata in the Data Stages

16 Analysis through Working Files Output from Working Files Working Data Files Metadata Disclosure Analysis Tables Reports Research Communications Website Journals Conferences Repository Supporting metadata Metadata in the Data Stages

17 The RFP describing the tools to mine and to identify comparable data from the master metadata files will be developed in approximately six months. This phase of software development will exploit the architecture built during phase one as well as the DDI metadata for the master microdata files. Phase Two of the Metadata Project

18 Next Steps Call for bids on the first phase. Staff the Metadata Project Manager position. Meet to draft the RFP for the second phase. Call for bids on the second phase.