Looking into the future… Providing Social Science Data Services Jim Jacobs.

Slides:



Advertisements
Similar presentations
3rd International Digital Curation Conference Washington, DC, Dec 2007 Paper Presentations: Interoperability, Metadata & Standards Data Documentation Initiative:
Advertisements

UK DATA ARCHIVE Louise Corti, ODAF April UK Data Archive an internationally-renowned centre of expertise in data acquisition, preservation, dissemination.
Preservation by Migration to XML Dirk Roorda. work on a preservation strategy positioning of the XML preservation strategy implementing the strategy in.
ICPSR-SRO Shared Data Model Project Mary Vardigan Director, DDI Alliance.
A Gentle Introduction to DDI - What's in it for me? Jim Jacobs University of California, San Diego Wendy Thomas University of Minnesota.
Archiving Trevor Croft MICS3 Data Archiving, Dissemination and Further Analysis Workshop Geneva - November 6th, 2006.
MICS4 Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop Data Archiving.
DDI for the Uninitiated ACCOLEDS /DLI Training: December 2003 Ernie Boyko Statistics Canada Chuck Humphrey University of Alberta.
DLI Training Nesstar Workshop
Data Documentation Initiative (DDI) Workshop Carol Perry Ernie Boyko April 2005 Kingston Ontario.
RSS. March HB/The Data Archive. The RSS Working Group on Data preservation and sharing: standards for documenting data for preservation and secondary.
The Economic and Social Data Service (ESDS) Kevin Schürer ESDS/UKDA ESDS Awareness Day 5 December 2003.
IKA Øst IKS - a company for long-term storage of electronic archives By Børge Strand.
Metadata and the UK Data Archive CESSDA Expert Seminar Odense September 2008 Margaret Ward Lenin Ageer.
Metadata at ICPSR Sanda Ionescu, ICPSR.
Making the Case for Metadata at SRS-NSF National Science Foundation Division of Science Resources Statistics Jeri Mulrow, Geetha Srinivasarao, and John.
Resources for Social Sciences
DDI at the Australian Data Archive Steve McEachern Deputy Director, ADA with Deborah Mitchell (ADA), Ben Evans and Olaf Delgado-Friedrichs (ANUSF) EDDI.
Developments in Data Discovery at ICPSR George Alter Director, ICPSR University of Michigan.
Wendy Thomas Minnesota Population Center NADDI 2014.
INTER-UNIVERSITY CONSORTIUM FOR POLITICAL AND SOCIAL RESEARCH Social Science Data and Resources for Researchers Converting Legacy Documentation to DDI:
Peter Granda Archival Assistant Director / ICPSR and the Gerald R. Ford Presidential Library: Two Decades of Collaboration.
Looking into the future… DDI workshop IASSIST 2006 Jim Jacobs.
ASSDA: A Trusted Digital Repository or a trusted digital repository? Sophie Holloway The Australian Social Science Data Archive Taking the Shock Out of.
The Minority Data Resource Center Felicia LeClere, Ph.D. Director, MDRC.
Research and IR Cohabitating Chuck Humphrey University of Alberta IASSIST 2006.
Managing the Metadata Lifecycle The Future of DDI at GESIS and ICPSR Peter Granda, ICPSR Meinhard Moschner, GESIS Mary Vardigan, ICPSR Joachim Wackerow,
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
INTRODUCTION TO RESEARCH DATA MANAGEMENT Robin Desmeules Janice Kung J W Scott Health Sciences Library University of Alberta Libraries.
ISO as the metadata standard for Statistics South Africa
Data Documentation Initiative (DDI): Goals and Benefits Mary Vardigan Director, DDI Alliance.
ESCWA SDMX Workshop Session: Role in the Statistical Lifecycle and Relationship with DDI (Data Documentation Initiative)
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
The TARO Project Texas Archival Resources Online Fred Gilmore Sr Operating Systems Specialist UT Austin General Libraries April.
Searching for Statistics Why can’t we find the data we need? Where should we even start?
DDI at the Swedish National Data Service
DLI Training April 2004 Kingston Ontario. DDI What, Why, How?
Leveraging the DDI Model for Linked Statistical Data in the Social, Behavioural, and Economic Sciences DC Thomas Bosch GESIS – Leibniz.
Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009.
DDI-RDF Leveraging the DDI Model for the Linked Data Web.
Documenting and disseminating census and survey data sets Ilpo Survo, United Nations ESCAP, Bangkok, for UNECE.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Documentation and Cataloguing in Data.
Soc : Principles of Research Design LONGITUDINAL DATA Sunny Kaniyathu, Data Services Librarian.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Background Cornell Institute for Social and Economic Research (CISER): Data and Computing Support for Social and Economic Researchers at Cornell University.
Evolution of Data Documentation Providing Social Science Data Services Jim Jacobs, 2008.
Colectica: A Platform for DDI 3 based Metadata Management Design. Collect. Share.
Persistent Digital Archives and Library System (PeDALS)
Archiving microdata Standards and good practices United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development.
The Data Documentation Initiative: more discussion Chuck Humphrey University of Alberta Atlantic DLI Workshop 2005, Acadia University.
Integrated metadata systems History Status Vision Roadmap
General concepts: DDI Irena Vipavc Brvar, ADP SEEDS Kick-off meeting, Lausanne, May 2015.
Presented by: Amy Carson, Trisha Hansen and Jonathan Sears.
Laine Ruus University of Toronto.Data Library Service
Evolution of Data Documentation ICPSR Evolution of Data Documentation.
Ingest – Workflow Irena Vipavc Brvar ADP SEEDS Workshop I Belgrade, October.
A SCRIPT FOR ARCHIVING DIGITAL RESEARCH DATA IMPROVING ACCURACY AND EFFICIENCY IN THE DATAVERSE NETWORK ABSTRACT SUMMARY Rachel Carriere, Thu-Mai Christian,
Writing a HOWTO Guide for DDI An approach for getting started.
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
Ingest – Acquisition and deposit Irena Vipavc Brvar ADP SEEDS Workshop I Belgrade, October.
Looking into the future… Providing Social Science Data Services Jim Jacobs.
Navigating Your Way Through the EFT, Nesstar and Beyond 20/20 (WDS)
Utility of an OAI Service Provider Search Portal
Data stewardship life cycle
DDI-Lifecycle and Colectica at the UCLA Social Science Data Archive
Institutional Repositories
Dataverse for citing and sharing research data
The role of metadata in census data dissemination
Presentation transcript:

Looking into the future… Providing Social Science Data Services Jim Jacobs

First principles Metadata are data about data -- information about information. It’s all about having complete, accurate, re-usable metadata. Software to process the metadata is secondary. We should be able to have metadata today that we know will be usable in unforeseeable computing environments (operating systems, software, hardware).

First principles Metadata should be…  Comprehensive  Complete  Uncompromised  Consistent  Flexible  Sharable  Usable and re-usable  Preservable  Parseable by computer  Documented  Non-proprietary

How XML fits in… XML is designed to be parseable with generic tools. XML can encode meaning and can be self- documenting XML is non-proprietary, open, flexible.

How XML fits in… XML is designed to make it easy to find and use just the elements you need from a large document. “Cherry picking”

How XML fits in… Great Power Wars, Levy, Jack S. National Science Foundation. SES <distrbtr abbr="ICPSR" affiliation="Institute for Social Research, University of Michigan" URI="http;// Consortium for Political and Social Research Levy, Jack S. GREAT POWER WARS, [Computer file]. New Brunswick, NJ and Houston, TX: Jack S. Levy and T. Clifton Morgan … Great Power Wars, You can cherry- pick just what you need from a large XML document…

From legacies to the future SAS SPSS OSIRIS PDF Paper Data dictionary Etc.  HTML  PDF  Any stat package  Nesstar, SDA, Dataverse  Library OPAC  Google  OAI, METS, etc.  RSS, RDF  GIS  DDI 3, 4… DDI

From many contributors to many uses researcher Data collector Analyst Data producer, distributor Data archivist Data librarian Users of statistics Government agency  The web  Live documents  Databases  publications  Data archives  Data libraries  Institutional repositories  Secondary analysis  New research  New knowledge DDI

OAIS Functional Model Ingest OAIS Functional Model Archival Storage Access

Information Packages SIP OAIS Information Model AI P DIP SIP DIP

Data stewardship life cycle Data RepurposingData Production Data Repository Data Dissemination Data Discovery

DDI Production Data RepurposingData Production Data Repository Data Dissemination Data Discovery

DDI Use Data RepurposingData Production Data Repository Data Dissemination Data Discovery

DDI will enable transformation New kinds of data discovery (beyond “indexing”) Metadata as a primary resource (metadata as data)

Metadata for data discovery ICPSR already uses DDI metadata to create its Variables database. Nesstar and Dataverse software use metadata to produce searchable indexes of data repositories In the future we should see the harvesting of DDI from many repositories to create indexes across collections. (oclc.org/oaister/) In the future we’ll see data discovery by concept and methodology and geography and time period, not just keyword.

Metadata as data By structuring metadata according to a methodology (the lifecycle-of-data approach), we create metadata that we can treat as data. We can analyze metadata the way we would analyze any data file. As more metadata of this kind are created, we are accumulating a body of information that makes it possible to study trends across time and geography.

Metadata as data The technical documentation for the Army's Korean conflict casualty electronic records file has casualty codes that were never used in the data files. The presence of codes in the metadata for injury by lethal gas and by radiation exposure suggests that Army personnel who designed this record-keeping system expected the possible use of those as weapons. Examination of the data alone would have missed this suggestion. The codes for 'place of casualty' included, in addition to South Korea Sector and North Korea Sector, the Indo- China Sector, Tibet Sector, Mongolia Sector, Honan Sector (sic), Manchuria Sector, North Japan Sector, South Japan Sector, South China Sector, and Formosa Sector."

Metadata as data A researcher at the Danish Data Archive is doing a qualitative analysis of the questionnaires used in seven surveys about ethnic minorities in Danish society, "with the purpose of showing how surveys... mirror and project societal understandings of the subjects under investigation."

Metadata as data Wendy Thomas of the Minnesota Population Center examined U.S. Census metadata from 1790 through 2000 and compared the changing concept of race and ethnicity as embodied in the categories used by the Census Bureau questions over time. Those concepts are only documented in the metadata, not the Census data files themselves.