White Paper on Establishing an Infrastructure for Open Language Archiving Steven Bird and Gary Simons.

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
DC8 Registries Breakout. Goals of the session Discuss and clarify : Requirements for registry Framework for policy Relate issues raised to EOR prototype.
Registry breakout group DC-8, National Library of Canada 5 October 2000.
DC2001, Tokyo DCMI Registry : Background and demonstration DC2001 Tokyo October 2001 Rachel Heery, UKOLN, University of Bath Harry Wagner, OCLC
OLAC Metadata Steven Bird University of Melbourne / University of Pennsylvania OLAC Workshop 10 December 2002.
Accessing Distributed Resources Information: An OLAC perspective Steven Bird Gary Simons Chu-Ren Huang Melbourne SIL Academia Sinica ENABLER/ELSNET Workshop.
The Seven Pillars of Open Language Archiving: A Vision Statement Gary Simons and Steven Bird Workshop on Web-based Language Documentation and Description.
Outreach Jeff Good UC Berkeley. OLAC's Needs Maximal involvement from the whole community –The more data providers involved the more useful the services.
The Open Language Archives Community: Building a worldwide library of digital language resources Gary Simons, SIL International LSA Tutorial on Archiving.
OLAC Process and OLAC Protocol: A Guided Tour Gary F. Simons SIL International ___________________________ OLAC Workshop 10 Dec 2002, Philadelphia.
An Overview of OLAC: The Open Language Archives Community Gary Simons and Steven Bird Workshop on The Digitization of Language Data: The Need for Standards.
The OLAC Metadata Set Gary Simons Workshop on The Digitization of Language Data: The Need for Standards June 2001.
IRCS Workshop on Open Language Archives, 12/02 1 Revised OLAC Vocabulary for Language Technology.
Getting Involved in OLAC Steven Bird University of Pennsylvania LREC Symposium: The Open Language Archives Community 29 May 2002.
Getting Involved in OLAC Steven Bird University of Pennsylvania LSA Symposium: The Open Language Archives Community 4 January 2002.
Helen Dry & Anthony Aristar LINGUIST List: LREC Symposium: The Open Language Archives Community 29 May 2002http://linguistlist.org.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LREC Symposium: The Open Language Archives Community.
Helen Dry & Anthony Aristar LINGUIST List: LSA Symposium: The Open Language Archives Community 4 January 2002http://linguistlist.org.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LSA Symposium: The Open Language Archives Community.
The Dryad Data Repository Ryan Scherle 1, Hilmar Lapp 1, Amol Bapat 2, Sarah Carrier 2, Jane Greenberg 2, Peggy Schaeffer 1, Todd Vision 1,3, Hollie White.
A centre of expertise in digital information management IMS Digital Repositories Interoperability Andy Powell UKOLN,
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
Metadata and the UK Data Archive CESSDA Expert Seminar Odense September 2008 Margaret Ward Lenin Ageer.
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
The Open Archives Initiative Simeon Warner (Cornell University) Open Archives seminar “Facilitating Free and Efficient Scientific.
The Open Archives Initiative Simeon Warner Cornell University, Ithaca, NY, USA CREPUQ 2002, Montréal, Canada 14:00, 24 October 2002.
Introduction to ebXML Mike Rawlins ebXML Requirements Team Project Leader.
Educause October 29, 2001 A GEM of a Resource: The Gateway to Educational Materials Copyright Nancy Virgil Morgan, This work is the intellectual.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
Digital Library Architecture and Technology
Malaysian Grid for Learning October DC 2004, Shanghai, China. © 2004 MIMOS Berhad. All Rights Reserved Metadata Management System DC2004: International.
Resource Discovery (metadata and searching) Working Group Report.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Electronic Theses at Rhodes University presented by Irene Vermaak Rhodes University Library National ETD Project CHELSA Stakeholder Workshop 5 November.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Metadata Helen Aristar Dry Eastern Michigan University LINGUIST List.
LIS 654 BUILDING DIGITAL LIBRARIES FALL 2011 NOVEMBER 03, 2011 The OAI-PMH Harvester Plugin for The Omeka Content Management System JAMES R. GRIFFIN III.
1 NDLTD Welcome and Introduction ETD 2011: 14 th Int. Symp. on ETDs Cape Town, South Africa Edward A. Fox Executive Director, NDLTD,
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
The NCAR Community Data Portal (CDP) Experiences with OAI metadata record federation presented by Michael Burek (NCAR/SCD/VETS) Acknowledgments:
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
SPASE and the VxOs Jim Thieman Todd King Aaron Roberts.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Metadata and OAI DLESE OAI Workshop April 29-30, 2002 Katy Ginger Presentation available at:
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
1 Video Message: Welcome ETD 2015: 18 th Int’l Symposium on ETDs New Delhi, India Edward A. Fox Executive Director, Chairman of the Board NDLTD,
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Metadata-based Discovery: Experience in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK A centre of.
How to Implement an Institutional Repository: Part IV A NASIG 2006 Pre-Conference May 4, 2006 Policy Issues.
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
IPDA Architecture Project International Planetary Data Alliance IPDA Architecture Project Report.
International Planetary Data Alliance Registry Project Update September 16, 2011.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
MICHAEL and the European Digital Library: promoting teaching, learning and research The MICHAEL Project is funded under the European Commission eTEN Programme.
Introduction to Implementing an Institutional Repository
The JISC IE Metadata Schema Registry
Session 2: Metadata and Catalogues
Open Archive Initiative
Some Options for Non-MARC Descriptive Metadata
Presentation transcript:

White Paper on Establishing an Infrastructure for Open Language Archiving Steven Bird and Gary Simons

The Open Archives Initiative Began with e-prints Now covers digital repositories of scholarly materials, regardless of type Each participating archive implements a repository: Item: identifier + metadata Specifies entry point WP:1

OAI Repositories and Archives WP:1

Built on Two Standards The OAI shared metadata set: Dublin Core core set of 15 metadata elements represent a broad, interdisciplinary consensus widely useful for resource discovery WP:1 OAI Metadata Harvesting Protocol software services can query a repository retrieve item identifiers and metadata records

OAI Service and Data Providers WP:1

Definition of the OAI Community The OAI is a community of archives which: supply Dublin Core metadata support the OAI Metadata Harvesting Protocol register with the OAI Any compliant repository can register No other notion of community membership WP:2

The OAI Community WP:2

OAI Supports Specialist Communities The community can define metadata formats other than Dublin Core Specific to a particular domain DPs serve the new format SPs harvest the new format Result: an OAI subcommunity WP:2

What does OAI provide us? Data Providers Service Provider Community- specific metadata WP:2

Proposed OLAC Metadata Set Metadata is what makes OLAC a distinct subcommunity of the OAI Through metadata, our community describes the resources which are fundamental to the enterprise of language documentation Minimally extend Dublin Core to express what is fundamental about: Open Language Archiving But how? WP:3

Back to the Requirements 4. Identify the languages that archived items relate to 5. Identify how open or restricted an item is 6. Identify format and encoding details for digital resources 7. Identify other resources required for using an item 8. Match data resources with appropriate software tools WP:3.2

OLAC metadata elements Subject.language Rights.openness Format.openness Format.encoding Format.markup Type.data Relation.requires Rights.openness Format.language Type.functionality Type.os Type.osversion Type.cpu WP:3.2-3

Controlled vocabulary servers Many elements have a restricted range of values: Rights.openness: open, published, restricted, unknown Subject.language: Ethnologue codes Controlled vocabulary server: Network-accessible service Maintains and documents a vocabulary SIL has agreed to be a C.V.S. for language id WP:3.5

Subcommunities with richer metadata standards Just as OLAC is a subcommunity of the OAI, there are other subcommunities in the scope of OLAC Language data centers (LDC, ELRA, GSK) ISLE Meta Data Initiative – detailed metadata for describing recorded speech events These subcommunities would support DC and OLAC metadata, plus their own set Specialized service provider Focussed searching based on richer metadata WP:3.6

Founding the Open Language Archives Community Standards OLAC definition OLAC Gateway Primary OLAC service provider Peer review Defining recommended best practice WP:4

Standards The framework that allows the core infrastructure to function: Gatewaygoverned by a protocol for harvest- ing metadata from participating archives Metadatagoverned by an XML schema that ensures uniformity across all archives Reviewgoverned by a process that promotes draft to candidate and then to best practice WP:4.1

OLAC Definition Definition: The Open Language Archives Community (OLAC) is the community of language archives and associated services which implement the OLAC standards. Purpose: to support the language documentation community, by fostering the sharing of language resources. Advisory council: each OLAC archive will be asked to select a representative to serve on an advisory council. WP:4.2

OLAC Gateway: This site will host information for the community of people: OLAC standards documents index of service providers collection of best practice recommendations …plus information for the community of machines: OLAC metadata schema registry of data providers controlled vocabulary servers (local or remote) WP:4.3

Primary OLAC Service Provider Qualifications: foremost electronic network of linguists, with over 13,000 members worldwide a decade of experience worldwide mirrors Roles: Provides a complete union catalog Institutes an informal, open, peer-review process WP:4.4

Peer Review How can you judge the quality of a digital resource? scale, quality, openness of the resource / support information may be misleading, outdated, erroneous access delayed/blocked by unadvertised restrictions problems with data, tools, formats, best practices An informal, open, peer review process Users of a data or service provider can report their experience using a form on the OLAC Gateway Review forwarded to the provider, post a response Visitors to the Gateway could peruse them WP:4.5

Defining recommended best practice Anyone could submit an RFC, posted on Gateway RFC: existing practice; experience; case for wider adoption RFCs would be reviewed by the community and the advisory council Accepted RFCs promoted to the status of Recommended Best Practice Not standards, but recommendations To limit the needless incompatibilities of format Encourage genuine innovation WP:4.6

Next steps: This week Working group discussions, leading to revised requirements Working group discussions, leading to a revised white paper Identify alpha test group Endorsement and announcement WP:5