The OLAC Metadata Set Gary Simons Workshop on The Digitization of Language Data: The Need for Standards 21-24 June 2001.

Slides:



Advertisements
Similar presentations
The OLAC Metadata Set and Controlled Vocabularies Steven Bird Gary Simons Penn SIL.
Advertisements

Ali Alshowaish. dc.coverage element articulates limitations in the scope of the resource, typically along the following lines: geographical, temporal,
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Dublin Core for Digital Video: Overview of the ViDe Application Profile.
THE DONOR PROJECT Titia van der Werf-Davelaar. Project Financed by: Innovation of Scientific Information Provision (IWI) Duration: –phase 1: 1 may 1998.
T. Baker / 23 Sep 2000 Dublin Core Qualifiers and A Grammar for Dublin Core Thomas Baker DC-8, National Library of Canada, Ottawa 4 October 2000.
OLAC State of the Archives A summary of implementation practices during the first year.
OLAC Metadata Steven Bird University of Melbourne / University of Pennsylvania OLAC Workshop 10 December 2002.
Accessing Distributed Resources Information: An OLAC perspective Steven Bird Gary Simons Chu-Ren Huang Melbourne SIL Academia Sinica ENABLER/ELSNET Workshop.
OLAC: The Open Language Archives Community Steven Bird Gary Simons Penn SIL.
White Paper on Establishing an Infrastructure for Open Language Archiving Steven Bird and Gary Simons.
The Open Language Archives Community: Building a worldwide library of digital language resources Gary Simons, SIL International LSA Tutorial on Archiving.
An Overview of OLAC: The Open Language Archives Community Gary Simons and Steven Bird Workshop on The Digitization of Language Data: The Need for Standards.
IRCS Workshop on Open Language Archives, 12/02 1 Revised OLAC Vocabulary for Language Technology.
Helen Dry & Anthony Aristar LINGUIST List: LREC Symposium: The Open Language Archives Community 29 May 2002http://linguistlist.org.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LREC Symposium: The Open Language Archives Community.
Helen Dry & Anthony Aristar LINGUIST List: LSA Symposium: The Open Language Archives Community 4 January 2002http://linguistlist.org.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LSA Symposium: The Open Language Archives Community.
Treasury Board of Canada Secretariat Secrétariat du Conseil du Trésor du Canada IM Standards for E-government The Canadian Experience Managing Information.
OLAC: Open Language Archives Community OLAC : The Open Language Archives Community Gary F. Simons SIL International and Graduate Institute of Applied Linguistics.
Metadata vocabularies and ontologies Dr. Manjula Patel Technical Research and Development
UKOLN, University of Bath
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
Metadata 8/7/2012 Katie Moss Digital Metadata Technician, Digital Library Services
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
Digital Media Technology Week 4: The TEI Header Peter Verhaar.
Content and Systems Week 3. Today’s goals Obtaining, describing, indexing content –XML –Metadata Preparing for the installation of Dspace –Computers available.
Natalia Wehler: Dublin Core Requirements on Metadata  multiple softwares to use metadata  management of changing standards  needs to be functional,
Reusable!? Or why DDI 3.0 contains a recycling bin.
Kristin Eberle Monica Hampton Carmen Velasquez Kristin Eberle Monica Hampton Carmen Velasquez Knowledge Management.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Resource Description Framework ( RDF ) Xinxia An.
Dublin Core as a tool for interoperability Common presentation of data from archives, libraries and museums DC October 2006 Leif Andresen Danish.
Digital Encoding What’s behind E-text Resources?.
DUBLIN CORE: BEYOND THE LIBRARY David Hirsch LIS Knowledge Organization Dr. Selenay Aytac Spring 2013.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
ISO as the metadata standard for Statistics South Africa
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
1 © Netskills Quality Internet Training, University of Newcastle Metadata Explained © Netskills, Quality Internet Training.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Metadata Xiangming Mu. What is metadata? What is metadata? (cont’) Data about data –Any data aids in the identification, description and location of.
1 CS/INFO 430 Information Retrieval Lecture 20 Metadata 2.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Metadata Helen Aristar Dry Eastern Michigan University LINGUIST List.
Content and Computer Platforms Week 3. Today’s goals Obtaining, describing, indexing content –XML –Metadata Preparing for the installation of Dspace –Computers.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
LIS654 lecture 5 DC metadata and omeka tables Thomas Krichel
Basics of Information Retrieval W Arms Digital Libraries 1999 Manuscript as background reading.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Metadata Bridget Jones Information Architecture I February 23, 2009.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
Aug 2-5, 2002 EMELD Workshop Overview & Update Helen Aristar Dry The LINGUIST List & Eastern Michigan University EMELD Workshop on The Digitization.
A Quick Introduction to Metadata Michael Day UKOLN: The UK Office for Library and Information Networking, University of Bath
1 Dublin Core & DCMI – an introduction Some slides are from DCMI Training Resources at:
Content and Systems Week 3. Today’s goals Obtaining, describing, indexing content –XML –Metadata Preparing for the installation of Dspace –Computers available.
21 June 2001Managing Information Resources for e-Government1 The Dublin Core Makx Dekkers, Managing Director, Dublin Core Metadata Initiative
A centre of expertise in digital information managementwww.ukoln.ac.uk DCMI Affiliates: Implications for Institutions Rosemary Russell UKOLN University.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
1 Dublin Core and its implementation in RDF/XML Paul Miller Interoperability Focus UK Office for Library & Information Networking (UKOLN)
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Application Profiles Application profiles -- are schemas which consist of data elements drawn from one or more namespaces, combined together by implementers,
1 Educational Metadata Paul Miller Interoperability Focus UKOLN U KOLN is funded by Resource: the Council for.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Metadata & Repositories Jackie Knowles RSP Support Officer.
Dublin Core Basics Workshop Lisa Gonzalez KB/LM Librarian.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Session 3 Metadata & Workflow
prepared by Dr. Ammar Yakan
Metadata Standards - Types
Some Options for Non-MARC Descriptive Metadata
Presentation transcript:

The OLAC Metadata Set Gary Simons Workshop on The Digitization of Language Data: The Need for Standards June 2001

What is metadata? Structured data about data" Descriptive information about a resource whether it be physical or electronic Content designed for resource discovery Format designed for automated searching

An OLAC metadata description Derbyshire, Desmond C. Topic continuity and OVS order in Hixkaryana In Joel Sherzer and Greg Urban (eds.), Native South American discourse, Berlin: Mouton. Word order Topic Typology

Foundational design decisions We need a low overhead metadata set. N.B. The Open Archives Initiative support for multiple metadata formats allows subcom- munities to develop richer metadata sets. We should build on the Dublin Core metadata set. We should extend DC by using the qualifi- cation mechanisms recognized by DC.

The XML implementation All elements are optional and repeatable Use attributes for DC qualifications Refinements: … Encoded values: Language of element content: Die Bremer Stadtmusikanten Refinements with encoding schemes go in element name:

The fifteen Dublin Core elements Contributor Coverage Creator Date Description Format Identifier Language Publisher Relation Rights Source Subject Title Type

Additional elements for DATA Subject.language A language the resource is about Use for a language the resource is in Type.data The nature of the content from a linguistic point of view E.g. transcription, annotation, description, lexicon

Additional elements for TOOLS For matching DATA with TOOLS Format.encoding Format.markup For describing TOOLS Format.cpu Format.os Format.sourcecode Type.functionality

Controlled vocabularies Closed enumerations of allowed values for refine, code, and lang attributes To improve success of resource discovery Recall – % of relevant resources that are found Precision – % of found resources that are relevant Use element content as an escape hatch When the right term is not in controlled vocabulary When the term needs refinement or explanation

Elements with DC vocabularies ElementRefine attributeCode attribute DateDC-Qualifiers RelationDC-Qualifiers TypeDC-Type

OLAC-Language Used for Lang attribute on all elements Code attribute on Terms in the vocabulary follow RFC 3066 Unambiguous codes from ISO 639: en, fr, eng All codes from Ethnologue: x-sil-HIX Ancient languages at LINGUIST : x-LL-???

Other OLAC vocabularies ElementRefine attributeCode attribute Contributor, CreatorOLAC-Role FormatOLAC-Format Format.cpuOLAC-CPU Format.encodingOLAC-Encoding Format.osOLAC-OS Format.sourcecodeOLAC-Sourcecode RightsOLAC-Rights Type.dataOLAC-Data Type.functionalityOLAC-Functionality