Non-MARC Cataloging Standards Overview: TEI & EAD, MODS, METS, XML- based MARC Eric Childress OCLC Eric Childress OCLC February 10, 2003 OCLC.

Slides:



Advertisements
Similar presentations
Making and Moving Metadata: Two Library of Congress Initiatives Sally McCallum NDMSO, Library of Congress NISO/BISG Forum - June 22, 2012.
Advertisements

Interoperability: the value of recombinant potential Lorcan Dempsey VP Research and Chief Strategist ARLIS 2004, New York, April 2004.
FEDLINK OCLC Users Group Meeting
METS Awareness Training An Introduction to METS Digital libraries – where are we now? Digitisation technology now well established and well-understood.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Standards showcase: MODS, METS, MARCXML ALA Annual 2006 Rebecca Guenther and Jackie Radebaugh Network Development and MARC Standards Office Library of.
METS: An Introduction Structuring Digital Content.
MODS, METS, and other metadata standards
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
The Future of Technical Services: Metadata: Standards and New Developments NLA Annual Conference, October, 2005 Reno, Nevada Vicki Toy Smith University.
From EAD to METS An overview and history of METS Rick Beaubien UC Berkeley.
3. Technical and administrative metadata standards Metadata Standards and Applications.
Merrilee Proffitt e(X)literature / Digital Cultures Project April 2003 News from the Digital Library The Metadata Encoding and Transmission Standard; the.
Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
A Practical Introduction to XML in Libraries Marty Kurth NYLA October 22, 2004.
Presented by Karen W. Gwynn LS – Metadata University of Alabama Prof. Steven MacCall Spring 2011.
METS: Metadata Encoding and Transmission Standard Richard Gartner Oxford University Library Services
Use of METS in CDL Digital Special Collections Brian Tingle.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
OCLC Online Computer Library Center Two Paths to Interoperable Metadata Jean Godby, Devon Smith, Eric Childress DC-2003 September 29, 2003.
Digital Encoding What’s behind E-text Resources?.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
METS Intro & Overview Mets Opening Day Germany May 7, 2007 Nancy J. Hoebelheinrich Stanford University Libraries.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
13 Oct DC2004--IFLA New and traditional descriptive formats in the library environment DC2004: IFLA session 13 Oct Rebecca Guenther
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
An Introduction to METS Morgan Cundiff Network Development and MARC Standards Office Library of Congress Metadata Encoding and Transmission Standard.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
Creating an Open Archives Metadata Harvesting Protocol Compliant Repository for the American Memory Online Collections OAI Open Meeting, Washington, DC.
Lifecycle Metadata for Digital Objects September 11, 2002 Major archival and digital library metadata schemes.
Roy Tennant California Digital Library escholarship.cdlib.org/rtennant/presentations/2003cil/ Achieving Together What None Can Do Alone: Interoperability.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Metadata Bridget Jones Information Architecture I February 23, 2009.
Cataloging Compound Digital Objects: Using METS for Digitized Sanborn Maps Christopher Cronin Head of Digital Resources Cataloging University of Colorado.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
CEAL 2003 XML for CJK Wooseob Jeong School of Information Studies University of Wisconsin - Milwaukee.
METS: Implementing a metadata standard in the digital library Richard Gartner Oxford University Library Services
Introduction to metadata
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Metadata Standards in Various Environments Spring January, 2006 Bharat Mehra IS 520 Organization and Representation of Information School of Information.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
A RCHIVAL COLLECTIONS IN A D IGITAL W ORLD Cheryl Walters Nov. 6, 2008.
A centre of expertise in digital information management UKOLN is supported by: Metadata – what, why and how Ann Chapman.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
Metadata & Repositories Jackie Knowles RSP Support Officer.
Updated :02 Hong Kong University of Science & Technology Library Workshop on XML-Based Library Applications 1. What is XML?
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Metadata - what works, what doesn’t?
Introduction to Metadata
Oya Y. Rieger Cornell University Library May 2004
Presentation transcript:

Non-MARC Cataloging Standards Overview: TEI & EAD, MODS, METS, XML- based MARC Eric Childress OCLC Eric Childress OCLC February 10, 2003 OCLC

Overview Fundamentals –Metadata and content –Types of metadata –Document mark-up languages & character encoding The Big Picture Metadata formats: –MARC –MODS –METS –MIX –TEI –EAD –ONIX

Fundamentals Metadata and content 33 Metadata linked to content object MARC record with URL for ftp object 22 Metadata separate from content object Book + catalog card Book + MARC record 11 Metadata embedded in content object Title page / CIP HTML header in HTML document 44 Metadata embedded and linked MARC record with URL for HTML document PDF document linked to DC-XML record Aggregation of discrete objects linked to record

Fundamentals Types of metadata Administrative metadata: Data about the metadata (e.g. record number) Descriptive metadata: Description of the object for discovery and retrieval (e.g. Title) Technical metadata: Technical characteristics of the object (e.g. file size)

Fundamentals Markup languages: –Address the structure of a document –Convey instructions to software that will process text to: Index the text for searching To render the text (e.g., for screen display or print) Transform the text (e.g., for a voice synthesizer) for some output device(s) –The markup is generally invisible to end-users Extensible Markup Language (XML): –XML is metalanguage: agencies define their own XML to suit their task by creating Document Type Definitions (DTDs) or XML schema –Data separate from presentation instructions (recorded in a style sheet) –Offers just the right mix of flexibility and structure Character encoding: –Used for communicating text characters in a computing environment –Hundreds of character encoding standards exist –Character conversion is complex and expensive Unicode: –A single, “comprehensive” global encoding standard –Includes characters from scripts of all major modern, most minor, and selected ancient languages Markup languages & Character encoding

The Big Picture Standards in a grid Rich Description Simple Description ItemCollections Dublin Core RSLP OAI set record TEI VRA Core ONIX MARC 8 CSDGM

Library-related standards MARC 21 (ISO 2709) MARC 8: –Library metadata communications format based on ISO 2709 –Strengths: Mature standard Widely adopted by libraries (U.S., Canada, and beyond) Large universe of records available Wide choice of software vendors –Weaknesses (in the present & future): Virtually unused outside of libraries Field and record size limitations Restricted range of scripts supported (MARC 8 repertoire only) Limited ability to convey hierarchical & complex relationships, attributes No ability to embed related objects (e.g., book cover GIF) Cannot be directly processed by widely-used web applications MARC 21 (ISO 2709) Unicode: –MARC 21 with Unicode character encoding –Limited to 16K characters equivalent to MARC 8 repertoire MARC 21 (ISO 2709) MARC

Library-related standards MARC 21 and XML: –Library of Congress’ MARCXML: LC’s schema provides a lossless conversion of MARC 21 (ISO2709) to XML LC’s XML framework positions MARCXML as both an end format and as an intermediate format to non-MARC formats –Stanford University’s Lane Medical School’s XMLMARC: Developed before LC’s MARCXML schema Ignores/simplifies some MARC 21 data UNIMARC and XML: –Ministère de la culture et de la communication (France), Board of Research and Technology BiblioML DTD for converting UNIMARC to XML Conversion tools in development MARC and XML MARC « BiblioML »

Library-related standards Metadata Object Description Schema (MODS) –Essentially MARC 21 recast in an XML-native framework Text-based tags rather than numeric ones, Selected clusters of related MARC 21 attributes condensed into single MODS element –MARC 21 readily converts to MODS, but can’t do a lossless reverse conversion of MODS to MARC 21 Value of MODS: –A rich, library-metadata-oriented XML metadata schema –Optimized for from-MARC conversion of legacy records –Selectively “improves” some of MARC’s mechanisms for representing resource type –Well-suited as a metadata format for OAI harvesting –Maintained by the same agency (LC) that maintains MARC 21 Applications of MODS: –LC planning to convert 100K American Memory records –Minerva project, U of Chicago Press, California Digital Library, others using or planning to use for records for web sites, e-texts. MODS

Library-related standards Metadata Encoding and Transmission Standard (METS) –Standard for encoding descriptive, administrative, structural, rights and other data essential for retrieving, preserving, and serving up digital resources –Six modules (header, descriptive metadata, administrative metadata, file section, structural map, behavior section) –Header and structural map are required; descriptive, administrative, behavior metadata may reside in METS object or be external. Value of METS: –Need for METS identified at DLF metadata experts meetings – varied local approaches to non-descriptive metadata not scaling well nor supporting interoperability between agencies –Can be used to collect digital resource metadata for submission to repository, hold metadata in the repository, inform user access applications Applications of METS: –LC using for moving images, audio recordings, folk life mixed media collections –OCLC DPR, RLG, Harvard, National Library of Wales exploring or using for variety of projects METS

Library-related standards Metadata for Images in XML (MIX) –Collaboration of LC and NISO Technical Metadata for Digital Still Images Standards Committee –XML schema for a set of technical data elements required to manage digital image collections –Format for interchange and/or storage of the data specified in the NISO Draft Standard Data Dictionary: Technical Metadata for Digital Still Images (version 1.2) –Still in early development and testing phases Value of MIX: –Provides a common XML schema for expressing technical data particular to still and moving digital images –Can be used with other schema such as METS and MODS as part of a comprehensive approach to managing and preserving digital images Applications of MIX: –OCLC DPR, LC, others planning or testing –MIX still in nascent stage of development and testing MIX

E-text-related standard Text Encoding Initiative (TEI): –For complex markup of literary texts –Both SGML & XML [new] DTDs available –TEI “header” (TEIH) can be used as a descriptive metadata record –Maintenance agency: TEI Consortium TEI Consortium has executive offices in Bergen, Norway, and is hosted at four university sites worldwide: the University of Bergen, Brown University, Oxford University, and the University of Virginia Consortium maintains “P4” Guidelines for Electronic Text Encoding and Interchange Value of TEI: –Designed to meet the needs of scholarly research community (esp. in the humanities) for a variety of activities including: Adding in-line academic commentary in e-texts As an aid to research through supporting special indexing points, etc. Applications of TEI: –Widely used by major humanities electronic text collections such as CETH, UVa e-text center, many others. TEI

Archives-related standard Encoded Archival Description (EAD) –A format for expressing electronic archival finding aids –Created by LC and the Society of American Archivists (SAA) –EAD DTD (Version 2002) is designed to function as both an SGML and XML DTD Value of EAD: –Effectively an organized presentation of a collection of documents EAD header carries metadata for the finding aid Provides for simple or complex mark-up to support varying levels of indexing Well-suited for interweaving narrative with links to specific objects in a collection (either directly to the object or via a record for the object that may link to the object). Applications of EAD: –Conversion of existing paper finding aids to electronic form –Widely used by academic institutions and archives in North America –RLG Archival Resources database host copies of many EADs EAD

Publishing-related standard ONIX International (Online Information Exchange): –Standard format for publishers to use to distribute electronic information about their publications. –XML schema with Unicode encoding –Based on EPICS (EDItEUR Product Information Communication Standards) –Maintenance agency: EDItEUR working with input from the Book Industry Communication (BIC) and the Book Industry Study Group (BISG) Value of ONIX: –Designed to meet needs of publishers, jobbers, retail sellers for richer book data online (including cover art) a common data exchange format that will allow players to be rid of the burden of costly, custom programming to handle data from individual suppliers –Offers two levels of richness (level 1 & level 2) Applications of ONIX: –Primarily oriented towards jobbers and publishers – Most major players (Amazon, Baker & Taylor, etc.) now using/supporting –Some interest in implementation in library systems ONIX

& Q uestions A A nswer s

Links MARC 21: MARCXML: XMLMARC: BiblioML (UNIMARC XML): MODS: METS: MIX: TEI: EAD: ONIX: Further reading on MARCXML, MODS, METS: “New Metadata Standards for Digital Resources,” Bulletin of the American Society for Information Science and Technology. Dec/Jan 2003, pp Major emphasis in this presentation

Links SCORM: RSLP: VRA Core: IMS LOM: CSDGM: GEM: CIMI: Also appearing (in Big Picture)