Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program
Many definitions of metadata “Data about data” “Structured information about an information resource of any media type or format.” (Caplan) “Structured information that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource.” (NISO) … 8/4/08 S504 Summer 2008
More definition, in libraries Structure Control Origin Machine-generated Human-generated The difference between data, metadata, and meta-metadata is often one of perspective 8/4/08 S504 Summer 2008
Some uses of metadata By information specialists By novices Describing “non-traditional” materials Cataloging Web sites Navigating within digital objects Managing digital objects over the long term By novices Preparing Web sites for search engines Depositing materials into an institutional repository Managing citation lists iTunes Tagging – flickr, del.icio.us, etc. LibraryThing 8/4/08 S504 Summer 2008
Metadata and cataloging Depends on what you mean by: metadata, and cataloging! But, in general: Metadata is broader in scope than cataloging Much metadata creation takes place outside of libraries Good metadata practitioners use key cataloging principles in non-MARC environments Metadata created for many different types of materials Metadata is NOT only for Internet resources! 8/4/08 S504 Summer 2008
Creating metadata HTML <meta> tags Spreadsheets Databases XML Library catalogs Digital library/content management systems ContentDM DigiTool DSpace 8/4/08 S504 Summer 2008
Types of metadata Descriptive metadata Administrative metadata Technical metadata Preservation metadata Rights metadata Structural metadata 8/4/08 S504 Summer 2008
How metadata is used 8/4/08 S504 Summer 2008
Descriptive metadata Purpose Some common general schemas Discovery Description to support use and interpretation Some common general schemas MARC MARCXML MODS Dublin Core LOTS of domain-specific schemas 8/4/08 S504 Summer 2008
MODS “Metadata Object Description Schema” Developed and maintained by the Library of Congress Network Development and MARC Standards Office For encoding bibliographic information Influenced by MARC, but not equivalent Quickly gaining adoption 8/4/08 S504 Summer 2008
Dublin Core (1) “Core” across all knowledge domains National and international standard 2001: Released as ANSI/NISO Z39.85 2003: Released as ISO 15836 No element required All elements repeatable 1:1 principle 8/4/08 S504 Summer 2008
Dublin Core (2) Two “flavors” Unqualified – 15 elements Qualified Additional elements Element refinements Encoding schemes (vocabulary and syntax) All qualifiers must follow “dumb-down” principle Unqualified DC required for sharing metadata via the Open Archives Initiative 8/4/08 S504 Summer 2008
DCMI Abstract Model New direction for the Dublin Core Metadata Initiative An “information model which is independent of any particular encoding syntax” RDF-inspired, but not RDF DCMI resource model DCMI description set model DCMI vocabulary model Full abstract model recommendation Still too early to really know where this is going 8/4/08 S504 Summer 2008
Comparing descriptive metadata formats MARC [example] MARCXML MODS Dublin Core [DC example] [QDC example] Record format Binary (ISO 2709) XML RDF, XML, HTML, et al. Field labels Numeric Text Reliance on AACR Strong Implied None Common method of creation By specialists By derivation By specialists and by derivation By novices and by specialists 8/4/08 S504 Summer 2008
Levels of control Data structure standards (e.g., MARC) Data content standards (e.g., AACR2r) Controlled vocabularies (e.g., LCSH) Very few metadata standards include a counterpart to the AACR “chief source of information” 8/4/08 S504 Summer 2008
Data content standards Anglo-American Cataloging Rules, 2nd edition (AACR2) Describing Archives: A Content Standard (DACS) Cataloging Cultural Objects (CCO) Also many format-specific guidelines Descriptive Cataloging of Rare Materials (DCRM) series Archival Moving Image Materials: A Cataloging Manual Betz: Graphic Materials … 8/4/08 S504 Summer 2008
Vocabulary encoding schemes aka, controlled vocabularies TGM I TGM II TGN GeoNet AAT LCSH LCNAF DCMI Type MIME Types …etc. 8/4/08 S504 Summer 2008
Syntax encoding schemes ISO8601 W3CDTF URI …etc. 8/4/08 S504 Summer 2008
Technical metadata One type of administrative metadata For recording technical aspects of digital objects Of use for long-term maintenance of data Some examples: NISO Z39.87: Data Dictionary – Technical Metadata for Digital Still Images & MIX Schema for Technical Metadata for Text 8/4/08 S504 Summer 2008
Structural metadata For creating a logical structure between digital objects Locating the same intellectual content on multiple representations Noting points of interest within a single resource Grouping and sequencing multiple files that make up a logical whole METS is the current primary schema 8/4/08 S504 Summer 2008
How do I pick a metadata format? Genre/format of materials being described Nature of holding institution Use and audience for the metadata What others in the community are doing Describing analog vs. digitized item Relationships between records Plan for interoperability, including repeatability of elements More information on handout 8/4/08 S504 Summer 2008
Further information jenlrile@indiana.edu These presentation slides: <http://www.dlib.indiana.edu/~jenlrile/presentations/slis/ 08summer/s504/s504.ppt> Metadata librarians listserv: <http://metadatalibrarians.monarchos.com> Priscilla Caplan: Metadata Fundamentals for all Librarians, 2003 8/4/08 S504 Summer 2008