Presentation on theme: "1 Demystifying metadata Ann Chapman UKOLN University of Bath UKOLN is funded by Resource: The Council for Museums, Archives and Libraries, the Joint Information."— Presentation transcript:
1 Demystifying metadata Ann Chapman UKOLN University of Bath UKOLN is funded by Resource: The Council for Museums, Archives and Libraries, the Joint Information Systems Committee (JISC) of the Higher Education Funding Councils, as well as by project funding from the JISC and the European Union. UKOLN also receives support from the University of Bath where it is based.
2 What is metadata? Structured data about resources Library catalogues Abstracting and indexing services Archival finding aids Museum documentation Community information Carriers: MARC, HTML, SGML, XML
3 Markup languages SGML - Standard Generalised Markup Language - controls document formatting for publication XML - Extensible Markup Language - next generation SGML HTML - Hyper Text Markup Language - SGML subset, controls display of web pages Tags (usually paired) structure text into elements e.g. headings, paragraphs, lists, etc.
4 MARC - structure Structured format Numeric and alpha tags Fixed fields Leader, 001-008, 010-099 Variable fields
5 MARC – elements 1XX Main entry 2XX Title, SR, edition, publication 3XX Physical description 4XX Series 5XX Notes 6XX Subject access 7XX Added entries 8XX Added entries for series 9XX References and local fields
6 ONIX - structure Carrier - XML Primary use publishers to Internet booksellers rich product information In use first version 1999 current version Release 2.0 (2001) Elements – XML reference name and tag
7 ONIX - elements Message header Product record identifiers, author, title, edition, language, subject, audience, descriptions, publisher, dates territorial rights, dimensions, suppliers, availability, promotions Main series and sub series records
8 ONIX record 0123456789 Alice in Wonderland Author Carroll, Lewis Collins 2000
9 Dublin Core - structure Simple resource discovery DCMES – Dublin Core Metadata Element Set HTML the most common carrier Comprises 15 elements with element qualifiers element encoding schemes optional/mandatory elements Application profiles
10 Dublin Core - elements Title Creator Subject Description Publisher Contributor Date Resource Type Format Resource Identifier Source Language Relation Coverage Rights
11 Dublin Core - record Alice in Wonderland Lewis Carroll Fiction Project Gutenberg 2000 ASCII file via FTP http://promo.net/pg/…..
12 Encoded Archival Description EAD 1993 project to develop standard for machine-readable finding aids, Version 1 1998 SGML (and XML compliant) Hierarchical structure of archives repository, management group, fonds, series, file, item Possible to embed MARC elements
14 EAD - elements [ id + bibliographic inf. for finding aid ] [ data on a body of archival materials ] [container, physical description, physical location, repository, date and title of unit ] [ biography, scope, access, arrangement ] [name, place, genre, subject, title]
15 EAD record - LKX-3042
"name": "15 EAD record - LKX-3042
16 EAD record - collection A collection of materials in and about shorthand collected by Sir Isaac Pitman and James Pitman Shorthand
17 Collection Description Schema developed May 2000 Access version for RSLP – summer 2001 Web version for Reveal – spring 2002 General attributes Subject Dates Associated agents External relationships
19 Coll. Desc. - record Title: Pitman Collection Strength: Shorthand – national collection Phys. Desc: Printed texts and manuscripts Lang: English, Spanish, Esperanto, …… Access: Written request to the Librarian, Bath Univ. Accrual: passive, deposit Location: The Library, Bath University, Bath Subject: Shorthand, Sir Isaac Pitman Owner: Pitman Publishing Co. Catalogue: Bath University OPAC
20 M21 Community Information Same principles as MARC Bibliographic Leader individual/organization/program/event/other Fixed fields 001-008, 010-099 fixed fields 007 disability facilities 008 special aspects Variable fields
21 M21 Comm. Inf. - elements 1XX Name 2XX Title and Address 3XX Physical description 4XX Series (for events) 5XX Notes 6XX Subject access 7XX Added entries 8XX Other variable fields
22 M21 Comm. Inf. - record 110 $a CILIP 245 $a CILIP HQ 247 $a LA HQ $f 19?? - 2002 270 $a 7 Ridgmount St, London, WC1E 7AE $k 020 7255 0505 $m email@example.com $r 9am to 6pm 311 $a Ewart Room $d seats 50 $g £100 per day 312 $a Overhead projector $f £10 per day 581 $a Library + Information Update 856 $a http://www.cilip.org.uk
23 Metadata – fit for purpose MARC Bibliographic ONIX Dublin Core EAD Collection description M21 Community Information