1 CS 430: Information Discovery Lecture 5 Descriptive Metadata 1 Libraries Catalogs Dublin Core.

Slides:



Advertisements
Similar presentations
Ali Alshowaish. dc.coverage element articulates limitations in the scope of the resource, typically along the following lines: geographical, temporal,
Advertisements

Metadata and Search at Boeing Julie Martin Library & Learning Center Services
Metadata 8/7/2012 Katie Moss Digital Metadata Technician, Digital Library Services
Module 5a: Authority Control and Encoding Schemes IMT530: Organization of Information Resources Winter 2007 Michael Crandall.
1 CS 502: Computing Methods for Digital Libraries Lecture 18 Descriptive Metadata: Metadata Models.
SLIDE 1IS 257 – Fall 2007 Codes and Rules for Description: History 2 University of California, Berkeley School of Information IS 245: Organization.
8/28/97Information Organization and Retrieval Metadata and Data Structures University of California, Berkeley School of Information Management and Systems.
William Y. Arms Corporation for National Research Initiatives March 22, 1999 Object models, overlay journals, and virtual collections.
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
1 CS 502: Computing Methods for Digital Libraries Lecture 13 Descriptive Metadata I: cataloguing, classification, authority files.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
10/24/2000Information Organization and Retrieval Information Structures and Metadata University of California, Berkeley School of Information Management.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
The Library Cataloging Tradition
SLIDE 1IS 245 – Spring 2009 Codes and Rules for Description: History University of California, Berkeley School of Information IS 245: Organization.
1 CS 430 / INFO 430 Information Retrieval Lecture 16 Library Catalogs 1.
SLIDE 1IS 257 – Fall 2007 Codes and Rules for Description: History University of California, Berkeley School of Information IS 245: Organization.
LSTA Digital Imaging Grants Presentation Projects Workshop September 13, 2002 Wendy Sistrunk Music Catalog Librarian University of Missouri—Kansas City.
1 CS 430: Information Discovery Lecture 15 Library Catalogs 3.
Metadata and identifiers for e- journals Copenhagen Juha Hakala Helsinki University Library
1 Open-source platform for accessible content management Museo & Web CMS.
1 CS 430: Information Discovery Lecture 5 Descriptive Metadata 1 Library Catalogs MARC.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
1 © Netskills Quality Internet Training, University of Newcastle Metadata Explained © Netskills, Quality Internet Training.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
1 CS 430: Information Discovery Lecture 17 Library Catalogs 2.
1 CS 430: Information Discovery Lecture 14 Automatic Extraction of Metadata.
Metadata Xiangming Mu. What is metadata? What is metadata? (cont’) Data about data –Any data aids in the identification, description and location of.
7/14/09. Robert L. Maxwell RDA Lecture Series National Library of South Africa 22 July /14/09 Cataloging: Still a Professional Asset to Become Excited.
The Library Cataloging Tradition Marty Kurth CS 431 February 9, 2005 [slides stolen from Diane Hillmann]
1 CS/INFO 430 Information Retrieval Lecture 20 Metadata 2.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
1 CS/INFO 430 Information Retrieval Lecture 16 Metadata 3.
1 CS 430: Information Discovery Lecture 6 Descriptive Metadata 2 Library Catalogs Dublin Core.
1 CS 430: Information Discovery Lecture 7 Descriptive Metadata 3 Dublin Core Automatic Generation of Catalog Records.
Current Events and Issues Using Index Databases for Finding Answers.
Introduction to Omeka. What is Omeka? - An Open Source web publishing platform - Used by libraries, archives, museums, and scholars through a set of commonly.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
LIS654 lecture 5 DC metadata and omeka tables Thomas Krichel
Modularization and Interoperability: Dublin Core and the Warwick Framework Sandra D. Payette Digital Library Research Group Cornell University November.
What users want & how FRBR can help Diane Vizine-Goetz Research Scientist OCLC Research.
Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.
1 Discussion Class 4 The Dublin Core Metadata Initiative.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
BEN METADATA SPECIFICATION Isovera Consulting Feb
Cataloguing Code and Cataloguing Process. What is a Catalog(ue)?  A list of library materials contained in a collection, a library, or a group of libraries.
Evidence from Metadata INST 734 Doug Oard Module 8.
RDA DAY 1 – part 2 web version 1. 2 When you catalog a “book” in hand: You are working with a FRBR Group 1 Item The bibliographic record you create will.
1 Dublin Core & DCMI – an introduction Some slides are from DCMI Training Resources at:
Description of Bibliographic Items. Review Encoding = Markup. The library cataloging “markup” language is MARC. Unlike HTML, MARC tags have meaning (i.e.,
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Functional Requirements for Bibliographic Records The Changing Face of Cataloging William E. Moen Texas Center for Digital Knowledge School of Library.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Subject Description LIS 571 The Organization and Control of Recorded Information.
1 CS 430: Information Discovery Lecture 8 Collection-Level Metadata Vector Methods.
8/28/97Information Organization and Retrieval Introduction University of California, Berkeley School of Information Management and Systems SIMS 245: Organization.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Queensland University of Technology Faculty of Information Technology Michael Middleton 1 CRICOS No J Bibliographic description.
1 CS 430: Information Discovery Lecture 7 Automatic Generation of Catalog Records.
Global Rangelands Data Entry Guidelines March 23, 2015.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
CS 430: Information Discovery
Lecture 12 Why metadata? CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
CS 430: Information Discovery
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
LIBRARY CATALOGUE (meaning, definition, scope, purpose, & Objectives)
Attributes and Values Describing Entities.
CS 430: Information Discovery
Presentation transcript:

1 CS 430: Information Discovery Lecture 5 Descriptive Metadata 1 Libraries Catalogs Dublin Core

2 Course Administration

3 Descriptive Metadata Catalog: metadata records that have a consistent structure, organized according to systematic rules. Abstract: a free text record that summarizes a longer document. Indexing record: less formal than a catalog record, but more structure than a simple abstract. Some methods of information discovery search descriptive metadata about the objects. Metadata typically consists of a catalog or indexing record, or an abstract, one record for each object.

4 Descriptive Metadata Usually stored separately from the objects that it describes, but sometimes is embedded in the objects. Usually the metadata is a set of text fields. Textual metadata can be used to describe non-textual objects, e.g., software, images, music

5 Descriptive metadata Information discovery is often most effective when applied to metadata rather than raw information Allows fielded searching author = "Goethe" Suitable for non-textual material type = "picture" and subject = "Ithaca" Can be used with controlled vocabulary language = "en"

6 Origins of Library Catalogs Bibliographic Objective: To bring together like items To differentiate among similar ones Sir Anthony Panizzi, Keeper of Books at the British Museum ( ). His Ninety-One Rules (1841) were the basis of modern catalogue rules.

7 Origins of Library Catalogs Information Discovery: to enable a person to find a book of which either the author, title or subject is known to show what the library has by a given author, on a given subject, or in a given kind of literature to assist in the choice of a book as to its edition (bibliographically) or to its character (literary or topical). Charles Ammi Cutter Librarian of the Boston Athenaeum Rules for a Dictionary Catalog, 1874

8 Origins of Library Catalogs Classification: Division of subject matter into a hierarchy. Typically used in libraries to provided a subject- based order for shelving books. Melvil Dewey Acting Librarian of Amherst College (1874) Dewey Decimal system of book classification, uses the numbers 000 to 999 to cover the general fields of knowledge and decimals to fit special subjects.

9 Technology Materials to be catalogued: Originally books Extended to serials, maps, music, etc., but concepts still rely heavily on experience with books Form of catalog: Entries in books (Panizzi) Index cards (Cutter) Online databases (Kilgour) [Library Cataloguing will be continued in Lecture 6.]

10 Catalogs as Investments Costs: Conventional Catalog Records are created by skilled librarians. (cost estimate $100 per record). OCLC's catalog has 43 million records. Total investment is several billion dollars. Cataloguing Standards: Enable libraries to share records Combine records of the past with records created today Allow readers and librarians to move between libraries

11 Dublin Core Simple set of metadata elements for online information 15 basic elements intended for all types and genres of material all elements optional all elements repeatable Developed by an international group chaired by Stuart Weibel since (Diane Hillmann and Carl Lagoze of Cornell are very active in this group.)

12

13 Dublin Core publisher: OCLC creator: Weibel, Stuart L. creator: Miller, Eric J. title: Dublin Core Reference Page date: format: text/html (MIME type) language: en (English) identifier:

14 Dublin Core with Meta Tags

15 Dublin Core elements 1. Title The name given to the resource by the creator or publisher. 2. Creator The person or organization primarily responsible for the intellectual content of the resource. For example, authors in the case of written documents, artists, photographers, or illustrators in the case of visual resources. 3. Subject The topic of the resource. Typically, subject will be expressed as keywords or phrases that describe the subject or content of the resource. The use of controlled vocabularies and formal classification schemes is encouraged.

16 Dublin Core elements 4. Description A textual description of the content of the resource, including abstracts in the case of document-like objects or content descriptions in the case of visual resources. 5. Publisher The entity responsible for making the resource available in its present form, such as a publishing house, a university department, or a corporate entity. 6. Contributor A person or organization not specified in a creator element who has made significant intellectual contributions to the resource but whose contribution is secondary to any person or organization specified in a creator element (for example, editor, transcriber, and illustrator).

17 Dublin Core elements 7. Date A date associated with the creation or availability of the resource. 8. Type The category of the resource, such as home page, novel, poem, working paper, preprint, technical report, essay, dictionary. 9. Format The data format of the resource, used to identify the software and possibly hardware that might be needed to display or operate the resource. 10. Identifier A string or number used to uniquely identify the resource. Examples for networked resources include URLs and URNs.

18 Dublin Core elements 11. Source Information about a second resource from which the present resource is derived. 12. Language The language of the intellectual content of the resource. 13. Relation An identifier of a second resource and its relationship to the present resource. This element permits links between related resources and resource descriptions to be indicated. Examples include an edition of a work (IsVersionOf), or a chapter of a book (IsPartOf).

19 Dublin Core elements 14. Coverage The spatial locations and temporal durations characteristic of the resource. 15. Rights A rights management statement, an identifier that links to a rights management statement, or an identifier that links to a service providing information about rights management for the resource.

20 Qualifiers Element qualifier Example: Date DC.Date -> Created: DC.Date -> Issued: DC.Date -> Available: / DC.Date -> Valid: /

21 Qualifiers Value qualifiers Example: Subject DC.Subject -> DDC: DC.Subject -> LCSH: Digital libraries-United States

22

23 Dublin Core with qualifiers Digital Libraries and the Problem of Purpose David M. Levy Corporation for National Research Initiatives January 2000 article /january2000-levy English Copyright (c) David M. Levy

24 Limits of Dublin Core Complex objects Article within a journal A thumbnail of another image The March 28 final edition of a newspaper Complete object Sub-objects Metadata records

25 Flat v. linked records Flat record All information about an item is held in a single Dublin Core record, including information about related items convenient for access and preservation information is repeated -- maintenance problem Linked record Related information is held in separate records with a link from the item record less convenient for access and preservation information is stored once Compare with normal forms in relational databases

26 Dublin Core with flat record extension Continuation D-Lib Magazine

27 Events Version 1 New material Version 2 Should Version 2 have its own record or should extra information be added to the Version 2 record? How are these represented in Dublin Core?

28 Minimalist versus structuralist Minimalist 15 elements, no qualifiers, suitable for non-professionals encourage creators to provide metadata Structuralists 15 elements, qualifiers, RDF, detailed coding rules will require trained metadata experts [For an example of how complex Dublin Core can become, see the source of: htm#]

29 Dublin Core in many languages See: Thomas Baker, Languages for Dublin Core, D-Lib Magazine December 1998,