2003.09.11 - SLIDE 1IS 202 - FALL 2003 Lecture 06: Metadata Introduction Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30.

Slides:



Advertisements
Similar presentations
Future of Cataloging RDA and other innovations Pt. 2.
Advertisements

Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
SLIDE 1IS 257 – Fall 2007 Codes and Rules for Description: History 2 University of California, Berkeley School of Information IS 245: Organization.
Bibliographic Records, Data Structures and Databases (Cont.)
8/28/97Information Organization and Retrieval Metadata and Data Structures University of California, Berkeley School of Information Management and Systems.
SLIDE 1IS 257 – Fall 2009 Organization of Information in Collections: Introduction University of California, Berkeley School of Information.
Using Metadata in CONTENTdm Diana Brooking and Allen Maberry Metadata Implementation Group, Univ. of Washington Crossing Organizational Boundaries Oct.
10/23/2001Information Organization and Retrieval Information Structures and Metadata University of California, Berkeley School of Information Management.
10/26/2000Information Organization and Retrieval Metadata and Description University of California, Berkeley School of Information Management and Systems.
SLIDE 1IS FALL 2004 Lecture 18: Metadata & Controlled Vocabulary Introduction Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
10/24/2000Information Organization and Retrieval Information Structures and Metadata University of California, Berkeley School of Information Management.
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
SLIDE 1IS Fall 2002 Lecture 05: Metadata: Introduction Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30.
SLIDE 1IS 245 – Spring 2009 Codes and Rules for Description: History University of California, Berkeley School of Information IS 245: Organization.
IMT530- Organization of Information Resources1 Feedback Like exercises –But want more instructions and feedback on them –Wondering about grading on these.
SLIDE 1IS 257 – Fall 2007 Codes and Rules for Description: History University of California, Berkeley School of Information IS 245: Organization.
Use of METS in CDL Digital Special Collections Brian Tingle.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
1 © Netskills Quality Internet Training, University of Newcastle Metadata Explained © Netskills, Quality Internet Training.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
CONSER RDA Bridge Training [date] Presenters : [names] 1.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Introduction to metadata
AACR2 Pt. 1, Monographic Description LIS Session 2.
Evidence from Metadata INST 734 Doug Oard Module 8.
RDA and Special Libraries Chris Todd, Janess Stewart & Jenny McDonald.
RDA DAY 1 – part 2 web version 1. 2 When you catalog a “book” in hand: You are working with a FRBR Group 1 Item The bibliographic record you create will.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
Description of Bibliographic Items. Review Encoding = Markup. The library cataloging “markup” language is MARC. Unlike HTML, MARC tags have meaning (i.e.,
1 CS 430: Information Discovery Lecture 5 Descriptive Metadata 1 Libraries Catalogs Dublin Core.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
FRBR: Cataloging’s New Frontier Emily Dust Nimsakont Nebraska Library Commission NCompass Live December 15, 2010 Photo credit:
Functional Requirements for Bibliographic Records The Changing Face of Cataloging William E. Moen Texas Center for Digital Knowledge School of Library.
AACR 2 –Rules for Descriptive Cataloguing
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Information Retrieval
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Subject Description LIS 571 The Organization and Control of Recorded Information.
8/28/97Information Organization and Retrieval Introduction University of California, Berkeley School of Information Management and Systems SIMS 245: Organization.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
The ___ is a global network of computer networks Internet.
A centre of expertise in digital information management UKOLN is supported by: Metadata – what, why and how Ann Chapman.
Definition, purposes/functions, elements of IR systems Lesson 1.
MARC Tags to BIBFRAME Vocabulary: a new view of metadata Sally McCallum Library of Congress ALA - January 2014.
Queensland University of Technology Faculty of Information Technology Michael Middleton 1 CRICOS No J Bibliographic description.
Organization of Information LSIS Summer II (2005)
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Catherine Lai MUMT-611 MIR January 27, 2005
Metadata for research outputs management
Introduction to Semantic Metadata & Semantic Web
Metadata to fit your needs... How much is too much?
Introduction to Metadata
Introduction to Information Retrieval
Database Design Hacettepe University
FRBR and FRAD as Implemented in RDA
Presentation transcript:

SLIDE 1IS FALL 2003 Lecture 06: Metadata Introduction Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 am Fall 2002 SIMS 202: Information Organization and Retrieval

SLIDE 2IS FALL 2003 Lecture Contents Review –Lexical Relations –WordNet Organization of Information Metadata Kinds of Metadata Dublin Core Discussion

SLIDE 3IS FALL 2003 Lecture Contents Review –Lexical Relations –WordNet Organization of Information Metadata Kinds of Metadata Dublin Core Discussion

SLIDE 4IS FALL 2003 Syntax The syntax of a language is to be understood as a set of rules which accounts for the distribution of word forms throughout the sentences of a language These rules codify permissible combinations of classes of word forms

SLIDE 5IS FALL 2003 Semantics Semantics is the study of linguistic meaning Two standard approaches to lexical semantics (cf., sentential semantics; and, logical semantics): –(1) compositional –(2) relational

SLIDE 6IS FALL 2003 Pragmatics Deals with the relation between signs or linguistic expressions and their users Deixis (literally “pointing out”) –E.g., “I’ll be back in an hour” depends upon the time of the utterance Conversational implicature –A: “Can you tell me the time?” –B: “Well, the milkman has come.” [I don’t know exactly, but perhaps you can deduce it from some extra information I give you.] Presupposition –“Are you still such a bad driver?” Speech acts –Constatives vs. performatives –E.g., “I second the motion.” Conversational structure –E.g., turn-taking rules

SLIDE 7IS FALL 2003 Lexical Relations Conceptual relations link concepts –Goal of Artificial Intelligence Lexical relations link words –Goal of Linguistics

SLIDE 8IS FALL 2003 Major Lexical Relations Synonymy Polysemy Metonymy Hyponymy/Hypernymy Meronymy/Holonymy Antonymy

SLIDE 9IS FALL 2003 WordNet Started in 1985 by George Miller, students, and colleagues at the Cognitive Science Laboratory, Princeton University –Miller also known as the author of the paper “The Magical Number Seven, Plus or Minus Two: Some Limits on our Capacity for Processing Information” (1956) Can be downloaded for free: –

SLIDE 10IS FALL 2003 Miller on WordNet “In terms of coverage, WordNet’s goals differ little from those of a good standard college-level dictionary, and the semantics of WordNet is based on the notion of word sense that lexicographers have traditionally used in writing dictionaries. It is in the organization of that information that WordNet aspires to innovation.” –(Miller, 1998, Chapter 1)

SLIDE 11IS FALL 2003 Structure of WordNet

SLIDE 12IS FALL 2003 Structure of WordNet

SLIDE 13IS FALL 2003 Structure of WordNet

SLIDE 14IS FALL 2003 Lecture Contents Review –Lexical Relations –Wordnet Organization of Information Metadata Kinds of Metadata Dublin Core Discussion

SLIDE 15IS FALL 2003 Organization of Information Is there a basic human need to put things into some sort of order? –Much of natural language concerns categories of things rather than individual things –Why do we organize things and information? Why do spoons go in THAT drawer in the kitchen and not in a can in the garage? Why do your favorite books go on one shelf and not-so-favorite on another?

SLIDE 16IS FALL 2003 Why Organize Information? The main reason –So that you can find things more effectively I.e., effective retrieval is predicated on some sort of organization applied to information resources Historically there have been many institutions and tools devoted to information organization –Libraries –Museums –Archives –Indexes and catalogs, dictionaries, phone books, etc.

SLIDE 17IS FALL 2003 Why Organize Information? A question of scale –Using your own ad hoc set of categories and methods to organize your own collection of books or CDs seems to work fine… –What if your collection grew to 10 Times the size? How would you organize it? 100 Times? 1000 Times? times?

SLIDE 18IS FALL 2003 What is Information Organization? Identifying the existence of all types of information-bearing entities as they are made available Identifying the works contained within those information-bearing entities or as parts of them Systematically pulling together these information-bearing entities into collections in libraries, archives, museums, Internet communications files and other such depositories From Hagler via Taylor, Chap. 1

SLIDE 19IS FALL 2003 What is Information Organization? Producing lists of these information- bearing entities prepared according to standard rules for citation Providing name, title, subject and other useful access to these information-bearing entities Providing the means of locating each information-bearing entity or a copy of it

SLIDE 20IS FALL 2003 Organizing Information Libraries Archives Museums and galleries Internet Corporate and office environments

SLIDE 21IS FALL 2003 Key Issues in This Course How to describe information resources or information-bearing objects in ways so that they may be effectively used by those who need to use them –Organizing How to find the appropriate information resources or information-bearing objects for someone’s (or your own) needs –Retrieving

SLIDE 22IS FALL 2003 Key Issues Creation UtilizationSearching Active Inactive Semi-Active Retention/ Mining Disposition Discard Using Creating Authoring Modifying Organizing Indexing Storing Retrieval Distribution Networking Accessing Filtering

SLIDE 23IS FALL 2003 Organizing/Indexing Collecting and integrating information Affects data, information and metadata “Metadata” describes data and information –More on this later Organizing information –Types of organization? Indexing

SLIDE 24IS FALL 2003 Accessing/Filtering Using the organization created in the O/I stage to –Select desired (or relevant) information –Locate that information –Retrieve the information from its storage location (often via a network)

SLIDE 25IS FALL 2003 Structure of an IR System Interest profiles & Queries Documents & data Rules of the game = Rules for subject indexing + Thesaurus (which consists of Lead-In Vocabulary and Indexing Language Storage Line Potentially Relevant Documents Comparison/ Matching Store1: Profiles/ Search requests Store2: Document representations Indexing (Descriptive and Subject) Formulating query in terms of descriptors Storage of profiles Storage of Documents Information Storage and Retrieval System

SLIDE 26IS FALL 2003 Lecture Contents Review –Lexical Relations –WordNet Organization of Information Metadata Kinds of Metadata Dublin Core Discussion

SLIDE 27IS FALL 2003 Metadata Metadata is –“Data about Data” (database systems) –Information about Information First used (to the best we can discover) in 1978 (meta-data) Used for databases in (Meta-Data Base) –“a data base which itself contains the structural and semantic data of other data bases” »Thomas R. Cousins & Wayne D. Dominick, “The Management of Data Bases of Data Bases” ASIS Proceedings, 1978.

SLIDE 28IS FALL 2003 Metadata Structures and languages for the description of information resources and their elements (components or features) “Metadata is information on the organization of the data, the various data domains, and the relationship between them” (Baeza-Yates p. 142)

SLIDE 29IS FALL 2003 Metadata Often two main types of metadata are distinguished –Descriptive metadata Describes the information/data object and its properties May use a variety of descriptive formats and rules –Topical metadata Describes the topic or “aboutness” of an information/data object May include a variety of vocabularies for describing, subjects, topics, categories, etc.

SLIDE 30IS FALL 2003 Types of Metadata Element names Element description Element representation Element coding Element semantics Element classification

SLIDE 31IS FALL 2003 Metadata Systems and Standards Naming and ID systems Bibliographic description –Texts Music Images and objects Numeric data Geospatial data Collections Video and motion pictures

SLIDE 32IS FALL 2003 The Same Item in Different Metadata Systems ISBD RFC 1807 TEI Header MARC Record Dublin Core (a bit later)

SLIDE 33IS FALL 2003 ISBD Punctuation Title Proper (GMD) = Parallel title : other title info / First statement of responsibility ; others. -- Edition information. -- Material. -- Place of Publication : Publisher Name, Date. -- Material designation and extent ; Dimensions of item. -- (Title of Series / Statement of responsibility). -- Notes. -- Standard numbers: terms of availability (qualifications).

SLIDE 34IS FALL 2003 Bibliographic Record Introduction to cataloging and classification / Bohdan S. Wynar. -- 8th ed. / Arlene G. Taylor. -- Englewood, Colo. : Libraries Unlimited, (Library science text series).

SLIDE 35IS FALL 2003 RFC 1807 BIB-VERSION:: CS-TR-v2.1 ID:: UCB// ENTRY:: September 9, 1997 TYPE:: BOOK TITLE:: Introduction to cataloging and classification AUTHOR:: Wynar, Bohdan S. AUTHOR:: Taylor, Arlene G. DATE:: 1992 PAGES:: 633 COPYRIGHT:: Libraries Unlimited, 1992 SERIES:: Library Science Text Series END:: UCB//123456

SLIDE 36IS FALL 2003 Minimal TEI Header Introduction to cataloging and classification Bohdan S. Wynar 8th edition by Arlene G. Taylor Libraries Unlimited Introduction to cataloging and classification / Bohdan S. Wynar. -- 8th ed. / Arlene G. Taylor. -- Englewood, Colo. : Libraries Unlimited, 1992.

SLIDE 37IS FALL 2003 MARC Record (Display) ID:DCLC B RTYP:c ST:p FRN: MS:c EL: AD: CC:9110 BLT:am DCF:a CSC: MOD: SNR: ATC: UD: CP:cou L:eng INT: GPC: BIO: FIC:0 CON:b PC:s PD:1992/ REP: CPI:0 FSI:0 ILC:a II:1 MMD: OR: POL: DM: RR: COL: EML: GEN: BSE: (cloth) (paper) 040 DLC$cDLC$dDLC Z693$b.W $ Wynar, Bohdan S Introduction to cataloging and classification /$cBohdan S. Wynar th ed. /$bArlene G. Taylor. 260 Englewood, Colo. :$bLibraries Unlimited,$c xvii, 633 p. :$bill. ;$c24 cm Library science text series 504 Includes bibliographical references (p ) and index Cataloging Subject cataloging Classification$xBooks Anglo-American cataloguing rules Taylor, Arlene G.,$d1941-

SLIDE 38IS FALL 2003 Lecture Contents Review –Lexical Relations –WordNet Organization of Information Metadata Kinds of Metadata Dublin Core Discussion

SLIDE 39IS FALL 2003 Dublin Core Simple metadata for describing internet resources For “Document-Like Objects” 15 Elements (in base DC)

SLIDE 40IS FALL 2003 Dublin Core TITLE: Introduction to cataloging and classification CREATOR: Taylor, Arlene G. OTHER CONTRIBUTOR: Wynar, Bohdan S. DATE: 1992 FORMAT: BOOK LANGUAGE: ENG PAGES: 633 PUBLISHER: Libraries Unlimited SUBJECT: Cataloging. SUBJECT: subject cataloging. SUBJECT: Classification -- Books DESCRIPTION: Textbook on cataloging and classification RESOURCE TYPE: text.monograph RESOURCE IDENTIFIER: (ISBN)

SLIDE 41IS FALL 2003 Dublin Core Elements Title Creator Subject Description Publisher Other Contributors Date Resource Type Format Resource Identifier Source Language Relation Coverage Rights Management

SLIDE 42IS FALL 2003 Mega-Metadata Standards METS - Metadata Encoding and Transmission Standard ( –Developed by the Digital Library Federation as an implementation strategy for preservation metadata –"XML document format for encoding metadata necessary for both management of digital library objects within a repository and exchange of such objects between repositories (or between repositories and their users)” –Provides a flexible mechanism for encoding descriptive, administrative, and structural metadata for a digital library object, and for expressing the complex links between these various forms of metadata

SLIDE 43IS FALL 2003 Metadata Resources Check the Links section from the class home page Best site is the “Digital Library: Metadata Resources” page from IFLA at For another good source of information on metadata standards see

SLIDE 44IS FALL 2003 Lecture Contents Review –Lexical Relations –WordNet Organization of Information Metadata Kinds of Metadata Dublin Core Discussion

SLIDE 45IS FALL 2003 Chap 3 Questions: Tu Tran In the 1830s, there was debate and opposition to having a non-Englishman (Italian born Anthony Panizzi) be the Keeper of the Printed Books of the British Museum. In today’s society, who is qualified to categorize information? For instance, should it be done by the creators of the works, experts in the fields, information science professionals, or a combination of people? How is categorization open to interpretation?

SLIDE 46IS FALL 2003 Tu Tran What would be some pros and cons of using a standard list of subject headings (such as the Library of Congress Subject Headings) to organize the Internet? See ml for the LOC list.

SLIDE 47IS FALL 2003 Chap 5 Questions: Hong Qu As more information gets produced and published everyday, would an extensible metadata standard such as the Dublin Core work better than a fixed metadata standard? What arguments are there for fixed versus extensible metadata systems? Taylor notes that the newer standards do not prescribe content. What exact does that mean? And what reasons would the metadata creators have for not prescribing “form of content”?

SLIDE 48IS FALL 2003 Next Time Controlled vocabularies (Introduction) Readings/discussion (all in PDF) –Paper by Elaine Svenonius on controlled vocabularies Sarah –Paper by Chris Borgman on online catalogs Paul –Paper by Marcia Bates on a design model for subject access Matt