Metadata for the Web A Necessary Evil? CS 431 – March 2, 2005 Carl Lagoze – Cornell University.

Slides:



Advertisements
Similar presentations
Dublin Core for Digital Video: Overview of the ViDe Application Profile.
Advertisements

T. Baker / 23 Sep 2000 Dublin Core Qualifiers and A Grammar for Dublin Core Thomas Baker DC-8, National Library of Canada, Ottawa 4 October 2000.
Dublin Core Metadata Tutorial July 9, 2007 Stuart Weibel Senior Research Scientist OCLC Programs and Research.
UKOLN, University of Bath
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
Alexandria Digital Library Project The ADEPT Bucket Framework.
DC 2004, Shanghai, October 2004D. Hillmann, Slide 1 An Introduction to Dublin Core Diane I. Hillmann National Science Digital Library DC2004 Tutorial,
IFLA Namespaces Gordon Dunsire Chair, IFLA Namespaces Technical Group Session 204 — IFLA library standards and the IFLA Committee on Standards – how can.
Natalia Wehler: Dublin Core Requirements on Metadata  multiple softwares to use metadata  management of changing standards  needs to be functional,
Where are the Semantics in the Semantic Web? Michael Ushold The Boeing Company.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
Kristin Eberle Monica Hampton Carmen Velasquez Kristin Eberle Monica Hampton Carmen Velasquez Knowledge Management.
Cornell CS 502 Metadata for the Web From Discovery to Description CS 502 – Carl Lagoze – Cornell University.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Basic Dublin Core Semantics DC 2006 Tutorial 1, 3 October 2006 Marty Kurth Head of Metadata Services Cornell University Library.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Some URLs JODI Paper – Harmony project –
Everything Around the Core Practices, policies, and models around Dublin Core Thomas Baker, Fraunhofer-Gesellschaft DC2004, Shanghai Library
Stuart Weibel OCLC, Inc. October, 1997 Dublin Core Metadata Stuart Weibel Consulting Research Scientist OCLC Office of Research purl.org/net/weibel October.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
Metadata and identifiers for e- journals Copenhagen Juha Hakala Helsinki University Library
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Metadata: first principles Pat Bell Knowledge, Analysis and Intelligence.
1 Metadata for Citizens’ Information UKOLN is funded by the Library and Information Commission, the Joint Information Systems Committee (JISC) of the Higher.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
1 CS/INFO 430 Information Retrieval Lecture 20 Metadata 2.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Semantics and Syntax of Dublin Core Usage in Open Archives Initiative Data Providers of Cultural Heritage Materials Arwen Hutt, University of Tennessee.
Metadata Modularization Concepts and Tools Carl Lagoze CS
Cornell CS 502 Metadata for the Web Issues and Simple Answers CS 502 – Carl Lagoze – Cornell University.
I Never Met a Data I Didn’t Like Metadata Issues in Local and Shared Digital Collections Presentation to ALCTS Electronic Resources Interest Group January.
Modularization and Interoperability: Dublin Core and the Warwick Framework Sandra D. Payette Digital Library Research Group Cornell University November.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
1 Discussion Class 4 The Dublin Core Metadata Initiative.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Metadata Bridget Jones Information Architecture I February 23, 2009.
A Logical Framework for Metadata Interoperability 16th August 2007 The Advanced Digital Library Seminar 2007 Guilin, China.
A Quick Introduction to Metadata Michael Day UKOLN: The UK Office for Library and Information Networking, University of Bath
1 Dublin Core & DCMI – an introduction Some slides are from DCMI Training Resources at:
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
A Whirlwind Tour Through Part of the Metadata Landscape Jenn Riley Metadata Librarian IU Digital Library Program.
21 June 2001Managing Information Resources for e-Government1 The Dublin Core Makx Dekkers, Managing Director, Dublin Core Metadata Initiative
A centre of expertise in digital information managementwww.ukoln.ac.uk DCMI Affiliates: Implications for Institutions Rosemary Russell UKOLN University.
1 CS 430: Information Discovery Lecture 5 Descriptive Metadata 1 Libraries Catalogs Dublin Core.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata Interaction, Integration, and Interoperability MODS, MARC and Metadata Interoperability, ALA Conference, June 27, 2005, Chicago, IL William E.
Cornell CS 502 Metadata for the Web Issues and Simple Answers CS 502 – Carl Lagoze – Cornell University.
Metadata for the Web Beyond Dublin Core? CS 431 – March 9, 2005 Carl Lagoze – Cornell University Acknowledgements to Liz Liddy and Geri Gay.
Metadata Interaction, Integration, and Interoperability NISO Workshop: Metadata Practices on the Cutting Edge, May 20, 2004, Washington, DC William E.
Metadata (and cataloging?) Jenn Riley Metadata Librarian IU Digital Library Program.
1 RDF, XML & interoperability Metadata : a reprise Communities, communication & XML An introduction to RDF RDF, XML and interoperability.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Metadata: It’s everywhere! It’s everywhere!
Metadata for the Web From Discovery to Description
Introduction to Metadata
Applications of IFLA Namespaces
Attributes and Values Describing Entities.
Introduction to Semantic Metadata & Semantic Web
A Whirlwind Tour Through Part of the Metadata Landscape
Some Options for Non-MARC Descriptive Metadata
Attributes and Values Describing Entities.
Presentation transcript:

Metadata for the Web A Necessary Evil? CS 431 – March 2, 2005 Carl Lagoze – Cornell University

“Metadata is data about data”

Metadata is semi-structured data conforming to commonly agreed upon models, providing operational interoperability in a heterogeneous environment

Are metadata and data distinguishable? Objectivity? Intellectual property? Structure? Aboutness?

Some untested hypotheses Metadata is useful for… –People –Machines More metadata is better Simple metadata created by non-experts (Joe Sixpack and his smarter friends) is useful

Metadata Quality as function of Creator Expertise Hoped Actual?

Some known facts Number and variety of metadata vocabularies will continue to increase The Tower of Babel is a franchise –There is not one common view of reality “The one thing I know about metadata is that it is expensive” (Bill Arms) “I hate metadata projects because they make every other digital library project more expensive” (Michael Lesk)

Metadata Triage

Metadata Takes Many Forms

Metadata Challenges Accommodate multiple varieties of metadata –community-specific functionality, creation, administration, access –Lowest common denominator Tensions –functionality and simplicity –extensibility and interoperability –human and machine creation and use

The fiction of classification …there is no classification of the universe that is not fictional and conjectural. Jorge Luis Borges

Lenses and Views All classification does and should provide a biased lens or view of reality Each view emphasizes certain characteristics and hides others Geospatial Rights Museum

Reality is Complex Created by: George Castaldo Created on: 1994 Created by: Leonardo da Vinci Created on: 1506 Relationship?

Objects are Related IFLA Entity Model

Haven’t we done metadata already?

What’s wrong with this model? Expensive –Complex (even for its original goal?) –Professional intervention (assumes single community of expertise) Monolithic –One size fits all approach –Reflects its centralized system origins Bias towards physical artifacts –Fixed resources –Incomplete handling of resource evolution and other resource relationships Anglo-centric

Why hasn’t metadata worked on the Web? Its all about trust People are lazy Metadata is hard No perceived benefit –“Reverse tragedy of the commons” No agreement on one way to describe things “Metacrap” -

The fifteen Dublin Core Elements

A Pidgin for Digital Tourists Metadata is language Dublin Core is a small and simple language -- a pidgin -- for finding resources across domains. Speakers of different languages naturally "pidginize" to communicate –E.g., tourists using simple phrases to order beer ("zwei Bier bitte" "dva pivo prosim" "biru o san bai kudasi"...) We are all "tourists" on the global Internet.

What is the Dublin Core (1) A simple set of properties to support resource discovery on the web (fuzzy search buckets)? Domain Independent view

What is Dublin Core (2)? An extensible ontology for resource desciption? Greater Functionality & Cost

Progressive Metadata Models: Drill-Down Searching Paradigm Moving along a specificity spectrum Inter-domain vs. intra-domain terms, models, query mechanisms

Drill-down search paradigm Domain Independent view Domain Specific View

What is the Dublin Core (3)? A cross-domain switchboard for interoperable metadata? Switchboard Dublin Core MARC INDECSIMS

Dublin Core Qualifiers From fuzzy buckets to more specific description Model of “graceful degradation” –Support both simplicity and specificity –Intra-domain and inter-domain semantics

Varieties of qualifiers: Element Refinements Make the meaning of an element narrower or more specific. Narrowing implies an is a relationship –a "date created“ is a "date“ –an "is part of relation“ is a "relation“ If your software does not understand the qualifier, you can safely ignore it.

Varieties of Qualifiers: Value Encoding Schemes Says that the value is –a term from a controlled vocabulary (e.g., Library of Congress Subject Headings) –a string formatted in a standard way (e.g., " " means May 3, not February 5) Even if a scheme is not known by software, the value should be "appropriate" and usable for resource discovery.

A Grammar of Dublin Core r.html By design not as subtle as mother tongues, but easy to learn and extremely useful in practice Pidgins: small vocabularies (Dublin Core: fifteen special nouns and lots of optional adjectives) Simple grammars: sentences (statements) follow a simple fixed pattern...

Example Dublin Core statements Resource has Title 'Grammar of Dublin Core'. Resource has Creator 'Tom Baker'. Resource has Subject 'Metadata'. Resource has Relation

Resourcehasproperty DC:Creator DC:Title DC:Subject DC:Date... X implied subject implied verb one of 15 properties property value (an appropriate literal)

Resourcehasproperty DC:Creator DC:Title DC:Subject DC:Date... X implied subject implied verb one of 15 properties property value (an appropriate literal) [optional qualifier] qualifiers (adjectives)

ResourcehasDate" " Revised ISO8601 ResourcehasSubject"Languages -- Grammar" LCSH

Dumb-Down Principle for Qualifiers The fifteen elements should be usable and understandable with or without the qualifiers Qualifiers refine meaning (but may be harder to understand) Nouns can stand on their own without adjectives If your software encounters an unfamiliar qualifier, look it up -- or just ignore it! "has a“ relations break the model –E.g., a creator has a hair color

ResourcehasDate" " Revised ISO8601 ResourcehasSubject"Languages -- Grammar" LCSH Test for “good““ qualifiers: cover and ask: -- Does the statement still make sense? -- Is it still correct?

Resourcehassubject audience Resourcehascreator affiliation “Incorrect” Qualification “Cornell University” “pre-schoolers”