O N T O P E D I A The Identity of Everything psi.ontopedia.net Expressing Dublin Core.

Slides:



Advertisements
Similar presentations
Ali Alshowaish. dc.coverage element articulates limitations in the scope of the resource, typically along the following lines: geographical, temporal,
Advertisements

Agents and the DC Abstract Model Andy Powell UKOLN, University of Bath DC Agents WG Meeting DC-2005, Madrid.
DC2001, Tokyo DCMI Registry : Background and demonstration DC2001 Tokyo October 2001 Rachel Heery, UKOLN, University of Bath Harry Wagner, OCLC
DC Architecture WG meeting Monday Sept 12 Slot 1: Slot 2: Location: Seminar Room 4.1.E01.
OLAC Metadata Steven Bird University of Melbourne / University of Pennsylvania OLAC Workshop 10 December 2002.
Andreas Schuck Metadata schema, keywords and guidelines NEFIS WP5 Meeting November, 2005 Hamburg, Germany.
A centre of expertise in digital information management Approaches To The Validation Of Dublin Core Metadata Embedded In (X)HTML Documents Background The.
Metadata vocabularies and ontologies Dr. Manjula Patel Technical Research and Development
UKOLN, University of Bath
The Dublin Core Collection Description Application Profile Pete Johnston UKOLN, University of Bath Chair, DC CD WG Collection Description Schema Forum,
Andy Powell, Eduserv Foundation Feb 2007 The Dublin Core Abstract Model – a packaging standard?
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
A centre of expertise in digital information management UKOLN is supported by: XML and the DCMI Abstract Model DC Architecture WG Meeting,
Introduction to RDF Based on tutorial at
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
Digital Libraries Models and Content. Goals for tonight Finish up from last week – the 5 S model more formally – Status of the systems available Obtaining,
DL:Lesson 4 Metadata: Dublin Core Luca Dini
Dr. Alexandra I. Cristea RDF.
Reusable!? Or why DDI 3.0 contains a recycling bin.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Dublin Core as a tool for interoperability Common presentation of data from archives, libraries and museums DC October 2006 Leif Andresen Danish.
Dublin Core Metadata Initiative Dr. donna Bair-Mundy Adding metadata to a web page.
A centre of expertise in digital information management UKOLN is supported by: XML Schema for DC Libraries AP DC Libraries WG Meeting,
Everything Around the Core Practices, policies, and models around Dublin Core Thomas Baker, Fraunhofer-Gesellschaft DC2004, Shanghai Library
ISO Standards: Status, Tools, Implementations, and Training Standards/David Danko.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Dublin Core Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Metadata in Digital (Music) Libraries L631 April 7, 2003 Mary Wallace Davidson.
1 CS/INFO 430 Information Retrieval Lecture 20 Metadata 2.
Presenter: Sam Oh Tutorial:. Presentation Outline Representing DCMI Metadata Terms in Topic Maps Topic Maps-Driven Semantic Services.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
1  Bob Hager Director of Publishing Standards Metadata Specification.
Creating an Application Profile Tutorial 3 DC2004, Shanghai Library 13 October 2004 Thomas Baker, Fraunhofer Society Robina Clayphan, British Library Pete.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Presentation Title: Day:
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
LIS654 lecture 5 DC metadata and omeka tables Thomas Krichel
RDF and XML 인공지능 연구실 한기덕. 2 개요  1. Basic of RDF  2. Example of RDF  3. How XML Namespaces Work  4. The Abbreviated RDF Syntax  5. RDF Resource Collections.
WI 4 (CWA1): Guidelines for machine-processable representation of Dublin Core Application Profiles Pete Johnston, UKOLN, University of Bath Thomas Baker,
Standards-Based Knowledge Systems using NewsML and Topic Maps Presented by Daniel Rivers-MooreDaniel Rivers-Moore Director of New Technologies, RivComRivCom.
Evidence from Metadata INST 734 Doug Oard Module 8.
1 Dublin Core & DCMI – an introduction Some slides are from DCMI Training Resources at:
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
21 June 2001Managing Information Resources for e-Government1 The Dublin Core Makx Dekkers, Managing Director, Dublin Core Metadata Initiative
A centre of expertise in digital information management UKOLN is supported by: Metadata for the People’s Network Discovery Service PNDS.
Overview of SC 32/WG 2 Standards Projects Supporting Semantics Management Open Forum 2005 on Metadata Registries 14:45 to 15:30 13 April 2005 Larry Fitzwater.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
1 Schemas or Vocabularies? April 26, 2005 OASIS Symposium on the Future of XML Vocabularies Bob DuCharme LexisNexis.
Pete Johnston, Eduserv Foundation 16 April 2007 An Introduction to the DCMI Abstract Model JISC.
Application Profiles Application profiles -- are schemas which consist of data elements drawn from one or more namespaces, combined together by implementers,
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
DCMI Abstract Model Analysis Resource Model Jorge Morato– Information Ingeneering Universidad Carlos III de Madrid
DC Architecture WG meeting Wednesday Seminar Room: 5205 (2nd Floor)
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Metadata & Repositories Jackie Knowles RSP Support Officer.
Dublin Core Basics Workshop Lisa Gonzalez KB/LM Librarian.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
prepared by Dr. Ammar Yakan
AGLS Metadata Standard
Introduction to Metadata
Attributes and Values Describing Entities.
Indicator structure and common elements for information flow
Session 2: Metadata and Catalogues
Some Options for Non-MARC Descriptive Metadata
Proposal of a Geographic Metadata Profile for WISE
Attributes and Values Describing Entities.
Presentation transcript:

O N T O P E D I A The Identity of Everything psi.ontopedia.net Expressing Dublin Core Metadata in Topic Maps Steve Pepper* Topic Maps Research and Applications * founder member of WSSSL

O N T O P E D I A The Identity of Everything psi.ontopedia.net Agenda Dublin Core Metadata Initiative – Brief introduction Dublin Core in Topic Maps – Timeline – Rationale – Proposed approach – Issues Conclusion – The Subject-Centric Advantage

O N T O P E D I A The Identity of Everything psi.ontopedia.net Dublin Core Metadata Initiative An increasingly widely used metadata standard for Web documents The Dublin Core Metadata Initiative is an open organization engaged in the development of interoperable online metadata standards that support a broad range of purposes and business models. – Originally sponsored by OCLC, Dublin, Ohio – Run by a Directorate, overseen by a Board of Trustees Defines sets of abstract metadata “elements” – Supports multiple concrete representations – HTML/XHTML, RDF/XML, XML

O N T O P E D I A The Identity of Everything psi.ontopedia.net DC Approved Element Sets The Dublin Core Metadata Element Set Other Elements and Element Refinements Encoding Schemes The DCMI Type Vocabulary

O N T O P E D I A The Identity of Everything psi.ontopedia.net Dublin Core in Topic Maps A work item in SC34 / WG3 for a Technical Report Timeline – Initial draft already available ISO/IEC DTR 29111, 2007 (see references) – CD balloting already underway – Results to be reviewed in Kyoto, December 8-11 – Stable specification expected by Xmas ’07

O N T O P E D I A The Identity of Everything psi.ontopedia.net DCinTM – The Rationale 1. In order to add (standard) metadata to topic maps 2. In order to enable Topic Maps-based collation of Dublin Core information – Dublin Core description sets have limited usefulness unless they can be aggregated based on meaning – Topic Maps makes such aggregation possible 3. In order to combine DC data with other, non-DC data

O N T O P E D I A The Identity of Everything psi.ontopedia.net DCinTM – The Proposed Approach Basic Principles – Start from the Dublin Core Abstract Model (DCAM) – Map to the Topic Maps Data Model (TMDM) – Reuse the DC identifiers – Use minimum additional vocabulary – Only map what’s really there: don’t infer stuff – Topic Maps representation should be as natural as possible – Take a subject-centric approach

O N T O P E D I A The Identity of Everything psi.ontopedia.net The Dublin Core Abstract Model See the more detailed presentation by Lutz In short: – Resources are described by property-value pairs in which the value may be a literal or non-literal:

O N T O P E D I A The Identity of Everything psi.ontopedia.net Basic DCAM to TMDM Mapping Dublin Core (DCAM) – described resource – property-value pair – property – property URI – literal value S – non-literal value S Topic Maps (TMDM) – subject / topic – statement name, association, or occurrence – statement type name type, association type, or occurrence type – subject identifier of typing topic – string (name or occurrence) – Either topic (when association) URI (when occurrence)

O N T O P E D I A The Identity of Everything psi.ontopedia.net Minimum of Additional Vocabulary The solution reuses the DC vocabulary – Property URIs become the subject identifiers of association types, etc. – There is no unnecessary duplication of identifiers However, TMs require role types in addition to name, occurrence and association types – Vocabulary extensions handle this:

O N T O P E D I A The Identity of Everything psi.ontopedia.net Dublin Core Metadata Element Set DCMES: A set of 15 core elements: – contributor, coverage, creator, date, description, format, identifier, language, publisher, relation, rights, source, subject, title, type Each element has 7 properties – URI, Label, Definition, Comment, Type of Term, Status, Date Issued The URI is an identifier, e.g. – (dc:contributor)

O N T O P E D I A The Identity of Everything psi.ontopedia.net Mappings (DCMES) The 15 core DCMES elements map to: A. Names: title because it is clearly a naming property B. Occurrences: date, description, identifier, rights because their values consist of descriptive text or structured data types C. Associations: contributor, coverage, creator, format, language, publisher, relation, source, subject, type because associations are the default...

O N T O P E D I A The Identity of Everything psi.ontopedia.net DCinTM – CTM example #prefix and template declarations omitted *myTM # named wildcard creates reference-able item identifier – title: "Norwegian Opera" creator: Huchu date: description: "A topic map about Norwegian opera, originally created as a term assignment for the University of Oslo course MUS2420." identifier: language: English rights: "Copyright (c) 2007, Ontopedia" subject: Norwegian_Opera type: topic_map N  A  O  A  O  A 

O N T O P E D I A The Identity of Everything psi.ontopedia.net Elements and Element Refinements A set of 40 additional elements and refinements – abstract, accessRights, accrualMethod, accrualPeriodicity, accrualPolicy, alternative, audience, available, bibliographicCitation, conformsTo, created, dateAccepted, dateCopyrighted, dateSubmitted, educationLevel, extent, hasFormat, hasPart, hasVersion, instructionalMethod, isFormatOf, isPartOf, isReferencedBy, isReplacedBy, isRequiredBy, issued, isVersionOf, license, mediator, medium, modified, provenance, references, replaces, requires, rightsHolder, spatial, tableOfContents, temporal, valid Each element has up to 8 properties – URI, Label, Definition, Comment, Type of Term, Refines, Status, Date Issued The URI is an identifier, e.g. – (dct:abstract)

O N T O P E D I A The Identity of Everything psi.ontopedia.net Mappings (Other Elements, etc.) The 40 elements map to A. Names: alternative because it is clearly a naming property B. Occurrences: abstract, accessRights, available, bibliographicCitation, created, dateAccepted, dateCopyrighted, dateSubmitted, educationLevel, extent, instructionalMethod, issued, license, medium, modified, provenance, tableOfContents, valid because their values consist of descriptive text or structured data types C. Associations: accrualMethod, accrualPeriodicity, accrualPolicy, audience, conformsTo, hasFormat, hasPart, hasVersion, isFormatOf, isPartOf, isReferencedBy, isReplacedBy, isRequiredBy, isVersionOf, mediator, references, replaces, requires, rightsHolder, spatial, temporal because associations are the default...

O N T O P E D I A The Identity of Everything psi.ontopedia.net Encoding Schemes A set of 18 encoding schemes: – Box, DCMIType, DDC, IMT, ISO3166, ISO639-2, LCC, LCSH, MESH, NLM, Period, Point, RFC1766, RFC3066, TGN, UDC, URI, W3CDTF Each term has up to 8 properties – URI, Label, Definition, See, Type of Term, Qualifies, Status, Date Issued – See property references documentation – Qualifies property specifies which property (or properties) the encoding scheme is intended to be used with The URI is an identifier, e.g. – (dct:box)

O N T O P E D I A The Identity of Everything psi.ontopedia.net Mappings (Encoding Schemes) Each encoding scheme is represented by a topic Qualifiers for occurrence properties: – DCMI Period, W3C-DTF (Date), URI (Identifier) ► Datatype of the occurrence value Qualifiers for association properties: – DCMI Box (Spatial), DCMI Period (Temporal), DCMI Point (Spatial), DCMI Type Vocabulary (Type), DDC (Subject), IMT (Format), ISO 3166 (Spatial), ISO (Language), LCC (Subject), LCSH (Subject), MeSH (Subject), NLM (Subject), RFC 1766 (Language), RFC 3066 (Language), TGN (Spatial), UDC (Subject), URI (Relation, Source), W3C-DTF (Temporal) – ► belongs_to association between the “value” and the encoding scheme

O N T O P E D I A The Identity of Everything psi.ontopedia.net The DCMI Type Vocabulary A set of 12 media types: – Collection, Dataset, Event, Image, InteractiveResource, MovingImage, PhysicalObject, Service, Software, Sound, StillImage, Text Each term has up to 9 properties – Broader Than, Comment, Date Issued, Definition, Label, Narrower Than, Status, Type of Term, URI – Broader Than & Narrower Than properties apply to Image, MovingImage and StillImage

O N T O P E D I A The Identity of Everything psi.ontopedia.net Mappings (Type Vocabulary) Each type is represented as a topic These topics are candidates for the “value” role in dc:type associations

O N T O P E D I A The Identity of Everything psi.ontopedia.net Topic Maps included in the TR The TR includes two topic maps in annexes: – Dublin Core Topic Map (normative) “This annex specifies a topic map that contains the topic declarations necessary in order to express Dublin Core metadata using Topic Maps in accordance with this Technical Report.” One topic per element or term, plus topics for the role types resource and value For inclusion in other topic maps – DCMT Topic Map (informative) “This annex specifies a topic map that encodes the core content of [DCMT], including definitions and explicit relationships between elements.” Fully browsable representation of the official specification DCMI Metadata Terms For ease of reference

O N T O P E D I A The Identity of Everything psi.ontopedia.net RELAX-NG schemas Two RELAX-NG schemas planned To be included when the TR is stable 1. Loose schema – “allow the semantic validation of Topic Maps representations of Dublin Core metadata that conform to the requirements of this Technical Report.” 2. Strict schema – “allow the semantic validation of Topic Maps representations of Dublin Core metadata that follow recommended best practice as well as conforming to the requirements of this Technical Report.”

O N T O P E D I A The Identity of Everything psi.ontopedia.net Issues Identifiers Simplicity vs. naturalness Strict vs. loose

O N T O P E D I A The Identity of Everything psi.ontopedia.net The Issue of Identifiers Note that dc:identifier is represented as an occurrence – Subject identifiers or subject locators would actually be more “topic-mappish” – But how would we know when to use which? – Dublin Core (like RDF) does not make the distinction between direct and indirect addressing – The only solution is to use occurrences as the default and allow either a subject identifier or a subject locator to be specified in addition, if the nature of the resource in question is known.

O N T O P E D I A The Identity of Everything psi.ontopedia.net Simplicity vs. naturalness We could have chosen a simpler route – “Just use occurrences for everything” Why didn’t we? – Partly because we wanted to leverage the built-in semantics of the Topic Maps model distinction between names, occurrences, and associations this is more “topic mappish” and more intuitive – Mostly because we want to encourage the adoption of a subject-centric approach wherever possible This means using associations, not occurrences, wherever it makes sense

O N T O P E D I A The Identity of Everything psi.ontopedia.net Strict vs. loose This is also why we chose the more draconian approach: – “Thou SHALT NOT use occurrences” We could have given the choice: – Either an association creator: huchu – or an occurrence creator: "Huchu" But we didn’t. The final short example will hopefully explain why.

O N T O P E D I A The Identity of Everything psi.ontopedia.net A Blog Entry (2 legs) *e21 isa: entry - title: "About Dublin Core in Topic Maps" date: subject: "Subject-centric computing" subject: "Dublin Core" creator: "Steve Pepper" contributor: "Dmitry Bogachev" description: "Topic Maps definitely needs Dublin Core, but the reverse also holds..."

O N T O P E D I A The Identity of Everything psi.ontopedia.net A Blog Entry (4 legs) *e41 isa: entry - title: "About Dublin Core in Topic Maps" date: subject: Subject-centric_computing subject: Dublin_Core creator: Steve_Pepper contributor: Dmitry_Bogachev description: "Topic Maps definitely needs Dublin Core, but the reverse also holds..."

O N T O P E D I A The Identity of Everything psi.ontopedia.net 2- and 4-legged merging (demo) Demo uses second entry in 2- and 4-legs versions – 2 legs: occurrences only – 4 legs: associations where appropriate 2-legged and 4-legged merging shown w/ Omnigator – files: 2legs1.ltm, 2legs2.ltm, 4legs1.ltm, 4legs2.ltm – location: Conclusion: Two legs bad, Four legs good!

O N T O P E D I A The Identity of Everything psi.ontopedia.net The Subject Centric Advantage Explicit identification of subjects – Enables collation of information based on what it is about This is what end-users (mostly) want – They are interested in a subject and want to find out more about it – If they could go to one place and find “everything” there is to know, the problem of infoglut would be under control This is the goal – It is the central message of subject centric computing

O N T O P E D I A The Identity of Everything psi.ontopedia.net References ISO/IEC DTR 29111: 2007: Information technology – Topic Maps – Expressing Dublin Core Metadata using Topic Maps; International Organization for Standardization: Geneva, 2007, Pepper, S.; Schwab, S. Curing the Web’s Identity Crisis: Subject Indicators for RDF; Ontopia: 2003, Pepper, S. The Case for Published Subjects; Ontopia: 2006, AToMS 2007; Kyoto, Japan, December Topic Maps 2008; Oslo, Norway, April