Dublin Core, OAI-PMH and the eBank UK schema Monica Duke UKOLN, University of Bath, UK UKOLN is supported by:

Slides:



Advertisements
Similar presentations
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Introduction to XHTML Programming the World Wide Web Fourth edition.
Advertisements

Ali Alshowaish. dc.coverage element articulates limitations in the scope of the resource, typically along the following lines: geographical, temporal,
Chapter 7 System Models.
Copyright © 2003 Pearson Education, Inc. Slide 7-1 Created by Cheryl M. Hughes, Harvard University Extension School Cambridge, MA The Web Wizards Guide.
Copyright © 2003 Pearson Education, Inc. Slide 8-1 Created by Cheryl M. Hughes, Harvard University Extension School Cambridge, MA The Web Wizards Guide.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
A centre of expertise in digital information management The OAI Protocol for Metadata Harvesting Andy Powell UKOLN,
1 Web Search Environments Web Crawling Metadata using RDF and Dublin Core Dave Beckett Slides:
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
Agents and the DC Abstract Model Andy Powell UKOLN, University of Bath DC Agents WG Meeting DC-2005, Madrid.
DC2001, Tokyo DCMI Registry : Background and demonstration DC2001 Tokyo October 2001 Rachel Heery, UKOLN, University of Bath Harry Wagner, OCLC
DC Architecture WG meeting Monday Sept 12 Slot 1: Slot 2: Location: Seminar Room 4.1.E01.
OLAC Metadata Steven Bird University of Melbourne / University of Pennsylvania OLAC Workshop 10 December 2002.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
Properties of Real Numbers CommutativeAssociativeDistributive Identity + × Inverse + ×
A centre of expertise in digital information management IMS Digital Repositories Interoperability Andy Powell UKOLN,
A centre of expertise in digital information management Approaches To The Validation Of Dublin Core Metadata Embedded In (X)HTML Documents Background The.
Pete Johnston UKOLN, University of Bath Bath, BA2 7AY
Distributed Service Registries Workshop, July 2005 Slide 1 NISO Metasearch Initiative Registries Robert Sanderson Dept. of Computer Science University.
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
Metadata vocabularies and ontologies Dr. Manjula Patel Technical Research and Development
UKOLN, University of Bath
An overview of collection-level metadata Applications of Metadata BCS Electronic Publishing Specialist Group, Ismaili Centre, London, 29 May 2002 Pete.
The Dublin Core Collection Description Application Profile Pete Johnston UKOLN, University of Bath Chair, DC CD WG Collection Description Schema Forum,
Andy Powell, Eduserv Foundation July 2006 Repository Roadmap – technical issues.
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
Andy Powell, Eduserv Foundation Feb 2007 The Dublin Core Abstract Model – a packaging standard?
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
The Discovery Landscape in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK – eBank UK project A centre.
Dr. Alexandra I. Cristea CS 253: Topics in Database Systems: C3.
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
A centre of expertise in digital information management UKOLN is supported by: XML and the DCMI Abstract Model DC Architecture WG Meeting,
Encoding DC in (X)HTML, XML and RDF Andy Powell UKOLN, University of Bath, UK UKOLN is supported by: Tutorial.
Collection-level description in practice Collection-Level Description & NOF-digitise projects NOF-digitise programme seminar, London, 22 February 2002.
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
Encoding DC in (X)HTML, XML and RDF Andy Powell UKOLN, University of Bath, UK UKOLN is supported by: Tutorial.
An Introduction to Dublin Core
Collections and collection-level description CIMI Members’ meeting, Boston, MA, USA April 2002 Pete Johnston UKOLN, University of Bath Bath, BA2.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
Dr. Alexandra I. Cristea XHTML.
2004 EBSCO Publishing Presentation on EBSCOadmin.
1 An inference engine for the semantic web Naudts Guido Student at the Open University Netherlands.
The Dublin Core Metadata Initiative: Cross-domain resource description and resource discovery OASIS Symposium on Reliable Infrastructures for XML, April.
Chapter 2 Entity-Relationship Data Modeling: Tools and Techniques
Chapter 12 Working with Forms Principles of Web Design, 4 th Edition.
PSSA Preparation.
Murach’s OS/390 and z/OS JCLChapter 16, Slide 1 © 2002, Mike Murach & Associates, Inc.
Steffen Staab 1WeST Web Science & Technologies University of Koblenz ▪ Landau, Germany Structured Data on the Web Introduction to.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
1 Dublin Core & DCMI – an introduction Some slides are from DCMI Training Resources at:
A centre of expertise in digital information managementwww.ukoln.ac.uk DCMI Affiliates: Implications for Institutions Rosemary Russell UKOLN University.
Pete Johnston, Eduserv Foundation 16 April 2007 An Introduction to the DCMI Abstract Model JISC.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Metadata-based Discovery: Experience in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK A centre of.
DC Architecture WG meeting Wednesday Seminar Room: 5205 (2nd Floor)
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Dublin Core Basics Workshop Lisa Gonzalez KB/LM Librarian.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Introduction to Metadata
Attributes and Values Describing Entities.
Some Options for Non-MARC Descriptive Metadata
Attributes and Values Describing Entities.
Presentation transcript:

Dublin Core, OAI-PMH and the eBank UK schema Monica Duke UKOLN, University of Bath, UK UKOLN is supported by: eBank UK workshop on Chemistry schemas University of Bath, 18 th February 2005

2 Contents Whirlwind guide to DC DC abstract model Encoding in XML OAI-PMH and the eBank UK project eBank UK XML schema definition Crystallography suggestions Note: you are going to see some angle-brackets

3 Acknowledgement Andy Powell (UKOLN) For donation of slides Recommendation Dublin Core Conference Tutorials (Oct 04) ntations/ecdl-2004/dc-tutorial/

4 Bluffers guide to DC 1.DC short for Dublin Core 2.simple metadata standard, supporting cross-domain resource discovery 3.original focus on Web resources but that is no longer the case – e.g. usage to describe physical artefacts in museums 4.current usage across wide range of sectors – academic, e-government, museums, libraries, business, semantic Web

5 Bluffers Guide to DC simple DC provides 15 elements (metadata properties) multiple encoding syntaxes including HTML tags, XML and RDF/XML (XML schema are available) dc:titledc:contributordc:source dc:creatordc:datedc:language dc:subjectdc:typedc:relation dc:descriptiondc:formatdc:coverage dc:publisherdc:identifierdc:rights

6 Bluffers Guide to DC 7.relatively slow programme of adding new terms to qualified DC new elements (e.g. dcterms:audience) element refinements (e.g. dcterms:dateCopyrighted) encoding schemes (e.g. dcterms:LCSH and dcterms:W3CDTF 48 elements and 17 encoding schemes

7 Bluffers Guide to DC 8.DC can be embedded into HTML pages but almost none of the big search engines will use it! Why? Lack of trust… meta-spam meta-crap however, embedding DC in HTML may be worthwhile if your own site search engine uses it 9.however, simple DC forms baseline metadata format for the OAI protocol.

Important DCMI documents… DCMI Abstract Model – DRAFT Expressing Dublin Core in HTML/XHTML meta and link elements Guidelines for implementing Dublin Core in XML Expressing Simple Dublin Core in RDF/XML Expressing Qualified Dublin Core in RDF/XML Namespace Policy for the DCMI DCMI Metadata Terms

9 Abstract models for DC

10 Why an abstract model? Before we start creating DCMI descriptions we need to understand what kinds of things we want to say about resources the DCMI view of the world/resources we want to describe (the DCMI resource model) the DCMI view of the descriptions we make about that world (the DCMI description model) Known as the DCMI abstract model Simplified view presented here

11

12 What is a resource? W3C/IETF definition of resource is …anything that has identity. Familiar examples include an electronic document, an image, a service (e.g., "today's weather report for Los Angeles"), and a collection of other resources. Not all resources are network "retrievable"; e.g., human beings, corporations, and bound books in a library can also be considered resources. i.e. a resource is anything physical things (books, cars, people) digital things (Web pages, digital images) conceptual things (colours, points in time)

13 DC and resources but… this seems to be too wide for the things we can describe with DC! can we really describe people using DC? do people have titles and subjects? no… in general we only use DC to describe a sub-set of all resources anything covered by the DCMIType list… Collection, Dataset, Event, Image (Still or Moving), Interactive Resource, Service, Software, Sound, Text, Physical Object

14 DCMI resource model each resource that we want to describe has zero or more properties a property is a specific aspect, characteristic, attribute or relation used to describe a resource each property has one or more values each value is a resource (the physical or conceptual entity that is associated with a property when it is used to describe a resource)

15 DCMI description model a description is made up of one or more statements (about one, and only one, resource) and zero or one resource URI (a URI reference that identifies the resource being described) each statement is made up of a property URI (that identifies a property), zero or one value URI (that identifies a value of the property), zero or one encoding scheme URI (that identifies the class of the value) and zero or more value representations of the value

16 DCMI description model (2) each property is an attribute of the resource being described each property URI may be repeated in multiple statements the value representation may take the form of a value string, a rich value or a related description Value string is a simple human-readable string May have an associated language (e.g.en- gb)

17 The 1:1 principle notice that the model indicates that each property used in a description must be an attribute of the resource being described this is commonly referred to as the 1:1 principle - the principle that a DCMI metadata description describes one, and only one, resource however…

18 Description sets real-world metadata applications tend to be based on loosely grouped sets of descriptions (where the described resources are typically related in some way) known here as description sets for example, a description set might comprise descriptions of both a painting and the artist

19 DCMI records description sets are instantiated, for the purposes of exchange between software applications, in the form of metadata records each record conforms to one of the DCMI encoding guidelines (XHTML meta tags, XML, RDF/XML, etc.) a document andy powell

20 Model summary record (encoded as XHTML, XML or RDF/XML) description set description (about a resource (URI)) statement property (URI) value (URI) representationvalue string OR rich value OR related description vocabulary encoding scheme (URI) syntax encoding scheme (URI) language (e.g. en-GB)

21 Simple and qualified a simple DC record is a record that: conforms to the abstract model, comprises only a single description, uses only the 15 properties in the Dublin Core Metadata Element Set, makes no use of value URIs, encoding schemes, rich values or related descriptions.

22 Simple and qualified DC a simple DC record is a record that: conforms to the abstract model, comprises only a single description, uses only the 15 properties in the Dublin Core Metadata Element Set, makes no use of value URIs, encoding schemes, rich values or related descriptions.

23 Qualified DC model a qualified DC record is a record that: conforms to the DCMI abstract model, contains at least one property taken from the DCMI Metadata Terms recommendation

24 A couple of notes… everything in DC is optional Dumb-down the process of translating a qualified DC metadata record into a simple DC metadata record informed dumb-down uninformed dumb-down …

25 Encoding DC in XML

26 DCMI recommendations For the full details: Guidelines for implementing Dublin Core in XML nine recommendations for encoding DC in XML

27 General Recommendations implementers should base their XML applications on XML Schemas rather than XML DTDs Use of upper and lower case in property names and encoding schemes e.g. property names for the 15 DCMES elements should be lower-case

28 Properties and values implementers should encode properties as XML elements and values as the content of those elements the name of the XML element should be an XML qualified name (QName) of the property Dublin Core in XML do not use constructs like

29 Repeating Properties multiple value strings should be encoded by repeating the XML element for that property First title Second title

30 Value String language where the language of the value is indicated, it should be encoded using the xml:lang attribute seafood fruits de mer

31 Container elements note that it is anticipated that records will be encoded within one or more container XML element(s) of some kind candidate container element names include,,, and

32 Element Refinements element refinements should be treated in the same way as other properties the name of the XML element should be an XML qualified name (QName): do not use any of the following:

33 Encoding schemes encoding schemes should be implemented using the ' xsi:type ' attribute of the XML element for the property the name of the encoding scheme should be given as the attribute value, and should be in the form of an XML qualified name (QName):

OAI-PMH

35 OAI-PMH OAI Protocol for Metadata Harvesting simple protocol for sharing metadata records between applications currently at version 2.0 based on HTTP, XML, XML Schema and XML namespaces allows a harvester to ask a remote repository for some or all of its metadata records where some is based on date-stamps, sets, metadata formats

36 OAI-PMH (2) OAI-PMH carries only metadata content (e.g. full-text or image) made available separately – typically at URL in metadata simple DC is default (mandatory) record format supports any record format provided it can be encoded using XML (e.g. DC, IMS, MARC, ODRL, …)

37 OAI-PMH model OAI-PMH identifier = entry point to all records pertaining to the resource resource Dublin Core Metadata item records MARC Metadata Crystal Structure Report Jump-off page (HTML) Model adapted from: Model adapted from:

38 Data Flow in eBank UK Submit Store/link Data files Metadata present HTML Institutional repository OAI-PMH Harvest (XML) Index and Search present HTML eBank aggregator create

39 OAI-PMH model OAI-PMH identifier = entry point to all records pertaining to the resource resource Dublin Core Metadata item records IMS Metadata Crystal Structure Report Jump-off page (HTML) Linking Dublin Core Metadata type Date created 1:1 principle

40 OAI-PMH outline record oai:ecrystals.chem.soton.ac.uk: D

41 OAI-PMH outline record

42 OAI-PMH outline record <!– Need a wrapper here <!– insert end wrapper here

43 Wrapper choices Invent our own Re-use a packaging standard Choice from MPEG_DIDL, METS, IMS METS preliminary use –Free –Innate support for DC –Dig-lib currency Increasing interest in packaging formats in the OAI-PMH community For Links see: tml For Links see: tml

44 Using packaging <!– Need a wrapper here

45 OAI-PMH model OAI-PMH identifier = entry point to all records pertaining to the resource resource Dublin Core Metadata item records METS Metadata Crystal Structure Report Jump-off page (HTML) Linking Dublin Core Metadata (eBank_dc) DC 1:1 principle Crystal Structure CIFDataset

46 Anatomy of an eBank UK record

47 eBank_dc schema Crystal Structure Hursthouse, Michael B. Coles, Simon J. C14H22O6 (5,2 -Dimethyl-5 -oxo-octahydro-[2, 2 ]bifuranyl-5-yl)- hydroxy-acetic acid ethyl ester HUZDEL Organic bin/getarticleid?issn= &volume=59&fpage=o501&details=yes

48 eBank_dc schema Crystal Structure Hursthouse, Michael B. Coles, Simon J. ??????? C14H22O6 (5,2 -Dimethyl-5 -oxo-octahydro-[2, 2 ]bifuranyl-5-yl)- hydroxy-acetic acid ethyl ester HUZDEL Organic bin/getarticleid?issn= &volume=59&fpage=o501&details=yes

49 eBank_dc schema Crystal Structure Hursthouse, Michael B. Coles, Simon J. C14H22O6 (5,2 -Dimethyl-5 -oxo-octahydro-[2, 2 ]bifuranyl-5-yl)- hydroxy-acetic acid ethyl ester HUZDEL Organic bin/getarticleid?issn= &volume=59&fpage=o501&details=yes Add general subject terms e.g. Chemistry, Crystallography Add general subject terms e.g. Chemistry, Crystallography Are there existing ontologies that can be re-used for general subject terms? Are there existing ontologies that can be re-used for general subject terms?

50 eBank_dc schema Crystal Structure Hursthouse, Michael B. Coles, Simon J. C14H22O6 (5,2 -Dimethyl-5 -oxo-octahydro-[2, 2 ]bifuranyl-5-yl)- hydroxy-acetic acid ethyl ester HUZDEL Organic bin/getarticleid?issn= &volume=59&fpage=o501&details=yes

51 eBank_dc schema Crystal Structure Hursthouse, Michael B. Coles, Simon J. C14H22O6 (5,2 -Dimethyl-5 -oxo-octahydro-[2, 2 ]bifuranyl-5-yl)- hydroxy-acetic acid ethyl ester HUZDEL Organic bin/getarticleid?issn= &volume=59&fpage=o501&details=yes

52 eBank_dc schema Crystal Structure Hursthouse, Michael B. Coles, Simon J. C14H22O6 (5,2 -Dimethyl-5 -oxo-octahydro-[2, 2 ]bifuranyl-5-yl)- hydroxy-acetic acid ethyl ester HUZDEL Organic bin/getarticleid?issn= &volume=59&fpage=o501&details=yes

53 eBank_dc schema (cont.) CIFDataset METS Metadata Dublin Core Metadata (ebank_dc)

54 Suggestions It useful to design two sets of metadata: A core set expressible within the OAI- PMH Dublin Core manifestation; an extended set specific to value- adding agents. Use OAI-PMH friends facility Define OAI-PMH sets for crystallography data

55 Questions?