A Web-Based Resource Model for eScience: Object Reuse & Exchange 2008 Microsoft eScience Conference Indianapolis, December 8, 2008.

Slides:



Advertisements
Similar presentations
Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09.
Advertisements

Interoperability and semantics in RDF representations of FRBR, FRAD and FRSAD Gordon Dunsire Presented at the Cologne Conference on Interoperability and.
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Enhanced Publications Presentation for ODaF Europe 2009 Thomas Place 2 April 2009.
DLM-Forum - Barcelona, 7-8 May 2002 Promoting and Supporting Open Archives in Europe: The Open Archives Forum Project Donatella Castelli IEI-CNR
Digital Repositories – Linked Open Data – the possible Role of D4Science Workshop, December 2010, FAO use cases A tool to create Linked Data providers.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
Andy Powell, Eduserv Foundation Feb 2007 The Dublin Core Abstract Model – a packaging standard?
W3C and RDF. Why OCLC is a W3C Member Access to networked information resources –the browser and online access –the breath and depth of networked information.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Creating Linked Data Juan F. Sequeda Semantic Technology Conference June 2011.
A centre of expertise in digital information management UKOLN is supported by: If you don’t remember anything else, remember these… Peter.
Metadata Descriptions statements descriptions records.
Semantic Web Thanks to folks at LAIT lab Sources include :
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
E © 2002 Dario Aganovic Resource Description Framework Schema (RDFS) Dario Aganovic Industrial PhD-student NPI Production Kista, Ericsson AB and Production.
RDFa: Embedding RDF Knowledge in HTML Some content from a presentation by Ivan Herman of the W3c, Introduction to RDFa, given at the 2011 Semantic Technologies.
OCLC Research TAI CHI Webinar 5/27/2010 A Gentle Introduction to Linked Data Ralph LeVan Sr. Research Scientist OCLC Research.
Semantic Web Introduction
Repositories: Disruptive Technology or Disrupted Technology? Sandy Payette, Executive Director DORSDL Workshop at ECDL 2008 September 2008.
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
UKOLN is supported by: OAI-ORE a perspective on compound information objects ( Defining Image Access.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
UKOLN is supported by: A non-technical introduction to: OAI-ORE ( Defining Image Access project meeting.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Open Annotation Collaboration Rob Sanderson, Herbert Van de Sompel DMSS Meeting, May 14-15, Stanford, CA Robert Sanderson –
UKOLN is supported by: OAI-ORE : Object Reuse and Exchange an introduction ( UKOLN staff seminar UKOLN,
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
The Open Archives Initiative Simeon Warner (Cornell University) Open Archives seminar “Facilitating Free and Efficient Scientific.
The Open Archives Initiative Simeon Warner Cornell University, Ithaca, NY, USA CREPUQ 2002, Montréal, Canada 14:00, 24 October 2002.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
The OAI-ORE based data model of Europeana and the Digital Public Library of America: implications for educational publishing Dov Winer MAKASH – Advancing.
Interoperability through Library APIs Library Technology Services Open House 7/30/15.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Metadata Modularization Concepts and Tools Carl Lagoze CS
Interoperability Fundamentals: OAI-PMH and OAI-ORE SUETr Interoperability Event 9 th December 2008 London School of Economics Library Dr Robert Sanderson.
RDF, XML and interoperability Managing networks : understanding new technologies, Birmingham, 13 September 2001 Pete Johnston UKOLN, University of Bath.
Antoine Isaac 1 st PRELIDA Workshop Pisa, June 26, 2013.
The OAI: overview and historical context OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University --
Ricardo Pereira Software Engineer TDWG Infrastructure Project (TIP)
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
RELATORS, ROLES AND DATA… … similarities and differences.
Breakout session OAI The future of scholarly communication: Enhanced Publications Saskia Woutersen University of Amsterdam.
An Update on the OAI-ORE Project CNI Spring 2007 Task Force Meeting, Phoenix AZ, April 17, 2007 Lagoze, Nelson & Van de Sompel An Update on the Open Archives.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Metadata and Technology/Architecture Working Groups DLF Aquifer Project DLF Fall Forum Providence, RI November 14, 2008.
OAI Object Reuse & Exchange: Atom Serialization Nordbib Workshop, September , Stockholm, Sweden OAI-ORE: Atom Serialization The ORE Editors are:
EDM Europeana Data Model Guus Schreiber with input from Carlo Meghini, Antoine Isaac, Stefan Gradmann, Maxx Dekkers et al. from Europeana V1.
Open Archives Initiative Gail McMillan Digital Library and Archives, Virginia Tech Society for Scholarly Publishing: June 1, 2000.
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
1 RDF, XML & interoperability Metadata : a reprise Communities, communication & XML An introduction to RDF RDF, XML and interoperability.
Carl Lagoze Digital Library Service Registry Workshop Services in a Scholarly Communication Framework.
CNI Task Force Meeting April 7, 2008 OAI-ORE Project Briefing David Reynolds Tim DiLauro Sayeed Choudhury Library Digital Programs Sheridan Libraries Johns.
RDFa Primer Bridging the Human and Data webs Presented by: Didit ( )
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
OAI Object Reuse & Exchange: Discovery Nordbib Workshop, September , Stockholm, Sweden OAI-ORE: Discovery The ORE Editors are: Carl Lagoze (Cornell.
Linked Data Publishing on the Semantic Web Dr Nicholas Gibbins
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Subjects in the FR family
Linked Data Web that can be processed by machines
Systems for scholarly communication
Jenn Riley Metadata Librarian Digital Library Program
An Architecture for Complex Objects and their Relationships
An OAI-ORE Aggregation for the National Virtual Observatory
Introduction to Digital Libraries Week 13: Reference Linking & OpenURL
Jenn Riley Metadata Librarian Digital Library Program
Presentation transcript:

A Web-Based Resource Model for eScience: Object Reuse & Exchange 2008 Microsoft eScience Conference Indianapolis, December 8, 2008

OAI-ORE Editors Carl Lagoze o Cornell University Herbert Van de Sompel o Los Alamos National Laboratory Pete Johnston o Eduserv Foundation Michael Nelson o Old Dominion University Rob Sanderson o University of Liverpool Simeon Warner o Cornell University

Joint work with …

OAI Object Reuse and Exchange: Support The Andrew W. Mellon Foundation The Coalition for Networked Information Joint Information Systems Committee Microsoft Corporation The National Science Foundation

OAI Object Reuse and Exchange Subject: Aggregations of Web resources Approach: Publish Resource Maps to the Web that Instantiate, Describe, and Identify Aggregations

Instantiate, Describe, and Identify Aggregations Aggregations

At one time it was possible to convey all scientific information about a topic in a single convenient medium. Babylonian Astronomical Catalogue

Aggregations But quickly the limitations of that medium became obvious. textdata 1857 Astrophysics paper

Aggregations Those limitations seem to live on.

Aggregations Solving the problem with ad hoc methods. Photo plate kept separate from text (digitized version of original plate shown) text 1890 Astrophysics paper

Hubble optical observation Baltimore, MD Basic object information Strasbourg, France Aggregations Objects of interest in eScience are by nature compound. text 2006 Astrophysics paper X-MM-Newton X-ray observation Vilspa, Spain Chandra X-ray observation Cambridge, MA A1795

Aggregations! FormatsVersionsIdentifiersRelationshipsSplash page

Object Reuse and Exchange: A Web-Centric Approach The Web Architecture as the platform for interoperability De-facto integration with existing Web applications Potential of adoption by other communities Potential of tools created by other communities Incorporating the social web (Web 2.0) in eScience

Foundations of OAI-ORE o Web Architecture - o Semantic Web, RDF - o Linked Data o Cool URIs for the Semantic Web -

W3C Web Architecture Resource URI Representation 2 Represents Representation 1 Represents Identifies Content Negotiation The tools we have to solve the interoperability problem are: Resource URI Representation

Semantic Web The tools we have to solve the interoperability problem are: URI RDF Vocabularies Semantic Web URIRDF Vocabularies

Linked Data Linked Data principles: 1. Use URIs as names for things. 2. Use HTTP URIs so that people can look up those names. 3. When someone looks up a URI, provide useful information. 4. Include links to other URIs. So that they can discover more things.

OAI Object Reuse and Exchange: The Approach Subject: Aggregations of Web resources Approach: Publish Resource Maps to the Web that Instantiate, Describe, and establish identity of Aggregations Approach: Instantiate Aggregations as Resources with unique URIs on the Web

An Aggregation and the Web Resources of an Aggregation are distinct URI-identified Web resources Missing are: o The boundary that delineates the Aggregation in the Web o An identity (URI) for the Aggregation

Publish a Resource Map to the Web

The Resource Map Describes the Aggregation

The Resource Map and the Aggregation integrate into the Web

ORE Data Model

We want to have our cake and to eat it too (don't we all?): o ORE should be simple and easy to use without deep understanding -Use simple tools and rules to create Atom Resource Maps o ORE should have well crafted data model that enables interoperability through well defined semantics -Separate design from implementation -Future-proof ORE – today's technologies will be replaced (even HTTP?) -Don't need to understand Data Model fully to do ORE

Aggregation: Resource that is a set of resources This resource is an Aggregation This resource is an Aggregated Resource A Relationship defined in the ORE vocabulary

Resource Map: Describes an Aggregation: This resource is a Resource Map Resource Map Serialization The resource has a representation HTTP GET ore:isDescribedBy Implied as inverse of describes

Based on Resource Description Framework (RDF) Resource #1 Resource #2 relatedTo Resource #3 relatedTo hasChapter follows SubjectPredicateObject R1hasChapterR2 R1hasChapterR3 followsR2 R1createdByCarl Lagoze Triples Carl Lagoze createdBy RDF model – multiple serializations: RFD/XML, Atom, RDFa

Recommend use if HTTP URIs HTTP is technology of today's web Want to be able to cite of refer to Aggregation but get Resource Map describing it o Follow Linked Data strategies to link: access URI-A, get redirected to URI-R (HTTP 303) or simple # URI o Provides notion of Authority Multiple Resource Maps o An Aggregation MAY be asserted and described by multiple Resource Maps o The purpose of multiple Resource Maps is to provide descriptions of the Aggregation in multiple serializations (e.g., Atom, RDF/XML, RDFa, etc.) o Each Resource Map MUST have only one representation

Authority o Authoritative Resource Maps o Get to Resource Map via Aggregation, usually created by same authority o Multiple: MUST be minimally equivalent (same Aggregated Resources and Proxies), SHOULD assert mutual existence o Non-authoritative Resource Maps o Best practice is to not create them o Assert your own Aggregation instead o Use rdfs:seeAlso to assert relationship between two Aggregation

Multiple Resource Maps Atom RDFa Atom RDF/XML ore:describes These are authoritative Resource Maps These are non-authoritative Resource Maps

Not much else

Dont overload URI-A These resources mean something already. Dont use one URI for multiple information objects.

Association with another resource/identifier

Adding other properties to the core The ReM makes the assertions Metadata about the ReM Metadata about the Aggregation Required

Asserting other Relationships Aggregation is a journal Aggregation has another version A Aggregated Resources are articles AR-3 is by Stephen Hawking The ReM makes the assertions Assertions about the Aggregation. Assertions about Aggregated Resources.

Limits of Assertions thus Far The meaning of an RDF triple is independent of the context in which it is stated Think of the difference: o Carl is a man o Carl is visiting Indianapolis All the triples described thus far are context independent o Therefore they can have the URI of an aggregated resource as subject or object o But remember that is just the URI of the Resource and is not exclusive of it being an Aggregated Resource Introduce proxy URI

Proxy: Stands for resource in context of other resource hasNext might have meaning only in context

lineage: this came from Reuse of data set AR-1 in Aggregation A-2. ore:lineage predicate expressed origin or provenance of data. Needs proxies because statement depends on contexts

ORE Deployment

arXiv.org: ORE possibilities arXiv is an e-print archive of 500k scholarly articles Express: Structure of arXiv: archives, sub-categories, articles Versioning: article (concept) and specific versions and formats Articles by Joe Smith – somewhat like a result set Constituents of an article (metadata, PDF, source, video, data, extracted references) Describe internal and external components (e.g. external video associated with article but on Perimeter Institute server) Use as part of workflow for ingest – assembly of components, possible combination with SWORD

SCOPE Architecture