Afraz Jaffri, Hugh Glaser, Ian Millard Electronics and Computer Science University of Southampton.

Slides:



Advertisements
Similar presentations
On The Evolution of Terms
Advertisements

Presented to the ALCTS FRBR Interest Group, ALA Annual, 24 June 2011
May 23, 2004OWL-S straw proposal for SWSL1 OWL-S Straw Proposal Presentation to SWSL Committee May 23, 2004 David Martin Mark Burstein Drew McDermott Deb.
©euroCRIS/Keith G JefferyOA Workshop May 2010 CNR Roma The euroCRIS view of the Rome OA Workshop Keith G Jeffery President, euroCRIS
Chapter 1: The Database Environment
Author: Graeme C. Simsion and Graham C. Witt Chapter 12 Physical Database Design.
September, 2005What IHE Delivers 1 Key Image Notes Evidence Documents Simple Image & Numeric Report Access to Radiology Information IHE Vendors Workshop.
June 28-29, 2005IHE Interoperability Workshop 1 Integrating the Healthcare Enterprise Cross-enterprise Document Sharing for Imaging (XDS-I) Rita Noumeir.
Copyright Management for the LUISA Semantic Learning Content Management System Roberto García Universitat de Lleida, Spain Tomas Pariente ATOS Origin SAE,
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Mirror Mirror on the wall does your repository reflect it all? Peter West and Timothy Miles-Board EPrints Services University of Southampton Southampton,
© Keith G Jeffery, Anne G S Asserson GL 11 Washington Keith G Jeffery Director, IT & International Strategy, STFC
1 An Update on XML.org Registry and Repository Una Kearns Documentum, Inc.
1 Web Search Environments Web Crawling Metadata using RDF and Dublin Core Dave Beckett Slides:
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
Programs and Research Public Private Agreements for Mass Digitisation Ricky Erway JISC Digitisation Conference July 2007.
Copyright © 2006 Data Access Technologies, Inc. Open Source eGovernment Reference Architecture Approach to Semantic Interoperability Cory Casanave, President.
Ontological Resources and Top-Level Ontologies Nicola Guarino LADSEB-CNR, Padova, Italy
1 ICS-FORTH EU-NSF Semantic Web Workshop 3-5 Oct Christophides Vassilis Database Technology for the Semantic Web Vassilis Christophides Dimitris Plexousakis.
Digital Repositories – Linked Open Data – the possible Role of D4Science Workshop, December 2010, FAO use cases A tool to create Linked Data providers.
0 - 0.
Addition Facts
Copyright 2006 Digital Enterprise Research Institute. All rights reserved. MarcOnt Initiative Tools for collaborative ontology development.
A Semantic Web Browser for Supporting Open-Corpus Linking and Adaptive Hypermedia Melike Şah Intelligence, Agents and Multimedia Group School of Electronics.
RKBExplorer.com: A Knowledge Driven Infrastructure for Linked Data Providers Hugh Glaser, Ian C. Millard, Afraz Jaffri ESWC 2008 Demo.
Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009.
RKBExplorer: Repositories, Linked Data and Research Support Hugh Glaser, Ian Millard & Les Carr At Eprints User Group, Open Repositories 2009.
RKBExplorer Jean-Claude Laprie, Hugh Glaser, Ian Millard.
Prototype Knowledge Base: an on-line information service in dependability and security Hugh Glaser Electronics & Computer Science University of Southampton.
1/ 26 AGROVOC and the OWL Web Ontology Language: the Agriculture Ontology Service - Concept Server OWL model NKOS workshop Alicante,
UKOLN, University of Bath
An overview of collection-level metadata Applications of Metadata BCS Electronic Publishing Specialist Group, Ismaili Centre, London, 29 May 2002 Pete.
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
George Anadiotis, Spyros Kotoulas and Ronny Siebes VU University Amsterdam.
Week 2 The Object-Oriented Approach to Requirements
Configuration management
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
12/03/ Second International Workshop on New Generation Enterprise and Business Innovation NGEBIS 2013 Cross Domain Crawling for Innovation Pieruigi.
1 ISWC-2003 Sanibel Island, FL IMG, University of Manchester Jeff Z. Pan 1 and Ian Horrocks 1,2 {pan | 1 Information Management.
Sunday October 28, www.eprints.org Tim Brody - Stevan Harnad -
2 Artificial Intelligence Applications Institute, University of Edinburgh, UK Institute for Human and Machine Cognition, Pensacola, Florida CoSAR-TS Coalition.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Who are the Experts?Simon KampaSlide 1 Who are the Experts? Simon Kampa IAM Group University of Southampton
Korean Place Name Information Service on the Web 2.0 Environment
A framework for Linked Data business models Michalis Vafopoulos vafopoulos.org 1/10/2011.
Cómo seleccionar estrategias de codificación adecuados para la producción de LOD habilitado para datos bibliográficos Marcia L.Zeng Kent State University.
Creating Linked Data Juan F. Sequeda Semantic Technology Conference June 2011.
Addition 1’s to 20.
10-Sep-02 Page 1 Gadjah Mada University - Yogyakarta - Indonesia Gadjah Mada University10-Sep-02 Page 1 Gadjah Mada University - Yogyakarta - Indonesia.
An Adaptive System for User Information needs based on the observed meta- Knowledge AKERELE Olubunmi Doctorate student, University of Ibadan, Ibadan, Nigeria;
Week 1.
RDF Tutorial.
Current Trends in Databases Network Effects (co-presentation) Gert Nelissen UHasselt
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
You Cannot ReSIST Hugh Glaser Electronics & Computer Science University of Southampton DSSE, 28th February 2007.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Research on Linked Data and Co-reference Resolution Hugh Glaser, Ian Millard: University of Southampton, UK 성원경, 이승우, 김평, 류범종 Won-Kyung Sung, Seungwoo.
Deploying Trust Policies on the Semantic Web Brian Matthews and Theo Dimitrakos.
URI Disambiguation in the Context of Linked Data Afraz Jaffri, Hugh Glaser, Ian MillardECS, University of Southampton
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
LOD for the Rest of Us Tim Finin, Anupam Joshi, Varish Mulwad and Lushan Han University of Maryland, Baltimore County 15 March 2012
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
RELATORS, ROLES AND DATA… … similarities and differences.
An Integrated Knowledge Base for European Dependability Research © Hugh Glaser, Ian Millard et al. Electronics & Computer Science University of Southampton.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Big Data Quality the next semantic challenge
Analyzing and Securing Social Networks
Presentation transcript:

Afraz Jaffri, Hugh Glaser, Ian Millard Electronics and Computer Science University of Southampton

2SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage 1. Linked Data 2. URI Multiplicity 3. The Problem of Coreference 4. URI Identity Management Approaches 5. The Problem with owl:sameAs 6. The Consistent Reference Service (CRS) 7. CRS Architecture 8. A CRS Application: The RKB Explorer 9. Summary and Future Work

3SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage DBpedia has URIs for approximately 2 million entities Linked datasets contain many overlapping entities A single entity can have a number of URIs Entities are linked using owl:sameAs Example

4SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage Contains URIs for more than 10 million entities Data relating to people, projects, papers and institutions A single entity has a number of URIs (even within the same repository) Entities are linked using CRSs DBLP

5SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage URIs for Spain: URIs for Hugh Glaser:

6SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage Tom Anderson – Is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dc:creator of dc:creator is dblp:editor of dblp:editor Vice President O-in Design Automation inc. USAProfessor, University of NewcastleProfessor, Heriot Watt UniversityUniversity of WashingtonUniversity of California, BerkelyTom Andersen - University of DenmarkLucent Technologies, Illinois

7SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage The problem of coreference has existed for many years Physical Libraries disambiguate authors through Date of Birth Digital Libraries still have the problem of author disambiguation Problems caused by variations in naming schemes e.g. Glaser, H. H. Glaser Glaser, Hugh H. Glazer

8SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage Coreference Problem referred to as Record Linkage Matching entities between records similar to matching entities between datasets Database linkage is easier due to imposed schema Formal theory of Record Linkage proposed by Fellegi & Sunter (1969) Uses coded agreements between each field (property) to give the probability of record (instance) equivalence Can be adapted for use on the Semantic Web

9SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage Coreference on the Semantic Web is defined as being the situation where two or more URIs are used for a single non- information resource URI usage can change with context Non-Information resources are hard to define precisely Examples Hugh Glaser at Southampton vs. Hugh Glaser at Imperial Harry Potter and the Order of the Phoenix in Hardback vs. Softback ISBN:

10SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage Use a centralised naming authority to issue URIs for every entity in the world Let everyone create their own URIs and link them to official URIs (using owl:sameAs) Let everyone create their own URIs and register them at a centralised repository Let everyone create their own URIs and let them be managed by many decentralised repositories In all of the above encourage reuse and linking as far as possible

11SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage owl:sameAs was designed for a specific purpose Resources linked with owl:sameAs have the same identity i.e. The subject and object are exactly the same resource owl:sameAs has been misused for Linking Open Data Linking can occur between two very different resources, e.g. Tom Anderson Reasoning with LOD will have unintended consequences

12SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage Hugh Glaser Hugh Glaser Reader Lecturer Assert SELECT ?x WHERE { vcard: ?x} Returns Which belongs to which role? Using owl:sameAs means that both URIs become indistinguishable even though they may refer to different entities according to the context in which they are used.

13SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage Data (Knowledge) providers publish data (knowledge) Resources from one provider cannot be guaranteed to be the same as resources from another provider Knowledge will be published and made dereferenceable at the domain that the publisher has control over URIs will be constructed from the domain name of the publishers site An intermediate service groups URIs of resources that may be the same This knowledge is made available upon dereferencing the URI of a resource

14SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage Can be seen as a conventional Knowledge Base Contains knowledge about the URIs in a repository URIs referring to the same resource are grouped together in Bundles A Bundle has properties: Coref:hasEquivalentReference – The URIs in a bundle are grouped together using this predicate Coref:hasCanonicalReference – One URI in a bundle can be made to be the canonical representation i.e. The preferred URI Coref:updatedOn – The date of the last update to the bundle

15SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and rdf:. a coref:Bundle ; coref:hasCanonicalReference ; coref:hasEquivalentReference,,,.

16SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage RESOLVE RETRIEVE RDF KB CRS Non-Information Resource Information Resource Text/HtmlRDF/XML Application

17SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage Finding all equivalences (bundles) is up to the application A separate activity from coreferencing a single data source Services such as Sindice can perform this function for free To perform the equivalence closure just follow the crs:hasCRS links Scalability is ensured by not including all possible bundles in every CRS

18SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage The Resilience Knowledge Base Explorer displays communities of practice for people, projects and publications from the RKB Uses multiple CRSs to disambiguate people and publications One CRS per knowledge base ensures scalability Multiple SPARQL queries Look yourself up!

19SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage Equivalence Mining is a difficult task that requires multiple algorithms Adding policies to determine the trust level of a CRS Establishing the authority of a CRS over a KB Establishing performance metrics Collaborating with LOD community for wide scale deployment Formalising the linking methodology

20SSWS07 - Vilamoura, Potugal URI Identity Management for Semantic Web Data Integration and Linkage Coreference exists in many disciplines and will exist on the Semantic Web The equivalence of non-information resources depends on context The semantics of owl:sameAs do not fit with the current usage in Linked Data The CRS is a solution that is being deployed on a large knowledge-based infrastructure Its my knowledge, so let me name it!

SSWS07 - Vilamoura, Potugal21 Questions? URI Identity Management for Semantic Web Data Integration and Linkage