© Copyright 2011 TopQuadrant Inc. Slide 1 Evolving Practices of Linked Data Irene Polikoff, TopQuadrant June 29-30, 2011 W3C Government Linked Data Working.

Slides:



Advertisements
Similar presentations
Presented to the ALCTS FRBR Interest Group, ALA Annual, 24 June 2011
Advertisements

Copyright © 2003 Pearson Education, Inc. Slide 7-1 Created by Cheryl M. Hughes, Harvard University Extension School Cambridge, MA The Web Wizards Guide.
Copyright © 2003 Pearson Education, Inc. Slide 6-1 Created by Cheryl M. Hughes, Harvard University Extension School Cambridge, MA The Web Wizards Guide.
© Copyright 2009 TopQuadrant Inc. Slide 1 QUDT: An OWL Ontology for Measurable Quantities, Units, Dimension Systems, and Dimensional Data Types James Chip.
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
OLAC Metadata Steven Bird University of Melbourne / University of Pennsylvania OLAC Workshop 10 December 2002.
LIS650lecture 1 XHTML 1.0 strict Thomas Krichel
Requirements. UC&R: Phase Compliance model –RIF must define a compliance model that will identify required/optional features Default.
XPointer and HTTP Range A possible design for a scalable and extensible RDF Data Access protocol. Bryan Thompson Presented to the RDF Data Access.
XPointer and HTTP Range A possible design for a scalable and extensible RDF Data Access protocol. Bryan Thompson draft Presented to the RDF.
Metadata vocabularies and ontologies Dr. Manjula Patel Technical Research and Development
UKOLN, University of Bath
Dr. Alexandra I. Cristea CS 253: Topics in Database Systems: C3.
Last update: (2) (3) The Dutch airline.
4. Internet Programming ENG224 INFORMATION TECHNOLOGY – Part I
Semantic Descriptions for RESTful Services SA-REST by Knoesis Service Research Lab Tomas Vitvar WSMO Phone Conference January 09,
SPARQL Dimitar Kazakov, with references to material by Noureddin Sadawi ARIN, 2014.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
XML: Extensible Markup Language
Web Development & Design Foundations with XHTML
Getting Familiar with Web Pages 1 2 The Internet Worldwide collection of interconnected computer networks that enables businesses, organizations, governments,
Creating Linked Data Juan F. Sequeda Semantic Technology Conference June 2011.
1 An inference engine for the semantic web Naudts Guido Student at the Open University Netherlands.
Steffen Staab 1WeST Web Science & Technologies University of Koblenz ▪ Landau, Germany Structured Data on the Web Introduction to.
Microdata and schema.org. Basics Microdata is a simple semantic markup scheme that’s an alternative to RDFa Microdata Developed by WHATWG and supported.
Semantic Web Thanks to folks at LAIT lab Sources include :
The Semantic Web – WEEK 4: RDF
Introduction to RDF Based on tutorial at
1 RDF Tutorial. C. Abela RDF Tutorial2 What is RDF? RDF stands for Resource Description Framework It is used for describing resources on the web Makes.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
CS570 Artificial Intelligence Semantic Web & Ontology 2
RDFa: Embedding RDF Knowledge in HTML Some content from a presentation by Ivan Herman of the W3c, Introduction to RDFa, given at the 2011 Semantic Technologies.
4.01 How Web Pages Work.
Supported by EU projects 12/12/2013 Athens, Greece Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products.
HTML/XML XHTML Authoring. Creating Tables  Table: An arrangement of horizontal rows and vertical columns. The intersection of a row and a column is called.
RDF Tutorial.
Semantic Web Introduction
Linked Data for Libraries, Archives, Museums. Learning objectives Define the concept of linked data State 3 benefits of creating linked data and making.
RDF formats for Linked Data by Mabi Harandi. RDF is not a format, it is a model for data So: It will provide supports for different formats like :  Turtle.
Embedding Knowledge in HTML Some content from a presentations by Ivan Herman of the W3c.
The Web of data with meaning... By Michael Griffiths.
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
The Semantic Web Web Science Systems Development Spring 2015.
RDF and XML 인공지능 연구실 한기덕. 2 개요  1. Basic of RDF  2. Example of RDF  3. How XML Namespaces Work  4. The Abbreviated RDF Syntax  5. RDF Resource Collections.
Access and Query Task Force Status at F2F1 Simon Miles.
1 SPARQL A. Emrah Sanön. 2 RDF RDF is quite committed to Semantic Web. Data model Serialization by means of XML Formal semantics Still something is missing!
RELATORS, ROLES AND DATA… … similarities and differences.
Embedding Knowledge in HTML Some content from a presentations by Ivan Herman of the W3c.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Access and Query Task Force Status at F2F1 Simon Miles.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Semantic Web 06 T 0006 YOSHIYUKI Osawa. Problem of current web  limits of search engines Most web pages are only groups of character strings. Most web.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
4.01 How Web Pages Work.
Data.gov: Web, Data Web, Social Data Web 7/22/2010 #health2stat.
Embedding Knowledge in HTML
Embedding Knowledge in HTML
Resource Description Framework (RDF)
W3C Recommendation 17 December 2013 徐江
Linked Data Ryan McAlister.
Presentation transcript:

© Copyright 2011 TopQuadrant Inc. Slide 1 Evolving Practices of Linked Data Irene Polikoff, TopQuadrant June 29-30, 2011 W3C Government Linked Data Working Group

© Copyright 2011 TopQuadrant Inc. Slide 2 What is data? Data has: value type structure units of measure encoding bit and byte order Not a topic of this presentation but many questions relevant to interpretation of data depend on the attributes of the data

© Copyright 2011 TopQuadrant Inc. Slide 3 What is Linked Data? A set of best practices for publishing and connecting structured data on the Web A method of publishing structured data so that it can be interlinked and become more useful. It builds upon standard Web technologies such as HTTP and URIs, but rather than using them to serve web pages for human readers, it extends them to share information in a way that can be read automatically by computers. This enables data from different sources to be connected and queried.HTTPURIs

© Copyright 2011 TopQuadrant Inc. Slide 4 How is LD publishing being done today? SPARQL endpoints Making static serialized RDF available at a URL URL that corresponds to the base namespace? Content negotiation (person gets HTML document, machine gets RDF) Structured markup embedded in HTML (RDFa, microdata, microformats) Provided as a meta tag link in an HTML page pointing to the corresponding RDF file Zipped RDF files downloadable from the web ???

© Copyright 2011 TopQuadrant Inc. Slide 5 What factors influence LD publishing decisions? Available infrastructure and its constraints Cost Data consumers preferences Size of the data being published Frequency of change Skills and knowledge of the data publisher W3C recommendations ???

© Copyright 2011 TopQuadrant Inc. Slide 6 A data consumer viewpoint – in favor of SPARQL endpoint At the latest EBI Industry Day industry reps requested EBI curated content to be made available as SPARQL endpoint as opposed to, e.g., published as a large download, or being re-hosted by a third party The following arguments were made: Ease of access. The datasets are very large, and are updated regularly. Download of an entire dataset is time consuming and costly. A (high-performance) SPARQL endpoint allows a client to specify just what data they want, and get it in a just-in-time manner. Currency. The datasets change often, users want to know that they have the latest version, without having to perform tedious checks at every access. Authority. The users of this data trust the EBI curation for this data, and don't know if they can trust a third party. Was the data corrupted? Is it the version it claims to be?

© Copyright 2011 TopQuadrant Inc. Slide 7 Each publishing approach requires guidance on best practices For example, for Content negotiation: How does a client identify its requirements (RDF/XML, Turtle, HTML, SPARQL endpoint)? The Turtle submission suggests mimetype text/turtle for turtle. What types of content can be negotiated? (SPARQL endpoint? RDF/XML? Turtle? NTriples? OWL/XML?) Must all negotiated variants contain the same information? What does this mean, when different formats have different interpretations (e.g., OWL/XML vs. Turtle)? Must all negotiated variants have the same prefix definitions? What about forms that don't have a notion of prefixes (NTripes, HTML)? And in a more general sense: How are versions managed? (e.g, using owl:versionInfo)? How are the URLs for various versions managed? If one dataset uses resources from another, how does it indicate this? Just use it? rdfs:seeAlso? owl:imports? What is the appropriate behavior of a client in these situations? Is there any relationship between the location at which a file is found and the URIs it describes? How about its base URI, owl:Ontology or default namespace?

© Copyright 2011 TopQuadrant Inc. Slide 8 An example of what we may see when we look at the published data " Recoveries Recouvrement Bankruptcies Debtors Seizure (of property) The regaining of something of value, such as property or funds lent, as a result of special efforts by the owner or creditor. EC Economics and Industry " Recovery plans (Environment) Environmental management NE Nature and Environment " Recreation Loisir Entertainment Hobbies Leisure Recreational activities Games Recreational facilities Sports Tourism Toys Outdoor recreation An activity that diverts, amuses or stimulates usually done in one's spare time. SO Society and Culture Government of Canada Core Subject Thesaurus

© Copyright 2011 TopQuadrant Inc. Slide 9 Issues with the example Minting new URIs in someone elses namespace e.g., skos:UsedFor, skos:SubjectCategory, etc. Providing no type definitions for the new URIs (Possibly) making errors in URIs did they mean skos:scopeNote or skos:ScopeNote? (Potentially) misusing URIs did they meant skos:narrower when they said skos:NarrowerTerm, if so, it is an object property Inventing a way to do language tags, e.g., skos:French perhaps, because they are not aware of how to do this correctly Not following a convention of lower camel case for properties Not linking their own data skos:NarrowerTerm and skos:RelatedTerm are all strings

© Copyright 2011 TopQuadrant Inc. Slide 10 One possible guideline or test Assuming that information about a resource should be found at the place it resolves to, then a resource like: skos:RelatedTerm should be available at which it isn't

© Copyright 2011 TopQuadrant Inc. Slide 11 Looking at examples helps There will be issues the working group would not have thought possible Understanding these will provide the needed scope/level of details for best practices One good resource is the Pedantic Web Group:

© Copyright 2011 TopQuadrant Inc. Slide 12 More questions to address - 1 If you use someone elses vocabulary, do you include type declarations, effectively replicating information? It is very common to see included something like: foaf:Person a owl:Class What role do imports play, if any, in Linked Data publishing? What name do we give to a set of graphs (ontologies) that belong together e.g., skos and skos-xl, QUDT ontology collection What should be a relationship between their URIs/namespaces? TQ has build grammars to resolve this

© Copyright 2011 TopQuadrant Inc. Slide 13 More questions to address - 2 What information should be returned for a resource? All triples that it is a subject of? What about back links? What about if a resource is a class? How to express vocabulary and data mappings? owl:sameAs, owl:equivalentClass, etc. are commonly used, sometimes, without understanding semantic commitment SKOS mapping properties are an alternative What about more complex mappings – at TQ, we use SPIN (SPARQL) maps

© Copyright 2011 TopQuadrant Inc. Slide 14 Thank You Irene Polikoff