A water information R & D alliance between the Bureau of Meteorology and CSIRO’s Water for a Healthy Country Flagship Vocabulary Services, RDF, SKOS and.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
ESDSWG2011 – Semantic Web session Semantic Web Sub-group Session ESDSWG 2011 Meeting – Semantic Web sub-group session Wednesday, November 2, 2011 Norfolk,
Semantic Web Introduction
(1) Standardizing for Open Data Ivan Herman, W3C Open Data Week Marseille, France, June Slides at:
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
International Workshop Linked Open Data & the Jewish Cultural Heritage Rome, 20 th January 2015 International Workshop Linked Open Data & the Jewish Cultural.
Thesauri, Terminologies and the Semantic Web
Ontology Notes are from:
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Intelligent Systems Semantic Web. Aims of the session To introduce the basic concepts of semantic web ontologies.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
A Registry for controlled vocabularies at the Library of Congress
Linked Vocabulary API for the Earth Sciences Community AGU Fall Meeting 2014 Stephan Zednik Peter Fox Linyun Fu Patrick West Xiaogang Ma.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Vocabulary Services “Huuh - what is it good for…” (in WDTS anyway…) 4 th September 2009 Jonathan Yu CSIRO Land and Water.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Towards validating observation data in WaterML 2.0 WATER FOR A HEALTHY COUNTRY You can change this image to be appropriate for your topic by inserting.
Practical RDF Chapter 1. RDF: An Introduction
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
A J Miles Rutherford Appleton Laboratory SKOS Standards and Best Practises for USING Knowledge Organisation Systems ON THE Semantic Web NKOS workshop ECDL.
Using Vocabulary Services in Validation of Water Data May 2010 Simon Cox, JRC Jonathan Yu & David Ratcliffe, CSIRO.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Logics for Data and Knowledge Representation
The Semantic Web Web Science Systems Development Spring 2015.
The MMI Tools Carlos Rueda Monterey Bay Aquarium Research Institute OOS Semantic Interoperability Workshop Marine Metadata Interoperability Project Boulder,
Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Ontology Summit2007 Survey Response Analysis Ken Baclawski Northeastern University.
Semantic Web - an introduction By Daniel Wu (danielwujr)
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
RELATORS, ROLES AND DATA… … similarities and differences.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
1.Registration block send request of registration to super peer via PRP. Process re-registration will be done at specific period to info availability of.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
ELIS – Multimedia Lab PREMIS OWL Sam Coppens Multimedia Lab Department of Electronics and Information Systems Faculty of Engineering Ghent University.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
RDF & SPARQL Introduction Dongfang Xu Ph.D student, School of Information, University of Arizona Sept 10, 2015.
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
® Using (testing?) the HY_Features model, 95th OGC Technical Committee Boulder, Colorado USA Rob Atkinson 3 June 2015 Copyright © 2015 Open Geospatial.
UNEP Terminology Workshop - Geneva, April 15, Environmental Terminology & Thesaurus Workshop UN Environment Programme Regional Office of Europe.
“New Dimensions in KOS” CENDI/NKOS Workshop September 11, 2008 Washington, DC, USA An international conference to share and advance knowledge and experience.
Summer of Vocabs: Knowledge Organisation Water Resources Management - Environmental Information Infrastructures Megan Williams| Vacation Scholar 29 January.
Linked Vocabulary API for the Earth Sciences Community AGU Fall Meeting 2014 Stephan Zednik Peter Fox Linyun Fu Patrick West Xiaogang Ma
EXtended Knowledge Organization System (XKOS) Prepared by Franck Cotton, Institut National de la Statistique et des Études Économiques Daniel W. Gillman,
SKOS : A language to describe simple knowledge structures for the web
® Sponsored by Hosted by HY_Features Part 3 - OWL encoding: rhyme and reason 96th OGC Technical Committee Nottingham, UK Rob Atkinson 17 September 2015.
Validating water resources described in WaterML 2.0 Jonathan Yu, CSIRO Land and Water Peter Taylor, Gavin Walker (CSIRO ICT Centre), Simon Cox (CSIRO ESRE)
SysML v2 Model Interoperability & Standard API Requirements Axel Reichwein Consultant, Koneksys December 10, 2015.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
knowledge organization for a food secure world
Analyzing and Securing Social Networks
Session 2: Metadata and Catalogues
LOD reference architecture
Presentation transcript:

A water information R & D alliance between the Bureau of Meteorology and CSIRO’s Water for a Healthy Country Flagship Vocabulary Services, RDF, SKOS and REST Peter Fitch

Outline Outline the problem Background on Linked data RDF SKOS REST – Linked Data API Vocabulary Service What is it How to develop a vocabulary service Test case with USGS code list Demo

Warning Frequent use of XML

Motivations Xlink is all well and good, but the real problem is what is at the end of the link and how to use it. Agreed, and I wish I knew more about the semantic technologies.

The need for semantic context From Lemon OSDM Linked Data workshop 2010 Semantic Context Black and White Bessie Good Milker

Machines need it to. From Lemon OSDM Linked Data workshop 2010

Information Needed Internal Structure-the information model Supported functions – the operations Semantics What are the concepts What are the vocabularies How are they related Where are they defined Where did it come from? How was it created? Current Metadata Adapted from Lemon OSDM Linked Data workshop 2010 Semantic Context

The need for semantic information in Hydro-Domain data exchange

Don’t Information Models solve the problem?

Take a closer look

The O word You need an ontology! O What? I know one O word and its not that. I better find out more.

4 Rules of Linked Data TBL – key take home! 1.Use URIs as names for things 2.Use HTTP URIs so that people can look up those names. 3.When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4.Include links to other URIs. so that they can discover more things. Comment: So by following rule 3, we might be able to get some useful information, we still need semantic context though. Tim Berners-Lee

Linked data quality scheme Sir Tim BL RatingDescription ★ Available on the web (whatever format), but with an open license ★★ Available as machine-readable structured data (e.g. excel instead of image scan of a table) ★★★ as (2) plus non-proprietary format (e.g. CSV instead of excel) ★★★★ as (2) plus non-proprietary format (e.g. CSV instead of excel) All the above plus, Use open standards from W3C (RDF and SPARQL) to identify things, so that people can point at your stuff ★★★★★ All the above, plus: Link your data to other people’s data to provide context Tim Berners-Lee

Intro to RDF RDF is a data model for describing resources Resource Description Framework SubjectObject Predicate ThingsHave properties Property Value The object of one statement can become the subject in another. The set of linked statements, forms a directed graph Subject, Object and Predicate are all Resources* A set of Subject, Predicate, Object entities is called a Triple

RDF Example Remember - Resources are URI’s Peter – hasColleague – Nate - Peter Nate hasColleague Subject: Peter Predicate: hasColleague Object: Nate

RDF Landscape Basic resource descriptions RDF Express resources as classes, with properties and class relationships RDFS OWL Web Ontology Language Exact description and relationships SKOS Simple KOS Simple description and relationships Expressivity Basic Building Blocks SPARQL RDF Query SPARQL RDF Query

Intro to SKOS SKOS : Simple Knowledge Organizational System. KOS- provides semantic context. Built on RDF and RDFS Designed to bridge current chaotic, poorly described web, and full sematic web – OWL. See SKOS primer at Limited vocabulary eg: skos:ConceptScheme skos:Concept skos:prefLabel skos:scopeNote And some limited standard relationships skos:exactMatch skos:narrower, skos:broader Allows for limited inference Because of its limited vocabulary, really useful for Thesauri, classification lists, taxonomies etc.

SPARQL Queries Purpose: query a RDF triple store, works by matching triples to patterns. example: select ?concept where { ?concept rdf:type skos:Concept} Return me all concepts which are of rdf:type skos:Concept Other Queries CONSTRUCT – returns a rdf graph ASK – returns bool if triple is matched DESCRIBE – returns a graph describing a resource.

Intro to Linked Data API Familiar with RESTful services right?? LD API designed as a bridge between the complexity of SPARQL endpoints, and a standard REST API Provides standard URI matching patterns and additional specification for behaviors eg: /doc/school/12345 should respond with a document that includes information about /id/school/12345’ /doc/school should respond with a document of schools /doc/school/12345.JSON should return with a JSON document.

Intro to linked data API URI pattern SPARQL for Result Set SPARQL for view on Result Set Response in RDF, Turtle, etc.

Vocabulary Service In semantic web, vocabulary is defined as a set of URI’s Functionally we want: Ability to look up definitions of terms and or code lists – skos:Concept, skos:definition, skos:prefLabel Ability to resolve synonyms skos:exactMatch, skos:broader, skos:narrower Ability to deal with different langauages skos:preflabel lang=en Standard API – Linked data API and REST Standard Information/Data Model - SKOS

Simon Cox vocab proposal Proposal by S. Cox

Vocab development process 1.Select code list or vocab for service 2.Map code list to skos 3.Check code list for web compatibility and harmonise with other code lists or vocabs 1.eg: use a standards units vocab 2.remove any non conforming content. 4.Convert code list to SKOS RDF 5.Validate RDF using W3C RDF validator 6.Import to Triple Store 7.Publish Service 8.Use: Link to in documents!

Case study USGS Parameter Code List Proof of Concept Code List is a CSV table of parameter codes.

Mapping code list to SKOS Parameter CodeSKOS Mapping Parameter Code Listskos:ConceptScheme Parameter Codeskos:Concept Group Nameskos:broader Parameter Nameskos:prefLabel cas Nameskos:exactMatch srs Nameskos:broadMatch Unitsneed additional relationship usgs:hasUnits

Content conformance-harmonization Issues need skos:Definition – Parameter Name? Invalid characters for web in Parameter Name eg &, - & < > Units – non standard representation, eg Mi2 (square Miles),mgd (Million G per day),%, nu (number of bad characters TX by DCP) Fix up Leave as a literal?

Comments on code list Conflation of information Chromium(VI), water, unfiltered, recoverable, micrograms per liter Observable phenomena – Chromium(VI) Procedure – unfiltered/recoverable media – water Units – ug/L Phosphorus, suspended sediment, total digestion, dry weight, percent Observable phenomena - Phosphorus procedure total digestion (but not linked to standard method) media – suspended sediment units dry weight percent. Some meaningless codes - Precipitation, cumulative at given time, location 6, inches

Duplication Turbidity, water, unfiltered, broad band light source ( nm), detection angle 90 +/- 30 degrees to incident light, nephelometric turbidity units (NTU) Turbidity, water, unfiltered, laboratory, Hach 2100AN, nephelometric turbidity units USEPA method 180.1??? Are they the same? Why not link to standard methods? Needs work by domain experts to resolve.

Conversion to SKOS-EXcel2SKOS.Net utility to convert Table into skos using Nvelocity Spreadsheet Template Office Interop Formatter Excel2Skos

Nvelocity Template-Mapping to SKOS #foreach($row in $excelsheet) Definition of parameter code $row.get($code) $row.get($name) #if($row.get($casrn)!= "") #end #if($row.get($srsname) != "") #end #end Classes passed in Globals – ConceptScheme definitions excelsheet – 2D table of values.

Conversion to skos

Converting list to RDF

RDF Validation

Import to Triple store

Test Services Developed REST services using Microsoft WCF dotnetRDF and NVelocity libraries Test API /Vocab/ParameterCodeList – respond with a document of skos:ConceptScheme /Vocab/ParameterCodeList/ParameterCode – respond with the first page of parameter codes – my implementation returns all! /Vocab/ParameterCodeList/ParameterCode/{ID}

Process reminder Sesame RDF Triple store SPARQL API USGS Code List Harmonise and Map Excel2RDF RDF Validator ValidateRDF Load dotnetRDF WCF REST TestServices

Demo

Next Steps Try the Auscope tooling the process is the same, uses sesame rdf store has diferent tooling for Excel to RDF Different service interface, LD not quite ready. If have time, we should set up a test service before I leave. Below is example of what LD vocabs in WaterML2.0 might look like.

Conclusions The need for sematic context to assist with data integration is pressing. Vocabularies are foundation services and need to be put in place for data mediation. Technologies and approaches are now mature enough to use RDF, SKOS, SPARQL, LD API There is tooling available through AUSCOPE, but it needs assessment. USGS & CIDA has the opportunity to make a range of standard vocabularies available for the hydro community.

Pillars and foundations of Interoperability System of Systems Interoperablity Identity and Registration Service Standards Application Schema Network Standards Community Profiles Feature Catalogs Agreed Vocabularies and Ontologies Semantic Brokering

Final word I don’t look it, but I’m so happy, I know what is at the end of the xlink! I don’t look it, but I’m so happy, I know what is at the end of the xlink!

Thank you Business Unit Name Peter Fitch Program Leader Environmental Information Systems Phone: Web: Contact Us Phone: or Web:

Lessons of Climate Gate Theft of s from UEA Nov s indicated manipulation of data, and suppression of raw data Investigations found methods dis-organised bunker mentality lack of transparency Researchers promised to improve scientific data management open access to data Improve transparency climatic research unit, University of East Anglia

From RDF Primer W3C

SWT in other domains Eco-Informatics Bio-Informatics Eco-Informatics Bio-Informatics HWB – WIRADA Symposium August 2011

If Insert presentation title

Terminology Activity – single process block which can perform a useful task and which can be linked to another process block Workflow – a linked set of process blocks HWB – WIRADA Symposium August 2011