Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics.

Slides:



Advertisements
Similar presentations
A Proposal for Addressing Issues Related to RDF Mapping.
Advertisements

Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
1 © 2006 Adam Pease, Articulate Software - apease [at] articulatesoftware [dot] com SUMO Applications Adam Pease Articulate Software
The Semantic Web – WEEK 4: RDF
An Introduction to RDF(S) and a Quick Tour of OWL
CS570 Artificial Intelligence Semantic Web & Ontology 2
OCLC Research TAI CHI Webinar 5/27/2010 A Gentle Introduction to Linked Data Ralph LeVan Sr. Research Scientist OCLC Research.
The Semantic Web. The Web Today Designed for Human to read Cannot express meaning Architecture: URL –Decentralized: Link structure Language: html.
John Deck, University of California, Berkeley Brian Stucky, University of Colorado, Boulder Lukasz Ziemba, University of Florida, Gaineseville Nico Cellinese,
Entomological Collections Network Meeting, Indianapolis, IN 13 December 2009 Darwin Core Ratified in the Year of Darwin Gail E. Kampmeier Illinois Natural.
Ontology Notes are from:
Chapter 8: Web Ontology Language (OWL) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
Introduction to Protégé AmphibiaTree 2006 Workshop Sunday 8:45–9:15 J. Leopold & A. Maglia.
1 Copyright © 2005 Access Innovations, Inc. OWL Mapping Thesaurus Format NEXT GENERATION KNOWLEDGE ORGANIZATION SYSTEMS: INTEGRATION CHALLENGES AND STRATEGIES.
CSE 428 Semantic Web Topics Introduction Jeff Heflin Lehigh University.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
Fourth Annual Summit | Feb | Tucson, AZ Scratchpads for community involvement for natural history collections Dr Dimitris Koureas Biodiversity.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
IDs in and out of the database Entomological Collections Network (ECN) 2012 November 10 – 11, Knoxville, TN Debbie Paul, Greg Riccardi.
Practical interoperability across semantic stores of data for blah blah
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
Chapter 6 Understanding Each Other CSE 431 – Intelligent Agents.
Introduction to OBIS-USA Biological Data, Applications, & Relationships March 14, 2011.
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
The Semantic Web Web Science Systems Development Spring 2015.
Imports, MIREOT Contributors: Carlo Torniai, Melanie Courtot, Chris Mungall, Allen Xiang.
Chapter 6 Understanding Each Other CSE 431 – Intelligent Agents.
ONTOLOGY ENGINEERING Lab #1 - August 25, Lab Syllabus 2  Lab 1 – 8/25: Introduction and Overview of Protégé  Lab 2 – 9/8: Building an ontology.
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
© Copyright 2008 STI INNSBRUCK NLP Interchange Format José M. García.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
1 © 2006 Adam Pease, Articulate Software - apease [at] articulatesoftware [dot] com Perspectives on UOS Adam Pease Articulate Software
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Resource Description Framework (RDF) Presented by: Jonathan Catlett.
Michael Eckert1CS590SW: Web Ontology Language (OWL) Web Ontology Language (OWL) CS590SW: Semantic Web (Winter Quarter 2003) Presentation: Michael Eckert.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Wrap-Up Barry Smith. Principles of Ontology Development.
Department of computer science and engineering Two Layer Mapping from Database to RDF Martin Švihla Research Group Webing Department.
Pour mieux affirmer ses missions, le Cemagref devient Irstea Catherine ROUSSEY (Irstea), Jean-Pierre CHANET (irstea), Vincent CELLIER (INRA),
WDO-It! 101 Workshop: Creating an abstraction of a process UTEP’s Trust Laboratory NDR HP MP.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
Ricardo Pereira Software Engineer TDWG Infrastructure Project (TIP)
Semantic Technologies and Application to Climate Data M. Benno Blumenthal IRI/Columbia University CDW /04-01.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
TDWG Life Sciences Identifiers Applicability Statement Ben Richardson Review Manager, LSID Applicability Statement Western Australian Herbarium Department.
CSE 428 Semantic Web Topics Introduction Jeff Heflin Lehigh University.
SPINNING THE SEMANTIC WEB APPLICATIONS FOR THE MODERN ERA LIBRARIES
Creating a Semantic Web with Linked Data Todd King.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
1cs The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,
Presented by: Yuhana 12/17/2007 Context Aware Group - Intelligent Agent Laboratory Computer Science and Information Engineering National Taiwan University.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
CSE 428 Semantic Web Topics Introduction Jeff Heflin Lehigh University.
Copy right 2004 Adam Pease permission to copy granted so long as slides and this notice are not altered Ontology Overview Introduction.
Review of Core Dave Reynolds. XML syntax [i1] Section 2.1. The example XML syntax lacks any namespace. Should indicate that the final XML syntax will.
© The ATHENA Consortium. Susan Thomas SAP AG, Research Department How do you do semantics? Semantic Web Drawings by Sebastian Cremers Unit 3:
IPT + Darwin Core OBIS XML Schema OBIS Database Schema Explained Mike Flavell OBIS Data Manager OBIS Nodes Training Course, Oostende, Belgium, 6 May 2014.
TDWG Core Ontology J Kennedy R Gales, R Hyam, R Kukla, J Wieczorek, G Hagedorn, M Döering D Vieglais, S Perry, D Hobern.
OWL imports Nick Drummond or “How to make life hard for tool developers”
GB22 TRAINING EVENT FOR NODES – 4 OCTOBER 2015 Session 02: 2015 Data Publishing Landscape Laura Russell.
Introduction to Persistent Identifiers
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
Ontology.
OBI – Standard Semantic
Ontology.
HOW (and why?) DO WE DESCRIBE ?
Presentation transcript:

Using Darwin Core as a Model: An Ontologically Minimalist Approach to Publishing Occurrence Data in RDF Joel Sachs Formal Models track of the Semantics for Biodiversity Symposium TDWG 2013

The first thing I want to communicate: Semantics != Ontologies

Semantics = Ontologies ? Semantics – Semiotics – Linguistics – Psychology Ontology – Philosophy – Computer Science

Ontologies as a vehicle for semantics Ontologies were the first choice for putting the “semantic” in semantic web. But ontologies aren’t the only way to supply semantics. Furthermore, ontologies can be a barrier to shared semantics, in a number of ways.

What’s green? Def 1:

What’s green? Def 2: Green is the portion of the electromagnetic spectrum with a wavelength between 520 – 570 nm. What’s electromagnetic? What’s a spectrum? What’s a wavelength? What’s a nanomemter?

Occurrence_ID Location_ID URI DateTime IndividualOrganism_ID URI Location_ID URI Latitude float Longitude float Datum URI Identification_ID Individual_ID URI Taxon URI Identified_by URI Occurrence_ID Latitude Longitude Scientific Name Vernacular Name Taxon_ID Scientific Name Vernacular Name Authorship Year etc. Occurrence Location Identification Taxon Occurrence

There are many ways to think about biodiversity data.

Thing #2 that I want to communicate Darwin Core (as it is) can be used as a light weight “ontology”.

Don’t try this at home

Thing #3 How to minimize the amount of ontology in the Core.

Example: Material Sample dwctype:MaterialSample (roughly?) corresponds to OBI:Specimen.

(forall (x) (if (MaterialEntity x) (IndependentContinuant x))) // axiom label in BFO2 CLIF: [ ] material MaterialEntity (forall (x) (if (and (Entity x) (exists (y t) (and (MaterialEntity y) (continuantPartOfAt x y t)))) (MaterialEntity x))) // axiom label in BFO2 CLIF: [ ] (forall (x) (if (and (Entity x) (exists (y t) (and (MaterialEntity y) (continuantPartOfAt y x t)))) (MaterialEntity x))) // axiom label in BFO2 CLIF: [ ]

curl -L -H "Accept: application/rdf+xml" | grep OBI MaterialSample A resource describing the physical results of a sampling (or subsampling) event. In biological collections, the material sample is typically collected, and either preserved or destructively processed recommended DataSets/DataSet/Units/Unit

curl -L -H "Accept: application/rdf+xml" | grep OBI

On the one hand Nobody forces consuming application to ingest the OBI and BFO ontologies when they ingest Darwin Core. So what’s the big deal?

On the other hand Many semantic web clients automatically fetch and load referenced documents. – Especially if the documents are referenced with important properties like rdfs:subClassOf It’s bad form (and slightly dangerous) to clutter a semantic web document with terms from unnecessary namespaces.

My suggestion? Assertions that tie Core terms to upper ontologies should be asserted in a separate document. E.g. should be asserted in obi.owl, or dwc_obi.owl That way, those doing integration that depends on OBI axioms can ingest the appropriate descriptions. Those that don’t need the OBI axioms don’t have to worry about incorrect inference. – Keep in mind: There is no preferred upper ontology for science on the semantic web. BFO, Dolce, SUMO, UMBEL, NULO, etc.

Thank you for paying attention! Question, comments, and criticism