Controlled Vocabularies: What, Why, How?

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Progress Update Semantic Web, Ontology Integration, and Web Query Seminar Department of Computing David George.
Marine Metadata Interoperability Initiative Congreso Colombiano de Computación - CCC 2007 Abril 18 al 20 de 2007 Pontificia Universidad Javeriana, D.C.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Ontology Notes are from:
Vocabulary management: a foundation for semantic interoperability through ontology development Roy Lowry British Oceanographic Data Centre GO-ESSP, Paris,
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
Introduction to Protégé AmphibiaTree 2006 Workshop Sunday 8:45–9:15 J. Leopold & A. Maglia.
Intelligent Systems Semantic Web. Aims of the session To introduce the basic concepts of semantic web ontologies.
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
Demonstration of adding content to an ICAN Semantic Resource Roy Lowry, Adam Leadbetter, Olly Clements (NETMAR - BODC) Tanya Haddad (ICAN - OCA)
Ontology Semantic Mediation in the Big Picture MMI Workshop - August 2005.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Plaintext to governed vocabularies: restoring order to anarchic metadata Roy Lowry British Oceanographic Data Centre Building a Global Data Network Workshop,
The NERC DataGrid Vocabulary Server Roy Lowry British Oceanographic Data Centre Ontology Registry Meeting.
The NERC DataGrid Vocabulary Server: an operational system with distributed ontology potential Roy Lowry British Oceanographic Data Centre GO-ESSP 2008,
RDF: Concepts and Abstract Syntax W3C Recommendation 10 February Michael Felderer Digital Enterprise.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
9/10/20151 SKOS. 9/10/20152 SKOS Describes thesauruses and taxonomies Properties: broader, narrower, subject, related Classes: Concept, Collection
Practical RDF Chapter 1. RDF: An Introduction
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Logics for Data and Knowledge Representation
NERC DataGrid Vocabulary Server Access Vocabulary Workshop, RAL, February 25, 2009.
The MMI Tools Carlos Rueda Monterey Bay Aquarium Research Institute OOS Semantic Interoperability Workshop Marine Metadata Interoperability Project Boulder,
By: Dan Johnson & Jena Block. RDF definition What is Semantic web? Search Engine Example What is RDF? Triples Vocabularies RDF/XML Why RDF?
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
DDI-RDF Leveraging the DDI Model for the Linked Data Web.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
NERC DataGrid NERC DataGrid Vocabulary Server Use Cases Vocabulary Workshop, RAL, February 25, 2009.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
Semantic Web - an introduction By Daniel Wu (danielwujr)
1 What is an Ontology? n No exact definition n A tool to help organize knowledge n Or a way to convey a theory on how to represent a class of things n.
The Semantic Web and expert metadata: pull apart then bring together Presented at 12.seminar Arhivi, Knjižnice, Muzeji Nov 2008, Pore č, Croatia.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
PHS / Department of General Practice Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Knowledge representation in TRANSFoRm AMIA.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Metadata, Resources, and the RDF 김민수 Chapter 1. Creating the Sementic Web with RDF2 Overview Knowledge Representation Library Metadata RDFRDF.
Creating a Semantic Web with Linked Data Todd King.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
1cs The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lotzi Bölöni.
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
26/02/ WSMO – UDDI Semantics Review Taxonomies and Value Sets Discussion Paper Max Voskob – February 2004 UDDI Spec TC V4 Requirements.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Usage of BODC parameter vocabularies
Service-Oriented Computing: Semantics, Processes, Agents
Introduction to Persistent Identifiers
Sharing lessons through effective modelling
Service-Oriented Computing: Semantics, Processes, Agents
Resource Description Framework
Service-Oriented Computing: Semantics, Processes, Agents
Web Ontology Language for Service (OWL-S)
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Ontology.
Semantic Web: Core Concepts and Mechanisms
PREMIS Tools and Services
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Semantic Statistics DDI Lifecycle: Moving Forward Outcome of the Recent Workshops in Dagstuhl Joachim Wackerow.
Taxonomy of public services
Taxonomy of public services
Presentation transcript:

Controlled Vocabularies: What, Why, How? Vocabulary Workshop, RAL, February 25, 2009 Controlled Vocabularies: What, Why, How?

Metadata Love it or hate it without metadata automated data handling isn’t possible For automated data handling to be possible across distributed data sources metadata standards are required Standardised metadata comprises fields that represent real world entities such as location, time, phenomena, etc.

Metadata These fields need to be populated Plaintext may be used. Makes population easy, but it’s next to useless. Some real examples: A wide variety of chemical and biological parameters Amplitude de l'echo retrodiffuse Cu, Zn, Fe, Pb, Cd, Cr, Ni in biota MACR0-MEIOFAUNA,SED BIOCHEMISTRY,ZOOPLANKTON, CILIATES,BACT CELLS,BACT BIOMASS,LEUCINE UPT,PRIM. PROD,METABOL, COCCOLITH Plaintext should be confined to abstracts

Controlled Vocabularies Much better to use concepts labelled using universally agreed terms that have universally agreed meanings A collection of concepts designed to populate a given metadata field may be called a controlled vocabulary Controlled vocabularies Ensure consistent spellings Ensure consistent syntax Well-managed controlled vocabularies Prevent metadata misunderstandings Maintain a static relationship between metadata fields and the real world

Thesuari Concepts within a controlled vocabulary may be semantically connected using simple relationships: Blue broader colour Colour narrower blue Colour related pigmentation Concepts from different controlled vocabularies describing the same type of thing may be semantically connected using simple mapping relationships: Bacillariophycaea exactMatch diatoms IPTS68 temperature closeMatch ITS90 temperature Nutrients in rivers relatedMatch nitrate in water bodies Salinity broadMatch physical oceanography Physical oceanography narrowMatch salinity The results may termed thesauri

Ontologies But what if the controlled vocabularies describe different types of thing? We can relate them by increasing the semantic richness of the relationships For example: We could have a controlled vocabulary of instruments We could also have a controlled vocabulary of parameters

Ontologies We can link these up using relationships such as: Themosalinograph measures salinity Fluorometer measures chlorophyll Air temperature measuredBy psychrometer The result may be termed an ontology

Ontologies Ontology relationships are: Semantically rich Potentially abundant Software agents need to have some relationship understanding to exploit the knowledge encoded in the ontology This is achieved through relationships describing relationships called rules

Knowledge Representation Relationships between concepts may be expressed using Resource Description Framework (RDF) W3C standard XML encoding having ‘triples’ as its basic building block Each triple has a subject, a predicate and an object. For example: Colour related pigmentation Thermosalinograph measures salinity Familiar?

Knowledge Representation Controlled vocabularies (concept collections) and thesauri may be represented using the Simple Knowledge Organization System (SKOS) W3C standard XML schema based on RDF Jointly developed by STFC and Manchester University Computer Science 2008 version is the one to use

Knowledge Representation <?xml version="1.0" ?> - <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:skos="http://www.w3.org/2004/02/skos/core#" xmlns:dc="http://purl.org/dc/elements/1.1/"> - <skos:Concept rdf:about="http://vocab.ndg.nerc.ac.uk/term/P011/116/TEMPS901">   <skos:externalID>SDN:P011:116:TEMPS901</skos:externalID>   <skos:prefLabel>Temperature (ITS-90) of the water column by CTD or STD</skos:prefLabel>   <skos:altLabel>CTDTmp90</skos:altLabel>   <skos:definition>Unavailable</skos:definition>   <dc:date>2009-02-09T10:45:32.262+0000</dc:date>   <skos:broadMatch rdf:resource="http://vocab.ndg.nerc.ac.uk/term/P021/37/TEMP" />   </skos:Concept>   </rdf:RDF>

Knowledge Representation Ontologies may be represented using Web Ontology Language (OWL) W3C standard XML schema based on RDF Example OWL document http://mida.ucc.ie/ont/20080124/theme.owl Alternative simple text encodings are available such as Open Biomedical Ontologies (OBO) OBO used for NERC-related EnvO ontology

Knowledge Management Tools RDF Tools abound – see for example http://planetrdf.com/guide/ Jena is one of the better known SKOS See the SKOS Tool Shed http://esw.w3.org/topic/SkosDev/ToolShed Note this includes a Protégé plugin

Knowledge Management Tools Protégé with appropriate plugin is the most widely used There are commercial alternatives such as TopBraid Composer MMI (http://marinemetadata.org) has developed a vocabulary to OWL converter (voc2OWL) OBO Text so text tools work OWL and SKOS converters available

Knowledge Management Tools Mapping MMI have developed a mapping tool (VINE) to build maps from two OWL files Visualisation Concept maps are useful Cmap tools is very good FreeMind (open source)