Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
The Integration of Biological Data Using Semantic Web Technologies Susie Stephens Principal Product Manager, Life Sciences Oracle
1 ICS-FORTH EU-NSF Semantic Web Workshop 3-5 Oct Christophides Vassilis Database Technology for the Semantic Web Vassilis Christophides Dimitris Plexousakis.
1 ICS-FORTH & Univ. of Crete SeLene November 15, 2002 A View Definition Language for the Semantic Web Maganaraki Aimilia.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Semantic Web Thanks to folks at LAIT lab Sources include :
An Introduction to RDF(S) and a Quick Tour of OWL
CS570 Artificial Intelligence Semantic Web & Ontology 2
By Ahmet Can Babaoğlu Abdurrahman Beşinci.  Suppose you want to buy a Star wars DVD having such properties;  wide-screen ( not full-screen )  the extra.
1 Semantic Web Technologies: The foundation for future enterprise systems Okech Odhiambo Knowledge Systems Research Group Strathmore University.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Ontology Notes are from:
The Semantic Web – WEEK 5: RDF Schema + Ontologies The “Layer Cake” Model – [From Rector & Horrocks Semantic Web cuurse]
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Intelligent Systems Semantic Web. Aims of the session To introduce the basic concepts of semantic web ontologies.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
1 DCS861A-2007 Emerging IT II Rinaldo Di Giorgio Andres Nieto Chris Nwosisi Richard Washington March 17, 2007.
JOSH FLECK Semantic Web. What is Semantic Web? Movement led by W3C that promotes common formats for data on the web Describes things in a way that computer.
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
Semantic Web Bootcamp Dominic DiFranzo PhD Student/Research Assistant Rensselaer Polytechnic Institute Tetherless World Constellation.
Semantic Web Technologies: A Paradigm for Medical Informatics Chimezie Ogbuji (Owner, Metacognition LLC.)
1 Semantic Technologies: Diamond in the Rough? Unik Graduate Research Center Dr. Juan Miguel Gomez Universidad Carlos III de Madrid.
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Semantic Web Technologies in Biosciences Kei Cheung, Ph.D. Yale Center for Medical Informatics.
Okech Odhiambo Faculty of Information Technology Strathmore University
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
The Semantic Web Web Science Systems Development Spring 2015.
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Ontology Summit2007 Survey Response Analysis Ken Baclawski Northeastern University.
Information Interchange on the Semantic Web an interactive talk by Piotr Kaminski, University of Victoria
RDF and triplestores CMSC 461 Michael Wilson. Reasoning  Relational databases allow us to reason about data that is organized in a specific way  Data.
Semantic Web - an introduction By Daniel Wu (danielwujr)
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Semantic Web Constraint Language complement and the editor development in Protégé Piao Guangyuan.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
SPINNING THE SEMANTIC WEB APPLICATIONS FOR THE MODERN ERA LIBRARIES
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata, Resources, and the RDF 김민수 Chapter 1. Creating the Sementic Web with RDF2 Overview Knowledge Representation Library Metadata RDFRDF.
The Semantic Web Riccardo Rosati Dottorato in Ingegneria Informatica Sapienza Università di Roma a.a. 2006/07.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lotzi Bölöni.
The Semantic Web (Slides by Fabian M. Suchanek). Motivation scientists from Brisbane Australia's scientists visit Brisbane The National Science Education.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Author: Akiyoshi Matonoy, Toshiyuki Amagasay, Masatoshi Yoshikawaz, Shunsuke Uemuray.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
The Semantic Web By: Maulik Parikh.
Knowledge Representation Part II Description Logic & Introduction to Protégé Jan Pettersen Nytun.
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Analyzing and Securing Social Networks
Ontology.
Zachary Cleaver Semantic Web.
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Ontology.
Semantic-Web, Triple-Strores, and SPARQL
Presentation transcript:

Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University

Dealing with data Science 11 February 2011:

Examples of Big Data Genomics and proteomics data (e.g., next generation sequencing and mass spectrometry) Earth science data (e.g., satellite images) Electronic health records Social network data (e.g., facebook, youtube, …)

Big Data in Genome Sciences

Can Google answer every question?

Kei (Hoi) Cheung (>20 years ago) Kei (Hoi) Cheung (more recent) Kei (Hui) Cheung Not me! I’m NOT a company! Find the most recent image of the person “Kei Hoi Cheung” Problem with Keyword Search

Data Science Extraction of knowledge from data (and metadata) Machine learning Natural language processing High performance computing

Knowledge Bases Artificial Intelligence Machine-readable (reasonable) knowledge representation Ontologies Semantic web

Data Science & Knowledge Base Data ScienceKnowledge Base

What is an ontology? An ontology is a specification of a conceptualization It is a description of the concepts and their relationships that exist for a particular domain

Knowledge Web

Knowledge Web Data Integration

Semantic Web: Web 3.0 The Semantic Web provides a common machine-readable ontology framework that allows data to be represented, shared and reused across application, enterprise, and community boundaries –The Semantic Web is a knowledge web of data The Semantic Web is about two things –It is about common formats for identification, representation, and integration of data drawn from diverse sources –It is also about languages for describing how the data relates to real world objects

Layers of the Semantic Web

Web 3.0: Semantic Web (Cont’d) Global identifying scheme (URI) Standard data modeling languages (RDF, RDFS, OWL) Standard query languages (SPARQL) Enabling tools/technologies (e.g., Protégé, Jena, triplestore, etc)

Resource Description Framework (RDF) It is a standard data model (directed acyclic graph) for representing information (metadata) about resources in the World Wide Web In general, it can be used to represent information about “things” or “resources” that can be identified (using URI’s) on the Web It is intended to provide a simple way to make statements (descriptions) about Web resources

Uniform Resource Identifiers (URIs) A URI is a string of characters used to identify or name a resource on the Internet. URLs (Uniform Resource Locators) are a particular type of URI, used for resources that can be accessed on the WWW (e.g., web pages) In RDF, URIs typically look like “normal” URLs, often with fragment identifiers to point at specific parts of a document: – (id for “core cell cycle protein” in Cell Cycle Ontology)

RDF Triple/Graph The basic information unit in RDF is an RDF statement in the form of –(subject, property, object) Each RDF statement can be modeled as a graph comprising two nodes connected by a directed arc A triple example A set of such triples can jointly form a directed labeled graph (DLG) that can in theory model a significant part of domain knowledge. An RDF graph can be represented in different formats (XML, Turtle, N3…)

Linking data of the same type from multiple sources is a

Linking data across different types

Named Graph located in Interacts with biordf:P05067 Meta Statement biordf:P05067foaf:kei_cheung Created by

Cell Cycle Ontology (CCO) ( Antezana et al, 2009, Genome Biology)

RDF Graph Match (SPARQL) BASE PREFIX rdfs: PREFIX ssb: SELECT ?protein_label WHERE { GRAPH { ?protein ssb:is_a ssb:CCO_B ?protein rdfs:label ?protein_label } core cell cycle protein

Linked Data Cloud

RDF Schema (RDFS) RDF Schema terms: –Class –Property –type –subClassOf –range –Domain Example:

Relational table -> RDF -> RDFS ontology

Web Ontology Language (OWL) It is more semantically expressive than RDF and RDFS, but it is syntactically the same as RDF –Relationship constraints such as cardinality, sameAs, etc It has three species: OWL Lite, OWL DL, OWL Full

OWL DL Representation (Subsumption) :Nucleus a owl:Class ; rdfs:subClassOf [ a owl:Restriction ; owl:onProperty :part_of ; owl:someValuesFrom :Cell ] Necessary but not sufficient condition: part of a nucleus is also part of a cell, but part of a cell is not necessarily part of a nucleus

OWL Reasoning Which proteins participate in “mitosis” :Protein a owl:Class ; rdfs:subClassOf [ a owl:Restriction ; owl:onProperty :participates_in ; owl:someValuesFrom :Mitosis ]

Semantic Web Rule Language (SWRL = OWL + Rules) hasParent(?x1,?x2) ∧ hasBrother(?x2,?x3) ⇒ hasUncle(?x1,?x3)

SW Enabling Technologies Ontology editor (e.g., protégé) Triple store (e.g., virtuoso) OWL reasoner (e.g., Pellet) SWRL reasoner (e.g., protégé plug-in)

Boimedical ontologies available in RDF/OWL format UniProt Gene Ontology NCI Metathesaurus Cell Ontology Sequence Ontology Protein Ontology These and many more ontologies are available in ontology repositories such as the NCBO BioPortal (

Applications of Ontologies

Siri

Google Knowledge Graph

Semantic Medline

Questions to be answered Patient P has a tumor recurrence with new mutations X and Y – which drugs should be used? In estradiol-treated SKBR3 cells, which nuclear protein complexes have the greatest change in phosphorylation? What is the largest number of genes one can knock out of Mycoplasma for it to remain viable?

The End