Using DAML format for representation and integration of complex gene networks: implications in novel drug discovery K. Baclawski Northeastern University.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Jim Hendler Chief Scientist - Information Systems Office DARPA.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
XML: Extensible Markup Language
Semantic Web Thanks to folks at LAIT lab Sources include :
CS570 Artificial Intelligence Semantic Web & Ontology 2
The Semantic Web. The Web Today Designed for Human to read Cannot express meaning Architecture: URL –Decentralized: Link structure Language: html.
Interoperability of Distributed Component Systems Bryan Bentz, Jason Hayden, Upsorn Praphamontripong, Paul Vandal.
XML and Bioinformatics Rajvi Shah. What is XML ? XML stands for EXtensible Markup Language XML is a markup language much like HTML XML was designed to.
Ontology Notes are from:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
The Semantic Web: Implications for Future Intelligent Systems Lee McCluskey, Artform Research Group, Department of Computing And Mathematical Sciences,
ModelicaXML A Modelica XML representation with Applications Adrian Pop, Peter Fritzson Programming Environments Laboratory Linköping University.
Semantic Web Tools for Authoring and Using Analysis Results Richard Fikes Robert McCool Deborah McGuinness Sheila McIlraith Jessica Jenkins Knowledge Systems.
More XML namespaces, DTDs CS 431 – February 16, 2005 Carl Lagoze – Cornell University.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
1 Technologies and Modelling Frameworks XML ontology RDF taxonomy OWL thesaurus Semantic Web.
1 Advanced Topics XML and Databases. 2 XML u Overview u Structure of XML Data –XML Document Type Definition DTD –Namespaces –XML Schema u Query and Transformation.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
MotifML A Novel Ontology-based XML Model for Data- Exchange of Regulatory DNA Motif Profiles Eric Neumann, Beyond Genomics Tian Niu, Harvard University.
Practical RDF Chapter 1. RDF: An Introduction
Extending UML to Support Ontology Engineering Kenneth Baclawski and Mieczylaw K. Kokar Northeastern University Paul A. Kogut, William S. Holmes III and.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
Cornell CS 502 More XML XHTML, namespaces, DTDs CS 502 – Carl Lagoze – Cornell University.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
Of 33 lecture 3: xml and xml schema. of 33 XML, RDF, RDF Schema overview XML – simple introduction and XML Schema RDF – basics, language RDF Schema –
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
UNCERTML - DESCRIBING AND COMMUNICATING UNCERTAINTY WITHIN THE (SEMANTIC) WEB Matthew Williams
More XML namespaces, DTDs CS 431 – Carl Lagoze – Cornell University.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Metadata Schema for CERIF Andrei Lopatenko Vienna University of Technology
Oreste Signore- Quality/1 Amman, December 2006 Standards for quality of cultural websites Ministerial NEtwoRk for Valorising Activities in digitisation.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Internet & World Wide Web How to Program, 5/e. © by Pearson Education, Inc. All Rights Reserved.2.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Mining the Biomedical Research Literature Ken Baclawski.
Faculty Faculty Richard Fikes Edward Feigenbaum (Director) (Emeritus) (Director) (Emeritus) Knowledge Systems Laboratory Stanford University “In the knowledge.
Web Technologies for Bioinformatics Ken Baclawski.
PRACTICAL KNOWLEDGE REPRESENTATION FOR THE WEB Frank van Harmelen Dieter Fensel AIFB Kim Kangil Structural Complexity Laboratory.
Representing Data with XML February 26, 2004 Neal Arthorne.
XML CSC1310 Fall HTML (TIM BERNERS-LEE) HyperText Markup Language  HTML (HyperText Markup Language): December  Markup  Markup is a symbol.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Universal fuzzy system representation with XML Authors : Chris Tseng, Wafa Khamisy, Toan Vu Source : Computer Standards & Interfaces, Volume 28, Issue.
From XML to DAML – giving meaning to the World Wide Web Katia Sycara The Robotics Institute
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
The Semantic Web By: Maulik Parikh.
DOMAIN ONTOLOGY DESIGN
Web Ontology Language for Service (OWL-S)
Eugenia Fernandez IUPUI
Semantic Web Technologies
Multi-agent system for web services
Semantic Web - Ontologies
Extending UML to Support Ontology Engineering
Semantic Markup for Semantic Web Tools:
Presentation transcript:

Using DAML format for representation and integration of complex gene networks: implications in novel drug discovery K. Baclawski Northeastern University E. Neumann Beyond Genomics T. Niu Harvard School of Public Health

Abstract Great strides have been made in understanding the complex molecular networks underlying biological systems. Advances in high-throughput microchip-based assays are providing us with global gene activity profiles characterizing the output of the gene regulatory network. The eXtensible Markup Language (XML) is becoming a key component in data exchange in biology and currently there are over a dozen XML DTDs and schemas for bioinformatic applications, including BioML, CellML, RNAML, DDBJ-ML and BSML. However, there are intrinsic limitations of using XML in representing biopathways because of its hierarchical nature. To address this problem, we introduced a new way of knowledge representation using DARPA Agent Markup Language (DAML) schemas through ontology-based models for integration of complex genetic networks. This new approach has overcome the restriction of the use of hierarchical data structure, and can facilitate automatic retrieval and update of gene network-related data and can be transformed into other XML dialects. Our prototype is expected to shed novel insights on genetic networks that will help our efforts towards better understanding of complex disease etiology, which in turn, can have significant expectations in clinical pharmacogenomics.

Introduction The Biological Knowledge Onslaught n Advances in gene sequences and microchip- based assays are producing large amounts of complex information about biological systems. n The many incompatible formats make it difficult for applications to interoperate. n Complex linkages between information from multiple sources/formats is essential for understanding complex disease etiology.

XML for Data Interchange n Commonly recognized format for data interchange supports interoperation. n Many tools are available including XSLT, a powerful transformation language for XML. n XML supports deep, semantically rich data. n Numerical data, text and sequences can be specified in the same document. n Document Type Definition (DTD) specifies the meaning of tags and content of elements.

Example of BioML The browser will ignore any symbol that cannot be a nucleotide residue, so the numbers can remain in place to aid the author. 1 ctcgaggggc ctagacattg ccctccagag agagcaccca acaccctcca

Limitations of XML n Extending XML DTDs is very difficult. – Existing tools will only recognize extensions already provided by the original design. n Linking information in different documents is difficult even for the same DTD. n Merging information from different markup languages is usually impossible. n XML does not support concept taxonomies. n Limited support for general relationships.

Methods Ontology based models n Formal, declarative semantic models. n Includes vocabulary terms, general relationships, constraints and rules. n Supports concept taxonomies. n Supports merging of information from disparate sources in many formats. n Rules permit reasoning by autonomous agents at run-time.

RDF and DAML n The Resource Description Framework (RDF) and DARPA Agent Markup Language (DAML) are the basis for the Semantic Web. n DAML is an ontology language for semantic interoperability between Web pages, databases, programs and sensors. n DAML is based on RDF, and RDF is based on XML.

Results PathML Ontology and Language n PathML is a DAML ontology for biopathways as well as a markup language for representing specific examples of complex biological pathways. n Facilitates automatic retrieval and update of gene network-related data. n Relevant PathML information can be easily transformed into XML markup languages using the XSLT transformation language.

ReceptorSignallingProtein Deoxynucleotide Species DefenseimmunityProtein DNAStructure TwoComponentSensorMolecule ComplementDNA Transporter LigandBindingOrCarrier ProteinTagging Ribozyme Chromosome CytoskeletalRegulator boundTo hasStrandedness stronglyBinds translatedFrom hasSpecies polymerOf formsPolymer stronglyBoundTo partOf catalysedBy The PathML ontology Examples of Classes Examples of Properties

Part of the PathML taxonomy Chemical SmallMolecule IonMacromolecule DNA RNAProtein Cofactor Nucleotide MetalIon

Discussion and Conclusion n The ontology and markup language PathML has been developed for representing complex biological pathways. n PathML is compatible with the many XML- based markup languages being developed biological data interchange. n PathML supports automatic retrieval and update of gene network-related data from a variety of sources.

MotifML n MotifML is a DAML ontology for gene regulation as well as a markup language for gene regulation motifs. n Facilitates automatic retrieval and update of gene network-related data. n Relevant MotifML information can be easily transformed into XML markup languages.

Genetic Entity Regulatory Entity Gene Species Motif Regulation Protein Domain Protein Regulatory Region contained in regulatory relationship activated by motif species regulates Red arrows denote subclass relationships. Part of the MotifML Ontology