IBM Watson Research © 2004 IBM Corporation BioHaystack: Gateway to the Biological Semantic Web Dennis Quan

Slides:



Advertisements
Similar presentations
Exploiting the WWW: Lessons from a UK Research Project on a Health Record BrokerExploiting the WWW: Lessons from a UK Research Project on a Health Record.
Advertisements

Copyright © 2012 Certification Partners, LLC -- All Rights Reserved Lesson 4: Web Browsing.
European Life Sciences Infrastructure for Biological Information Rafael C Jimenez ELIXIR CTO EMBL-EBI workshop networks and pathways.
© InLoox GmbH InLoox Web App product presentation The web client for project management on the Internet.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Introduction to Web services MSc on Bioinformatics for Health Sciences May 2006 Arnaud Kerhornou Iván Párraga García INB.
INTERNET DATABASE Chapter 9. u Basics of Internet, Web, HTTP, HTML, URLs. u Advantages and disadvantages of Web as a database platform. u Approaches for.
Web Services Andrea Miller Ryan Armstrong Alex. Web services are an emerging technology that offer a solution for providing a common collaborative architecture.
INTERNET DATABASE. Internet and E-commerce Internet – a worldwide collection of interconnected computer network Internet – a worldwide collection of interconnected.
2006 IEEE International Conference on Web Services ICWS 2006 Overview.
IST NeOn-project.org The Semantic Web is growing… #SW Pages Lee, J., Goodwin, R. (2004) The Semantic.
Building Enterprise Information Portal using Oracle Portal 3
The my Grid project aims to provide middleware layers that make the Information Grid appropriate for the needs of bioinformatics. my Grid is building high.
Personal Data Management Why is this such an issue? Data Provenance Representing links v Representing data Identifying resources: Life Science Identifiers.
SiS Technical Training Development Track Technical Training(s) Day 1 – Day 2.
RSS RSS is a method that uses XML to distribute web content on one web site, to many other web sites. RSS allows fast browsing for news and updates.
Web service testing Group D5. What are Web Services? XML is the basis for Web services Web services are application components Web services communicate.
MIT CSAIL/IBM Watson Research © 2004 IBM Corporation Haystack: Bringing Good Metadata to Life Dennis Quan
Presentation By: Brian Mais. What Is It? Content Management Systems(CMS) describes software that manage content, workflow, and collaboration online and.
Provenance in my Grid Jun Zhao School of Computer Science The University of Manchester, U.K. 21 October, 2004.
1 Web Services Distributed Systems. 2 Service Oriented Architecture Service-Oriented Architecture (SOA) expresses a software architectural concept that.
Ben Szekely, IBM Cambridge Adtech © 2006 IBM Corporation TDWG GUID WorkshopFebruary 1, 2006 LSID as a Technology Overview, Participation and Related Projects.
Example XML Applications/Languages. Objectives To Review uses of XML To investigate some Language applications of XML XHTML RSS WML Web Services.
Free Open-Source, Open- Platform System for Information Mash-Up and Exploration in Earth Science Tawan Banchuen, Will Smart, Brandon Whitehead, Mark Gahegan,
Taverna and my Grid Basic overview and Introduction Tom Oinn
Designing, Executing, Reusing and Sharing Workflows: Taverna and myExperiment Supporting the in silico Experiment Life Cycle Katy Wolstencroft Paul Fisher.
UDDI ebXML(?) and such Essential Web Services Directory and Discovery.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
Web Services based e-Commerce System Sandy Liu Jodrey School of Computer Science Acadia University July, 2002.
MyGrid: Personalised e-Biology on the Grid Professor Carole Goble Contact e-Science.
Teranode Tools and Platform for Pathway Analysis Michael Kellen, Solution Manager June 16, 2006.
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
Tom Oinn, In general a grid system is, or should be : “A collection of a resources able to act collaboratively in pursuit of an overall.
L C SL C S Haystack Dennis Quan Oxygen Workshop, January, 2002.
Semantic Web Technologies Research Topics and Projects discussion Brief Readings Discussion Research Presentations.
Individualized Knowledge Access David Karger Lynn Andrea Stein Mark Ackerman Ralph Swick.
Anil Wipat University of Newcastle upon Tyne, UK A Grid based System for Microbial Genome Comparison and analysis.
WebService. Outline Overview of Web Services SOAP (messaging) WSDL (service description) UDDI (registry)
Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University.
LSIDs in a Nutshell Jun Zhao University of Manchester 1 st December, 2005.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Technology behind using Taverna in caGrid caGrid user meeting Stian Soiland-Reyes, myGrid University of Manchester, UK
Moby Web Services Iván Párraga García MSc on Bioinformatics for Health Sciences May 2006.
Web Services (SOAP) part 1 Eriq Muhammad Adams J |
© Geodise Project, University of Southampton, Knowledge Management in Geodise Geodise Knowledge Management Team Barry Tao, Colin Puleston, Liming.
Chapter 11 Using SAS ® Web Report Studio. Section 11.1 Overview of SAS Web Report Studio.
Chapter 29 World Wide Web & Browsing World Wide Web (WWW) is a distributed hypermedia (hypertext & graphics) on-line repository of information that users.
MyGrid/Taverna Provenance Daniele Turi University of Manchester OMII f2f Meeting, London, 19-20/4/06.
An Introduction to Web Services Web Services using Java / Session 1 / 2 of 21 Objectives Discuss distributed computing Explain web services and their.
Using DAML+OIL Ontologies for Service Discovery in myGrid Chris Wroe, Robert Stevens, Carole Goble, Angus Roberts, Mark Greenwood
Intro to Web Services Dr. John P. Abraham UTPA. What are Web Services? Applications execute across multiple computers on a network.  The machine on which.
EbXML Registry and Repository Dept of Computer Engineering Khon Kaen University.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Portals and my Grid Stefan Rennick Egglestone Mixed Reality Laboratory University of Nottingham.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Semantic Web unleashes your data! The Semantic Web will transform the use of content. Semantic Web – is an extension of the current web. Semantic Web.
Life Science Identifiers Chris Wroe (based on material from myGrid team and IBM Life Sciences)
MyGrid: Personalised Bioinformatics on the Information Grid Robert Stevens, Alan Robinson & Carole Goble University of Manchester & EBI, UK myGrid project.
Workflow and myGrid Justin Ferris IT Innovation Centre 7 October 2003 Life Sciences Grid GGF9.
Software Architecture Patterns (3) Service Oriented & Web Oriented Architecture source: microsoft.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Provenance: Problem, Architectural issues, Towards Trust
Warm Handshake with Websites, Servers and Web Servers:
LSIDs in Taverna Daniele Turi University of Manchester
A Brief Introduction to the Internet
Wsdl.
The Celera Genome Browser: A Tool for Visualizing and Annotating the Human Genome
Cataloging the Internet
Presentation transcript:

IBM Watson Research © 2004 IBM Corporation BioHaystack: Gateway to the Biological Semantic Web Dennis Quan

IBM Watson Research © 2004 IBM Corporation Problems in bioinformatics  Myriad of public databases have specific facets of information about biological objects of interest (e.g., proteins, genes, etc.)  Databases have their own access protocols, data formats, naming conventions, and means of describing relationships between objects in different databases  Different software required to view information from different databases –User must be keenly aware of which tool or site to use –Relevant information comes in fragments –Exploration process is discontinuous

IBM Watson Research © 2004 IBM Corporation A common naming convention: LSID URNs  Life Sciences Identifiers (LSIDs) are URNs for biological objects that are backed by RDF metadata: –E.g., urn:lsid:ncbi.nlm.nih.gov.lsid.i3c.org:genbank:nm_  LSID and LSID protocol (SOAP-based) specification sponsored by I3C and undergoing standardization by OMG  Most of the publicly available bioinformatics databases available via LSID today –PDB LSID authority online; “proxy” LSID authorities for databases such as NIH databases, SwissProt hosted by I3C  Really easy to set up LSID clients and servers –IBM Internet Technology group provides Open Source LSID client and server software for a variety of languages and platforms

IBM Watson Research © 2004 IBM Corporation RDF/XML: on demand data integration human hemoglobin LSID oxygen transport protein atagccgta cctgcgagt ctagaagct derives from is a human hemoglobin LSID has 3D structure GenBank Gene Ontology PDB human hemoglobin LSID atagccgta cctgcgagt ctagaagct derives from oxygen transport protein is a has 3D structure Unified view + +

IBM Watson Research © 2004 IBM Corporation Haystack: letting users interact with their data  Haystack is a tool for creating, exploring, and organizing information: –Personal information: s, contacts, documents, etc. –Bioinformatics: proteins, publications, genes, etc.  Research project originating from MIT CSAIL  Uses RDF as an underlying data model  Built on Java and Eclipse, IBM’s Open Source rich client platform

IBM Watson Research © 2004 IBM Corporation Browsing highly interconnected information  Single screen presents multiple facets of a single object originating from separate databases  Users navigate space like a Web browser: hyperlinking, drag and drop, etc.

IBM Watson Research © 2004 IBM Corporation Personalization  People keep track of their information by personalizing their workspaces: –Grouping paperwork into folders –Highlighting important text in documents –Attaching sticky notes as reminders –Jotting down lists of related items  Haystack has pervasive support for annotation and allows users to group related objects together arbitrarily for their own purposes

IBM Watson Research © 2004 IBM Corporation BioHaystack  BioHaystack: application of Haystack technologies to bioinformatics problem –Integrated environment for working with biological data –Intended for end users, i.e., non-programmers –Builds on LSID, RDF, and Haystack  Integration offers the promise of lowering barriers to access to different backend systems (e.g., LSID servers, Grids, Web Services, relational databases, annotation servers)  Just as the Web browser acts as a client for Web content, BioHaystack can act as a client for biological Semantic content and services

IBM Watson Research © 2004 IBM Corporation Real world collaboration: myGrid  UK-funded joint project with the University of Manchester and other UK research institutions  RDF-based platform for supporting e-Science experiments  Real use cases; developed in collaboration with bioinformaticians  myGrid creates LSIDs and RDF metadata in the process of enacting experiments for scientists  Using BioHaystack as a browser for metadata

IBM Watson Research © 2004 IBM Corporation Registry mIR Discovery View Haystack Provenance Browser FreeFluo Enactor Taverna WF Builder Pedro Annotation tool Ontology Store Others WSDL Soap- lab Interface Description Annotation/description Annotation providers Query & Retrieve Workflow Execution Store data/ knowledge Scientists Bioinformaticians invoking Query & register Service Providers Data descriptions Vocabulary myGrid Architecture Courtesy of Professor Carole Goble, University of Manchester

IBM Watson Research © 2004 IBM Corporation BioHaystack + myGrid Courtesy of Professor Carole Goble, University of Manchester

IBM Watson Research © 2004 IBM Corporation Thank you for your attention  Dennis Quan, (IBM Watson Research)  Haystack project home page (download available May 24) –  IBM LSID home page –  myGrid home page –  See also our session on constructing Haystack applications: –Developer’s Day, Saturday, 4:30pm