Phase II Additions to LSG Search capability to Gene Browser –Though GUI in Gene Browser BLAST plugin that invokes remote EBI BLAST service Working set.

Slides:



Advertisements
Similar presentations
Experiment Provenance: Towards Links to Network Measurement Data Mehmet Aktas, Beth Plale, Scott Jensen Data to Insight Center Indiana University.
Advertisements

LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
ASIAES Project Overview Satellite Image Network for Natural Hazard Management in ASEAN+3 region Pakorn Apaphant Geo-Informatics and Space Technology Development.
Kino : Making Semantic Annotations Easier Ajith Ranabahu #, Priti Parikh #, Maryam Panahiazar #, Amit Sheth # and Flora Logan- Klumpler* # Ohio Center.
REST and the Exchange Network 5/30/ REST REST stands for Representational State Transfer 2.
As computer network experiments increase in complexity and size, it becomes increasingly difficult to fully understand the circumstances under which a.
IBM Watson Research © 2004 IBM Corporation BioHaystack: Gateway to the Biological Semantic Web Dennis Quan
Knowledge Enabled Information and Services Science What can SW do for HCLS today? Panel at HCSL Workshop, WWW2007 Amit Sheth Kno.e.sis Center Wright State.
Introduction to Web services MSc on Bioinformatics for Health Sciences May 2006 Arnaud Kerhornou Iván Párraga García INB.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
Personal Data Management Why is this such an issue? Data Provenance Representing links v Representing data Identifying resources: Life Science Identifiers.
Sponsored by the National Science Foundation netKarma Spiral 2 Year-end Project Review Indiana University Beth Plale (PI) School of Informatics and Computing.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
DB Audit Expert v1.1 for Oracle Copyright © SoftTree Technologies, Inc. This presentation is for DB Audit Expert for Oracle version 1.1 which.
Provenance in my Grid Jun Zhao School of Computer Science The University of Manchester, U.K. 21 October, 2004.
Faculty of Informatics and Information Technologies Slovak University of Technology Personalized Navigation in the Semantic Web Michal Tvarožek Mentor:
February Semantion Privately owned, founded in 2000 First commercial implementation of OASIS ebXML Registry and Repository.
Špindlerův Mlýn, Czech Republic, SOFSEM Semantically-aided Data-aware Service Workflow Composition Ondrej Habala, Marek Paralič,
Taverna and my Grid Basic overview and Introduction Tom Oinn
DynamicBLAST on SURAgrid: Overview, Update, and Demo John-Paul Robinson Enis Afgan and Purushotham Bangalore University of Alabama at Birmingham SURAgrid.
Towards a Provenance Architecture Karen Schuchardt PNNL.
JSProxy: Safety from Javascript Benjamin Prosnitz, Tang Yi, Yinzhi Cao.
SEMESTER PROJECT PRESENTATION CS 6030 – Bioinformatics Instructor Dr.Elise de Doncker Chandana Guduru Jason Eric Johnson.
Instant Karma Collecting Provenance for AMSR-E Beth Plale Director, Data to Insight Center Indiana University Helen Conover Information Technology and.
Applying the Semantic Web at UCHSC - Center for Computational Pharmacology Ian Wilson.
GENOME-CENTRIC DATABASES Daniel Svozil. NCBI Gene Search for DUT gene in human.
BLAST: A Case Study Lecture 25. BLAST: Introduction The Basic Local Alignment Search Tool, BLAST, is a fast approach to finding similar strings of characters.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
The ACGT Workflow Editing & Enactment Environment Giorgos Zacharioudakis Institute of Computer Science, Foundation for Research & Technology – Hellas (ICS-FORTH)
Adding GO GO Workshop 3-6 August GOanna results and GOanna2ga 2. gene association files 3. getting GO for your dataset 4. adding more GO (introduction)
Using WSMX to Bind Requester & Provider at Runtime when Executing Semantic Web Services Matthew Moran, Michal Zaremba, Adrian Mocan, Christoph Bussler.
GeWorkbench Highlights caBIG ® Molecular Analysis Tools Knowledge Center AACR Annual Meeting, April 3, 2011.
Grup.bio.unipd.it CRIBI Genomics group Erika Feltrin PhD student in Biotechnology 6 months at EBI.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
Provenance Challenge Simon Miles, Mike Wilde, Ian Foster and Luc Moreau.
An Ontological Framework for Web Service Processes By Claus Pahl and Ronan Barrett.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Large Scale Nuclear Physics Calculations in a Workflow Environment and Data Provenance Capturing Fang Liu and Masha Sosonkina Scalable Computing Lab, USDOE.
Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University.
Faculty of Informatics and Information Technologies Slovak University of Technology Personalized Navigation in the Semantic Web Michal Tvarožek Mentor:
EMBOSS over a Grid 1. 1st EELA Grid School December 4th of 2006 Eduardo MURRIETA LEON Romualdo ZAYAS-LAGUNAS Pierre-Alain BRANGER Jérôme VERLEYEN Roberto.
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
A collaborative tool for sequence annotation. Contact:
MyGrid/Taverna Provenance Daniele Turi University of Manchester OMII f2f Meeting, London, 19-20/4/06.
ARGOS (A Replicable Genome InfOrmation System) for FlyBase and wFleaBase Don Gilbert, Hardik Sheth, Vasanth Singan { gilbertd, hsheth, vsingan
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
Using DAML+OIL Ontologies for Service Discovery in myGrid Chris Wroe, Robert Stevens, Carole Goble, Angus Roberts, Mark Greenwood
The Protein Identifier Cross-Reference (PICR) service.
Getting GO: how to get GO for functional modeling Iowa State Workshop 11 June 2009.
1 DMS-DQS-SUPSC03-PRE-12-E © DEIMOS Space S.L., 2007 A Semantic Data Grid for Satellite Mission Quality Analysis Reuben Wright Deimos Space.
The Gateway Computational Web Portal Marlon Pierce Indiana University March 15, 2002.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Navigation Framework using CF Architecture for a Client-Server Application using the open standards of the Web presented by Kedar Desai Differential Technologies,
High throughput biology data management and data intensive computing drivers George Michaels.
PROTEIN IDENTIFIER IAN ROBERTS JOSEPH INFANTI NICOLE FERRARO.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
GeneConnect Use Cases and Design August 3, GeneConnect Database IDs are linked by Direct Annotation, Inferred Annotation, or Sequence Alignment.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
Designing, Executing and Sharing Workflows with Taverna 2.4 Different Service Types Katy Wolstencroft Helen Hulme myGrid University of Manchester.
SMART GROUND platform overview
Training course on Euro SDMX Registry
ID Mapping tools: Converting Accessions between Databases
Semantic Markup for Semantic Web Tools:
Presentation transcript:

Phase II Additions to LSG Search capability to Gene Browser –Though GUI in Gene Browser BLAST plugin that invokes remote EBI BLAST service Working set manager –State retention between sessions Provenance viewer –Displays annotated provenance –Accepts RDF representations

Knowledge Discovery through Provenance Collection, Representation, and Use in the Life Science Grid (LSG) Phase II Final Report : Architectural and Technical Details Beth Plale Director, Center for Data and Search Informatics Indiana University

Key contribution to LSG proper (provenance aside) Introduction of state

LSG Space user Entrez Gene Ontology Gene Browser Lilly CAB Bus Lilly to Karma Events Reflector Karma Framework Provenance DB Events Capture OPM* RDF Interface S-OGSA Service Semantic Binding Annotated Provenance Graph Karma Events Bus (WS+LSG) Resources (public + private) Proxy SAWSDL Registry Service Annotations Services and data ontology (myGrid) Karma structure ontology Karma Services *Open Provenance Model v1.01 BLAST Working Set Manager RDF Viewer Provenance Graphs

BLAST support

Working Set: support for user state

Demo 1: Phase II Use Case Select “gene” from database list, list will show in Gene Browser Submit gene to NCBI Open tab of BLAST plugin, download FASTA sequence Run BLAST Add results to Working Set Annotate Working set

Working Set Manager listens to CAB bus. It uses Entrez ID and/or all or partial of BLAST result as input to working set, or imports csv file into working set. Working set WS2 was generated from working set WS1 by Delete Rows. WS2 can be exported as csv file.

Demo II: Query provenance database Query BLAST related data. Query 1: get the latest Blast_Plugin. create or replace view v1 as select process_id, service_id, process_initialization_time from process where service_id like '%Blast_Plugin' and process_initialization_time = (select max(process_initialization_time) from process where service_id like '%Blast_Plugin’) select * from v1; Query 2: get the service (Blast_Ebi_Web_Service) invoked by Blast_Plugin. select invoker_id, invokee_id, p.service_id from invocation, process p, v1 where invoker_id = v1.process_id and invokee_id = p.process_id;

Demo II: Query provenance database Query 3: get input to Blast Ebi Web Service select p.service_id, artifact_id,artifact_value from artifact_used au,artifact a, process p, v1 where au.artifact_no = a.artifact_no and au.process_id = p.process_id and p.process_id = v1.process_id + 1 and p.service_id like '%Blast_Ebi_Web_Service’; Query 4: get output from Blast Ebi Web Service select p.service_id, artifact_id,artifact_value from artifact_generated ag,artifact a, process p,v1 where ag.artifact_no = a.artifact_no and ag.process_id = p.process_id and p.process_id = v1.process_id + 2 and p.service_id like '%Blast_Ebi_Web_Service’;

Suggested Future Work Next steps: –Engagement of users or, –Expanded functionality set Build in development time if we must do this ourselves Represent to user combined visualization and process provenance Write research quality paper –Requires user study, or comparison, or … Formally integrate provenance collection tools into non-public LSG

Suggested Future Work: Technical Support BLAST in asynchronous mode, extend NCBI Entrez to work on other NCBI databases, and design rich provenance queries.