Scientific Workflows Systems : In Drug discovery informatics Presented By: Tumbi Muhammad Khaled 3 rd Semester Department of Pharmacoinformatics.

Slides:



Advertisements
Similar presentations
Integrating ChemAxon technology into your End User Applications Java solutions for cheminformatics Ver. Mar., 2005.
Advertisements

JChem Node extension for KNIME Workbench
CICC June meeting IUPUI team: Kelsey Forsythe Malika Mahoui Deepthi Jonnala Usha Cheemakurthi.
Designing Services for Grid-based Knowledge Discovery A. Congiusta, A. Pugliese, Domenico Talia, P. Trunfio DEIS University of Calabria ITALY
Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
SSRS 2008 Architecture Improvements Scale-out SSRS 2008 Report Engine Scalability Improvements.
Test Case Management and Results Tracking System October 2008 D E L I V E R I N G Q U A L I T Y (Short Version)
Windows XP Photo Workflow Tim Grey Imaging Strategist Microsoft Corporation.
Design and Making of Information System at Dentist Work Place By : Advisor : Samuel Budi GAlexander Setiawan, MT Leo Willyanto Industry Engineering.
Samford University Virtual Supercomputer (SUVS) Brian Toone 4/14/09.
WebRatio BPM: a Tool for Design and Deployment of Business Processes on the Web Stefano Butti, Marco Brambilla, Piero Fraternali Web Models Srl, Italy.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
CPSC 695 Future of GIS Marina L. Gavrilova. The future of GIS.
Computational Physics Kepler Dr. Guy Tel-Zur. This presentations follows “The Getting Started with Kepler” guide. A tutorial style manual for scientists.
TRAVEL RESERVATION SYSTEM USING WEB SERVICES COMPOSITION LANGUAGE
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Business Intelligence System September 2013 BI.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Sharon Burton Product Manager/Product Evangelist MadCap Software
Introduction to BIM BIM Curriculum 01.
Joy Oberoi Grade 12. Introduction THEATRE BOOKING SYSTEM (TBS) A system used to perform tasks that one would manually execute at a theatre It is online.
KÜRT COMPUTER RT. COMPUTER AND AUTOMATION RESEARCH INSTITUTE (MTA SZTAKI) UNIVERSITY OF VESZPRÉM MATHEMATICS AND COMPUTING DEPARTMENT KÜRT COMPUTER RT.
CLARIN tools for workflows Overview. Objective of this document  Determine which are the responsibilities of the different components of CLARIN workflows.
January, 23, 2006 Ilkay Altintas
Scientific Workflows Within the Process Mining Domain Martina Caccavale 17 April 2014.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
1 I n t u i t C o n f i d e n t i a l Construction Estimating Software Jeff Gerardi | President | Solution Introduction.
Taverna and my Grid Basic overview and Introduction Tom Oinn
Software Engineering 2003 Jyrki Nummenmaa 1 CASE Tools CASE = Computer-Aided Software Engineering A set of tools to (optimally) assist in each.
14/11/11 Taverna Roadmap Shoaib Sufi myGrid Project Manager.
1 INTRODUCTION TO DATABASE MANAGEMENT SYSTEM L E C T U R E
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
Supporting High- Performance Data Processing on Flat-Files Xuan Zhang Gagan Agrawal Ohio State University.
1 Peter Allan14-15 Dec 2004AstroGrid Consortium Meeting: Architecture Discussion AstroGrid Architecture – the view from outside Is the description acceptable?
Domain-Specific Languages for Composing Signature Discovery Workflows Ferosh Jacob*, Adam Wynne+, Yan Liu+, Nathan Baker+, and Jeff Gray* *Department of.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
Research Design for Collaborative Computational Approaches and Scientific Workflows Deana Pennington January 8, 2007.
© 2007 IBM Corporation SOA on your terms and our expertise Software WebSphere Process Server and Portal Integration Overview.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
Clinical Collaboration Platform Overview ST Electronics (Training & Simulation Systems) 8 September 2009 Research Enablers  Consulting  Open Standards.
Google Refine for Data Quality / Integrity. Context BioVeL Data Refinement Workflow Synonym Expansion / Occurrence Retrieval Data Selection Data Quality.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
Application of Design Heuristics in the Designing and Implementation of Object Oriented Informational Systems.
6 February 2009 ©2009 Cesare Pautasso | 1 JOpera and XtremWeb-CH in the Virtual EZ-Grid Cesare Pautasso Faculty of Informatics University.
1 Limitations of BLAST Can only search for a single query (e.g. find all genes similar to TTGGACAGGATCGA) What about more complex queries? “Find all genes.
1 Peter Fox Xinformatics 4400/6400 Week 10, April 9, 2013 Information management, workflow and discovery /check-in for project definitions.
Copyright © 2015, SAS Institute Inc. All rights reserved. Future Drug Applications with No Tables, Listings and Graphs? PhUSE Annual Conference 2015, Vienna.
Data Mining Tools some examples.
“Request For System Change” Sushil Bhatnagar MBA(IT) 4 th Semester Sikkim Manipal University (SMU DE) Roll No. : LC Code. : IICE College (02086)
Toward interactive visualization in a distributed workflow Steven G. Parker Oscar Barney Ayla Khan Thiago Ize Steven G. Parker Oscar Barney Ayla Khan Thiago.
® IBM Software Group © 2007 IBM Corporation Module 1: Getting Started with Rational Software Architect Essentials of Modeling with IBM Rational Software.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
Data and storage services on the NGS.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
Confidencial - TRACASA Automatize test [e- Reporting]
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
RDA 9th Plenary Breakout 3, 5 April :00-17:30
University of Chicago and ANL
MATLAB Distributed, and Other Toolboxes
Simple and intuitive fare conditions
Data Warehousing and Data Mining
Renouncing Hotel’s Data Through Queries Using Hadoop
Jonathan Griffin, Managing Director, IFIS Publishing &
What is UiPATH? For more details visit this link online-training.
Presentation transcript:

Scientific Workflows Systems : In Drug discovery informatics Presented By: Tumbi Muhammad Khaled 3 rd Semester Department of Pharmacoinformatics

Introduction to Scientific Workflows What is a workflow General definition: series of tasks performed to produce a final outcome Scientific workflow – “data analysis pipeline” Automate tedious jobs that scientists traditionally performed by hand for each dataset Process large volumes of data faster than scientists could do by hand 2

What is a Workflow? 3

Background: Business Workflows Example: Planning a trip Need to perform a series of tasks: book a train tickets, reserve a hotel room, arrange for a rental car for sight seeing, etc.. Each task may depend on outcome of previous task –Days you reserve the hotel depend on days of the flight –If hotel has shuttle service, may not need to rent a car –etc.. 4

What about scientific workflows? Perform a set of transformations/ operations on a scientific dataset Examples Process Simulation output Generating images from raw data Identifying areas of interest in a large dataset Classifying set of objects Querying a web service for more information on a set of objects Many others… 5

Is this topic is useful to discuss ????? Yes…. 6

Scientific Workflow Design: Challenges “And that’s why our scientific workflows are much easier to develop, understand and maintain !” 7

Why… Challenges/Requirements Mastering a programming language –Not all Visualizing workflow –User interaction e.g., users may inspect intermediate results –“Smart” re-runs Changing a parameter after intermediate results without executing workflow from scratch 8

Why… Challenges/Requirements Sharing/exchanging workflow – Formatting issues –File type conversion (OpenBabel) Locating datasets, services, or functions –Seamless access to resources and services Web services are simple solution but doesn’t address harder problems, e.g., web service orchestration, third party transfers 9

Industry point Of View: Schrodinger’s maximum workforce is working on KNIME® base workflow development for its products/ modules which may become rival for market leader Accelrys - Pipeline Pilot ® Why… 10

Practical Examples …. There Many Scientific workflows software /Workbenches are available : I.Pipeline Pilot ® Commercially Available from Accelrys® Market leader in scientific workflow II. KNIME Open source software Schrodinger’s target to make it as RIVAL for Pipeline Pilot Include many chemoinformatics NODES were developed to perfome some basic calculation and DATA MINING III.TAVERNA WORKBENCH Open source software Active development form user Applications in BIOINFORMATICS 11

KNIME KNIME (Konstanz Information Miner) is a user-friendly and comprehensive open-source data integration, processing, analysis, and exploration platform. KNIME include plugins for CDK (Chemistry Development Kit) Also have some nodes for Statistical data mining etc.. As already discussed KNIME based workflows for Maestro are also available. Here we see an VERY SMALL example of workflow for extraction of METADATA from.sdf file 12

13 video

It is open source workbench developed by University of Manchester It have many applications only in bioinformatics No commercial Tie-Ups Example:- A simple workflow ( Part of Workflow ) wich will fetch the PDB structure from RCSB database TAVERNA WORKBENCH 14

15 Video

Advantages of Workflow System Can perform routine extensive complicated works which may include Data Transformation Data mining Data Analysis Etc. without any manual interference which may results in less errors. Result reproducibility Reduce data loss Time saving etc 16

Workflow System 17 As Developer

Thank You My software never has bugs. It just develops random features 18