Designing, Executing and Reusing Scientific Workflows Katy Wolstencroft, Paul Fisher, myGrid.

Slides:



Advertisements
Similar presentations
David De Roure Social Networking and Workflows in Research.
Advertisements

Taverna, myExperiment and BioCatalogue: Workflow Tools for Informatics Integration Dr Katy Wolstencroft School of Computer Science University of Manchester.
ISWC 2005, Galway Seven Bottlenecks to Workflow Reuse and Repurposing Antoon Goderis Ulrike Sattler Phillip Lord Carole Goble University of Manchester.
European Life Sciences Infrastructure for Biological Information Rafael C Jimenez ELIXIR CTO EMBL-EBI workshop networks and pathways.
David De Roure Manchester Edition. John Taylor There are a number of grid applications being developed and there is a whole raft of computer technologies.
Accelerating Time to Experiment – The myExperiment Approach to Open Science David De Roure Carole Goble Jiten Bhagat.
Taverna and myExperiment: Designing, Exchanging and Sharing of Scientific Workflows Katy Wolstencroft University of Manchester.
IBM Watson Research © 2004 IBM Corporation BioHaystack: Gateway to the Biological Semantic Web Dennis Quan
WebRatio BPM: a Tool for Design and Deployment of Business Processes on the Web Stefano Butti, Marco Brambilla, Piero Fraternali Web Models Srl, Italy.
A Systematic approach to the Large-Scale Analysis of Genotype- Phenotype correlations Paul Fisher Dr. Robert Stevens Prof. Andrew Brass.
Microsoft Research Faculty Summit David De Roure University of Southampton, UK.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
Software for the Data-Driven Researcher of the Future Dr. Paul Fisher
Building Scientific Workflows with Taverna and BPEL: a Comparative Study in caGrid Wei Tan 1, Paolo Missier 2, Ravi Madduri 1, Ian Foster 1 1 University.
Computational Physics Kepler Dr. Guy Tel-Zur. This presentations follows “The Getting Started with Kepler” guide. A tutorial style manual for scientists.
Jiten Bhagat University of myExperiment A Social VRE for Research Objects JISC Roadshow | February.
Service Discovery in my Grid and the Biocatalogue, a Life Science Service Registry Katy Wolstencroft myGrid University of Manchester.
Provenance in my Grid Jun Zhao School of Computer Science The University of Manchester, U.K. 21 October, 2004.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.
My Experiment – A Web 2.0 Virtual Research Environment David De Roure Carole Goble.
Taverna and my Grid Basic overview and Introduction Tom Oinn
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
Designing, Executing, Reusing and Sharing Workflows: Taverna and myExperiment Supporting the in silico Experiment Life Cycle Katy Wolstencroft Paul Fisher.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
Data-driven research with e-Laboratories Stuart Owen University of Manchester
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
David De Roure University of Southampton, UK Carole Goble The University of Manchester, UK A Web 2.0 Virtual Research Environment OGF Semantic Grid Research.
Building and Running caGrid Workflows in Taverna 1 Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA 2 Mathematics.
MyExperiment 2.0 – Preserving digital Research Objects using the Wf4Ever architecture EGI/SHIWA Workshops on e-Science Workflows Budapest, Stian.
Semantic Professor Carole Goble
CaBIG Workflow University of Chicago, USA University of Manchester, UK.
Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester.
Agenda Intro: Information management in Biology Information management engineering Formats and standards XML MAGE example Perspectives: the Semantic Web.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
Shannon Hastings Multiscale Computing Laboratory Department of Biomedical Informatics.
1 Dr. Paolo Missier, Prof. Carole Goble Information Management Group School of Computer Science, University of Manchester, UK with additional material.
Professor Carole Goble
Towards an understanding of Genotype-Phenotype correlations Paul Fisher et al.,
Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University.
Association of variations in I kappa B-epsilon with Graves' disease using classical and my Grid methodologies Peter Li School of Computing Science University.
Stian Soiland-Reyes myGrid, School of Computer Science University of Manchester, UK UKOLN DevSci: Workflow Tools Bath,
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
David De Roure Repeat, Reuse, Remix, Reproduce, … Reconstructable Research.
Infrastructures for Social Simulation Rob Procter National e-Infrastructure for Social Simulation ISGC 2010 Social Simulation Tutorial.
Bioinformatics Workflows Chris Wroe (based on material from the myGrid team & May Tassabehji / Hannah Tipney Medical Genetics, St Marys)
An Introduction to Designing, Executing and Sharing Workflows with Taverna Katy Wolstencroft myGrid University of Manchester IMPACT/Taverna Hackathon 2011.
Databases, Ontologies and Text mining Session Introduction Part 2 Carole Goble, University of Manchester, UK Dietrich Rebholz-Schuhmann, EBI, UK Philip.
A presentation about myExperiment David De Roure and Carole Goble.
Scientific Workflows for the Sensor Web ICT for Earth Observation Anwar Vahed.
The 10 Best Practices for Workflow Design BioVeL M6 Workshop Göteborg, May 10-11, 2012 Kristina Hettne, Marco Roos (LUMC), Katy Wolstencroft, Carole Goble.
ISMB Demo, 01 July 2009 Franck Tanoh University of Manchester, UK.
CaGrid Workflow Examples Wei Tan, Ravi Madduri University of Chicago {wtan,
ACCESSING DATA IN THE NIS USING THE KEPLER WORKFLOW SYSTEM Corinna Gries.
An Introduction to Taverna caBIG monthly workspace call and Taverna, Franck Tanoh.
Co-evolution of digital technologies and research methods David De Roure.
Designing, Executing and Sharing Workflows with Taverna 2.2 Katy Wolstencroft myGrid University of Manchester.
Taverna, myExperiment and HELIO services Anja Le Blanc Stian Soiland-Reyes Alan Willams University of Manchester.
Smart Labs for Smart People New ways to collect, curate and share information Jeremy Frey School of Chemistry, University of Southampton June 2010Jeremy.
ARCH/VCDE F2F BoF And the Presentation Subtitle Goes Here Ravi Madduri December 2008.
Exploring Taverna 2 Katy Wolstencroft myGrid University of Manchester.
Taverna: A Workbench for the Design and Execution of Scientific Workflows Paul Fisher University of Manchester.
The Influence and Impact of Web 2.0 on e-Research Infrastructure, Applications and Users User Day.
myExperiment: Towards Research Objects David De Roure
Professor Carole Goble University of Manchester, UK
Automatic launch and tracking the computational simulations with LiFlow and Sumatra Evgeniy Kuklin.
Computational Physics Kepler
Taverna workflow management system
Presentation transcript:

Designing, Executing and Reusing Scientific Workflows Katy Wolstencroft, Paul Fisher, myGrid

Lots of Scientific Resources NAR 2009 – over 1170 databases

Interoperability, Integration and Collaboration Access to distributed and local resources Iteration over data sets Interactive Automation of data flow Agile software development Experimental protocols Taverna Workflows

Create and run workflows Create and manage services as components API Consumer Share, discover and reuse workflows Manage the metadata needed and generated RDF, OWL Discover and reuse services Feta Open Source Workflow Environment for Scientists

Taverna Gui and Enactor Taverna Remote Execution service T-REX Graphical Workbench Drag and drop interface Plug-in architecture Nested Workflows Workflow Enactor Local and remote enactor Implicit iteration over data collections Automation of data flow Logging and data provenance tracking Workflow Enactor Engine 2

Types of Service Many different types can be incorporated into Taverna by providing the URL – No coding Activity WSDL Activity Bury interaction Activity Workflow Enactor Engine Activity

Finding and Curating Services

What do Scientists use Taverna for? Data gathering, annotation and model building Data analysis from distributed tools Data mining and knowledge management Data curation and warehouse population Parameter sweeps and simulation Users from Systems Biology, Proteomics, Sequence analysis, Protein structure prediction, Gene/protein annotation, Microarray data analysis, QTL studies, Chemioinformatics, Medical image analysis, Public Health care epidemiology, Heart model simulation, Phenotype studies, Phylogeny, Statistical analysis, Pharmacogenomics, Text mining Astronomy, Music, Meteorology

Systems Biology: Integration of microarray data onto SBML Models Data analysis Manipulation of SBML models libSBML incorporated into Taverna through the Java API Consumer Peter Li, Doug Kell, University of Manchester

Data Analysis: Pharmacogenomics Association study of Nevirapine-induced skin rash in Thai Population A systemic (bodywide) allergic reaction with a characteristic rash 100 Cases: rash – 100 Cases: no rash controls 10,000 SNP significantly associated with rash Pathway analysis and systems biology Prioritising SNPs Functional studies Diagnostic tools

Taverna in caGrid caGrid Scavenger with semantic/metadata based caGrid service query caGrid workflow for microarray analysis, using caArray, GenePattern and geWorkbench [Ravi Madduri] Orchestrating CaGrid Services in Taverna Wei Tan, Ravi Madduri, Kiran Keshav, Baris E. Suzek, Scott Oster, Ian Foster, Proc IEEE Intl Conf on Web Services (ICWS 2008)

Sharing Experiments Taverna supports the in silico experimental process for individual scientists How do you share your results/experiments/experiences with your Research group Collaborators Scientific community How do you compare your results with others produced by e.g. Kepler / Triana / Trident?

Recycling, Reuse, Repurposing Paul writes workflows for identifying biological pathways implicated in resistance to Trypanosomiasis Paul meets Jo. Jo is investigating mouse Whipworm infection. Jo reuses one of Paul’s workflow without change. Jo identifies the biological pathways involved in sex dependence in the mouse model, believed to be involved in the ability of mice to expel the parasite. Previously a manual two year study by Jo had failed to do this. Workflows are protocols

myExperiment Features User Profiles Groups Friends Sharing Tags Developer interface Workflows Credits and Attributions Fine control over privacy Packs Federation Enactment

Just Enough Sharing….And Credit! myExperiment allows you to credit others’ contributions myExperiment allows you to say Who can look at/download or modify your workflow Who can run your workflow Enactment extends accessibility Taverna is for informaticians myExperiment is for informaticians AND laboratory scientists

The myGrid Team

More Information myGrid Taverna myExperiment BioCatalogue Thanks to Carole Goble, David De Roure and Jiten Bhagat for slide contributions