Presentation is loading. Please wait.

Presentation is loading. Please wait.

Wrapping analytical services for caBIG Taverna-caGrid technical review meeting Stian Soiland-Reyes, myGrid University of Manchester, UK 2009-01-23

Similar presentations


Presentation on theme: "Wrapping analytical services for caBIG Taverna-caGrid technical review meeting Stian Soiland-Reyes, myGrid University of Manchester, UK 2009-01-23"— Presentation transcript:

1 Wrapping analytical services for caBIG Taverna-caGrid technical review meeting Stian Soiland-Reyes, myGrid University of Manchester, UK 2009-01-23 http://www.mygrid.org.uk/dev/wiki/display/caGrid

2 Agenda Project overview Primary goals Service selection Services identified Architecture Service outputs UML model Template workflow Work so far Implementation plan

3 Project overview Taverna caGrid cooperation Taverna workbench enhancements for caGrid Grid-enabling analytical services caGrid security support for Taverna This presentation deals with the analytical services

4 Primary goals Identify two publicly available analytical web services currently accessible through Taverna caGrid-enable the services; semantically described using caBIG’s infrastructure Demonstrate building of workflows combining the new services with existing caBIG services

5 Service selection Selected services in collaboration with the caGrid Workflow working group, lead by Juli Winners: NCBI Blast hosted by EBI InterProScan hosted by EBI

6 Why these services? Freely available Highly reliable, hosted by EBI Widely used by the scientific community Can be combined with existing caBIG tools in biologically meaningful workflows caBIO, GridPIR, etc.

7 Services identified NCBI Blast A popular similarity search tool using local sequence alignment Supports sequences of proteins, DNA, RNA Searches sequences in a whole range of databases SWISSPROT, UNIPROT, NCBI, EMBL, etc. SOAP web service hosted by EMBL-EBI

8 Services identified InterProScan Integrates various databases of protein domains and functional sites Searches using protein signature recognition methods SOAP web service hosted by EMBL-EBI

9 Architecture

10 Architecture as pseudo code class CaGridClient: def main(): endpointReference = wrappedService.invoke(inputs) endpointReference.subscribe() def resourcePropertyChanged(): outputs = endpointReference.getResourceProperty() print "Result", outputs class WrappedService: def invoke(inputs): convertedInputs = dataConverter.convertFromCaGrid(inputs) jobId = serviceInvoker.invoke(convertedInputs) endpointReference = new EndpointReference(jobId) return endpointReference def outputReturned(jobId, outputs): convertedOutputs = dataConverter.convertToCaGrid(outputs) endpointReference.setResourceProperty(convertedOutputs) class ServiceInvoker: def invoke(convertedInputs): jobId = originalService.invoke(convertedInputs) return jobId

11 Output InterProScan (Untranslated) http://www.ebi.ac.uk/schemachema-instance" xsi:noNamespaceSchemaLoca.. /Header> <protein id="unipro <interpro id="IPR008197" name="Whey acidic protein, 4-disulphide core" type="Domain" parent_id="IPR015874"> Molecular Function protease inhibitor activity <match id="G3DSA:4.10.75.10" name="Whey_acidic_protein_4-diS_core" dbname="GENE3D"> <location start="77" end="128" score="9.899996308397199E-5" status="T" evidence="Gene3D" /> <location start="30" end="72" score="6.30000254573025E-5" status="T" evidence="HMMPfam" /> <location start="79" end="126" score="1.59999889349247E-14" status="T" evidence="HMMPfam" /> <interpro id="IPR008198" name="Proteinase inhibitor I17" type="Domain" parent_id="IPR008197">...

12 Output InterProScan (Untranslated) http://www.ebi.ac.uk/schemachema-instance" xsi:noNamespaceSchemaLoca.. /Header> <protein id="unipro <interpro id="IPR008197" name="Whey acidic protein, 4-disulphide core" type="Domain" parent_id="IPR015874"> Molecular Function protease inhibitor activity <match id="G3DSA:4.10.75.10" name="Whey_acidic_protein_4-diS_core" dbname="GENE3D"> <location start="77" end="128" score="9.899996308397199E-5" status="T" evidence="Gene3D" /> <location start="30" end="72" score="6.30000254573025E-5" status="T" evidence="HMMPfam" /> <location start="79" end="126" score="1.59999889349247E-14" status="T" evidence="HMMPfam" /> <interpro id="IPR008198" name="Proteinase inhibitor I17" type="Domain" parent_id="IPR008197">...

13 UML model: wrapped InterproScan

14 UML model: wrapped NCBIBlast

15 Template workflow http://www.myexperiment.org/workflows/230 EBI_dbfetch_fetchBatch will be replaced with the caBIG service caBIO This workflow uses both NCBIBlast and InterproScan which will be replaced with the wrapped services

16 Work so far Identified services and example workflow Described services (Deliverable 3.2) Modelled service inputs and outputs in UML according to caGrid guidelines Still a few tweaks needed for WS-Resource usage Architecture and implementation plan for wrapping services (Deliverable 3.3) JavaDoc needs updating for WS-Resource

17 Implementation plan Generate Common Data Elements for inputs and outputs and verify Silver compatability Generate semantically annotated XMIs Submit Silver compatability review package Implement and deploy wrapped services Using Introduce and possibly gRavi Implement, test, deploy We’ll start with this before submitting CDEs Build caGrid-based workflow using services

18 Any questions..?


Download ppt "Wrapping analytical services for caBIG Taverna-caGrid technical review meeting Stian Soiland-Reyes, myGrid University of Manchester, UK 2009-01-23"

Similar presentations


Ads by Google