OMII-UK Software Activities Steven Newhouse, Director.

Slides:



Advertisements
Similar presentations
Delivering User Needs: A middleware perspective Steven Newhouse Director.
Advertisements

Tom Sugden EPCC OGSA-DAI Future Directions OGSA-DAI User's Forum GridWorld 2006, Washington DC 14 September 2006.
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Current status of grids: the need for standards Mike Mineter TOE-NeSC, Edinburgh.
VO Support and directions in OMII-UK Steven Newhouse, Director.
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
Sandra Gesing Division for Simulation of Biological Systems Eberhard-Karls-Universität Tübingen Portals for Life.
Sandra Gesing Eberhard-Karls-Universität Tübingen Requirements on a portal for MoSGrid (Molecular Simulation.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Resource wrappers, web services, grid services Jaspreet Singh School of Computer.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Intelligent Grid Solutions 1 / 18 Convergence of Grid and Web technologies Alexander Wöhrer und Peter Brezany Institute for Software.
Workflows within Taverna Stuart Owen University of Mancester, UK
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Globus 4 Guy Warner NeSC Training.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
The OMII Perspective on Grid and Web Services At the University of Southampton.
User requirements for and concerns about a European e-Infrastructure Steven Newhouse, Director.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
OPEN GRID SERVICES ARCHITECTURE AND GLOBUS TOOLKIT 4
Software from Science for Science Steven Newhouse, Director.
The GRIMOIRES Service Registry Weijian Fang and Luc Moreau School of Electronics and Computer Science University of Southampton.
The OMII Overview, Product and Roadmap. © University of Southampton omii OMII_1 Delivering a secure, reliable, web services infrastructure for grid applications.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grid-enabling OGC Web Services Andrew Woolf, Arif Shaon STFC e-Science Centre Rutherford Appleton Lab.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
COMP3019 Coursework: Introduction to GridSAM Steve Crouch School of Electronics and Computer Science.
Going with the Flow Distributed Computing for Systems Biology Using Taverna Prof Carole Goble The University of Manchester, UK
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
© 2008 Open Grid Forum Independent Software Vendor (ISV) Remote Computing Primer Steven Newhouse.
London e-Science Centre GridSAM A Standards Based Approach to Job Submission A. Stephen M C Gough Imperial College London A Standards Based Approach to.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
London e-Science Centre GridSAM Job Submission and Monitoring Web Service William Lee, Stephen McGough.
OGSA-DAI in OMII-Europe Neil Chue Hong EPCC, University of Edinburgh.
Provenance challenge --- my Grid David De Roure University of Southampton Jun Zhao, Carole Goble and Daniele Turi University of Manchester.
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
OGSA-UK: Putting the users first Steven Newhouse OMII Deputy Director.
Taverna Workbench Stuart Owen University of Mancester, UK
Mike Jackson EPCC OGSA-DAI Architecture + Extensibility OGSA-DAI Tutorial GGF17, Tokyo.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI Technology Update GGF17, Tokyo (Japan)
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE User Forum, Manchester, 10 May ‘07 Nicola Venuti
NeSC Workshop - February /14 Study of User Priorities for e-Infrastructure for e-Research (SUPER) Steven Newhouse Jennifer Schopf Andrew Richards.
Middleware for Campus Grids Steven Newhouse, ETF Chair (& Deputy Director, OMII)
First International Workshop on Portals for Life Sciences Sandra Gesing
EScience Case Studies Using Taverna Dr. Georgina Moulton The University of Manchester
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
Testing Grid Software on the Grid Steven Newhouse Deputy Director.
Easy Access to Grid infrastructures Dr. Harald Kornmayer (NEC Laboratories Europe) Dr. Mathias Stuempert (KIT-SCC, Karlsruhe) EGEE User Forum 2008 Clermont-Ferrand,
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Standards driven AAA for Job Management within the OMII-UK distribution Steven Newhouse Director, OMII-UK
The Semantic Web, Service Oriented Architectures, the my Grid Experience Carole Goble
Workflows Description, Enactment and Monitoring in SAGA Ashiq Anjum, UWE Bristol Shantenu Jha, LSU 1.
Selected Workflow and Semantic Experiences from my Grid Professor Carole Goble The University of Manchester, UK
The NGS Grid Portal David Meredith NGS + Grid Technology Group, e-Science Centre, Daresbury Laboratory, UK
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
An Introduction to Taverna caBIG monthly workspace call and Taverna, Franck Tanoh.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
OGSA-DAI.
Introduction to Workflows with Taverna and myExperiment Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI on OMII 2.0 OMII The Open Middleware Infrastructure Institute NeSC,
OMII-BPEL Grid Services Orchestration using the Business Process Execution Language (BPEL) Liang Chen Bruno Wassermann Project Inspector: Wolfgang Emmerich.
Introduction to the Application Hosting Environment
Grid Systems: What do we need from web service standards?
Presentation transcript:

OMII-UK Software Activities Steven Newhouse, Director

© 2 Our Mission… OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its international collaborators Promote the use of good-quality open-source software Reduce the risk of moving to new e-infrastructure world Recognise distinct user communities: by domain and function

© 3 Manchester Southampton Edinburgh University of Manchester Electronics and Computer Science University of Edinburgh European Bioinformatics Institute The OMII-UK Partnership Cambridge Southampton: 14 FTEs Manchester: 9 FTEs Edinburgh: 8 FTEs Community: ~8 FTEs

© 4 OMII-UK in context Ad hoc e-Infrastructure services e-Science Users EPSRC ESRC STFC NERC BBSRC MRC AHRC e-Infrastructure Services to enable e-Science Globus, gLite, CROWN, NAREGI, Web Services,.. Higher-level services & tools National Grid Services Data Compute Bioinformatics Users Engineering Users Information Retrieval (JISC) Industrial Partners OMII-UK Services Organisation & Composition International Grid Providers

© 5 User Communities Applied Technology Specialists e-Infrastructure e-Researchers (domain & generic) Providers

© 6 Broad Software Activities Taverna Composing workflows across distributed resources OGSA-DAI Web service to integrating heterogeneous data resources GridSAM Web service for jobs submission and job monitoring GRIMOIRES Support for service publishing, discovery & annotation BPEL Supporting scientific workflows through web services

Taverna

© 8 Taverna - Background Emerged from myGrid project (UK EPSRC) Integration & Interoperability very difficult Cut and paste between data sources & services Can translate data formats by using other services Quickly realised that this was not a viable solution… Focus on the challenges within the bioinformatics community Everything is distributed: Data, Services and Scientists Heterogeneous data sources Many specifications: I/O, data representation, annotation Everything is a string!

© 9 Traditional Bioinformatics acatttctac caacagtgga tgaggttgtt ggtctatgtt ctcaccaaat ttggtgttgt cagtctttta aattttaacc tttagagaag agtcatacag tcaatagcct tttttagctt gaccatccta atagatacac agtggtgtct cactgtgatt ttaatttgca ttttcctgct gactaattat gttgagcttg ttaccattta gacaacttca ttagagaagt gtctaatatt taggtgactt gcctgttttt ttttaattgg gatcttaatt tttttaaatt attgatttgt aggagctatt tatatattct ggatacaagt tctttatcag atacacagtt tgtgactatt ttcttataag tctgtggttt ttatattaat gtttttattg atgactgttt tttacaattg tggttaagta tacatgacat aaaacggatt atcttaacca ttttaaaatg taaaattcga tggcattaag tacatccaca atattgtgca actatcacca ctatcatact ccaaaagggc atccaatacc cattaagctg tcactcccca atctcccatt ttcccacccc tgacaatcaa taacccattt tctgtctcta tggatttgcc tgttctggat attcatatta atagaatcaa Manual workflow by PhD student

© 10 Workflow language specifies how processes fit together High level workflow diagram separated from any lower level coding – you don’t have to be a coder to build workflows Workflow is the script or protocol used that you configure when you run it Workflow is the integrator of knowledge Provides automation and repeatability Easily share & customise workflows Workflows

© 11 Taverna Workflow Components SCUFL Simple Conceptual Unified Flow Language Taverna Writing, running workflows & examining results SOAPLAB Makes applications available SOAPLAB Web Service Any Application Web Service e.g. DDBJ BLAST

© 12 Adding your own processes Consume web services SoapLab Expose applications as web services Java API Consumer import Java API of libSBML as workflow components

© 13 Shield the Scientist – Bury the Complexity Workflow enactor Processor Plain Web Service Soap lab Processor Local Java App Processor Enactor Processor Bio MOBY Processor WSRF Processor Bio MART Styx client Processor R package... Scufl Model Taverna Workbench Workflow Execution Application Simple Conceptual Unified Flow Language

© 14 Related Challenges Community Services Services: Web Services, Web Forms, Local processors Need to be annotated and made available for discovery Use Semantic Web Technologies Annotate by function, type input & output using ontology Discover services by function in Feta Experimental workflow needs services to interact with Provenance Record the services run within a workflow Be able to go back and replay Quality of the results based on the quality of input

OGSA-DAI Mario Antonioletti

© 16 What is OGSA-DAI? Middleware providing data access and integrations capabilities Targeted at application developers Provides access to data through Web Services Uniform access interface Data integration capabilities Support different types of data Relational XML File system (note: it does not virtualize the underlying data model) Enacts a simple but powerful workflow: Query-Transform-Delivery-etc Encapsulates multiple service interactions in one Move computation closer to data

© 17 Why use OGSA-DAI? Use of Web Services Platform independence/Language neutrality Transparencies Location/Product Additional security layers service-level/resource-level Provides extensive base functionality: Query (SQL, XPath, XQuery, XUpdate, …) Transformation (XSL, Compression, …) Delivery (ftp, GridFTP, SOAPwAttachments, …) Extensible (add your own!) Non-SOAP delivery based mechanisms

© 18 More reasons … Out of the box solution Saves application developer time Extensible and Versatile Framework Can add or customise capabilities Plays Nicely with Other Grid Middleware OMII-UK Distribution Globus Toolkit 4.0.* Tomcat with Axis Soon: UNICORE 6 (OMII-Europe) gLite 3 (OMII-Europe) Good documentation

© 19 Using OGSA-DAI Data Source Client OGSA-DAI Application-specific service Data Source Application-specific service Client Control message Data message Data Source OGSA-DAI Application- specific functionality Client

© 20 Usage Scenarios Data Source Client OGSA-DAI Control message Data message Data Source Client OGSA-DAI FTP Server on Client Data Source 1 OGSA-DAI Data Source 2Data Source n Client

© 21 Relational Multi-Resources Multiple Data Resource Accessor Multi Resource Data Service Resource MultiOne Data Service Data Service Relational SQL Query Multi Results SQL Query Results x M Data Service Resource Data Service Relational SQL Query Results SQL Query Results SQL Query Results More sophisticated capabilities offered in conjunction with OGSA-DQP

© 22 Service Model Data Resource Accessor Relational XMLDB Data Resource Accessor Data Service Resource Files Data Service Resource Data Service Support different messaging infrastructures Perform Document Response Document Support different data source types Core functionality

© 23 Inside a Perform Document DB Query Delivery block Produces data in blocks Stores and provides access to data blocks Consumes data blocks Pipe

© 24 Activity Types Resource-specific Relational XMLDB Files Multi-resources Transformation Delivery Resource creation and destruction Can extend/customise the framework

© 25 Recap Request XML perform document submitted by client Contains: Connected set of activities to be executed by data resource Flow control  Sequential/parallel execution of activities An activity has An individual data-related operation Has 0 or more inputs and 0 or more outputs Data Resource Service Parses requests, executes activities, builds responses Response Status of execution of a request possibly with result data XML response document returned to a client

© 26 Next Release Complete re-write Release scheduled for Q OGSA-DAI 3.0 (OMII-UK) OGSA-DAI 3.0 (GT4.0.3) Offer a number of advantages …

© 27 For the next release … Request model Specify multiple data resources in a single request Support more complex scenarios involving multiple data resources Activity framework Improved support for: Streaming of data Handing BLOBs and binary data Simpler activity API and activity input/output model Support for iteration of activities in a single request Resource model Data resources, sessions, requests, data sources, data sinks

© 28 …and … Core APIs Drive OGSA-DAI core functionality down from the presentation layer Easier to write application-specific presentation layers Persistence Persist the state of services and resources within files/database Security Pluggable resource and activity authorization framework Call outs to databases or remote delegation services Improved support for: Message-level security Transport-level security Delegation Scalability and robustness Provision for future support to execute parts of a request on different JVMs Provision for future support of clustering and load balancing of requests to different OGSA-DAI servers

© 29 Further Information The OGSA-DAI project site The DAIS-WG site OGSA-DAI users mailing list General discussion on OGSA-DAI, data and the grid Formal support for OGSA-DAI releases

GridSAM

© 31 GridSAM A Job Submission and Monitoring Web Service Supports Job Submission Description Language (JSDL) It is not… A scheduling service That’s the role of the underlying launching mechanism That’s the role of a super-scheduler that brokers jobs to a set of GridSAM services A provisioning service GridSAM runs what’s been told to run GridSAM does not resolve software dependencies and resource requirements

© 32 What is GridSAM to end-users? A set of command line tools and client APIs to: Submit and Start Jobs Monitor Jobs Terminate Jobs File transfer Client-side submission scripting Client-side Java API

© 33 What is GridSAM to resource providers? A Web Service to expose heterogeneous execution resources uniformly Single machine through Forking or SSH Condor Pool Grid Engine 6 through DRMAA Globus exposed resources OR use our plug-in API to implement …

© 34 GridSAM Implementation Virtual File System API (Apache VFS) FTP / GSIFTP / HTTP / WEBDAV / SFTP Event dispatches (OpenSymphony Quartz) Job Persistence (Hibernate - JDBC databases) Runtime Monitoring and Control (JMX)

© 35 Latest Features JSDL extension to support MPI Applications Authorisation based on JSDL structure Allow / deny submission based on a set of XPath rules and the identities of the submitter (e.g. distinguished name). Tracking Basic Execution Service (ogsa-bes) HPCP Interoperation at SC06 Will implement future HPCP revisions

GRIMOIRES

© 37 GRIMOIRES Service Registry A Grimoire is a magician's manual for invoking demons Web Service for a service registry (UDDI) Certificate-based authentication and access control Lifetime management of entries Register WSDL and annotate entries with metadata Metadata from third party annotation Search for services Through WSDL interface Through metadata annotation

© 38 Implementation An open source implementation in Java Interoperable with standard UDDI tools, UDDI browser, uddi4j Deployable in multiple environments: Tomcat/Axis and the OMII-UK software environment GT4: Expose registry entities as WS-Resources Support WS-ResourceProperties, WS-ResourceLifetime, and WS-Notification Use RDF triple store behind the scene Link entries to support reasoning search interface

© 39 Features in Progress XPath and XML capability Publishing XML documents XPath-based query Scalability and performance improvement Collaboration with EGEE Replication support based on WS-Notification Notification and replication support outside WSRF

BPEL

© 41 BPEL: Flexible Orchestration of Scientific Workflows BPEL: Business Process Execution Language BPEL is the industry standard for orchestration of web services Multiple providers both commercial and open- source Funded project integrates open-source components for science Visual modelling environment (Eclipse project) UCL Developer with committer status BPEL Enactment engine (ActiveBPEL)

© 42 The OMII-UK BPEL Project Make the benefits of BPEL accessible to application scientists Need to overcome a number of issues: Provide suitable set of abstractions to simplify creation of scientific workflows Provide tool support to hide complexity of technologies inherent to BPEL Provide integration of various middleware technologies UCL Department of Computer Science

© 43 Example Integration UCL Department of Computer Science

© 44 Graphical BPEL Designer Developed in cooperation with IBM and Oracle An Eclipse plug-in & project Features (will) include: Pre-deployment validation Automated deployment to various BPEL engines: ActiveBPEL, JBOSS, jBPM and many more Automated client generation to run deployed workflows Integration with graphical WSDL, XSD editors Project management including cvs Context-sensitive wizards UCL Department of Computer Science

© 45 ActiveBPEL Engine Open source implementation of BPEL4WS 1.1 Support for WS-BPEL 2.0 via extensions Industrial-strength BPEL workflow enactment engine Scalable Persistence Hot-deployment Provides Web-based management & monitoring console UCL Department of Computer Science

© 46 Scientific Workflows Need to combine web/grid services into: Larger services Experiments that connect services & processes Experiments need to be changed frequently to incorporate new insights and ideas Application scientists need ownership of their workflows Challenges: Large data, Many services & invocations, concurrent workflows (parameter sweeps) UCL Department of Computer Science

Community Interactions

© 48 Support & Training Provide confidence in adopting e-Science solutions through software support and training. Provide collaborative mechanisms to enable the e- Science community to help itself. Engage with the international community to define, contribute and disseminate best practice and standards. ISSGC06

© 49 Standards Engagement: Consume & Contribute Open Grid Forum Membership of the Board of Directors, Standards & e- Science Active in WGs: OGSA-WG, BES-WG, JSDL-WG, DIAS- WG, Byte-IO, W3C Semantic Web/Grid (e.g. OWL, RDF) Healthcare and Life Sciences SIG (LSIDs) OGF Liaison OASIS WSRF membership Track relevant specifications Tracking European Standards Developments

© 50 Website Information: General pages - news, events & wiki (with RSS feeds) About software being used in the community Communities & projects using our software Software: Download individual software components Work that we have commissioned  Interim releases directly from individual development teams Contributions from the community Integrated software release (client & server bundles) Wizards to install Tomcat, Axis, Services, Database, …

© 51 e-Science Community Information about software that you have found useful Open Source Development Community Information about software that you are developing that others might use Contributing software into the Repository Open Source Developers funded by OMII-UK in the community Website & Wiki Software Repository Software Catalogue Register a software project or activity Website Interactions

© 52 OMII-UK User Community Download individual software components directly from repository OMII-UK Software Release Software components integrated and tested to form the OMII-UK software release - Applied Domain Researchers TechnologistsProviders Information about the community Advice & consultancy Community forums & feedback Support & Training Partnerships to provide software Website & Wiki Software Repository Software Catalogue

© 53 Commissioned Software Programme Invest in open-source community development activities Approximately 8 FTEs a year Projects of months in length Hardening of software not research Funding mechanism Through specific calls (e.g. portlets, GridAPIs, …) Respond to specific proposals as they come in Deposit outputs in the repository and NeSCForge Increase confidence & accelerate adoption of open-source software

© 54 Funding from recent calls SAGA Implementation (Shantenu Jha) Java & C++ implementation on OMII-UK & GT Shibboleth enabled portal (Richard Sinnott) Portlet to specify resource access control policy Portlets to manage attribute delegation Artefact sharing framework for portlets (Ian Taylor) Controlled sharing of workflows Still under negotiation: Sharing application descriptions based upon JSDL Execution of JSDL documents on resources Community outreach for new GridAPIs

© 55 Very Flexible Funding Model Engagement with large open-source projects Taverna, BPEL components, … Development of community driven projects OGSA-DAI, GridSAM, Grimoires, … Not just UK focussed Fund developers in USA & Amsterdam Community not technology focussed Have own software release around plain WS Fund activity consuming GT: OGSA-DAI, SAGA, Grimoires

© 56 Other projects Knoggle An open architecture for matchmaking and brokering Consume GridSAM, Grimoires, BPEL & Taverna Open Grid Manager Monitoring and reporting of resources Lightweight probes recording data in Grimoires Application Hosting Environment Simplified lightweight interface to running applications Uses WSRF::Lite to provide secure Perl WS

Release 3.2.0

© 58 Perspectives on release Web Service Developer Secure web services hosting environment Infrastructure to help you build services Application Developer Everything a Web Service Developer wants Higher-level services that do ‘something’ Service Provider Easy installs, portability, composability – ‘pick & mix’ End-User Greater focus on the ‘client’ side Workflow composition and execution

© 59 For a Web Service Developer Tomcat, Axis & WS-Security: Integrated and deployed onto your machine Implementation of the WS-Eventing specification Implementation of the WS-ReliableMessaging and WS-Reliability specifications

© 60 For an Application Developer Job Submission and Job monitoring web service that uses the OGF’s Job Submission Description Language (JSDL) to describe jobs UDDI compliant registry web service that can support the addition of extra service meta-data. Using OGSA-DAI as a framework for querying, processing and delivering data from and between heterogeneous sources via a web service interface.

© 61 For an End User Taverna: Graphical workflow composition tool able to integrate different web, data and web service sources. Packaging & contribution to the open-source Oracle/IBM BPEL workflow editor and ActiveBPEL execution engine. A lightweight application hosting environment for running unmodified scientific applications across different grid infrastructures which uses WSRF::Lite - a Perl implementation of the WS-RF specifications Accessing web and grid services from Matlab and Jython environments

© 62 Next Production Release Updates from OGSA-DAI, Taverna, GridSAM, Grimoires Open Grid Manager: A framework for reporting on the status of grid resources into Grimoires and viewing the collected results. AuthZ Service: Integration of SAML based service for container wide authorisation policy. MANGO: Reference application using BPEL and GridSAM. Portlets: Integrating hosting environment and portlets. Timeline Series of development releases (3.3.x) Production release in April 2007 (3.4.0)

© 63 Summary What can you do to get involved? Let others know about your project Contribute a release of your software Join the beta-testing programme Download the complete software release or a component More Information: Web: Contact: Mail: