Workflow and Triana Services Matthew Shields, e-Science Workflow Services, 3-5 December T r a ai n.

Slides:



Advertisements
Similar presentations
LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Advertisements

Pulan Yu School of Informatics Indiana University Bloomington Web service based Varuna.Net.
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Under the Hood of a Workflow Manager Matthew Shields, BiodiversityWorld GRID workshop, NeSC, 30 June - 1 July T r a ai n.
TSpaces Services Suite: Automating the Development and Management of Web Services Presenter: Kevin McCurley IBM Almaden Research Center Contact: Marcus.
Programming Workflow with Triana Services Matthew Shields, SC4DEVO Workshop, July 2004.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 12 Slide 1 Distributed Systems Design 2.
1 Opentest Architecture Table of Content –The Design Basic Components High-Level Test Architecture Test Flow –Services provided by each Layer Test Mgt.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Resource wrappers, web services, grid services Jaspreet Singh School of Computer.
ProActive Task Manager Component for SEGL Parameter Sweeping Natalia Currle-Linde and Wasseim Alzouabi High Performance Computing Center Stuttgart (HLRS),
G O B E Y O N D C O N V E N T I O N WORF: Developing DB2 UDB based Web Services on a Websphere Application Server Kris Van Thillo, ABIS Training & Consulting.
Rheeve: A Plug-n-Play Peer- to-Peer Computing Platform Wang-kee Poon and Jiannong Cao Department of Computing, The Hong Kong Polytechnic University ICDCSW.
Workshop on Cyber Infrastructure in Combustion Science April 19-20, 2006 Subrata Bhattacharjee and Christopher Paolini Mechanical.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
Client/Server Architecture
Programming Scientific and Distributed Workflow with Triana Services Matthew Shields, GGF10 Workflow Workshop, 9th March.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
GMD German National Research Center for Information Technology Innovation through Research Jörg M. Haake Applying Collaborative Open Hypermedia.
SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking.
SOA, BPM, BPEL, jBPM.
Grid Workflow within Triana Ian Wang Cardiff University.
Triana Dr Matthew Shields, School of Computer Science, Cardiff University A Problem Solving Environment and Grid Workflow Tool.
XForms: A case study Rajiv Shivane & Pavitar Singh.
UPC/SHMEM PAT High-level Design v.1.1 Hung-Hsun Su UPC Group, HCS lab 6/21/2005.
DISTRIBUTED COMPUTING
5th GridLab Review, 8 April 2005, Amsterdam WP12: Access for mobile users What's new in mobile user support Piotr Grabowski, Bartek Lewandowski
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
Triana: Service-Oriented Examples Ian Taylor Cardiff University, and the Center for Computation and Technology LSU.
DEVS Namespace for Interoperable DEVS/SOA
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Kuali Enterprise Workflow Presented at ITANA October 2009 Eric Westfall – Kuali Rice Project Manager.
© Geodise Project, University of Southampton, Workflow GUI and SMS Fenglian Xu May 2003
Interoperability between Scientific Workflows Ahmed Alqaoud, Ian Taylor, and Andrew Jones Cardiff University 10/09/2008.
Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
Dr. Ian Wang Cardiff University, U.K.. Ian Wang, Cardiff University Peer-to-Peer and Grids What does the Peer-to-Peer paradigm offer Grids? Scalable Decentralized.
GSFL: A Workflow Framework for Grid Services Sriram Krishnan Patrick Wagstrom Gregor von Laszewski.
Ian Taylor, Cardiff Work-Flow Application Toolkit Eger Meeting Ian Taylor & Ian Wang Cardiff University, UK.
What is Triana?. GAPGAP Triana Distributed Work-flow Network Action Commands Workflow, e.g. BPEL4WS Triana Engine Triana Controlling Service (TCS) Triana.
Ipgdec5-01 Remarks on Web Services PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce, Shrideep Pallickara, Choonhan Youn Computer Science,
Interactive Workflows Branislav Šimo, Ondrej Habala, Ladislav Hluchý Institute of Informatics, Slovak Academy of Sciences.
Distributed Computing With Triana A Short Course Matthew Shields, Ian Taylor & Ian Wang.
Enabling Grids for E-sciencE Astronomical data processing workflows on a service-oriented Grid architecture Valeria Manna INAF - SI The.
Using Federated Services with Triana Matthew Shields Cardiff University.
Course: COMS-E6125 Professor: Gail E. Kaiser Student: Shanghao Li (sl2967)
A Demonstration of Collaborative Web Services and Peer-to-Peer Grids Minjun Wang Department of Electrical Engineering and Computer Science Syracuse University,
Dr. Rebhi S. Baraka Advanced Topics in Information Technology (SICT 4310) Department of Computer Science Faculty of Information Technology.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
INFSOM-RI The Reporting Engine Paolo Fabriani Engineering Ingegneria Informatica S.p.A. 2 nd ETICS All Hands Meeting Bologna –
LQCD Workflow Project L. Piccoli October 02, 2006.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
Tool Integration with Data and Computation Grid “Grid Wizard 2”
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
© Geodise Project, University of Southampton, Workflow Application Fenglian Xu 07/05/03.
THE EYESWEB PLATFORM - GDE The EyesWeb XMI multimodal platform GDE 5 March 2015.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
Holding slide prior to starting show. Processing Scientific Applications in the JINI-Based OGSA-Compliant Grid Yan Huang.
SE 548 Process Modelling WEB SERVICE ORCHESTRATION AND COMPOSITION ÖZLEM BİLGİÇ.
Redmond Protocols Plugfest 2016 Jinghui Zhang Office Interoperability Test Tools (Test Suites and Open Source Projects) Software Engineer Microsoft Corporation.
A General Approach to Real-time Workflow Monitoring
Gordon Erlebacher Florida State University
Presentation transcript:

Workflow and Triana Services Matthew Shields, e-Science Workflow Services, 3-5 December T r a ai n

Matthew Shields, Cardiff University Outline Triana Overview Triana services and their distribution Distribution policies The GAP interface and its relation to the Gridlab GAT Workflow Overview of Triana workflow language Distributed workflow Web Services Web service implementation – discovery, data types, choreography Applications Overview of current Triana applications Coalescing Binary Scenario using the GAP interface

Matthew Shields, Cardiff University What is Triana?

Matthew Shields, Cardiff University Remote Control Triana Triana Engine Command Service Control Workflow, commands A client logged on Persistent Service Triana Controller Service Wap Html/Java Triana GUI and engine are Decoupled: - use standardized task-graphs - allows multiple devices to connect/disconnect - persistent service (TCS )

Matthew Shields, Cardiff University Triana Distributed Work-flow Network Action Commands Workflow, e.g. BPEL4WS Triana Engine Triana Controlling Service (TCS) Triana Service & Engine Triana Service & Engine Triana Service & Engine Other Engine Distributed Triana Work-flow - flexible distribution: based around Triana Groups - HPC and Pipelined distribution Triana Gateway

Matthew Shields, Cardiff University Triana Prototype Distributed Triana Prototype Based around Triana Groups i.e. aggregate tools Each group can be distributed Distribution policies: HTC - high throughput/task farming Peer to Peer - allow node to node communication Each service can be a gateway to finer granularities of distribution: Pipeline Distribution Task-Farming Distribution Triana Service

Matthew Shields, Cardiff University Current Triana Architecture GAP Interface Gridlab Services JXTAServe P2PS WServe JXTA Sockets Web Services Globus Services Distributed Mechanisms

Matthew Shields, Cardiff University Jxtaserve GSI Enabled NS-2 And more.. Java GAT Prototype Jxta GridLab GAT ( Advertising Discovery Communication GAP (Java Prototype) Web Services P2PS Job Submission (GRMS) Generic Job Submission Virtual filename data access Data Management Set of generic Java interfaces high level abstractions to Grid services Factory design – dynamic pluggable services OGSA (planned)

Matthew Shields, Cardiff University Triana Workflow Triana is inherently flow based Data flow - data arriving at component triggers execution Control flow - control commands trigger execution Decentralised execution Data or Control messages sent along communication “pipes” from sender to receiver causes receiver to execute Synchronous or Asynchronous messaging (Implementation dependant) Multiple inputs can block or trigger immediately (Component designer defined)

Matthew Shields, Cardiff University Components and Definitions Component is unit of execution Components are defined in XML files: Naming information Input and output ports Parameter information Why Components? To simplify the application design process and to speed up application development The component model provides an infrastructure for the interaction of components

Matthew Shields, Cardiff University Taskgraph Internal object based workflow graph representation Taskgraph - DAG Tasks Connections External XML representation Simple XML syntax List of participating Task definitions Parent/Child connection Hierarchical (Compound components) Alternative Languages & Syntax e.g. BPEL4WS Available through pluggable readers & writers.

Matthew Shields, Cardiff University Triana Pluggable Architecture Triana TaskGraph Writer Triana Command Writer XML Writer Other Writer TCom Writer Other Writer Triana TaskGraph Reader Triana Command Reader XML Reader WSFL Reader TCom Reader Other Reader Interactive Interactive/OfflineCommunication Channels Triana Engine Command Service Control Application’s Insert Points Plug-in Applications - flexible: apps can use Triana in various ways, as a: - GUI - remote control GUI - or in full inc. GAP/GAT 3 rd Party Application 3 rd Party Application

Matthew Shields, Cardiff University Workflow No explicit language support for control constructs Loops and execution branching handled by components Loop component - controls loop over sub-workflow Logical component - control workflow branching Unlike BPEL4WS or similar Flexibility of control - constraint based loops etc…

Matthew Shields, Cardiff University Example XML Workflow

Matthew Shields, Cardiff University Example XML Workflow

Matthew Shields, Cardiff University Example XML Workflow

Matthew Shields, Cardiff University Example XML Workflow

Matthew Shields, Cardiff University Distributing Triana Taskgraphs Mapping tasks or groups of tasks to resources Two stages: Taskgraph annotation, XML definition for each task or group of tasks extended to specify resources and message channels Data distribution, annotated sub-sections of taskgraph passed to resources

Matthew Shields, Cardiff University Custom Distribution Distribution units are standard Triana tools, enabling users to create their own custom distributions Distribution Unit WaveGrapher Gaussian FFT Gaussian FFT Remote Services Local Triana The workflow is cloned/split/rewired to achieve the required distribution topology Custom distribution units allow sub- workflows to be distributed in parallel or pipelined

Matthew Shields, Cardiff University Remote Deployment User can distribute any task or group of tasks (sub-workflow) Using the GAP Interface, Triana automatically launches a remote service providing that sub-workflow. Input, Output and Control Pipes are connected using the current GAP binding (e.g. JXTA Pipes)

Matthew Shields, Cardiff University Deploying and Connecting To Remote Services Running services are automatically discovered via the GAP Interface, and appear in the tool tree User can drag remote services onto the workspace and connect cables to them like standard tools (except the cables represent actual JXTA/P2PS pipes) Remote Services

Matthew Shields, Cardiff University Web Services Choreographing web services Discovering web services Automatic interface for complex input/output types Deploying Triana services Connecting Triana services

Matthew Shields, Cardiff University Web Service Discovery 1 Triana allows users to query UDDI repositories Alternatively, users can import services directly from WSDL

Matthew Shields, Cardiff University Web Service Discovery 2 Discovered/Imported Web Services are converted into Triana tools (service name = tool name) (input message parts = in nodes) (output message parts = out nodes) etc… Web Service tools are displayed in the user’s Tool Tree (alongside local tools)

Matthew Shields, Cardiff University Connecting Workflows Web Service tools can be dropped onto the user’s workspace and connected like local tools A workflow can contain both local and Web Service tools

Matthew Shields, Cardiff University Complex Data Types Users can build their own interface for creating/mediating between complex types Alternatively, Triana can dynamically generate an interface from the WSDL2Java generated bean class

Matthew Shields, Cardiff University Red Boxes – Web Services Simple but powerful example: read_bible - extracts verses from the bible BabelFish - translates between English and French (and other languages) Result = The Bible translated into French ! Converting the Bible into French

Matthew Shields, Cardiff University Applications User Interface Integration (direct plugin) Used GriPhyN to represent work-flow GEO 600 – interface with GEO++ monitoring system GEMSS: Application Work-flow Galaxy Formation Galaxy formation group at Cardiff for visualization GEO 600 Signal Processing Quick Look data analysis GEO 600 – rigorous unit testing (over 200 signal processing units) Data mining Coalescing Binary Code – will compare against MPI code Uses 50+ Triana units to simulate the search Full search complete …

Matthew Shields, Cardiff University GEMSS: Maxillo-facial Surgery Simulation

Matthew Shields, Cardiff University GEO 600 Inspiral Search Background Compact binary stars orbiting each other in a close orbit among the most powerful sources of gravitational waves As the orbital radius decreases a characteristic chirp waveform is produced - amplitude and frequency increase with time until eventually the two bodies merge together Computing Need 10 Gigaflops to keep up with real time data (modest search..) Data 8kHz in 24-bit resolution (stored in 4 bytes) -> Signal contained within 1 kHz = 2000 samples/second divided into chunks of 15 minutes in duration (i.e. 900 seconds) = 8MB Algorithm Data is transmitted to a node Node initialises i.e. generates its templates (around 10000) fast correlates its templates with data

Matthew Shields, Cardiff University Coalescing Binary Search

Matthew Shields, Cardiff University Triana Prototype GEO 600 Coalescing Binary Search

Matthew Shields, Cardiff University Coalescing Binary Scenario Gridlab Test-bed GW Data Distributed Storage Logical File Name CB Search Controller GAT (GRMS, Adaptive) GW Data GAT (Data Management) Submit Job Optimised Mapping , SMS notification

Matthew Shields, Cardiff University Triana Submits Itself … GAP (using GRMS service) Testbed

Matthew Shields, Cardiff University Thanks ! All this and more … at: