Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Flexible Scientific Workflows Using Dynamic Embedding Anne H.H. Ngu, Nicholas Haasch Terence.

Slides:



Advertisements
Similar presentations
Network II.5 simulator ..
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Provenance GGF18 Kepler/COW+RWS, Kepler/COW+RWS, Bowers, McPhiilips et al. Provenance Management in a COllection-oriented Scientific Workflow.
Architecture Representation
Ewa Deelman, Integrating Existing Scientific Workflow Systems: The Kepler/Pegasus Example Nandita Mangal,
ProActive Task Manager Component for SEGL Parameter Sweeping Natalia Currle-Linde and Wasseim Alzouabi High Performance Computing Center Stuttgart (HLRS),
DATAFLOW PROCESS NETWORKS Edward A. Lee Thomas M. Parks.
Nadia Ranaldo - Eugenio Zimeo Department of Engineering University of Sannio – Benevento – Italy 2008 ProActive and GCM User Group Orchestrating.
Component Technologies for Embedded Systems Johan Eker.
Chess Review May 8, 2003 Berkeley, CA Classes and Inheritance in Actor- Oriented Models Stephen Neuendorffer Edward Lee UC Berkeley.
5 th Biennial Ptolemy Miniconference Berkeley, CA, May 9, 2003 Java Code Generation Steve Neuendorffer UC Berkeley.
Type System, March 12, Data Types and Behavioral Types Yuhong Xiong Edward A. Lee Department of Electrical Engineering and Computer Sciences University.
Architecture-driven Modeling and Analysis By David Garlan and Bradley Schmerl Presented by Charita Feldman.
Department of Electrical Engineering and Computer Sciences University of California at Berkeley Behavioral Types for Actor-Oriented Design Edward A. Lee.
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Distributed Computing in Kepler Ilkay Altintas Lead, Scientific Workflow Automation Technologies.
February 11, 2010 Center for Hybrid and Embedded Software Systems Ptolemy II - Heterogeneous Concurrent Modeling and Design.
Using Interfaces to Analyze Compositionality Haiyang Zheng and Rachel Zhou EE290N Class Project Presentation Dec. 10, 2004.
February 12, 2009 Center for Hybrid and Embedded Software Systems Encapsulated Model Transformation Rule A transformation.
Review of “Embedded Software” by E.A. Lee Katherine Barrow Vladimir Jakobac.
An Extensible Type System for Component-Based Design
Design of Fault Tolerant Data Flow in Ptolemy II Mark McKelvin EE290 N, Fall 2004 Final Project.
Models of Computation for Embedded System Design Alvise Bonivento.
Chess Review November 21, 2005 Berkeley, CA Edited and presented by Causality Interfaces and Compositional Causality Analysis Rachel Zhou UC Berkeley.
Hierarchical Reconfiguration of Dataflow Graphs Stephen Neuendorffer UC Berkeley Poster Preview May 10, 2004.
Heterochronous Dataflow in Ptolemy II Brian K. Vogel EE249 Project Presentation, Dec. 4, 1999.
SEC PI Meeting Annapolis, May 8-9, 2001 Component-Based Design of Embedded Control Systems Edward A. Lee & Jie Liu UC Berkeley with thanks to the entire.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
Department of Electrical Engineering and Computer Sciences University of California at Berkeley System-Level Types for Component-Based Design Edward A.
Department of Electrical Engineering and Computer Sciences University of California at Berkeley Concurrent Component Patterns, Models of Computation, and.
February 12, 2009 Center for Hybrid and Embedded Software Systems Model Transformation Using ERG Controller Thomas H. Feng.
5 th Biennial Ptolemy Miniconference Berkeley, CA, May 9, 2003 The Component Interaction Domain: Modeling Event-Driven and Demand- Driven Applications.
System-Level Types for Component-Based Design Paper by: Edward A. Lee and Yuhong Xiong Presentation by: Dan Patterson.
Department of Electrical Engineering and Computer Sciences University of California at Berkeley The Ptolemy II Framework for Visual Languages Xiaojun Liu.
BRASS Analysis of QuasiStatic Scheduling Techniques in a Virtualized Reconfigurable Machine Yury Markovskiy, Eylon Caspi, Randy Huang, Joseph Yeh, Michael.
 Scientific workflow management system based on Ptolemy II  Allows scientists to visually design and execute scientific workflows  Actor-oriented.
Biology.sdsc.edu CIPRes in Kepler: An integrative workflow package for streamlining phylogenetic data analyses Zhijie Guan 1, Alex Borchers 1, Timothy.
Windows.Net Programming Series Preview. Course Schedule CourseDate Microsoft.Net Fundamentals 01/13/2014 Microsoft Windows/Web Fundamentals 01/20/2014.
DIANE Overview Germán Carrera, Alfredo Solano (CNB/CSIC) EMBRACE COURSE Monday 19th of February to Friday 23th. CNB-CSIC Madrid.
FALL 2005CSI 4118 – UNIVERSITY OF OTTAWA1 Part 4 Web technologies: HTTP, CGI, PHP,Java applets)
SDM Center A Quick Update on the TSI and PIW workflows SDM All Hands March 2-3, Terence Critchlow, Xiaowen Xin, Bertram.
German National Research Center for Information Technology Research Institute for Computer Architecture and Software Technology German National Research.
Composing Models of Computation in Kepler/Ptolemy II
COMP 410 & Sky.NET May 2 nd, What is COMP 410? Forming an independent company The customer The planning Learning teamwork.
University of California, Davis Daniel Zinn 1 University of California, Davis Daniel Zinn 1 Parallel Virtual Machines in Kepler Daniel Zinn Xuan Li Bertram.
Magnetic Field Measurement System as Part of a Software Family Jerzy M. Nogiec Joe DiMarco Fermilab.
Scientific Workflow Interchanging Through Patterns: Reversals and Lessons Learned Bruno Fernandes Bastos Regina Maria Maciel Braga Antônio Tadeu Azevedo.
Accelerating Scientific Exploration Using Workflow Automation Systems Terence Critchlow (LLNL) Ilkay Altintas (SDSC) Scott Klasky(ORNL) Mladen Vouk (NCSU)
Issues in (Financial) High Performance Computing John Darlington Director Imperial College Internet Centre Fast Financial Algorithms and Computing 4th.
Spring/2002 Distributed Software Engineering C:\unocourses\4350\slides\DefiningThreads 1 RMI.
Unit 4 Object-Oriented Design Patterns NameStudent Number CAI XIANGHT082182A KYAW THU LINHT082238Y LI PENGFEIHT082220L NAUNG NAUNG LATTHT082195L PLATHOTTAM.
Kepler includes contributors from GEON, SEEK, SDM Center and Ptolemy II, supported by NSF ITRs (SEEK), EAR (GEON), DOE DE-FC02-01ER25486.
Interactive Workflows Branislav Šimo, Ondrej Habala, Ladislav Hluchý Institute of Informatics, Slovak Academy of Sciences.
Abstract A Structured Approach for Modular Design: A Plug and Play Middleware for Sensory Modules, Actuation Platforms, Task Descriptions and Implementations.
Gedae, Inc. Gedae: Auto Coding to a Virtual Machine Authors: William I. Lundgren, Kerry B. Barnes, James W. Steed HPEC 2004.
Toward interactive visualization in a distributed workflow Steven G. Parker Oscar Barney Ayla Khan Thiago Ize Steven G. Parker Oscar Barney Ayla Khan Thiago.
© FPT SOFTWARE – TRAINING MATERIAL – Internal use 04e-BM/NS/HDCV/FSOFT v2/3 JSP Application Models.
Satisfying Requirements BPF for DRA shall address: –DAQ Environment (Eclipse RCP): Gumtree ISEE workbench integration; –Design Composing and Configurability,
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
ACCESSING DATA IN THE NIS USING THE KEPLER WORKFLOW SYSTEM Corinna Gries.
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
Ptolemy II - Heterogeneous Concurrent Modeling and Design in Java
Event Relation Graphs and Extensions in Ptolemy II
Ptolemy II - Heterogeneous Concurrent Modeling and Design in Java
Retargetable Model-Based Code Generation in Ptolemy II
Ptolemy II - Heterogeneous Concurrent Modeling and Design in Java
Ptolemy II - Heterogeneous Concurrent Modeling and Design in Java
CSP-Style Rendezvous Communication Thomas Feng, Edward Lee, Yang Zhao
A Semantic Type System and Propagation
Overview of Workflows: Why Use Them?
GGF10 Workflow Workshop Summary
Presentation transcript:

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Flexible Scientific Workflows Using Dynamic Embedding Anne H.H. Ngu, Nicholas Haasch Terence Critchlow Shawn Bowers, Timothy McPhilips, Bertram Ludaescher

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Outline Scientific Workflow Problems with static scientific workflow Frame actor Dynamic Embedding Implementation of Dynamic Embedding TSI case study Conclusion

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Scientific Workflows A model of the way a scientist works with their data and tools Mentally coordinate data export, import, analysis and visualization via various software tools Goals: Design Automation Component reuse … make data analysis and management tasks easier for the scientist!

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 SPA/Kepler Scientific Workflow System Scientific Process Automation (SPA) Modeling Workflows using actor-oriented framework Executing and Monitoring Workflows Built on top of Ptolemy II (Berkeley) Graphical User Interface Similar to a charting program (Visio) Drag-and-Drop Components Connect components Execute workflows Monitor Execution Ptolemy II SPA

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Actor-Oriented Modeling Actors single component or task well-defined interface (signature) given input data, produces output data

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Actor-Oriented Modeling Ports each actor has a set of typed input and output ports input ports consume data (tokens) output ports produce data

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Actor-Oriented Modeling Parameter Input that is configured statically User changes at design Produces data

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Actor-Oriented Modeling Dataflow Connections actor “communication” channels connect output ports with input ports data is transmitted with tokens

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Actor-Oriented Modeling Sub-Workflows (aka Composite Actors) composite actors “wrap” sub-workflows like actors, have signatures (i/o ports of sub-workflow) hierarchical workflows (arbitrary nesting levels) for abstraction versus ‘Atomic Actors'

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Actor-Oriented Modeling Directors define the execution semantics of workflow graphs schedule and execute workflow graphs sub-workflows may be governed by different directors Examples: Synchronous Data-Flow (SDF), Process Networks (PN), Discrete Event (DE), Finite State Machine (FSM)

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Data Process Network (PN) Domain Each actor run in its own thread Actors run in parallel Actor communication is asynchronous Commonly used in Scientific Workflows Streaming data Concurrency

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Problems in current SPA/Kepler modeling framework Workflow is static and must be completely specified before orchestration Specific tools used in the workflow must be picked at design time ( It may not be picking the best tool ) All alternative tools must be enumerated exhaustively ( resulted in complex workflow )

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Problems in current SPA/Kepler modeling framework (cont.) Our work aims to provide an abstraction of a tool, a resource, or an algorithm Specific tool or resource resource is selected during execution

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Terascale Supernova Initiative (TSI) Workflow

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Different TSI Workflows Each containing a Submit Job Actor A BC

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 TSI Submit Job Actors …when we look at the sub-workflows Each workflow does job submission in a different way Each containing a remote execution A C B

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Goals There exists actors (atomic and composite) that perform a similar task Submit Job Transfer Files Process Files RemoteExection SSH2Exec SSHWrapper InvokeSubmitJob Can we provide an abstraction for encapsulating different implementations of a specific task that can be reused across different workflows? Can we execute the workflow flexibly? Choosing a specific implementation depending on runtime condition

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Requirements  Select and execute actor based on run time conditions  Execute with data process networks Built in capability for streaming data Built in concurrency  Selected actors instantiated on demand  Reusable actor can be nested  Actors are usable by scientist

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Frames Actors are concrete Correspond to particular implementations Frames are abstract Placeholder for actor / composite actor input, output, and parameter ports An embedding occurs when a actor is placed in a frame A refinement is an actor that can be embedded in a frame F a Embedding F[a] F a RemoteExecutionFrame F SSH Actor a a

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 SSH2 Exec Web Service Web Service SSH2 Exec Static Embedding Frames A refinement to a Frame is embedded during design Frames become concrete and cannot be reconfigured during workflow execution F Design Time Web Service SSH2 Exec Run Time

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Static Embedding Frames (cont.) Developer defines different frame actors defines possible refinement (behavior patterns) User adds and configures frame at design time chooses refinement at design time Frame can be reconfigured by changing parameters before execution Actor 1Actor n … t1 t2 t3 (State) Frame Implementation – Composite Actor (Dataflow) T T F FST F

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Static Embedding Frames (cont.)  Execute with data process networks  Refinement instantiated as needed  Can be nested  Actors are usable by scientist  Select refinements based on run time conditions SSH2 Exec F Run Time Web Service F

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Dynamic Embedding Frames Refinement to frames are embedded during execution Frames are not concrete during workflow execution SSH2 Exec Web Service Web Service SSH2 Exec F Web Service SSH2 Exec Run Time

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Why Dynamic Embedding?  Select refinements based on run time conditions  Execute with data process networks  Refinement instantiated as needed  Can be nested  Actors are usable by scientist

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Implementation Of Dynamic Embedding Construct a new workflow to execute the actor Generates Remote Execution Frame Generated Workflow

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Dynamic Embedding Process 1. Wait for inputs to arrive to the Frame. 2. Select a refinement. 3. Transfer of input tokens from Frame to the refinement. 4. Select mappings Input Port Output Port Parameter 5. Constructs internal workflow. 6. Run internal workflow. 7. Transfer of output tokens from the refinement to the Frame. Model Generated By Dynamic Frame Remote Execution Frame

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 ModelReference Actor A higher order actor that can execute a given model (workflow) through its input port. It fits most of the requirement for dynamic embedding except: User must pre-construct the given model Output tokens from the given model are transferred only after completion of the internal workflow. Our implementation of dynamic embedding leverage the capability of ModelReference actor with two major improvements: The given model is constructed automatically Output tokens are transferred synchronously Frame is thus implemented as a subclass of ModelReference actor with four additional components: SelectActor(), FrameSourceActor(); FrameSinkActor() and PortWiring()

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Implementing a Type of Frame Subclass Frame and Implement Selection Process selectActor() Configure Ports and Parameters getIntputMappings() getOutputMappings() getParameterMappings()

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Selection Process: selectActor() Implements selection policy Returns a refinement Refinement that is returned gets automatically embedded Remote Execution String selectedActor selectActor(){ If(testWebService()) selectedActor = “webservice” return getWebServiceActor() else selectedActor = “ssh2exec” return getSSH2ExecActor() }

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Port Wiring Transfer Token Frame Input Port  Actor Input Port Actor Output Port  Frame Output Port Expressed as list containing pairs of strings {“hostname”,”hostname”} {“command”,”cmd”} SSH2 Exec F errors stdout errors out hostname cmd hostname command Remote Execution String selectedActor getInputPortMapping(){ if(selectedActor==“SSH2Exec”) return {{“hostname”,”hostname”} {“command”,”cmd”}} else if(selectedActor==“webservice”) return {{“hostname”,”url”} {“command”,”method”}} }

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 TSI-A Workflow SubmitJobFrame TSI-B Workflow TSI Case Study

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Remote Execution Frame SubmitJobFrame TSI-B subworkflow TSI-A subworkflow

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Benefits  Select refinements based on run time conditions  Execute with data process networks  Refinement instantiated as needed  Can be nested  Actors are usable by scientist

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Limitations Unable to type check internal workflow before execution There is overhead of creating an additional workflow to execute a refinement Change in selection process requires recoding/recompiling Can not monitor internal workflow (Useful for debugging)

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Future Work Semantic binding of Ports and Parameters Configurable selection criteria Intelligent brokering Ptolemy Expression Language Python Perl … Simplified refinement creation Caching of generated workflows Design time type checking of internal workflow

Ngu, Texas StatePtolemy Miniconference, February 13, 2007 Work performed under the auspices of the U. S. Department of Energy by Lawrence Livermore National Laboratory under Contract W-7405-Eng-48 UCRL-ABS