Toward interactive visualization in a distributed workflow Steven G. Parker Oscar Barney Ayla Khan Thiago Ize Steven G. Parker Oscar Barney Ayla Khan Thiago.

Slides:



Advertisements
Similar presentations
Nimrod/K: Towards Massively Parallel Dynamic Grid Workflows David Abramson, Colin Enticott, Monash Ilkay Altinas, UCSD.
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Tableau Software Australia
Executional Architecture
SSRS 2008 Architecture Improvements Scale-out SSRS 2008 Report Engine Scalability Improvements.
A component- and message-based architectural style for GUI software
RPC Robert Grimm New York University Remote Procedure Calls.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
WSUS Presented by: Nada Abdullah Ahmed.
UCSD SAN DIEGO SUPERCOMPUTER CENTER Ilkay Altintas Scientific Workflow Automation Technologies Provenance Collection Support in the Kepler Scientific Workflow.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
GridRPC Sources / Credits: IRISA/IFSIC IRISA/INRIA Thierry Priol et. al papers.
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Distributed Computing in Kepler Ilkay Altintas Lead, Scientific Workflow Automation Technologies.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
Chapter 1: Overview of Workflow Management Dr. Shiyong Lu Department of Computer Science Wayne State University.
.NET Mobile Application Development Introduction to Mobile and Distributed Applications.
© 2001 by Prentice Hall8-1 Local Area Networks, 3rd Edition David A. Stamper Part 3: Software Chapter 8 Client/Server Architecture.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
Overview SAP Basis Functions. SAP Technical Overview Learning Objectives What the Basis system is How does SAP handle a transaction request Differentiating.
What is Concurrent Programming? Maram Bani Younes.
Biology.sdsc.edu CIPRes in Kepler: An integrative workflow package for streamlining phylogenetic data analyses Zhijie Guan 1, Alex Borchers 1, Timothy.
Using Provenance to Support Real-Time Collaborative Design of Workflows Tommy Ellkvist 1, Erik Anderson 2, David Koop 2, Juliana Freire 2, and Claudio.
WORKFLOW IN MOBILE ENVIRONMENT. WHAT IS WORKFLOW ?  WORKFLOW IS A COLLECTION OF TASKS ORGANIZED TO ACCOMPLISH SOME BUSINESS PROCESS.  EXAMPLE: Patient.
January, 23, 2006 Ilkay Altintas
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
STRATEGIES INVOLVED IN REMOTE COMPUTATION
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
A Hybrid Decomposition Scheme for Building Scientific Workflows Wei Lu Indiana University.
Martin Berzins (Steve Parker) What are the hard apps problems? How do the solutions get shared? What non-apps work is needed? Thanks to DOE for funding.
Chapter 6 Operating System Support. This chapter describes how middleware is supported by the operating system facilities at the nodes of a distributed.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
TRƯỜNG ĐẠI HỌC CÔNG NGHỆ Bộ môn Mạng và Truyền Thông Máy Tính.
CCA Common Component Architecture Manoj Krishnan Pacific Northwest National Laboratory MCMD Programming and Implementation Issues.
Architecting Web Services Unit – II – PART - III.
DCE (distributed computing environment) DCE (distributed computing environment)
What are the main differences and commonalities between the IS and DA systems? How information is transferred between tasks: (i) IS it may be often achieved.
SUMA: A Scientific Metacomputer Cardinale, Yudith Figueira, Carlos Hernández, Emilio Baquero, Eduardo Berbín, Luis Bouza, Roberto Gamess, Eric García,
Chapter 1: Overview of Workflow Management Dr. Shiyong Lu Department of Computer Science Wayne State University.
A framework to support collaborative Velo: Knowledge Management for Collaborative (Science | Biology) Projects A framework to support collaborative 1.
Cracow Grid Workshop, October 27 – 29, 2003 Institute of Computer Science AGH Design of Distributed Grid Workflow Composition System Marian Bubak, Tomasz.
Accelerating Scientific Exploration Using Workflow Automation Systems Terence Critchlow (LLNL) Ilkay Altintas (SDSC) Scott Klasky(ORNL) Mladen Vouk (NCSU)
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 3: Operating-System Structures System Components Operating System Services.
The PROGRESS Grid Service Provider Maciej Bogdański Portals & Portlets 2003 Edinburgh, July 14th-17th.
Grid Computing Research Lab SUNY Binghamton 1 XCAT-C++: A High Performance Distributed CCA Framework Madhu Govindaraju.
1 Ilkay ALTINTAS - July 24th, 2007 Ilkay ALTINTAS Director, Scientific Workflow Automation Technologies Laboratory San Diego Supercomputer Center, UCSD.
Center for Component Technology for Terascale Simulation Software CCA is about: Enhancing Programmer Productivity without sacrificing performance. Supporting.
SCIRun and SPA integration status Steven G. Parker Ayla Khan Oscar Barney.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GVis: Grid-enabled Interactive Visualization State Key Laboratory. of CAD&CG Zhejiang University, Hangzhou
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
Distributed Components for Integrating Large- Scale High Performance Computing Applications Nanbor Wang, Roopa Pundaleeka and Johan Carlsson
CSI 3125, Preliminaries, page 1 SERVLET. CSI 3125, Preliminaries, page 2 SERVLET A servlet is a server-side software program, written in Java code, that.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Data Communications and Networks Chapter 9 – Distributed Systems ICT-BVF8.1- Data Communications and Network Trainer: Dr. Abbes Sebihi.
Satisfying Requirements BPF for DRA shall address: –DAQ Environment (Eclipse RCP): Gumtree ISEE workbench integration; –Design Composing and Configurability,
PROGRESS: GEW'2003 Using Resources of Multiple Grids with the Grid Service Provider Michał Kosiedowski.
Selenium server By, Kartikeya Rastogi Mayur Sapre Mosheca. R
Ocean Observatories Initiative OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Scientific Workflows for OOI Ilkay Altintas Charles.
BalticGrid-II Project EGEE UF’09 Conference, , Catania Partner’s logo Framework for Grid Applications Migrating Desktop Framework for Grid.
HNC COMPUTING - Network Concepts 1 Network Concepts Network Concepts Network Operating Systems Network Operating Systems.
Problem On a regular basis we use: –Java applets –JavaScript –ActiveX –Shockwave Notion of ubiquitous computing.
Workflow-Driven Science using Kepler Ilkay Altintas, PhD San Diego Supercomputer Center, UCSD words.sdsc.edu.
- DAG Scheduling with Reliability - - GridSolve - - Fault Tolerance In Open MPI - Asim YarKhan, Zhiao Shi, Jack Dongarra VGrADS Workshop April 2007.
Automatic launch and tracking the computational simulations with LiFlow and Sumatra Evgeniy Kuklin.
University of Technology
SDM workshop Strawman report History and Progress and Goal.
What is Concurrent Programming?
What is Concurrent Programming?
AIMS Equipment & Automation monitoring solution
Scientific Workflows Lecture 15
Presentation transcript:

Toward interactive visualization in a distributed workflow Steven G. Parker Oscar Barney Ayla Khan Thiago Ize Steven G. Parker Oscar Barney Ayla Khan Thiago Ize

Scientific Computing and Imaging Institute, University of Utah Component-Based Architectures Experience with numerous component-based architectures CCA (Parallel, Method Invocation, multi- language) SCIRun (Shared memory, Dataflow, C++) Uintah (Parallel, Method Invocation, C++) Kepler (Single process + web services, Generalized dataflow, Java +) SCIRun2 (Distributed/Parallel, Multi-model, mutli-language)

Scientific Computing and Imaging Institute, University of Utah DOE Common Component Architecture Project A CA for large-scale Scientific Computation Component Characteristics ­May be SPMD or multi-threaded parallel objects Heterogeneity ­Parallel platforms to desktops and any language Local and Remote ­Parallel communication for remote parallel interfaces and 0-copy in-process connection Dynamic Composition and Integration ­Hot-swapable components, shared instances Open forum involving DOE labs, Universities, others

Scientific Computing and Imaging Institute, University of Utah Uintah CCA-ish component architecture (C++ only) Plus components for multiphysics structured AMR simulations Scales to processors Simulation Controller Simulation Controller Problem Specification Problem Specification XML Simulation (One of Arches, ICE, MPM, MPMICE, MPMArches, …) Simulation (One of Arches, ICE, MPM, MPMICE, MPMArches, …) Scheduler Tasks Data Archiver Data Archiver Tasks Callbacks MPI Assignments Load Balancer Load Balancer Configuration

Scientific Computing and Imaging Institute, University of Utah SCIRun

Scientific Computing and Imaging Institute, University of Utah SCIRun PowerApps: BioImage

2/20/2004Building a KEPLER Extension Using Ptolemy II The KEPLER System for Scientific Workflows … A framework for design, execution and deployment of scientific workflows Caters specifically to the domain scientist Builds on Ptolemy II Application pull from various projects Slide thanks to: Ilkay Altintas and Efrat Jeager SDSC UCSD

Scientific Computing and Imaging Institute, University of Utah Kepler Workflow

Scientific Computing and Imaging Institute, University of Utah Component Architecture Design Choices Degree of isolation: processes, threads, single address space? Mechanism for communication: dataflow, process networks, method invocation Synchronization Programming languages: expressiveness tradeoffs Data types explicitly supported Performance requirements Extra tools required? Explicit support for parallelism?  Multiple designs for component architectures +Tailored to application needs -Islands of functionality

Scientific Computing and Imaging Institute, University of Utah SCIRun2 SCIRun2 provides a component model for component models (metacomponents) Plug-ins provide support for: CCA SCIRun Vtk Others Components use “native” communication mechanisms to connect to similar components Bridges connect models SCIRun2

Scientific Computing and Imaging Institute, University of Utah Meta-components example Common Framework Driver Function Integrator Function CORBACCA Integrator Driver Bridge

Scientific Computing and Imaging Institute, University of Utah Application

Scientific Computing and Imaging Institute, University of Utah SDM Requirements Distributed Workflow Repetitive Shared resources Automatically driven Coarse-grained (seconds to minute per operation) Interactive Visualization Exploratory Dedicated resources User-driven Fine-grained (milliseconds to seconds per operation)

Scientific Computing and Imaging Institute, University of Utah Goal What the user wants To get work done Make hard things easy How to do this 1.Combine tools with disparate strengths 2.Make them work efficiently 3.Focus on interfaces 4.Enable consistent user interfaces

Scientific Computing and Imaging Institute, University of Utah Utah's Contibution To the SPA Group SCIRun can now be controlled from SPA/Kepler workflows  Server interface  JNI interface “Smart” Re-run capability Provenance framework

Scientific Computing and Imaging Institute, University of Utah Kepler Workflow

Scientific Computing and Imaging Institute, University of Utah Workflow Requirements and “Wants” We Address Seamless access to resources and services “Smart” re-runs Data provenance Reliability and fault-tolerance Detached execution From: B. Ludäscher, et al. Scientific Workflow Management and the Kepler System. Concurrency and Computation: Practice & Experience, Special Issue on Scientific Workflows, to appear, 2005.

Scientific Computing and Imaging Institute, University of Utah SCIRun With SPA/Kepler Kepler actor sends requests to a SCIRun server Useful for processing batch jobs or iterating through the parameter space of a SCIRun module (actor) Requires existing SCIRun network, which the workflow actor will tell SCIRun to load JNI interface to SCIRun

Scientific Computing and Imaging Institute, University of Utah SCIRun Server Simple TCP/IP server that can be started remotely by Kepler Accepts requests from client actor in the workflow and then sends back location of results when it has finished Allows for the possibility of remote or/and detached execution of SCIRun

SCIRun and Kepler Dataflow Integration Automate SCIRun network execution with a Kepler actor driving execution through a JNI interface or a remote connection to a SCIRun server Incorporate SCIRun computation and visualization with the SPA workflow engine

Scientific Computing and Imaging Institute, University of Utah JNI interface with workflow

Scientific Computing and Imaging Institute, University of Utah What is provenance data? In general: steps taken to get a result Information about computational experiments or runs of scientific workflows that is needed to reproduce results We want to log metadata, steps applied to data, tools used to create data products Useful when you want to share/publish results

Scientific Computing and Imaging Institute, University of Utah The Standalone Provenance Framework

Scientific Computing and Imaging Institute, University of Utah “Smart” re-runs Instead of running a workflow from scratch we only re-run parts of the workflow that have not been done before Example: we change a parameter downstream and dont want to re-run the actors that lead up to the one with the parameter change Especially useful in visualization pipelines and long running workflows

Scientific Computing and Imaging Institute, University of Utah Utah and “Smart” Re-runs Uses VisTrails’ cache manager algorithm* Idea is to re-run as little of the network as possible by combining intermediate results from different workflow runs Recreates input to actors that need to be re- fired * L. Bavoil, et al. VisTrails: Enabling Interactive Multiple- View Visualizations. IEEE Visualization, 2005.

Scientific Computing and Imaging Institute, University of Utah

What is needed for “Smart” Re-runs We need to keep track of what we have done before Specifically we need to know what actors have been given what inputs with what outputs Stored provenance data can give us the information we need

Scientific Computing and Imaging Institute, University of Utah Other uses for provenance data Recreate results Recover from a system failure Checkpoint a workflow Create semantic links

Scientific Computing and Imaging Institute, University of Utah Future work Continue work on “Smart” Re-runs system Help workflow users integrate SCIRun with their workflows Get provenance framework checked into Ptolemy CVS Work on other provenance issues Help SCIRun users take advantage of workflow technology Develop CCA to Kepler bridging mechanisms