Online Performance Analysis and Visualization of Large-Scale Parallel Applications Kai Li, Allen D. Malony, Sameer Shende, Robert Bell Performance Research.

Slides:



Advertisements
Similar presentations
Computer Systems & Architecture Lesson 2 4. Achieving Qualities.
Advertisements

Network II.5 simulator ..
Open repositories: value added services The Socionet example Sergey Parinov, CEMI RAS and euroCRIS.
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall A.1.
Robert Bell, Allen D. Malony, Sameer Shende Department of Computer and Information Science Computational Science.
TAU Parallel Performance System DOD UGC 2004 Tutorial Allen D. Malony, Sameer Shende, Robert Bell Univesity of Oregon.
Subdue Graph Visualizer by Gayathri Sampath, M.S. (CSE) University of Texas at Arlington.
C++ Training Datascope Lawrence D’Antonio Lecture 11 UML.
© Copyright Eliyahu Brutman Programming Techniques Course.
Copyright 2004 Prentice-Hall, Inc. Essentials of Systems Analysis and Design Second Edition Joseph S. Valacich Joey F. George Jeffrey A. Hoffer Appendix.
WebDynpro for ABAP Short introduction.
Programming Logic and Design, Introductory, Fourth Edition1 Understanding Computer Components and Operations (continued) A program must be free of syntax.
About the Presentations The presentations cover the objectives found in the opening of each chapter. All chapter objectives are listed in the beginning.
Allen D. Malony, Sameer Shende, Robert Bell Department of Computer and Information Science Computational Science Institute, NeuroInformatics.
Kai Li, Allen D. Malony, Robert Bell, Sameer Shende Department of Computer and Information Science Computational.
Automated Tests in NICOS Nightly Control System Alexander Undrus Brookhaven National Laboratory, Upton, NY Software testing is a difficult, time-consuming.
SYSTEMS ANALYSIS. Chapter Five Systems Analysis Define systems analysis Describe the preliminary investigation, problem analysis, requirements analysis,
- Chaitanya Krishna Pappala Enterprise Architect- a tool for Business process modelling.
Annual SERC Research Review - Student Presentation, October 5-6, Extending Model Based System Engineering to Utilize 3D Virtual Environments Peter.
Windows.Net Programming Series Preview. Course Schedule CourseDate Microsoft.Net Fundamentals 01/13/2014 Microsoft Windows/Web Fundamentals 01/20/2014.
E-LABORATORY PRACTICAL TEACHING FOR APPLIED ENGINEERING SCIENCES W O R K S H O P University of Oradea, Romania February 6, 2012 G E N E R A L P R E S E.
PROCESS MODELING Chapter 8 - Process Modeling
Systems Analysis – Analyzing Requirements.  Analyzing requirement stage identifies user information needs and new systems requirements  IS dev team.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Team 1 Jesus Weibo Mina Yunming. Client’s Needs  Short term, medium term, long term plans  Simulation of energy system to provide affordable future.
DEVSView: A DEVS Visualization Tool Wilson Venhola.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Magnetic Field Measurement System as Part of a Software Family Jerzy M. Nogiec Joe DiMarco Fermilab.
Copyright 2001 Prentice-Hall, Inc. Essentials of Systems Analysis and Design Joseph S. Valacich Joey F. George Jeffrey A. Hoffer Appendix A Object-Oriented.
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 20 Object-Oriented.
Introduction To System Analysis and Design
Paper written by Flavio Oquendo Presented by Ernesto Medina.
NIH NCRR Overview The SCIRun and BioPSE Problem Solving Environments Chris Johnson, Rob MacLeod, and David Weinstein Scientific Computing and Imaging Institute.
A Web Laboratory for Visual Interactive Simulation of Epitaxial Growth Feng Liu University of Utah Recently, we have developed a prototype of web laboratory.
Framework for MDO Studies Amitay Isaacs Center for Aerospace System Design and Engineering IIT Bombay.
1. 2 Preface In the time since the 1986 edition of this book, the world of compiler design has changed significantly 3.
Allen D. Malony Department of Computer and Information Science TAU Performance Research Laboratory University of Oregon Discussion:
Connections to Other Packages The Cactus Team Albert Einstein Institute
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Software Engineering Chapter: Computer Aided Software Engineering 1 Chapter : Computer Aided Software Engineering.
GYTE - Bilgisayar Mühendisliği Bölümü Bilgisayar Mühendisliği Bölümü GYTE - Bilgisayar Mühendisliği Bölümü AN ARCHITECTURE FOR NEXT GENERATION MIDDLEWARE.
Online Monitoring System at KLOE Alessandra Doria INFN - Napoli for the KLOE collaboration CHEP 2000 Padova, 7-11 February 2000 NAPOLI.
Shangkar Mayanglambam, Allen D. Malony, Matthew J. Sottile Computer and Information Science Department Performance.
Parallel Performance Measurement of Heterogeneous Parallel Systems with GPUs Allen D. Malony, Scott Biersdorff, Sameer Shende, Heike Jagode†, Stanimire.
IBM Global Services © 2005 IBM Corporation SAP Legacy System Migration Workbench| March-2005 ALE (Application Link Enabling)
34 Copyright © 2007, Oracle. All rights reserved. Module 34: Siebel Business Services Siebel 8.0 Essentials.
Performane Analyzer Performance Analysis and Visualization of Large-Scale Uintah Simulations Kai Li, Allen D. Malony, Sameer Shende, Robert Bell Performance.
Chapter – 8 Software Tools.
FlowLevel Client, server & elements monitoring and controlling system Message Include End Dial Start.
ECG Simulation NCRR Overview Technology for the ECG Simulation project CardioWave BioPSE project background Tools developed to date Tools for the next.
Marcelo R.N. Mendes. What is FINCoS? A Java-based set of tools for data generation, load submission, and performance measurement of event processing systems;
IceCube DAQ Mtg. 10,28-30 IceCube DAQ: Implementation Plan.
1 © 2014 Atego. All Rights Reserved. Infrastructure Bridge Model Matthew Hause – Atego Chief Consulting Engineer
Computer Architecture Organization and Architecture
Fermilab Scientific Computing Division Fermi National Accelerator Laboratory, Batavia, Illinois, USA. Off-the-Shelf Hardware and Software DAQ Performance.
Kai Li, Allen D. Malony, Sameer Shende, Robert Bell
Course Outcomes of Object Oriented Modeling Design (17630,C604)
Object-Oriented Analysis and Design
Unified Modeling Language
J. Michael, M. Shing M. Miklaski, J. Babbitt Naval Postgraduate School
Allen D. Malony, Sameer Shende
Chapter 20 Object-Oriented Analysis and Design
Appendix A Object-Oriented Analysis and Design
Analysis models and design models
An Introduction to Software Architecture
Allen D. Malony Computer & Information Science Department
Outline Introduction Motivation for performance mapping SEAA model
Appendix A Object-Oriented Analysis and Design
Appendix A Object-Oriented Analysis and Design
Presentation transcript:

Online Performance Analysis and Visualization of Large-Scale Parallel Applications Kai Li, Allen D. Malony, Sameer Shende, Robert Bell Performance Research Laboratory, University of Oregon Parallel performance tools offer the program developer insights into the execution behavior of an application and are a valuable component in the cycle of application development and deployment. However, most tools do not work well with large-scale parallel applications where the performance data generated comes from thousands of processes. Not only can the data be difficult to manage and the analysis complex, but existing performance display tools are mostly restricted to two dimensions and lack the customization and display interaction to support full data investigation. In addition, it is increasingly important that performance tools be able to function online, making it possible to control and adapt long-running applications based on performance feedback. Again, large-scale parallelism complicates the online access and management of performance data, and it may be desirable to integrate performance analysis and visualization in existing computational steering infrastructures. We designed and prototyped a system architecture for online performance access, analysis, and visualization in a large-scale parallel environment. The architecture consists of four components. The performance data integrator component is responsible for interfacing with a performance monitoring system to merge parallel performance samples into a synchronous data stream for analysis. The performance data reader component reads the external performance data into internal data structures of the analysis and visualization system. The performance analyzer component provides the analysis developer a programmable framework for constructing analysis modules that can be linked together for different functionality. The performance visualizer component can also be programmed to create different displays modules. Our prototype is based on the TAU performance system, the Uintah computational framework, and the SCIRun computational steering and visualization system. Uintah Application Performance Steering Performance Visualizer Performance Analyzer Performance Data Reader TAU Performance System Performance Data Integrator SCIRun parallel performance data streams parallel performance data output (Parallel) File system sample sequencing reader synchronization accumulated samples socket Sample 1 Sample 2 Sample k … EPILOG Paraver TAU Performance System n,c,t i,j,k n:node i c:context j t:thread k Sample i n: 1n: 2n: j … c: 1c: 2c: k … c: 1c: 2c: k … t: 1t: 2t: l … t: 1t: 2t: l … Sample 1Sample k …… pppppp attribute and offspring access methods The performance data reader is implemented as a SCIRun module. It inputs the merged profile sample dataset sent by the data integrator and stores the dataset in an internal C++ object structure. A profile sample dataset is organized in a tree-like manner according to TAU ’ s profile hierarchy: node  context  thread  profile data Each object in the profile tree has a set of attribute access methods and a set of offspring access methods. These methods are the interface for the performance analyzer. The performance data integrator reads the performance profile files, generated for each profile sample for each thread, and merges the files into a single, synchronized profile sample dataset. Each profile sample file is assigned a sequence number and the whole dataset is sequenced and timestamped. A socket-based protocol is maintained with the performance data reader to inform it of the availability of new profile samples and to coordinate dataset transfer. Performance Data Integrator Performance Data Reader Performance Analyzer Performance analysis of the profile sample data can take various forms. Using the access methods on the profile tree object, all performance profile data, including cross- sample data, is available for analysis. A library of performance analysis modules can be developed, some simple and others more sophisticated. We have implemented two generic profile analysis modules: Gen2DField and Gen3DField. The modules provide a user control panel that allows them to be customized with respect to the events, data values, number of samples, and filter options. Ultimately, the output of the analysis modules must be in a form that can be visualized. The Gen2DField and Gen3DField modules are so named because they produce 2D and 3D Field data, respectively. SCIRun has different geometric meshes available for Fields. We use an ImageMesh for 2D fields and a PointCloudMesh for 3D fields. The role of the performance visualizer component is to read the Field objects generated from performance analysis and show graphical representations of performance results. We have built three visualization modules to demonstrate the display of 2D and 3D data fields. The Terrain visualizer shows ImageMesh data as a surface. The user can select the resolution of the X and Y dimensions in the Terrain control panel. A TerrainDenotator module was developed to mark interesting points in the visualization. A different display of 2D field data is produced by the KiviatTube visualizer. Here a “ tube ” surface is created where the distance of points from the tube center axis is determine by metric values and the tube length correlates with the sample. The visualization of PointCloudMesh data is accomplished by the PointCloud visualizer module. The SCIRun program graph below shows how the data reader, analyzer, and visualizer modules are connected to process parallel profile samples from a small Uintah testcase. A visualization from a 500 processor run is output from the second SCIRun graph. Performance Visualizer PointCloudMesh ImageMesh