Performance Tools BOF, SC’07 5:30pm – 7pm, Tuesday, A9 Sameer S. Shende Performance Research Laboratory University.

Slides:



Advertisements
Similar presentations
Machine Learning-based Autotuning with TAU and Active Harmony Nicholas Chaimov University of Oregon Paradyn Week 2013 April 29, 2013.
Advertisements

K T A U Kernel Tuning and Analysis Utilities Department of Computer and Information Science Performance Research Laboratory University of Oregon.
Dynamic performance measurement control Dynamic event grouping Multiple configurable counters Selective instrumentation Application-Level Performance Access.
Workload Characterization using the TAU Performance System Sameer Shende, Allen D. Malony, Alan Morris University of Oregon {sameer,
S3D: Performance Impact of Hybrid XT3/XT4 Sameer Shende
Robert Bell, Allen D. Malony, Sameer Shende Department of Computer and Information Science Computational Science.
Scalability Study of S3D using TAU Sameer Shende
Sameer Shende Department of Computer and Information Science Neuro Informatics Center University of Oregon Tool Interoperability.
Profiling S3D on Cray XT3 using TAU Sameer Shende
TAU Parallel Performance System DOD UGC 2004 Tutorial Allen D. Malony, Sameer Shende, Robert Bell Univesity of Oregon.
The TAU Performance Technology for Complex Parallel Systems (Performance Analysis Bring Your Own Code Workshop, NRL Washington D.C.) Sameer Shende, Allen.
Nick Trebon, Alan Morris, Jaideep Ray, Sameer Shende, Allen Malony {ntrebon, amorris, Department of.
TAU Performance System
On the Integration and Use of OpenMP Performance Tools in the SPEC OMP2001 Benchmarks Bernd Mohr 1, Allen D. Malony 2, Rudi Eigenmann 3 1 Forschungszentrum.
Allen D. Malony, Sameer Shende Department of Computer and Information Science Computational Science Institute University.
Workshop on Performance Tools for Petascale Computing 9:30 – 10:30am, Tuesday, July 17, 2007, Snowbird, UT Sameer S. Shende
TAU Performance System Alan Morris, Sameer Shende, Allen D. Malony University of Oregon {amorris, sameer,
Performance Instrumentation and Measurement for Terascale Systems Jack Dongarra, Shirley Moore, Philip Mucci University of Tennessee Sameer Shende, and.
June 2, 2003ICCS Performance Instrumentation and Measurement for Terascale Systems Jack Dongarra, Shirley Moore, Philip Mucci University of Tennessee.
Allen D. Malony Department of Computer and Information Science Performance Research Laboratory NeuroInformatics Center University.
Workshop on Performance Tools for Petascale Computing 9:30 – 10:30am, Tuesday, July 17, 2007, Snowbird, UT Sameer S. Shende
Performance Evaluation of S3D using TAU Sameer Shende
TAU: Performance Regression Testing Harness for FLASH Sameer Shende
Scalability Study of S3D using TAU Sameer Shende
Optimization of Instrumentation in Parallel Performance Evaluation Tools Sameer Shende, Allen D. Malony, Alan Morris University of Oregon {sameer,
S3D: Comparing Performance of XT3+XT4 with XT4 Sameer Shende
The TAU Performance System Sameer Shende, Allen D. Malony, Robert Bell University of Oregon.
Sameer Shende, Allen D. Malony Computer & Information Science Department Computational Science Institute University of Oregon.
Performance Tools for Empirical Autotuning Allen D. Malony, Nick Chaimov, Kevin Huck, Scott Biersdorff, Sameer Shende
Allen D. Malony Performance Research Laboratory (PRL) Neuroinformatics Center (NIC) Department.
1 Score-P – A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir Markus Geimer 2), Bert Wesarg 1), Brian Wylie.
Chee Wai Lee, Allen D. Malony, Alan Morris Department of Computer and Information Science Performance Research.
Allen D. Malony Performance Research Laboratory (PRL) Neuroinformatics Center (NIC) Department.
Score-P – A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir Alexandru Calotoiu German Research School for.
Integrated Performance Views in Charm++: Projections meets TAU Scott Biersdorff Allen D. Malony Department Computer and Information Science University.
Using TAU on SiCortex Alan Morris, Aroon Nataraj Sameer Shende, Allen D. Malony University of Oregon {amorris, anataraj, sameer,
Profile Analysis with ParaProf Sameer Shende Performance Reseaerch Lab, University of Oregon
Allen D. Malony 1, Scott Biersdorff 2, Wyatt Spear 2 1 Department of Computer and Information Science 2 Performance Research Laboratory University of Oregon.
Performance Analysis Tool List Hans Sherburne Adam Leko HCS Research Laboratory University of Florida.
Kevin A. Huck Department of Computer and Information Science Performance Research Laboratory University of.
Dynamic performance measurement control Dynamic event grouping Multiple configurable counters Selective instrumentation Application-Level Performance Access.
ASC Tri-Lab Code Development Tools Workshop Thursday, July 29, 2010 Lawrence Livermore National Laboratory, P. O. Box 808, Livermore, CA This work.
Early Experiences with KTAU on the Blue Gene / L A. Nataraj, A. Malony, A. Morris, S. Shende Performance Research Lab University of Oregon.
PerfExplorer Component for Performance Data Analysis Kevin Huck – University of Oregon Boyana Norris – Argonne National Lab Li Li – Argonne National Lab.
Allen D. Malony, Sameer S. Shende, Alan Morris, Robert Bell, Kevin Huck, Nick Trebon, Suravee Suthikulpanit, Kai Li, Li Li
Allen D. Malony Performance Research Laboratory (PRL) Neuroinformatics Center (NIC) Department.
Allen D. Malony, Sameer S. Shende, Robert Bell Kai Li, Li Li, Kevin Huck Department of Computer.
Simplifying the Usage of Performance Evaluation Tools: Experiences with TAU and DyninstAPI Paradyn/Condor Week 2010, Rm 221, Fluno Center, U. of Wisconsin,
Shangkar Mayanglambam, Allen D. Malony, Matthew J. Sottile Computer and Information Science Department Performance.
Allen D. Malony Department of Computer and Information Science Performance Research Laboratory University.
Integrated Performance Views in Charm++: Projections meets TAU Scott Biersdorff Allen D. Malony Department Computer and Information Science University.
Allen D. Malony Department of Computer and Information Science Performance Research Laboratory.
Parallel Performance Measurement of Heterogeneous Parallel Systems with GPUs Allen D. Malony, Scott Biersdorff, Sameer Shende, Heike Jagode†, Stanimire.
TAU Performance System ® TAU is a profiling and tracing toolkit that supports programs written in C, C++, Fortran, Java, Python,
Allen D. Malony Performance Research Laboratory (PRL) Neuroinformatics Center (NIC) Department.
Parallel OpenFOAM CFD Performance Studies Student: Adi Farshteindiker Advisors: Dr. Guy Tel-Zur,Prof. Shlomi Dolev The Department of Computer Science Faculty.
Navigating TAU Visual Display ParaProf and TAU Portal Mahin Mahmoodi Pittsburgh Supercomputing Center 2010.
Performance Tool Integration in Programming Environments for GPU Acceleration: Experiences with TAU and HMPP Allen D. Malony1,2, Shangkar Mayanglambam1.
Kai Li, Allen D. Malony, Sameer Shende, Robert Bell
Introduction to the TAU Performance System®
Performance Technology for Scalable Parallel Systems
Thanks for attending the ParaTools TAU Webex!
TAUmon: Scalable Online Performance Data Analysis in TAU
TAU integration with Score-P
Allen D. Malony, Sameer Shende
TAU Parallel Performance System
TAU Parallel Performance System
TAU: A Framework for Parallel Performance Analysis
Outline Introduction Motivation for performance mapping SEAA model
Parallel Program Analysis Framework for the DOE ACTS Toolkit
Presentation transcript:

Performance Tools BOF, SC’07 5:30pm – 7pm, Tuesday, A9 Sameer S. Shende Performance Research Laboratory University of Oregon TAU Performance System

SC’07: Performance Tools BOF 2 TAU Performance System Acknowledgements  Dr. Allen D. Malony, Professor  Alan Morris, Senior software engineer  Wyatt Spear, Software engineer  Scott Biersdorff, Software engineer  Matt Sottile, Research faculty  Rob Yelle, Research faculty  Kevin Huck, Ph.D. student  Aroon Nataraj, Ph.D. student  Shangkar Myangalam, Ph.D. student  Brad Davidson, Systems administrator

SC’07: Performance Tools BOF 3 TAU Performance System TAU Parallel Performance System   Multi-level performance instrumentation  Multi-language automatic source instrumentation  Flexible and configurable performance measurement  Widely-ported parallel performance profiling system  Computer system architectures and operating systems  Different programming languages and compilers  Support for multiple parallel programming paradigms  Multi-threading, message passing, mixed-mode, hybrid

SC’07: Performance Tools BOF 4 TAU Performance System What is TAU?  Portable, profiling and tracing toolkit  BSD style license  Automatic source level instrumentation (PDT, Opari [FZJ])  Routine, and loop level  Instrumentation optimization (TAU_THROTTLE)  Measurement  Callpath, phase, parameter based profiling  PAPI [UTK] support in profiling and tracing  One or more native, preset events  Analysis tools  Parallel profile analysis (ParaProf)  Performance data management (PerfDMF database)  Performance data mining (PerfExplorer)  Kernel monitoring and KTAU

SC’07: Performance Tools BOF 5 TAU Performance System What is new in TAU and PDT?  TAU v 2.17 and PDT v 3.12 released at SC’07 [tau.uoregon.edu]  Support for new platforms  IBM BG/P (Linux)  SiCortex  Cray XT (Compute Node Linux)  Windows Cluster 2003  Improved support for VampirTrace [TU Dresden] for atomic events, native OTF generation  Automatic wrapper library generation (tau_wrap) for HDF5, I/O…  Enhanced Eclipse/PTP plugin for tool interoperability  PerfExplorer: Custom charts, multiple database support  ParaProf supports multiple profile formats, databases  PERIXML, TAU, perfsuite, cube 2 & 3, mpiP, HPMtoolkit, gprof…  Support for storing metadata in profiles, TAU portal, PerfDMF  PDT now supports type information in PDB files GFortran parser

SC’07: Performance Tools BOF 6 TAU Performance System TAU Demos at SC’07  ASC/NNSA Booth #1617, demo station #2  Wednesday (11/14):12-1pm, 3-5pm  Thursday (11/15):10-11am  Schedule available at tau.uoregon.edu  SiCortex Booth  ANL Booth (KTAU presentation)  Thu. 12-1pm  Paper:  “Ghost in the machine: Observing the Effects of Kernel Operation in Parallel Application Performance” A. Nataraj, A. Morris, A. Malony, M. Sottile, P. Beckman, SC’07 A2/A5 Wed. 10:30am

SC’07: Performance Tools BOF 7 TAU Performance System Future Research Directions  Improving tool interoperability  OTF [TU Dresden]  TotalView [Totalview Tech]  Scalasca/KOJAK instrumentation [FZJ]  Hybrid sampling, instrumentation based measurements  PerfSuite [NCSA]  Kernel measurements for tracking I/O using KTAU and ZeptoOS [ANL]  Binary rewriter integration using DyninstAPI [U. Maryland, U. Wisconsin]  Improvements in SiCortex integrated tool environment  TAU Portal, regression testing

SC’07: Performance Tools BOF 8 TAU Performance System TAU Performance System Architecture

SC’07: Performance Tools BOF 9 TAU Performance System TAU Performance System Architecture

SC’07: Performance Tools BOF 10 TAU Performance System Program Database Toolkit (PDT) Application / Library C / C++ parser Fortran parser F77/90/95 C / C++ IL analyzer Fortran IL analyzer Program Database Files IL DUCTAPE PDBhtml SILOON CHASM TAU_instr Program documentation Application component glue C++ / F90/95 interoperability Automatic source instrumentation

SC’07: Performance Tools BOF 11 TAU Performance System Building Bridges to Other Tools: TAU

SC’07: Performance Tools BOF 12 TAU Performance System Using TAU: A brief Introduction  To instrument source code using PDT  Choose an appropriate TAU stub makefile in /lib: % setenv TAU_MAKEFILE /usr/tau-2.17/x86_64/lib/Makefile.tau-mpi-pdt-pgi % setenv TAU_OPTIONS ‘-optVerbose …’ (see tau_compiler.sh) And use tau_f90.sh, tau_cxx.sh or tau_cc.sh as Fortran, C++ or C compilers: % mpif90 foo.f90 changes to % tau_f90.sh foo.f90  Execute application and analyze performance data: % pprof (for text based profile display) % paraprof (for GUI)

SC’07: Performance Tools BOF 13 TAU Performance System TAU’s ParaProf Profile Browser: Manager Multiple PerfDMF databases Application Metadata

SC’07: Performance Tools BOF 14 TAU Performance System TAU’s ParaProf Scalable Profile Browser S3D: 6400 cores on XT3+XT4 System (Jaguar)  Gap represents XT3 nodes

SC’07: Performance Tools BOF 15 TAU Performance System 6400 cores S3D Scatter Plot: Visualizing Hybrid XT3+XT4  Red nodes are XT4, blue are XT3

SC’07: Performance Tools BOF 16 TAU Performance System PerfDMF Architecture K. Huck, A. Malony, R. Bell, A. Morris, “Design and Implementation of a Parallel Performance Data Management Framework,” ICPP 2005.

SC’07: Performance Tools BOF 17 TAU Performance System TAU Portal: Launch ParaProf/PerfExplorer

SC’07: Performance Tools BOF 18 TAU Performance System PerfExplorer: Regression Testing

SC’07: Performance Tools BOF 19 TAU Performance System PerfExplorer: S3D Total Runtime Breakdown MPI_Wait WRITE_ SAVEFILE 12,000 cores!

SC’07: Performance Tools BOF 20 TAU Performance System TAU in Eclipse/PTP Environment

SC’07: Performance Tools BOF 21 TAU Performance System Support Acknowledgements  US Department of Energy (DOE)  Office of Science  MICS, Argonne National Lab  ASC/NNSA  University of Utah ASC/NNSA Level 1  ASC/NNSA, Lawrence Livermore National Lab  US Department of Defense (DoD)  NSF HEC-RTF and SDCI  Research Centre Juelich  TU Dresden  Los Alamos National Laboratory  ParaTools, Inc.  PSC, NCSA, and U. Oregon