Introduction to the TAU Performance System®

Slides:



Advertisements
Similar presentations
Machine Learning-based Autotuning with TAU and Active Harmony Nicholas Chaimov University of Oregon Paradyn Week 2013 April 29, 2013.
Advertisements

Dynamic performance measurement control Dynamic event grouping Multiple configurable counters Selective instrumentation Application-Level Performance Access.
Workload Characterization using the TAU Performance System Sameer Shende, Allen D. Malony, Alan Morris University of Oregon {sameer,
Sameer Shende, Allen D. Malony, and Alan Morris {sameer, malony, Steven Parker, and J. Davison de St. Germain {sparker,
S3D: Performance Impact of Hybrid XT3/XT4 Sameer Shende
Robert Bell, Allen D. Malony, Sameer Shende Department of Computer and Information Science Computational Science.
Scalability Study of S3D using TAU Sameer Shende
Sameer Shende Department of Computer and Information Science Neuro Informatics Center University of Oregon Tool Interoperability.
Profiling S3D on Cray XT3 using TAU Sameer Shende
TAU Parallel Performance System DOD UGC 2004 Tutorial Allen D. Malony, Sameer Shende, Robert Bell Univesity of Oregon.
The TAU Performance Technology for Complex Parallel Systems (Performance Analysis Bring Your Own Code Workshop, NRL Washington D.C.) Sameer Shende, Allen.
TAU Performance System
On the Integration and Use of OpenMP Performance Tools in the SPEC OMP2001 Benchmarks Bernd Mohr 1, Allen D. Malony 2, Rudi Eigenmann 3 1 Forschungszentrum.
Allen D. Malony, Sameer Shende Department of Computer and Information Science Computational Science Institute University.
TAU Performance System Alan Morris, Sameer Shende, Allen D. Malony University of Oregon {amorris, sameer,
Performance Tools BOF, SC’07 5:30pm – 7pm, Tuesday, A9 Sameer S. Shende Performance Research Laboratory University.
Performance Instrumentation and Measurement for Terascale Systems Jack Dongarra, Shirley Moore, Philip Mucci University of Tennessee Sameer Shende, and.
June 2, 2003ICCS Performance Instrumentation and Measurement for Terascale Systems Jack Dongarra, Shirley Moore, Philip Mucci University of Tennessee.
Workshop on Performance Tools for Petascale Computing 9:30 – 10:30am, Tuesday, July 17, 2007, Snowbird, UT Sameer S. Shende
Performance Evaluation of S3D using TAU Sameer Shende
TAU: Performance Regression Testing Harness for FLASH Sameer Shende
Scalability Study of S3D using TAU Sameer Shende
Optimization of Instrumentation in Parallel Performance Evaluation Tools Sameer Shende, Allen D. Malony, Alan Morris University of Oregon {sameer,
S3D: Comparing Performance of XT3+XT4 with XT4 Sameer Shende
Kai Li, Allen D. Malony, Robert Bell, Sameer Shende Department of Computer and Information Science Computational.
The TAU Performance System Sameer Shende, Allen D. Malony, Robert Bell University of Oregon.
Performance Tools for Empirical Autotuning Allen D. Malony, Nick Chaimov, Kevin Huck, Scott Biersdorff, Sameer Shende
TAU Performance System® OpenSHMEM Tools Tutorial Sameer Shende ParaTools, Inc and University of Oregon. 1pm, Tuesday, March 4 th,
Allen D. Malony Performance Research Laboratory (PRL) Neuroinformatics Center (NIC) Department.
© 2008 Pittsburgh Supercomputing Center Performance Engineering of Parallel Applications Philip Blood, Raghu Reddy Pittsburgh Supercomputing Center.
1 Score-P – A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir Markus Geimer 2), Bert Wesarg 1), Brian Wylie.
1 Performance Analysis with Vampir DKRZ Tutorial – 7 August, Hamburg Matthias Weber, Frank Winkler, Andreas Knüpfer ZIH, Technische Universität.
Score-P – A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir Alexandru Calotoiu German Research School for.
Integrated Performance Views in Charm++: Projections meets TAU Scott Biersdorff Allen D. Malony Department Computer and Information Science University.
Using TAU on SiCortex Alan Morris, Aroon Nataraj Sameer Shende, Allen D. Malony University of Oregon {amorris, anataraj, sameer,
Profile Analysis with ParaProf Sameer Shende Performance Reseaerch Lab, University of Oregon
1 Performance Analysis with Vampir ZIH, Technische Universität Dresden.
Dynamic performance measurement control Dynamic event grouping Multiple configurable counters Selective instrumentation Application-Level Performance Access.
ASC Tri-Lab Code Development Tools Workshop Thursday, July 29, 2010 Lawrence Livermore National Laboratory, P. O. Box 808, Livermore, CA This work.
Allen D. Malony, Sameer S. Shende, Alan Morris, Robert Bell, Kevin Huck, Nick Trebon, Suravee Suthikulpanit, Kai Li, Li Li
Tool Visualizations, Metrics, and Profiled Entities Overview [Brief Version] Adam Leko HCS Research Laboratory University of Florida.
Simplifying the Usage of Performance Evaluation Tools: Experiences with TAU and DyninstAPI Paradyn/Condor Week 2010, Rm 221, Fluno Center, U. of Wisconsin,
Integrated Performance Views in Charm++: Projections meets TAU Scott Biersdorff Allen D. Malony Department Computer and Information Science University.
Parallel Performance Measurement of Heterogeneous Parallel Systems with GPUs Allen D. Malony, Scott Biersdorff, Sameer Shende, Heike Jagode†, Stanimire.
TAU Performance System ® TAU is a profiling and tracing toolkit that supports programs written in C, C++, Fortran, Java, Python,
TAU Performance System Sameer Shende Performance Reseaerch Lab, University of Oregon
Presented by Jack Dongarra University of Tennessee and Oak Ridge National Laboratory KOJAK and SCALASCA.
Parallel OpenFOAM CFD Performance Studies Student: Adi Farshteindiker Advisors: Dr. Guy Tel-Zur,Prof. Shlomi Dolev The Department of Computer Science Faculty.
Navigating TAU Visual Display ParaProf and TAU Portal Mahin Mahmoodi Pittsburgh Supercomputing Center 2010.
Performance Tool Integration in Programming Environments for GPU Acceleration: Experiences with TAU and HMPP Allen D. Malony1,2, Shangkar Mayanglambam1.
Kai Li, Allen D. Malony, Sameer Shende, Robert Bell
An Approach for Parallelizing Legacy CFD Applications
TAU Open Source Performance Tool for HPC
Python Performance Evaluation with the TAU Performance System
TAU Open Source Performance Tool for HPC
Performance Technology for Scalable Parallel Systems
Thanks for attending the ParaTools TAU Webex!
Tracing and Performance Analysis Tools for Heterogeneous Multicore System by Soon Thean Siew.
Tutorial Outline Welcome (Malony)
TAU integration with Score-P
TAU: Performance Technology for Productive, High Performance Computing
Improving Application Performance Using TAU
Allen D. Malony, Sameer Shende
TAU Parallel Performance System
Advanced TAU Commander
A configurable binary instrumenter
TAU The 11th DOE ACTS Workshop
TAU Parallel Performance System
Allen D. Malony Computer & Information Science Department
Parallel Program Analysis Framework for the DOE ACTS Toolkit
Presentation transcript:

Introduction to the TAU Performance System® Winter Workshop 2012 at Argonne National Laboratory, ALCF, Bldg. 240,# 1416, Jan 23, 2012, 2pm -3:00pm, Argonne, IL Sameer Shende sameer@paratools.com http://www.paratools.com/anl12

Acknowledgements: U. Oregon, ParaTools, Inc. Dr. Allen D. Malony, Professor, CIS Dept, and Director, NeuroInformatics Center, and CEO, ParaTools, Inc. Dr. Kevin Huck, Computer Scientist, ParaTools, Inc. Dr. John Linford, Computer Scientist, ParaTools, Inc. Dr. Chee Wai Lee, Senior software engineer, UO Wyatt Spear, Software engineer, UO, ParaTools, Inc. Suzanne Millstein, Software engineer, UO Scott Biersdorff, Software engineer, UO Nick Chaimov, Ph.D. student, UO William Voorhees, Ph.D. student, UO Dr. Robert Yelle, Research faculty, UO

What is TAU? TAU is a performance evaluation tool It supports parallel profiling and tracing toolkit Profiling shows you how much (total) time was spent in each routine Tracing shows you when the events take place in each process along a timeline Profiling and tracing can measure time as well as hardware performance counters from your CPU TAU can automatically instrument your source code (routines, loops, I/O, memory, phases, etc.) It supports C++, C, Chapel, UPC, Fortran, Python and Java TAU runs on all HPC platforms and it is free (BSD style license) TAU has instrumentation, measurement and analysis tools To use TAU, you need to set a couple of environment variables and substitute the name of the compiler with a TAU shell script

TAU Performance System® TAU Architecture Integrated toolkit for performance problem solving Instrumentation, measurement, analysis, visualization Portable performance profiling and tracing facility Performance data management and data mining Based on direct performance measurement approach Open source Available on all HPC platforms http://tau.uoregon.edu

Performance Evaluation Profiling Presents summary statistics of performance metrics number of times a routine was invoked exclusive, inclusive time/hpm counts spent executing it number of instrumented child routines invoked, etc. structure of invocations (calltrees/callgraphs) memory, message communication sizes also tracked Tracing Presents when and where events took place along a global timeline timestamped log of events message communication events (sends/receives) are tracked shows when and where messages were sent large volume of performance data generated leads to more perturbation in the program

TAU Performance Profiling Performance with respect to nested event regions Program execution event stack (begin/end events) Profiling measures inclusive and exclusive data Exclusive measurements for region only performance Inclusive measurements includes nested “child” regions Support multiple profiling types Flat, callpath, and phase profiling

TAU Performance System Architecture

TAU Performance System Architecture

Program Database Toolkit (PDT) Application / Library C / C++ parser Fortran parser F77/90/95 Program documentation PDBhtml Application component glue IL IL SILOON C / C++ IL analyzer Fortran IL analyzer C++ / F90/95 interoperability CHASM Program Database Files Automatic source instrumentation DUCTAPE tau_instrumentor

Automatic Source-Level Instrumentation in TAU

Using TAU: A brief Introduction TAU supports several measurement options (profiling, tracing, profiling with hardware counters, etc.) Each measurement configuration of TAU corresponds to a unique stub makefile that is generated when you configure it To instrument source code using PDT Choose an appropriate TAU stub makefile in <arch>/lib: % soft add +tau-latest (on BG/P) % export TAU_MAKEFILE=/soft/apps/tau/tau_latest/bgp/lib/Makefile.tau-bgptimers-mpi-pdt % export TAU_OPTIONS=‘-optVerbose …’ (see tau_compiler.sh -help) And use tau_f90.sh, tau_cxx.sh or tau_cc.sh as Fortran, C++ or C compilers: % mpixlf90_r foo.f90 changes to % tau_f90.sh foo.f90 % qsub –A <…> ./a.out (to submit the job) Execute application and analyze performance data: % pprof (for text based profile display) % paraprof (for GUI)

TAU Measurement Configuration % cd /soft/apps/tau/tau_latest/bgp/lib; ls Makefile.* Makefile.tau-pdt Makefile.tau-mpi-pdt Makefile.tau-bgptimers-mpi-pdt Makefile.tau-opari-openmp-mpi-pdt Makefile.tau-mpi-scalasca-epilog-pdt Makefile.tau-mpi-papi-pdt Makefile.tau-papi-mpi-openmp-opari-pdt Makefile.tau-pthread-pdt… For an MPI+F90 application, you may want to start with: Supports MPI instrumentation & PDT for automatic source instrumentation % export TAU_MAKEFILE= /soft/apps/tau/tau_latest/bgp/lib/Makefile.tau-mpi-bgptimers-pdt % make CC=tau_cc.sh CXX=tau_cxx.sh F90=tau_f90.sh % qsub … ./a.out

Parallel Profile Visualization: ParaProf % soft add +tau-latest % paraprof (Windows -> 3D Visualization)

ParaProf: 3D Communication Matrix % qsub –env TAU_COMM_MATRIX=1 … % paraprof (Windows -> 3D Communication Matrix)

Interval, Atomic and Context Events in TAU Interval Event Context Event % pprof Atomic Event

Jumpshot [ANL]: Trace Visualization % qsub –env TAU_TRACE=1 … % tau_treemerge.pl % tau2slog2 tau.trc tau.edf –o app.slog2 % jumpshot app.slog2

Acknowledgements Department of Energy HPCMP DoD PET Program Office of Science Argonne National Laboratory ORNL NNSA/ASC Trilabs (SNL, LLNL, LANL) HPCMP DoD PET Program National Science Foundation University of Tennessee Shirley Moore Daniel Terpstra University of Oregon Allen D. Malony, Chee Wai Lee W. Spear, S. Biersdorff TU Dresden Holger Brunst, Andreas Knupfer Wolfgang Nagel Research Centre Juelich, Germany Bernd Mohr Felix Wolf