Performance Technology for Scalable Parallel Systems

Slides:



Advertisements
Similar presentations
HPC in Poland Marek Niezgódka ICM, University of Warsaw
Advertisements

Productive Performance Tools for Heterogeneous Parallel Computing Allen D. Malony Department of Computer and Information Science University of Oregon Shigeo.
Presented by Scalable Systems Software Project Al Geist Computer Science Research Group Computer Science and Mathematics Division Research supported by.
Robert Bell, Allen D. Malony, Sameer Shende Department of Computer and Information Science Computational Science.
Scalability Study of S3D using TAU Sameer Shende
Sameer Shende Department of Computer and Information Science Neuro Informatics Center University of Oregon Tool Interoperability.
Profiling S3D on Cray XT3 using TAU Sameer Shende
TAU Parallel Performance System DOD UGC 2004 Tutorial Allen D. Malony, Sameer Shende, Robert Bell Univesity of Oregon.
The TAU Performance Technology for Complex Parallel Systems (Performance Analysis Bring Your Own Code Workshop, NRL Washington D.C.) Sameer Shende, Allen.
TAU Performance System
On the Integration and Use of OpenMP Performance Tools in the SPEC OMP2001 Benchmarks Bernd Mohr 1, Allen D. Malony 2, Rudi Eigenmann 3 1 Forschungszentrum.
Allen D. Malony, Sameer Shende Department of Computer and Information Science Computational Science Institute University.
Performance Tools BOF, SC’07 5:30pm – 7pm, Tuesday, A9 Sameer S. Shende Performance Research Laboratory University.
Allen D. Malony Department of Computer and Information Science Computational Science Institute University of Oregon TAU Performance.
Performance Evaluation of S3D using TAU Sameer Shende
TAU: Performance Regression Testing Harness for FLASH Sameer Shende
Scalability Study of S3D using TAU Sameer Shende
FACULTY OF COMPUTER SCIENCE OUTPUT DD  annual event from students for students with contact to industry (~800 visitors)  live demonstrations  research.
Sameer Shende, Allen D. Malony Computer & Information Science Department Computational Science Institute University of Oregon.
Simo Niskala Teemu Pasanen
Performance Technology for Complex Parallel Systems REFERENCES.
Performance Tools for Empirical Autotuning Allen D. Malony, Nick Chaimov, Kevin Huck, Scott Biersdorff, Sameer Shende
Allen D. Malony Performance Research Laboratory (PRL) Neuroinformatics Center (NIC) Department.
1 Intel® Many Integrated Core (Intel® MIC) Architecture MARC Program Status and Essentials to Programming the Intel ® Xeon ® Phi ™ Coprocessor (based on.
LLNL-PRES-XXXXXX This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
Chee Wai Lee, Allen D. Malony, Alan Morris Department of Computer and Information Science Performance Research.
SC’13: Hands-on Practical Hybrid Parallel Application Performance Engineering Introduction to VI-HPS Brian Wylie Jülich Supercomputing Centre.
November 13, 2006 Performance Engineering Research Institute 1 Scientific Discovery through Advanced Computation Performance Engineering.
Integrated Performance Views in Charm++: Projections meets TAU Scott Biersdorff Allen D. Malony Department Computer and Information Science University.
A Component Infrastructure for Performance and Power Modeling of Parallel Scientific Applications Boyana Norris Argonne National Laboratory Van Bui, Lois.
Cyberinfrastructure Planning at NSF Deborah L. Crawford Acting Director, Office of Cyberinfrastructure HPC Acquisition Models September 9, 2005.
Technology + Process SDCI HPC Improvement: High-Productivity Performance Engineering (Tools, Methods, Training) for NSF HPC Applications Rick Kufrin *,
Profile Analysis with ParaProf Sameer Shende Performance Reseaerch Lab, University of Oregon
Allen D. Malony 1, Scott Biersdorff 2, Wyatt Spear 2 1 Department of Computer and Information Science 2 Performance Research Laboratory University of Oregon.
Performance Analysis Tool List Hans Sherburne Adam Leko HCS Research Laboratory University of Florida.
Dynamic performance measurement control Dynamic event grouping Multiple configurable counters Selective instrumentation Application-Level Performance Access.
PerfExplorer Component for Performance Data Analysis Kevin Huck – University of Oregon Boyana Norris – Argonne National Lab Li Li – Argonne National Lab.
Allen D. Malony, Professor  University of Illinois, Urbana-Champaign  Fulbright Research Scholar  The Netherlands  Austria  Alexander von Humboldt.
Allen D. Malony Department of Computer and Information Science TAU Performance Research Laboratory University of Oregon Discussion:
High-Performance and Grid Computing for Neuroinformatics: NIC and Cerebral Data Systems Allen D. Malony University of Oregon Professor Department of Computer.
1 Spallation Neutron Source Data Analysis Jessica Travierso Research Alliance in Math and Science Program Austin Peay State University Mentor: Vickie E.
Program Update Presented by Larry Davis, Deputy Director September 2009 Department of Defense High Performance Computing Modernization Program.
BioPSE NCRR SCIRun2 -THE PROJECT -OBJECTIVES -DEVELOPMENTS -TODAY -THE FUTURE.
1 NSF/TeraGrid Science Advisory Board Meeting July 19-20, San Diego, CA Brief TeraGrid Overview and Expectations of Science Advisory Board John Towns TeraGrid.
IDC HPC User Forum April 14 th, 2008 A P P R O I N T E R N A T I O N A L I N C Steve Lyness Vice President, HPC Solutions Engineering
Sergiu April 2006June 2006 Overview of TeraGrid Resources and Services Sergiu Sanielevici, TeraGrid Area Director for User.
Power and Cooling at Texas Advanced Computing Center Tommy Minyard, Ph.D. Director of Advanced Computing Systems 42 nd HPC User Forum September 8, 2011.
Integrated Performance Views in Charm++: Projections meets TAU Scott Biersdorff Allen D. Malony Department Computer and Information Science University.
Allen D. Malony Department of Computer and Information Science Performance Research Laboratory.
Center for Component Technology for Terascale Simulation Software (CCTTSS) 110 April 2002CCA Forum, Townsend, TN This work has been sponsored by the Mathematics,
TG ’08, June 9-13, State of TeraGrid John Towns Co-Chair, TeraGrid Forum Director, Persistent Infrastructure National Center for Supercomputing.
Parallel OpenFOAM CFD Performance Studies Student: Adi Farshteindiker Advisors: Dr. Guy Tel-Zur,Prof. Shlomi Dolev The Department of Computer Science Faculty.
Productive Performance Tools for Heterogeneous Parallel Computing
Introduction to the TAU Performance System®
Thanks for attending the ParaTools TAU Webex!
HPC System Acquisition and Service Provision
TAU integration with Score-P
32nd TOP500 List SC08, Austin, TX.
Allen D. Malony, Sameer Shende
Software Practices for a Performance Portable Climate System Model
Scalable Systems Software for Terascale Computer Centers
TAU Parallel Performance System
Nicole Ondrus Top 500 Parallel System Presentation
What should we be teaching our computer science students?
Outline Introduction Motivation for performance mapping SEAA model
Parallel Program Analysis Framework for the DOE ACTS Toolkit
Department of Computer Science, University of Tennessee, Knoxville
Defining the Grid Fabrizio Gagliardi EMEA Director Technical Computing
TAU Performance DataBase Framework (PerfDBF)
Presentation transcript:

Performance Technology for Scalable Parallel Systems Allen D. Malony Department of Computer and Information Science University of Oregon

Performance Technology Tools for performance problem solving Empirical-based performance optimization process Performance technology concerns Data mining Models Expert systems Performance Technology Performance Tuning Experiment management Performance storage Performance Technology hypotheses Performance Diagnosis properties Instrumentation Measurement Analysis Visualization Performance Technology Performance Experimentation characterization Performance Observation

TAU Parallel Performance System Project Tuning and Analysis Utilities (15+ year project effort) Performance system framework for HPC systems Integrated, scalable, and flexible Target parallel programming paradigms Integrated toolkit for performance problem solving Instrumentation, measurement, analysis, and visualization Portable performance profiling and tracing facility Performance data management and data mining Partners LLNL, ANL, LANL Research Centre Jülich, TU Dresden http://tau.uoregon.edu

TAU Parallel Performance System Goals Portable (open source) parallel performance system Computer system architectures and operating systems Different programming languages and compilers Multi-level, multi-language performance instrumentation Flexible and configurable performance measurement Support for multiple parallel programming paradigms Multi-threading, message passing, mixed-mode, hybrid, object oriented, component-based Support for performance mapping Integration of leading performance technology Scalable (very large) parallel performance analysis

TAU Performance System Components TAU Architecture Program Analysis Performance Data Mining PDT PerfExplorer Parallel Profile Analysis PerfDMF Performance Monitoring ParaProf TAUoverSupermon

TAU Instrumentation and Measurement

TAU Analysis

TAU on HPC Platforms with Intel Processors ARL (JVN / MJM, x86_64 Linux NetworX) 14.7 TF / 52.8 TF, 2048 / 4400 processors ARFL (Hawk, Eagle ia64 SGI Altix) 59 TF, 9216 processors NCSA (Abe, x86_64 Dell) 89.47 TF, 9600 cores NASA (Columbia, ia64 SGI Altix) 60.96 TF, 10240 processors MHPCC (Jaws, x86_64 Dell) 60 TF, 5120 processors TACC (Lonestar, x86_64 Dell) 62 TF, 5840 processors

TAU on Leadership Class Facilities and TeraGrid Argonne National Laboratory IBM BG/P 111 TF 32768 processors Oak Ridge National Laboratory Cray XT-4 119 TF peak 23416 cores (AMD Opteron) Texas Advanced Computing Center Sun Blade 8000 504 TF peak 62976 cores (AMD Opteron)

Recent Funding A. Malony, S. Shende, N. Nystrom, S. Moore, R. Kufrin, SDCI HPC Improvement: High-Productivity Performance Engineering (Tools, Methods, Training) for NSF HPC Applications, NSF Software Development for Cyberinfrastructure (SDCI), 11/1/2007-10/31/2010. A. Malony, S. Shende, Knowledge-based Parallel Performance Technology, DOE Office of Science, 9/1/2007-8/31/2010. P. Beckman, A. Malony, Extreme Performance Scalable Operating Systems, DOE Office of Science, 12/1/04-1/31/08. A. Malony, S. Shende, Application-Specific Performance Technology for Productive Parallel Computing, DOE Office of Science, 5/1/05-4/30/08. S. McKee, A. Malony, G. Tyson, ST-HEC: Collaborative Research: Scalable, Interoperable Tools to Support Autonomic Optimization of High-End Applications, NSF High-End Computing (HEC), 11/1/04-10/31/07. A. Malony, Multi-core Parallel Programming and Performance Tools, Intel equipment grant, 9/15/2006. A. Malony, M. Sottile, Multi-core Parallel Programming, Intel equipment grant, 11/1/2007.

Intel Contacts Justin Rattner, Intel Senior Fellow, Vice-President Director, Corporate Technology Group In 1988 Rattner (at Intel Scientific Computers) suggested implementing a performance monitor for the iPSC/2 hypercube A. Malony, D. Reed, “A Hardware- Based Performance Monitor for the Intel iPSC/2 Hypercube,” ICS 1990. David Kuck, Intel Fellow Software and Solutions Group Director, Parallel and Distributed Solutions Division Worked for Kuck at the Center for Supercomputing Research and Development, University of Illinois, Urbana-Champaign Tim Mattson, Senior Research Scientist Computational Software Laboratory Werner Krotz-Vogel Technical Marketing Engineer, Intel Cluster Tools

Support Acknowledgements Department of Energy (DOE) Office of Science ASCR, Argonne National Lab ASC/NNSA University of Utah ASC/NNSA Level 1 ASC/NNSA, Lawrence Livermore National Lab Department of Defense (DoD) HPC Modernization Office (HPCMO) NSF Software Development for Cyberinfrastructure Los Alamos National Laboratory Research Centre Juelich, TU Dresden ParaTools, Inc.