CSCAPES Mission Research and development Provide load balancing and parallelization toolkits for petascale computation Develop advanced automatic differentiation.

Slides:



Advertisements
Similar presentations
Copyright © 2008 SAS Institute Inc. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks.
Advertisements

A NOVEL APPROACH TO SOLVING LARGE-SCALE LINEAR SYSTEMS Ken Habgood, Itamar Arel Department of Electrical Engineering & Computer Science GABRIEL CRAMER.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under contract.
A Backtracking Correction Heuristic for Graph Coloring Algorithms Sanjukta Bhowmick and Paul Hovland Argonne National Laboratory Funded by DOE.
Lecture # 2 : Process Models
Teaching Courses in Scientific Computing 30 September 2010 Roger Bielefeld Director, Advanced Research Computing.
Parallelizing stencil computations Based on slides from David Culler, Jim Demmel, Bob Lucas, Horst Simon, Kathy Yelick, et al., UCB CS267.
More MR Fingerprinting
COCOMO II - SLIM:A mathematical model. COCOMO II Revised and extended version of the model Allows estimation of object oriented software Provides quantitative.
System Development. Numerical Techniques for Matrix Inversion.
CS 584. Review n Systems of equations and finite element methods are related.
1cs542g-term Sparse matrix data structure  Typically either Compressed Sparse Row (CSR) or Compressed Sparse Column (CSC) Informally “ia-ja” format.
1 Pendahuluan Pertemuan 9 Matakuliah: H0062/Teori Sistem Tahun: 2006.
Scientific Computing on Heterogeneous Clusters using DRUM (Dynamic Resource Utilization Model) Jamal Faik 1, J. D. Teresco 2, J. E. Flaherty 1, K. Devine.
Department of Biomedical Informatics Dynamic Load Balancing (Repartitioning) & Matrix Partitioning Ümit V. Çatalyürek Associate Professor Department of.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Domain decomposition in parallel computing Ashok Srinivasan Florida State University COT 5410 – Spring 2004.
Framework for K-12 Science Education
The sequence of graph transformation (P1)-(P2)-(P4) generating an initial mesh with two finite elements GENERATION OF THE TOPOLOGY OF INITIAL MESH Graph.
© Fujitsu Laboratories of Europe 2009 HPC and Chaste: Towards Real-Time Simulation 24 March
Combinatorial Scientific Computing is concerned with the development, analysis and utilization of discrete algorithms in scientific and engineering applications.
Simulating Quarks and Gluons with Quantum Chromodynamics February 10, CS635 Parallel Computer Architecture. Mahantesh Halappanavar.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Michelle Mills Strout OpenAnalysis: Representation- Independent Program Analysis CCA Meeting January 17, 2008.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
CCA Common Component Architecture Manoj Krishnan Pacific Northwest National Laboratory MCMD Programming and Implementation Issues.
Abstract : The quest for more efficient and accurate computational electromagnetics (CEM) techniques has been vital in the design of modern engineering.
Combinatorial Aspects of Automatic Differentiation Paul D. Hovland Mathematics & Computer Science Division Argonne National Laboratory.
Research, Development, Consulting, Training High Fidelity Modeling and Simulation Where we are going… …future plans.
STE 6239 Simulering Friday, Week 1: 5. Scientific computing: basic solvers.
ITR: Collaborative research: software for interpretation of cosmogenic isotope inventories - a combination of geology, modeling, software engineering and.
Parallel Computing Sciences Department MOV’01 Multilevel Combinatorial Methods in Scientific Computing Bruce Hendrickson Sandia National Laboratories Parallel.
Automatic Differentiation: Introduction Automatic differentiation (AD) is a technology for transforming a subprogram that computes some function into a.
Building an Electron Cloud Simulation using Bocca, Synergia2, TxPhysics and Tau Performance Tools Phase I Doe SBIR Stefan Muszala, PI DOE Grant No DE-FG02-08ER85152.
Issues in (Financial) High Performance Computing John Darlington Director Imperial College Internet Centre Fast Financial Algorithms and Computing 4th.
Chapter 10 Analysis and Design Discipline. 2 Purpose The purpose is to translate the requirements into a specification that describes how to implement.
Combinatorial Scientific Computing and Petascale Simulation (CSCAPES) A SciDAC Institute Funded by DOE’s Office of Science Investigators Alex Pothen, Florin.
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
1 1  Capabilities: Dynamic load balancing and static data partitioning -Geometric, graph-based, hypergraph-based -Interfaces to ParMETIS, PT-Scotch, PaToH.
On the Use of Sparse Direct Solver in a Projection Method for Generalized Eigenvalue Problems Using Numerical Integration Takamitsu Watanabe and Yusaku.
Computational Aspects of Multi-scale Modeling Ahmed Sameh, Ananth Grama Computing Research Institute Purdue University.
Parallel and Distributed Computing Research at the Computing Research Institute Ananth Grama Computing Research Institute and Department of Computer Sciences.
Finite Element Analysis
Case Study in Computational Science & Engineering - Lecture 5 1 Iterative Solution of Linear Systems Jacobi Method while not converged do { }
Cracow Grid Workshop, November 5-6, 2001 Concepts for implementing adaptive finite element codes for grid computing Krzysztof Banaś, Joanna Płażek Cracow.
PaGrid: A Mesh Partitioner for Computational Grids Virendra C. Bhavsar Professor and Dean Faculty of Computer Science UNB, Fredericton This.
Computational Science & Engineering meeting national needs Steven F. Ashby SIAG-CSE Chair March 24, 2003.
1 1 Office of Science Jean-Luc Vay Accelerator Technology & Applied Physics Division Lawrence Berkeley National Laboratory HEP Software Foundation Workshop,
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Linear Algebra Libraries: BLAS, LAPACK, ScaLAPACK, PLASMA, MAGMA
COMPUTER SCIENCE Computer science (CS) is The systematic study of algorithmic.
Domain decomposition in parallel computing Ashok Srinivasan Florida State University.
Data Structures and Algorithms in Parallel Computing Lecture 7.
ComPASS Summary, Budgets & Discussion Panagiotis Spentzouris, Fermilab ComPASS PI.
CONFIDENTIAL © 2007 Maplesoft, a division of Waterloo Maple Inc. Confidential MapleSim Pilot Test Program.
Algebraic Solvers in FASTMath Argonne Training Program on Extreme-Scale Computing August 2015.
University of Texas at Arlington Scheduling and Load Balancing on the NASA Information Power Grid Sajal K. Das, Shailendra Kumar, Manish Arora Department.
1 1 Zoltan: Toolkit of parallel combinatorial algorithms for unstructured, dynamic and/or adaptive computations Unstructured Communication Tools -Communication.
COMP7330/7336 Advanced Parallel and Distributed Computing Task Partitioning Dynamic Mapping Dr. Xiao Qin Auburn University
Fermi National Accelerator Laboratory & Thomas Jefferson National Accelerator Facility SciDAC LQCD Software The Department of Energy (DOE) Office of Science.
Defining the Competencies for Leadership- Class Computing Education and Training Steven I. Gordon and Judith D. Gardiner August 3, 2010.
Auburn University
Parallel Hypergraph Partitioning for Scientific Computing
A Backtracking Correction Heuristic
Auburn University COMP7330/7336 Advanced Parallel and Distributed Computing Mapping Techniques Dr. Xiao Qin Auburn University.
for more information ... Performance Tuning
Estimating Rate Constants in Cell Cycle Models
Computational issues Issues Solutions Large time scale
Presentation transcript:

CSCAPES Mission Research and development Provide load balancing and parallelization toolkits for petascale computation Develop advanced automatic differentiation capabilities for complex applications Advance the state-of-the art in large-scale graph and sparse matrix computations Training and outreach In collaboration with SciDAC SAPs and CETs, support scientific applications with underlying graph and hypergraph algorithms Organize workshops, tutorials and short courses in CSC Train researchers in CSC skills at pre- doctoral and post-doctoral levels Combinatorial Scientific Computing and Petascale Simulations (CSCAPES) SciDAC Applied Math Institute Alex Pothen, Florin Dobrian and Assefaw Gebremedhin, Old Dominion University Erik Boman, Karen Devine and Bruce Hendrickson, Sandia National Laboratories Paul Hovland, Sanjukta Bhowmick, Boyana Norris and Jean Utke, Argonne National Laboratory Umit Catalyurek, Ohio State University Michelle Strout, Colorado State University Load balancing: In parallel computations, data and tasks have to be mapped to processors to balance workload and reduce communication costs. In dynamic or adaptive applications, these goals have to be met as the computations and communications change over time. Zoltan is a toolkit for parallelization and load balancing. Recently a hypergraph based partitioner that has superior performance relative to earlier methods has been included in Zoltan. CSCAPES plans to extend the capabilities of Zoltan in several ways to support petascale applications. C2C2 C4C4 C3C3 C1C1 C5C5 Graph coloring: Assorted variants of vertex colorings have been found to be key tools for reducing the work required to compute sparse derivative matrices using automatic differentiation. Graph coloring is also useful in discovering concurrency in parallel computation. We have developed novel coloring algorithms to make Jacobian and Hessian computations efficient. Preliminary parallel versions have been developed for some of these variants. We plan to extend the parallel coloring software and integrate it with automatic differentiation tools. Performance enhancement: Modern microprocessors are highly sensitive to the spatial and temporal locality of data. Reordering the vertices and elements in a mesh can have significant impact on performance. We have developed several reordering algorithms that use the hypergraph representation of a matrix. These algorithms can improve the performance of a mesh smoothing application by nearly 50%. We plan to further develop and enhance these algorithms. Combinatorial Scientific Computing (CSC) is concerned with the development, analysis and utilization of discrete algorithms in scientific and engineering applications. Graph and hypergraph algorithms are among the fundamental tools of CSC. They play a crucial enabling role in applications requiring parallelization, solution of differential equations, mesh generation, optimization, etc. The CSCAPES Institute is founded on the recognition of this critical role. The diagram above highlights some of the relationships between the research areas in CSCAPES and a typical SciDAC application; an arrow from A to B indicates that A in some sense uses B. Matching: Various kinds of matchings in graphs are critical components in many computational science contexts. Examples include sparse linear systems (numerical preprocessing and block triangular decomposition) and graph partitioning (coarsening phase of multilevel methods). Traditional matching algorithms compute optimal solutions in superlinear time, but current trends are toward algorithms that find approximate solutions faster. CSCAPES plans to develop petascale parallel matching algorithms based on approximation techniques. Automatic Differentiation (AD) computes analytic derivatives of functions specified by programs. Efficient derivative accumulation can be posed as graph transformation (elimination) problems. Checkpointing for irregular problems is another source of combinatorial problems in AD. CSCAPES will develop efficient algorithms for these problems, extend the capabilities of the AD tools ADIC and ADIFOR using the OpenAD framework, and provide users with integrated load balancing and coloring capabilities. Hypergraph representation used by partitioner Application matrix Parallel performance improvement A symmetrically orthogonal partition of a Hessian and its representation as a star coloring of the adjacency graph Performance of parallel distance-1 coloring algorithm OpenAD System Architecture Sensitivity to salinity in the the MITGCM