1 1 What does Performance Across the Software Stack mean?  High level view: Providing performance for physics simulations meaningful to applications 

Slides:



Advertisements
Similar presentations
School of something FACULTY OF OTHER School of Computing An Adaptive Numerical Method for Multi- Scale Problems Arising in Phase-field Modelling Peter.
Advertisements

4 th order Embedded Boundary FDTD algorithm for Maxwell Equations Lingling Wu, Stony Brook University Roman Samulyak, BNL Tianshi Lu, BNL Application collaborators:
Copyright © 2008 SAS Institute Inc. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks.
Performance Visualizations using XML Representations Presented by Kristof Beyls Yijun Yu Erik H. D’Hollander.
Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation,
ARCS Data Analysis Software An overview of the ARCS software management plan Michael Aivazis California Institute of Technology ARCS Baseline Review March.
Thermo-fluid Analysis of Helium cooling solutions for the HCCB TBM Presented By: Manmeet Narula Alice Ying, Manmeet Narula, Ryan Hunt and M. Abdou ITER.
Software Version Control SubVersion software version control system WebSVN graphical interface o View version history logs o Browse directory structure.
Steady Aeroelastic Computations to Predict the Flying Shape of Sails Sriram Antony Jameson Dept. of Aeronautics and Astronautics Stanford University First.
CE A Mathematical Approach for Discretization In the previous set of notes we saw how we could numerically solve With boundary conditions Using.
Multi-Scale Finite-Volume (MSFV) method for elliptic problems Subsurface flow simulation Mark van Kraaij, CASA Seminar Wednesday 13 April 2005.
FE Modeling Strategy Decide on details from design Find smallest dimension of interest Pick element types – 1D Beams – 2D Plate or.
© 2011 Autodesk Freely licensed for use by educational institutions. Reuse and changes require a note indicating that content has been modified from the.
Page - 1 Rocketdyne Propulsion & Power Role of EASY5 in Integrated Product Development Frank Gombos Boeing Canoga Park, CA.
1 Parallel Simulations of Underground Flow in Porous and Fractured Media H. Mustapha 1,2, A. Beaudoin 1, J. Erhel 1 and J.R. De Dreuzy IRISA – INRIA.
Infrastructure for Parallel Adaptive Unstructured Mesh Simulations
1 ATPESC 2014 Vijay Mahadevan Tutorial Session for Scalable Interfaces for Geometry and Mesh based Applications (SIGMA) FASTMath SciDAC Institute.
N Tropy: A Framework for Analyzing Massive Astrophysical Datasets Harnessing the Power of Parallel Grid Resources for Astrophysical Data Analysis Jeffrey.
SciDAC Accelerator Simulation project: FNAL Booster modeling, status and plans Robert D. Ryne, P. Spentzouris.
S.S. Yang and J.K. Lee FEMLAB and its applications POSTEC H Plasma Application Modeling Lab. Oct. 25, 2005.
1 Presenters: Cameron W. Smith and Glen Hansen Workflow demonstration using Simmetrix/PUMI/PAALS for parallel adaptive simulations FASTMath SciDAC Institute.
Page 1 Trilinos Software Engineering Technologies and Integration Numerical Algorithm Interoperability and Vertical Integration –Abstract Numerical Algorithms.
 A.C. Bauer, M.S. Shephard, E. Seol and J. Wan,   Scientific Computation Research Center  Rensselaer Polytechnic Institute,
Strategic Goals: To align the many efforts at Sandia involved in developing software for the modeling and simulation of physical systems (mostly PDEs):
ALEGRA is a large, highly capable, option rich, production application solving coupled multi-physics PDEs modeling magnetohydrodynamics, electromechanics,
ParCFD Parallel computation of pollutant dispersion in industrial sites Julien Montagnier Marc Buffat David Guibert.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Discussion – Portable performance Allowed to have two sets of kernels for libraries – make sure the application is portable without too much pain – o.k.
Components for Beam Dynamics Douglas R. Dechow, Tech-X Lois Curfman McInnes, ANL Boyana Norris, ANL With thanks to the Common Component Architecture (CCA)
Center for Component Technology for Terascale Simulation Software CCA is about: Enhancing Programmer Productivity without sacrificing performance. Supporting.
- Modal Analysis for Bi-directional Optical Propagation
ACES WorkshopJun-031 ACcESS Software System & High Level Modelling Languages by
Computational Aspects of Multi-scale Modeling Ahmed Sameh, Ananth Grama Computing Research Institute Purdue University.
CFX-10 Introduction Lecture 1.
Manno, , © by Supercomputing Systems 1 1 COSMO - Dynamical Core Rewrite Approach, Rewrite and Status Tobias Gysi POMPA Workshop, Manno,
Adaptive Meshing Control to Improve Petascale Compass Simulations Xiao-Juan Luo and Mark S Shephard Scientific Computation Research Center (SCOREC) Interoperable.
MESQUITE: Mesh Optimization Toolkit Brian Miller, LLNL
CCA Common Component Architecture CCA Forum Tutorial Working Group CCA Status and Plans.
Introduction What is detector simulation? A detector simulation program must provide the possibility of describing accurately an experimental setup (both.
Cracow Grid Workshop, November 5-6, 2001 Concepts for implementing adaptive finite element codes for grid computing Krzysztof Banaś, Joanna Płażek Cracow.
Computational Science & Engineering meeting national needs Steven F. Ashby SIAG-CSE Chair March 24, 2003.
Java EE Patterns Dan Bugariu.  What is Java EE ?  What is a Pattern ?
Connections to Other Packages The Cactus Team Albert Einstein Institute
Partitioning using Mesh Adjacencies  Graph-based dynamic balancing Parallel construction and balancing of standard partition graph with small cuts takes.
Presented by Adaptive Hybrid Mesh Refinement for Multiphysics Applications Ahmed Khamayseh and Valmor de Almeida Computer Science and Mathematics Division.
COMPASS All-Hands Meeting, FNAL, Sept , 2007 Accelerator Prototyping Through Multi-physics Analysis Volkan Akcelik, Lie-Quan Lee, Ernesto Prudencio,
Algebraic Solvers in FASTMath Argonne Training Program on Extreme-Scale Computing August 2015.
ParMA: Towards Massively Parallel Partitioning of Unstructured Meshes Cameron Smith, Min Zhou, and Mark S. Shephard Rensselaer Polytechnic Institute, USA.
C OMPUTATIONAL R ESEARCH D IVISION 1 Defining Software Requirements for Scientific Computing Phillip Colella Applied Numerical Algorithms Group Lawrence.
Predictive Load Balancing Using Mesh Adjacencies for Mesh Adaptation  Cameron Smith, Onkar Sahni, Mark S. Shephard  Scientific Computation Research Center.
Center for Component Technology for Terascale Simulation Software (CCTTSS) 110 April 2002CCA Forum, Townsend, TN CCA Status, Code Walkthroughs, and Demonstrations.
On the Path to Trinity - Experiences Bringing Codes to the Next Generation ASC Platform Courtenay T. Vaughan and Simon D. Hammond Sandia National Laboratories.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Unstructured Meshing Tools for Fusion Plasma Simulations
G. Cheng, R. Rimmer, H. Wang (Jefferson Lab, Newport News, VA, USA)
Scalable Interfaces for Geometry and Mesh based Applications (SIGMA)
I. E. Venetis1, N. Nikoloutsakos1, E. Gallopoulos1, John Ekaterinaris2
Data Structures for Efficient and Integrated Simulation of Multi-Physics Processes in Complex Geometries A.Smirnov MulPhys LLC github/mulphys
Paul P.H. Wilson UW-Madison Fusion Technology Institute
Parallel Unstructured Mesh Infrastructure
Construction of Parallel Adaptive Simulation Loops
Unstructured Grids at Sandia National Labs
HPC Modeling of the Power Grid
Trilinos Software Engineering Technologies and Integration
Huanyuan(Wayne) Sheng
GENERAL VIEW OF KRATOS MULTIPHYSICS
Beam Analysis with ANSYS
Ph.D. Thesis Numerical Solution of PDEs and Their Object-oriented Parallel Implementations Xing Cai October 26, 1998.
Dynamic Load Balancing of Unstructured Meshes
Presentation transcript:

1 1 What does Performance Across the Software Stack mean?  High level view: Providing performance for physics simulations meaningful to applications  To FASTMath it means Providing performance for people solving large systems of equations Providing performance for mesh-based simulations Providing performance for multiphysics simulations which can include a combination of mesh and particle based methods  Will focus on performance for mesh-based simulations Even at this level there is a substantial difference between structured and unstructured mesh methods Performance Across the Software Stack

2 2 (A simplified view by an unstructured mesh person – need proper technical description from structured mesh people) Core components of adaptive structured mesh system  Defining the physics on the grids  Management of the grids and their interaction  Dynamic load balancing of the grids  Assembly of algebraic equations  Solving of the systems of algebraic equations over the grids Implementation approach used  Provide an integrated infrastructure for the grids and the solutions of the systems of algebraic equations over the grids  Provide hooks for adding physics through defining the input to stencil operations Structured Mesh Simulations

3 3 Software implementations of structured mesh methods  Provide a parallel framework to support The interactions of the grids Adaptive control of the grids Assemble of the systems of algebraic systems Solution of the algebraic systems Defining stencil operations to account for physics  Key software stack integration points Integration with solver libraries (in reality influenced by both the grid infrastructure and physics being solved) Integration of new physics (note – expect it is not as simple as it “provide a new stencil sounds”) Structured Mesh Simulations

4 4 Historic focus has been on the  Physics equation discretization alternatives  Construction and solution of the algebraic systems  Result is development of unstructured mesh analysis codes Fact that unstructured mesh generation/adaptation can be automated introduced the need for a mesh infrastructure Two approaches to supporting unstructured mesh workflows  Components that can integrate with existing unstructured mesh simulation codes  Provide an unstructured mesh simulation infrastructure for solving new problems  SciDAC applications require both options Unstructured Mesh Simulations

5 5  Mesh generation and adaptation can be automated using design data input Any combinations of CAD and triangulations Voxel (image) to model to mesh capabilities Extensive control of mesh types, orders and layouts – boundary layer, anisotropic, gradation, etc. Curved element meshes Parallel mesh and distributed geometry Mesh generation in parallel Mesh adaptation in parallel Mesh Generation and Adaptation

6 6 Need uniform interface to solve multi-component problems on unstructured meshes  Create a native MOAB implementation that exposes the underlying array-based mesh data structures through the DM (Discretization Manager) object in PETSc (DMMoab)  Discretize the physics PDE described on MOAB mesh while leveraging the scalability of PETSc solvers.  Provide routines to build simple meshes in-memory or load an unstructured grid from file.  Analyze efficient unstructured mesh traversal, FD/FEM-type operator assembly for relevant problems in multi- dimensions. MOAB Discretization Manager

7 7 Albany – Agile Component Architecture Main PDE Assembly Nonlinear Solvers Field Manager Discretization Albany Glue Code Nonlinear Model Nonlinear Transient Optimization UQ Analysis Tools Iterative Linear Solvers Multi-Level Mesh Tools Mesh Adapt PUMI Problem Discretization ManyCore Node Multi-Core Accelerators Application Linear Solve Input Parser Node Kernels Libraries Interfaces PDE Terms Load Balancing

8 8 Components  Unstructured mesh infrastructure Parallel Mesh infrastructures Mesh Generation/Adaptation (includes linkage to geometry) Fields Solution transfer  Dynamic load balancing  Unstructured mesh solver Physics equation discretization System formation System solution Component Based Approach i M 0 j M 1 1 P 0 P 2 P inter-process part boundary intra-process part boundary Proc j Proc i

9 9 Components in Parallel Adaptive Analysis

10 File transfer a serious bottleneck in parallel simulation workflows  All core parallel data and services accessed through APIs  In-memory integration approach uses APIs Migration from file-based components to in-memory  Modify/extend components by wrapping data structures with APIs for:  read/write  memory management  inter-language coupling; typically FORTRAN and C  In-memory has far superior parallel performance In-Memory Coupling of Simulation Components

11 Electromagnetics Analysis

12  Adaptation based on Tracking particles Discretization errors  Full accelerator models Approaching 100 cavities Substantial internal structure Meshes with several hundred million high- order curved elements High-Order EM Coupled with PIC

13 Structural Analysis for Integrated Circuits

14 Must construct 3-D non-manifold solid from input geometry  Input domain defined in terms of 2-D layouts (gdsII/OASIS)  Third dimension based on process knowledge  A component has been developed to construct the model Adaptive loop constructed for thermally loaded case including thin liner Structural Analysis for Integrated Circuits Model of liner film only