HPC Technology Track: Foundations of Computational Science Lecture 1 Dr. Greg Wettstein, Ph.D. Research Support Group Leader Division of Information Technology.

Slides:



Advertisements
Similar presentations
PRAGMA – 9 V.S.S.Sastry School of Physics University of Hyderabad 22 nd October, 2005.
Advertisements

Prepared 7/28/2011 by T. O’Neil for 3460:677, Fall 2011, The University of Akron.
Presented by Dealing with the Scale Problem Innovative Computing Laboratory MPI Team.
Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
Information Systems in Business
Introduction: Foundations of Computational Science Dr. Greg Wettstein, Ph.D. Research Support Group Leader Division of Information Technology Adjunct Professor.
Zhao Lixing.  A supercomputer is a computer that is at the frontline of current processing capacity, particularly speed of calculation.  Supercomputers.
Fundamentals of Data Analysis Lecture 12 Methods of parametric estimation.
1 CE 530 Molecular Simulation Lecture 8 Markov Processes David A. Kofke Department of Chemical Engineering SUNY Buffalo
Reference: Message Passing Fundamentals.
Complexity Analysis (Part I)
May 29, Final Presentation Sajib Barua1 Development of a Parallel Fast Fourier Transform Algorithm for Derivative Pricing Using MPI Sajib Barua.
ECIV 301 Programming & Graphics Numerical Methods for Engineers Lecture 2 Mathematical Modeling and Engineering Problem Solving.
CS 282 Simulation Physics Lecture 1: Introduction to Rigid-Body Simulation 1 September 2011 Instructor: Kostas Bekris Computer Science & Engineering, University.
High Performance Communication using MPJ Express 1 Presented by Jawad Manzoor National University of Sciences and Technology, Pakistan 29 June 2015.
CS 240A: Complexity Measures for Parallel Computation.
LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
Modeling and Simulation
1 CE 530 Molecular Simulation Lecture 7 David A. Kofke Department of Chemical Engineering SUNY Buffalo
1 Statistical Mechanics and Multi- Scale Simulation Methods ChBE Prof. C. Heath Turner Lecture 11 Some materials adapted from Prof. Keith E. Gubbins:
Time Complexity Dr. Jicheng Fu Department of Computer Science University of Central Oklahoma.
© Fujitsu Laboratories of Europe 2009 HPC and Chaste: Towards Real-Time Simulation 24 March
HPC Technology Track: Foundations of Computational Science Lecture 2 Dr. Greg Wettstein, Ph.D. Research Support Group Leader Division of Information Technology.
Parallel Programming Models Jihad El-Sana These slides are based on the book: Introduction to Parallel Computing, Blaise Barney, Lawrence Livermore National.
Parallel Processing CS453 Lecture 2.  The role of parallelism in accelerating computing speeds has been recognized for several decades.  Its role in.
Bulk Synchronous Parallel Processing Model Jamie Perkins.
Computational issues in Carbon nanotube simulation Ashok Srinivasan Department of Computer Science Florida State University.
Energy saving in multicore architectures Assoc. Prof. Adrian FLOREA, PhD Prof. Lucian VINTAN, PhD – Research.
1 ECE-517: Reinforcement Learning in Artificial Intelligence Lecture 6: Optimality Criterion in MDPs Dr. Itamar Arel College of Engineering Department.
RESOURCES, TRADE-OFFS, AND LIMITATIONS Group 5 8/27/2014.
Chapter 6 Multiprocessor System. Introduction  Each processor in a multiprocessor system can be executing a different instruction at any time.  The.
Boltzmann Machine (BM) (§6.4) Hopfield model + hidden nodes + simulated annealing BM Architecture –a set of visible nodes: nodes can be accessed from outside.
Sept COMP60611 Fundamentals of Concurrency Lab Exercise 2 Notes Notes on the finite difference performance model example – for the lab… Graham Riley,
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
CprE / ComS 583 Reconfigurable Computing Prof. Joseph Zambreno Department of Electrical and Computer Engineering Iowa State University Lecture #4 – FPGA.
Interactive Computational Sciences Laboratory Clarence O. E. Burg Assistant Professor of Mathematics University of Central Arkansas Science Museum of Minnesota.
NCN nanoHUB.org Wagner The basics of quantum Monte Carlo Lucas K. Wagner Computational Nanosciences Group University of California, Berkeley In collaboration.
Brent Gorda LBNL – SOS7 3/5/03 1 Planned Machines: BluePlanet SOS7 March 5, 2003 Brent Gorda Future Technologies Group Lawrence Berkeley.
Introduction to Research 2011 Introduction to Research 2011 Ashok Srinivasan Florida State University Images from ORNL, IBM, NVIDIA.
© 2009 IBM Corporation Motivation for HPC Innovation in the Coming Decade Dave Turek VP Deep Computing, IBM.
October 2008 Integrated Predictive Simulation System for Earthquake and Tsunami Disaster CREST/Japan Science and Technology Agency (JST)
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
Scalable and Topology-Aware Load Balancers in Charm++ Amit Sharma Parallel Programming Lab, UIUC.
MSc in High Performance Computing Computational Chemistry Module Parallel Molecular Dynamics (i) Bill Smith CCLRC Daresbury Laboratory
IMS 4212: Database Implementation 1 Dr. Lawrence West, Management Dept., University of Central Florida Physical Database Implementation—Topics.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 2.
Onlinedeeneislam.blogspot.com1 Design and Analysis of Algorithms Slide # 1 Download From
Complexity Measures for Parallel Computation. Problem parameters: nindex of problem size pnumber of processors Algorithm parameters: t p running time.
Network Systems Lab. Korea Advanced Institute of Science and Technology No.1 Ch. 1 Introduction EE692 Parallel and Distribution Computation | Prof. Song.
Kriging for Estimation of Mineral Resources GISELA/EPIKH School Exequiel Sepúlveda Department of Mining Engineering, University of Chile, Chile ALGES Laboratory,
Introduction to Performance Tuning Chia-heng Tu PAS Lab Summer Workshop 2009 June 30,
OPERATING SYSTEMS CS 3502 Fall 2017
Introduction to Parallel Computing: MPI, OpenMP and Hybrid Programming
Overview of Molecular Dynamics Simulation Theory
Boundary Element Analysis of Systems Using Interval Methods
High Performance Computing and Monte Carlo Methods
Auburn University COMP8330/7330/7336 Advanced Parallel and Distributed Computing Communication Costs (cont.) Dr. Xiao.
Software Architecture in Practice
CSE543T: Algorithms for Nonlinear Optimization Yixin Chen, PhD Professor Department of Computer Science & Engineering Washington University in St Louis.
Complexity Measures for Parallel Computation
Objective of This Course
A Domain Decomposition Parallel Implementation of an Elasto-viscoplasticCoupled elasto-plastic Fast Fourier Transform Micromechanical Solver with Spectral.
Hybrid Programming with OpenMP and MPI
COMP60621 Designing for Parallelism
October 6, 2011 Dr. Itamar Arel College of Engineering
Presented By: Darlene Banta
Complexity Measures for Parallel Computation
Design and Analysis of Algorithms
CprE / ComS 583 Reconfigurable Computing
Computational issues Issues Solutions Large time scale
Presentation transcript:

HPC Technology Track: Foundations of Computational Science Lecture 1 Dr. Greg Wettstein, Ph.D. Research Support Group Leader Division of Information Technology Adjunct Professor Department of Computer Science North Dakota State University

What is Computational Science? Application of computational methods to fundamental problems in science and engineering. Concerned with application of computational methods to one of three objectives.  Modeling  Simulation  Prediction

Modeling Involves development of a mathematical model capable of predicting physical phenomenon.  weather prediction  molecular energy and force fields  properties of materials Typically involves solving systems of differential equations expressed in linear form.

Challenges in Modeling Currently only order 0 systems can be solved directly. Computational approaches are frequently implemented with methods involving truncated series.  Garbage In/Garbage out (GIGO) Limitations constantly demand attention to model parameterization and viability. The field considers better differential equation solvers to be essential.

Emerging Opportunities in Modeling Model parameter evaluation.  Considered important by the national laboratories  Involves computation of first and second derivatives of the model with respect to finalized model parameters Important focus is to determine quality or stability of model. ** Model evaluation is critical.

Multi-Scale Studies Defn: Calculation of system behavior or properties on one level using information from subordinate levels. Continuum of levels (physical systems):  quantum mechanical  molecular dynamics  meso or nanoscale levels  level of continuum  level of device

Simulation Focuses on simulating the behavior of physical systems. Usually involves Monte Carlo methods to solve stochastic systems. Most commonly employed in computational physics.

Simulation – con't. Central to the 'birth' of computational science.  Metropolis, Rosenbluth, Rosenbluth, Teller and Teller  “Equation of State Calculations by Fast Computing Machines” Goal is to develop 'ensembles' or collections of parameters. Typically implemented as 'coarse grained' parallelism.

Prediction The analysis or 'mining' of large sets of data for the purpose of predicting future phenomenon. Centrally important to marketing and e-commerce. Represents a type of computational problem referred to as 'embarassingly' parallel. Most famous example is NetFlix competition.

Challenges in Prediction Data locality  More processors does not equal more speed.  NetFlix competition demonstrated inadequacy of improperly 'balanced' computational architectures. Primary concern of national labs involved in security based computation. Current HPC architectures exacerbate data locality problems.

'The Wettstein Rule of Computational Reality' “If filling a cache line is too slow you will be really unhappy doing a cross-node lookup to a machine 200 racks away.”

RoadRunner Configuration 1.72 petaflops peak / petaflops demonstrated 296 racks covering 6,000 square feet Massively parallel – hybrid architecture  6,480 Opteron (x86) processors  12,960 IBM PowerXCell processors  122,400 cores terrabytes of memory But when is it fast?

A Tradeoff Compelling speeds when each node can work on a discrete element of the problem.  Strictly orthogonal decomposition Embarrassingly parallel problems.  MIMD Less efficient when.  Boundary condition dependent problem.  Access to entire memory space is required.

Latency The Enemy of Prediction Latency definition:  The amount of time required to retrieve the next relevant item of data required in a computational or predictive sequence. 'Achilles Heel' of modern massively parallel systems such as RoadRunner. Common problem since the design of the Cray-1.  Wiring optimized to place time critical connections on the inner portion of the computer.

Reducing Latencies through PTree's Current area of research interest. Addition of second order PTree's to optimize data selection decisions. Minimizes:  cache line flushing  cross-node data lookups

Exercise Log into cluster1.chpc.ndsu.nodak.edu. Use sinfo command to locate an available node.  e.g. node64-49 Use ping command to measure message latency over standard TCP/IP network.  ping -c 5 node64-49 Use ping command to measure message latency over Myrinet:  ping -c 5 node64M-49

Exercise – con't. Bottom of ping command details min, average and maximum communication latencies. Compute expected performance change if a computation is constrained by the length of time required to pass a message from one node to another.

goto technology_lecture1_2;