Department of Computer Science University of the West Indies Part II.

Slides:



Advertisements
Similar presentations
Building a CFD Grid Over ThaiGrid Infrastructure Putchong Uthayopas, Ph.D Department of Computer Engineering, Faculty of Engineering, Kasetsart University,
Advertisements

IT253: Computer Organization
Parallel Programming and Algorithms : A Primer Kishore Kothapalli IIIT-H Workshop on Multi-core Technologies International Institute.
1 Parallel Scientific Computing: Algorithms and Tools Lecture #2 APMA 2821A, Spring 2008 Instructors: George Em Karniadakis Leopold Grinberg.
CA 714CA Midterm Review. C5 Cache Optimization Reduce miss penalty –Hardware and software Reduce miss rate –Hardware and software Reduce hit time –Hardware.
Beowulf Supercomputer System Lee, Jung won CS843.
Parallel Programming Languages Andrew Rau-Chaplin.
History of Distributed Systems Joseph Cordina
Chris Madill Molecular Structure and Function, Hospital for Sick Children Department of Biochemistry, University of Toronto Supervised by Dr. Paul Chow.
Digital Systems Emphasis for Electrical Engineering Students Digital Systems skills are very valuable for electrical engineers Digital systems are the.
1 Distributed Computing Algorithms CSCI Distributed Computing: everything not centralized many processors.
Henry Hexmoor1 Chapter 7 Henry Hexmoor Registers and RTL.
Introduction CS 524 – High-Performance Computing.
Computational Astrophysics: Methodology 1.Identify astrophysical problem 2.Write down corresponding equations 3.Identify numerical algorithm 4.Find a computer.
1  1998 Morgan Kaufmann Publishers Chapter 9 Multiprocessors.
Introduction What is Parallel Algorithms? Why Parallel Algorithms? Evolution and Convergence of Parallel Algorithms Fundamental Design Issues.
1 Interfacing Processors and Peripherals I/O Design affected by many factors (expandability, resilience) Performance: — access latency — throughput — connection.
OPERATING SYSTEM OVERVIEW
Chapter 1 and 2 Computer System and Operating System Overview
1 Lecture 1 Introduction to Parallel Computing Parallel Computing Fall 2008.
CS 282 Simulation Physics Lecture 1: Introduction to Rigid-Body Simulation 1 September 2011 Instructor: Kostas Bekris Computer Science & Engineering, University.
1 CSE SUNY New Paltz Chapter Nine Multiprocessors.
Fall 2008Introduction to Parallel Processing1 Introduction to Parallel Processing.
Lecture 1: Introduction to High Performance Computing.
Computer Science 502 Objectives and Main Areas of Coverage.
High Performance Computing (HPC) at Center for Information Communication and Technology in UTM.
CMSC 611: Advanced Computer Architecture Parallel Computation Most slides adapted from David Patterson. Some from Mohomed Younis.
CSE 260 Parallel Computation Allan Snavely, Henri Casanova
08/21/2012CS4230 CS4230 Parallel Programming Lecture 1: Introduction Mary Hall August 21,
School of Aeronautical Engineering, Queen’s University Belfast Turbulent Wind Flow over a High Speed Train R K Cooper School of Aeronautical Engineering.
3- System modelling An architectural model presents an abstract view of the sub-systems making up a system May include major information flows between.
LOGO OPERATING SYSTEM Dalia AL-Dabbagh
 What is an operating system? What is an operating system?  Where does the OS fit in? Where does the OS fit in?  Services provided by an OS Services.
Operating System Review September 10, 2012Introduction to Computer Security ©2004 Matt Bishop Slide #1-1.
Overview of Computing. Computer Science What is computer science? The systematic study of computing systems and computation. Contains theories for understanding.
Seaborg Cerise Wuthrich CMPS Seaborg  Manufactured by IBM  Distributed Memory Parallel Supercomputer  Based on IBM’s SP RS/6000 Architecture.
Parallel and Distributed Systems Instructor: Xin Yuan Department of Computer Science Florida State University.
Computational Fluid Dynamics - Fall 2003 The syllabus Term project CFD references (Text books and papers) Course Tools Course Web Site:
High-Performance Computing An Applications Perspective REACH-IIT Kanpur 10 th Oct
If Exascale by 2018, Really? Yes, if we want it, and here is how Laxmikant Kale.
Application / User Viewpoint Computer Science Section Head Computational and Information Systems Laboratory National Center for Atmospheric.
April 26, CSE8380 Parallel and Distributed Processing Presentation Hong Yue Department of Computer Science & Engineering Southern Methodist University.
COMPUTER ORGANIZATIONS CSNB123. COMPUTER ORGANIZATIONS CSNB123 Why do you need to study computer organization and architecture? Computer science and IT.
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
CSCI-455/522 Introduction to High Performance Computing Lecture 1.
CS 240A Applied Parallel Computing John R. Gilbert Thanks to Kathy Yelick and Jim Demmel at UCB for.
The Cosmic Cube Charles L. Seitz Presented By: Jason D. Robey 2 APR 03.
CS4414 Issues in Scientific Computing Alexey Onufriev Departments of CS and Physics VT.
Neural Networks in Computer Science n CS/PY 231 Lab Presentation # 1 n January 14, 2005 n Mount Union College.
Workshop BigSim Large Parallel Machine Simulation Presented by Eric Bohm PPL Charm Workshop 2004.
Distributed Programming CA107 Topics in Computing Series Martin Crane Karl Podesta.
Introduction Application of parallel programming to the KAMM model
INEL6067 Technology ---> Limitations & Opportunities Wires -Area -Propagation speed Clock Power VLSI -I/O pin limitations -Chip area -Chip crossing delay.
LCSE – NCSA Partnership Accomplishments, FY01 Paul R. Woodward Laboratory for Computational Science & Engineering University of Minnesota October 17, 2001.
(Superficial!) Review of Uniprocessor Architecture Parallel Architectures and Related concepts CS 433 Laxmikant Kale University of Illinois at Urbana-Champaign.
1 Lecture 1: Parallel Architecture Intro Course organization:  ~18 parallel architecture lectures (based on text)  ~10 (recent) paper presentations 
… begin …. Parallel Computing: What is it good for? William M. Jones, Ph.D. Assistant Professor Computer Science Department Coastal Carolina University.
Computational Fluid Dynamics - Fall 2007 The syllabus CFD references (Text books and papers) Course Tools Course Web Site:
Towards a Digital Human Kathy Yelick EECS Department U.C. Berkeley.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 April 28, 2005 Session 29.
1  2004 Morgan Kaufmann Publishers Fallacies and Pitfalls Fallacy: the rated mean time to failure of disks is 1,200,000 hours, so disks practically never.
Feeding Parallel Machines – Any Silver Bullets? Novica Nosović ETF Sarajevo 8th Workshop “Software Engineering Education and Reverse Engineering” Durres,
Introduction to Parallel Computing: MPI, OpenMP and Hybrid Programming
Web: Parallel Computing Rabie A. Ramadan , PhD Web:
CMSC 611: Advanced Computer Architecture
Distributed Shared Memory
Lecture 1: Parallel Architecture Intro
Programming Languages
CSE8380 Parallel and Distributed Processing Presentation
Distributed Computing:
Presentation transcript:

Department of Computer Science University of the West Indies Part II

Parallel Programming? ENIAC, University of Pennsylvania 1946 (

The Need For Power

Computational Science qTraditional scientific and engineering paradigm q Do theory or paper design q Perform experiments or build system qReplacing both by numerical experiments q Real phenomena are too complicated to model by hand q Real experiments are: q too hard, e.g., build large wind tunnels q too expensive, e.g., build a throw-away passenger jet q too slow, e.g., wait for climate or galactic evolution q too dangerous, e.g., weapons, drug design

Computational Science Examples qAstrophysical thermonuclear flashes qNuclear weapons qWeather prediction qClimate and atmospheric modeling qDrug design qBlood flow qFluid dynamics (CFD)

Fluid Dynamics qForced convective heat transfer qBuoyant convection qHairpin vortex generation qRayleigh-Taylor instability

Hairpin Vortices - Transition to Turbulence qBoundary layer flow past a hemispherical roughness element qRe= based on hemisphere height qK= spectral elements of polynomial degree N=7-15

Simulation Cost qCost is O(Re 3 ) qRe=1K simulation ~ 1 week on 512 processors of ASCI Red q 50GF, 64 GB qRe=10K ~ 1 year on all 8192 processors of ASCI Red q 800GF, 1TB qWe’re really interested in Re=1M … qCan’t even think of doing the Re=1K problem on a uniprocessor machine let alone the 10K or 1M problems!

The Necessity of Parallel Computing

How fast can a serial computer be? qConsider the 1 Tflop sequential machine q data must travel some distance, r, to get from memory to CPU q to get 1 data element per cycle, this means times per second at the speed of light, c = 3e8 m/s q r < c/10 12 = 0.3 mm qNow put 1 TB of storage in a.3 mm 2 area q each word occupies about 3 Angstroms 2, the size of a small atom r =.3 mm 1 Tflop 1 TB sequential machine

Even if we could make it... q... it’d be too expensive qMarket forces are dictating use of COTS

The Solution ? qAdd more workers! qUse a collection of processors and memory modules to work together to solve our problems qSupercomputers, MPPs, Clusters, Beowulfs

Bad News

Still Lots of Work qDecide on and implement an interconnection network for the processors and memory modules qDesign and implement system software for the hardware qDevise algorithms and data structures for solving our problems qDivide the algorithms and data structures up into subproblems qIdentify the communication that will be needed between the subproblems qAssign subproblems to processors and memory modules

Modern Layered Framework CAD MultiprogrammingShared address Message passing Data parallel DatabaseScientific modeling Parallel applications Programming models Communication abstraction User/system boundary Compilation or library Operating systems support Communication hardware Physical communication medium Hardware/software boundary