Overview of Earth Simulator.

Slides:



Advertisements
Similar presentations
© 2007 IBM Corporation IBM Global Engineering Solutions IBM Blue Gene/P Blue Gene/P System Overview - Hardware.
Advertisements

Chapter 1 An Introduction To Microprocessor And Computer
IBM 1350 Cluster Expansion Doug Johnson Senior Systems Developer.
Top 500 Computers Federated Distributed Systems Anda Iamnitchi.
2. Computer Clusters for Scalable Parallel Computing
Computer Architecture & Organization
IBM RS6000/SP Overview Advanced IBM Unix computers series Multiple different configurations Available from entry level to high-end machines. POWER (1,2,3,4)
Parallel Computers Past and Present Yenchi Lin Apr 17,2003.
Introduction to Systems Architecture Kieran Mathieson.
Top500: Red Storm An abstract. Matt Baumert 04/22/2008.
1 Computer Science, University of Warwick Metrics  FLOPS (FLoating point Operations Per Sec) - a measure of the numerical processing of a CPU which can.
Hitachi SR8000 Supercomputer LAPPEENRANTA UNIVERSITY OF TECHNOLOGY Department of Information Technology Introduction to Parallel Computing Group.
Earth Simulator Jari Halla-aho Pekka Keränen. Architecture MIMD type distributed memory 640 Nodes, 8 vector processors each. 16GB shared memory per node.
Sun FIRE Jani Raitavuo Niko Ronkainen. Sun FIRE 15K Most powerful and scalable Up to 106 processors, 576 GB memory and 250 TB online disk storage Fireplane.
IBM RS/6000 SP POWER3 SMP Jari Jokinen Pekka Laurila.
Cluster Computing Slides by: Kale Law. Cluster Computing Definition Uses Advantages Design Types of Clusters Connection Types Physical Cluster Interconnects.
Lecture 1: Introduction to High Performance Computing.
Real Parallel Computers. Background Information Recent trends in the marketplace of high performance computing Strohmaier, Dongarra, Meuer, Simon Parallel.
1 Computer Science, University of Warwick Architecture Classifications A taxonomy of parallel architectures: in 1972, Flynn categorised HPC architectures.
Real Parallel Computers. Modular data centers Background Information Recent trends in the marketplace of high performance computing Strohmaier, Dongarra,
National Weather Service National Weather Service Central Computer System Backup System Brig. Gen. David L. Johnson, USAF (Ret.) National Oceanic and Atmospheric.
“The Architecture of Massively Parallel Processor CP-PACS” Taisuke Boku, Hiroshi Nakamura, et al. University of Tsukuba, Japan by Emre Tapcı.
2007 Sept 06SYSC 2001* - Fall SYSC2001-Ch1.ppt1 Computer Architecture & Organization  Instruction set, number of bits used for data representation,
 What is an operating system? What is an operating system?  Where does the OS fit in? Where does the OS fit in?  Services provided by an OS Services.
Computer Science and Engineering Copyright by Hesham El-Rewini Advanced Computer Architecture.
Seaborg Cerise Wuthrich CMPS Seaborg  Manufactured by IBM  Distributed Memory Parallel Supercomputer  Based on IBM’s SP RS/6000 Architecture.
Company LOGO High Performance Processors Miguel J. González Blanco Miguel A. Padilla Puig Felix Rivera Rivas.
Sun Fire™ E25K Server Keith Schoby Midwestern State University June 13, 2005.
The Red Storm High Performance Computer March 19, 2008 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
High Performance Computing Processors Felix Noble Mirayma V. Rodriguez Agnes Velez Electric and Computer Engineer Department August 25, 2004.
Lecture 1: Introduction. Course Outline The aim of this course: Introduction to the methods and techniques of performance analysis of computer systems.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 February Session 6.
Frank Casilio Computer Engineering May 15, 1997 Multithreaded Processors.
HPCVL High Performance Computing Virtual Laboratory Founded 1998 as a joint HPC lab between –Carleton U. (Comp. Sci.) –Queen’s U. (Engineering) –U. of.
COMPUTER ORGANIZATIONS CSNB123. COMPUTER ORGANIZATIONS CSNB123 Why do you need to study computer organization and architecture? Computer science and IT.
1 3 Computing System Fundamentals 3.2 Computer Architecture.
Computer Organization and Architecture Tutorial 1 Kenneth Lee.
Price Performance Metrics CS3353. CPU Price Performance Ratio Given – Average of 6 clock cycles per instruction – Clock rating for the cpu – Number of.
Earth Simulator Building Shared Memory 16GB Arithmetic Processor #0 Arithmetic Processor #1 Arithmetic Processor #7 Shared Memory 16GB Arithmetic.
Computer Hardware A computer is made of internal components Central Processor Unit Internal External and external components.
1 THE EARTH SIMULATOR SYSTEM By: Shinichi HABATA, Mitsuo YOKOKAWA, Shigemune KITAWAKI Presented by: Anisha Thonour.
CPU/BIOS/BUS CES Industries, Inc. Lesson 8.  Brain of the computer  It is a “Logical Child, that is brain dead”  It can only run programs, and follow.
3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-2.
Raw Status Update Chips & Fabrics James Psota M.I.T. Computer Architecture Workshop 9/19/03.
Tackling I/O Issues 1 David Race 16 March 2010.
Parallel Computers Today Oak Ridge / Cray Jaguar > 1.75 PFLOPS Two Nvidia 8800 GPUs > 1 TFLOPS Intel 80- core chip > 1 TFLOPS  TFLOPS = floating.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 April 28, 2005 Session 29.
6th Meeting Degree of Comparison and Buying a Computer.
COMP7500 Advanced Operating Systems I/O-Aware Load Balancing Techniques Dr. Xiao Qin Auburn University
CIT 140: Introduction to ITSlide #1 CSC 140: Introduction to IT Operating Systems.
Chapter 3 Getting Started. Copyright © 2005 Pearson Addison-Wesley. All rights reserved. Objectives To give an overview of the structure of a contemporary.
Possible foreseeable measures for tera-scale data handling Kazutoshi Horiuchi *1 Keiko Takahashi *1 Hirofumi Sakuma *1 Shigemune Kitawaki *2 *1 Frontier.
Ottawa Linux Symposium Christoph Lameter, Ph.D. Technical Lead Linux Kernel Software Silicon Graphics, Inc. Extreme High.
SPRING 2012 Assembly Language. Definition 2 A microprocessor is a silicon chip which forms the core of a microcomputer the concept of what goes into a.
Network Connected Multiprocessors
CST 303 COMPUTER SYSTEMS ARCHITECTURE (2 CREDITS)‏
Computer Network Course objective: To understand Network architecture
The Architecture of Earth Simulator
Appro Xtreme-X Supercomputers
The Earth Simulator System
Super Computing By RIsaj t r S3 ece, roll 50.
Characterization of Parallel Scientific Simulations
An Overview of the ITTC Networking & Distributed Systems Laboratory
Text Book Computer Organization and Architecture: Designing for Performance, 7th Ed., 2006, William Stallings, Prentice-Hall International, Inc.
The Athlons x86 Architecture
Characteristics of Reconfigurable Hardware
Course Description: Parallel Computer Architecture
COMS 361 Computer Organization
The C&C Center Three Major Missions: In This Presentation:
Cluster Computers.
Presentation transcript:

Overview of Earth Simulator

Outline History & Development Hardware Architecture Performance Processor Node Interconnection Network Performance Application Summary

History & Development Customers Manufacturer: 3 Japanese agencies: NASDA, JAERI, JAMSTEC Manufacturer: NEC Milestone of Development: Conceptual Design: 1997 Detail Design: 1999 Manufacture & Installation: 2000-2002 Operation: March 2002 - now

History & Development

Hardware Architecture Overview Architecture Model: NEC SX6 Architecture Distributed Memory System: 640 PNs Interconnection Network: Single state crossbar 700 Terabytes of disk storage

Overview Architecture

Arithmetic Processor 8Layers copper interconnection 20.79mm x 20.79mm 60 million transistors Clock Cycle: 500MHz(1GHz)

Processor Node 8 Arithmetic Processors 16Gb Shared Memory Divide into 32 units Remote Access Control Unit I/O processor

Processor Node

Interconnection Network RCU is directly connected to Switches and Control Units 128 Switches and 2 Control Units Number of Cable = 640x130 = 83,200 Total extension of 2,400Km or 1,900 miles

Interconnection Network

Interconnection Network

Interconnection Network

Performance Theoretical (T flops/s) sustained (T flops/s) BlueGene 91 Theoretical performance: 40Tflops/s Sustained performance ratio: 87% of its peak Some comparisons: Theoretical (T flops/s) sustained (T flops/s) BlueGene 91 70 Columbia 60 51 Earth Simulator 40 35

Application Forecast: Medical: Other: Weather forecast Predict natural disaster Medical: Speed discover of new drugs Plot the course of a pandemic Other: Simulation of nuclear power plant Research on ecological engine, nano technology.

Summary Distributed memory parallel computer 640 Processors nodes Memory: 16Gb each node Interconnection network: single state crossbar Performance: 35T flops/s

References The Earth Simulator: Top 500 http://www.es.jamstec.go.jp http://www.top500.org

Question