Presentation is loading. Please wait.

Presentation is loading. Please wait.

Parallel Computers Today Oak Ridge / Cray Jaguar > 1.75 PFLOPS Two Nvidia 8800 GPUs > 1 TFLOPS Intel 80- core chip > 1 TFLOPS  TFLOPS = 10 12 floating.

Similar presentations


Presentation on theme: "Parallel Computers Today Oak Ridge / Cray Jaguar > 1.75 PFLOPS Two Nvidia 8800 GPUs > 1 TFLOPS Intel 80- core chip > 1 TFLOPS  TFLOPS = 10 12 floating."— Presentation transcript:

1 Parallel Computers Today Oak Ridge / Cray Jaguar > 1.75 PFLOPS Two Nvidia 8800 GPUs > 1 TFLOPS Intel 80- core chip > 1 TFLOPS  TFLOPS = 10 12 floating point ops/sec  PFLOPS = 1,000,000,000,000,000 / sec (10 15 )

2 Supercomputers 1976:Cray-1, 133 MFLOPS (10 6 ) Supercomputers 1976: Cray-1, 133 MFLOPS (10 6 )

3 Trends in processor clock speed

4 AMD Opteron 12-core chip

5 AMD Opteron 6-core layout detail

6 The nVidia G80 GPU 128 streaming floating point processors @1.5Ghz 1.5 Gb Shared RAM with 86Gb/s bandwidth 500 Gflop on one chip (single precision)

7 More Detail on GPU Architecture

8 Cray XMT (highly multithreaded shared memory)

9 Top 500 List http://www.top500.org/list/2010/11/100 Graph 500 List http://www.graph500.org/Results.html

10 Generic Parallel Machine Architecture Key architecture question: Where is the interconnect, and how fast? Key algorithm question: Where is the data? Proc Cache L2 Cache L3 Cache Memory Storage Hierarchy Proc Cache L2 Cache L3 Cache Memory Proc Cache L2 Cache L3 Cache Memory potential interconnects

11 4-core Intel Nehalem chip (2 per Triton node):

12 Triton memory hierarchy Node Memory Proc Cache L2 Cache L3 Cache Proc Cache L2 Cache Proc Cache L2 Cache Proc Cache L2 Cache Proc Cache L2 Cache L3 Cache Proc Cache L2 Cache Proc Cache L2 Cache Proc Cache L2 Cache Chip Node


Download ppt "Parallel Computers Today Oak Ridge / Cray Jaguar > 1.75 PFLOPS Two Nvidia 8800 GPUs > 1 TFLOPS Intel 80- core chip > 1 TFLOPS  TFLOPS = 10 12 floating."

Similar presentations


Ads by Google