Presentation is loading. Please wait.

Presentation is loading. Please wait.

IBM Cell Processor Ryan Carlson, Yannick Lanner-Cusin, & Cyrus Stoller CS87: Parallel and Distributed Computing.

Similar presentations


Presentation on theme: "IBM Cell Processor Ryan Carlson, Yannick Lanner-Cusin, & Cyrus Stoller CS87: Parallel and Distributed Computing."— Presentation transcript:

1 IBM Cell Processor Ryan Carlson, Yannick Lanner-Cusin, & Cyrus Stoller CS87: Parallel and Distributed Computing

2 Outline Architectural overview Definition of Cell Processor Shared vs Private Memory Design System Scalability

3 Architectural Features Power Processor Element (PPE) –64 bit Synergistic Processing Elements (SPE) –32 bit –256 kb of on chip RAM Element Interconnect Bus (EIB) I/O Interface RAM

4

5 Synergistic Processing Elements Vector Instructions PPE delegates anything that is parallelizable Local Store

6 Stream Processing

7 Vector Instructions

8

9 Advantages of Vector Instructions Fewer instructions Fewer branch instructions -- fewer mispredictions Access memory block at a time Less memory access = faster processing time Example: convert an image to grayscale

10 Disadvantages of Vector Processor More expensive to produce Increased code complexity May be difficult to port between systems Increased power consumption Wasted resources if using scalar instructions

11 Definition of Cell Processor Microprocessor designed to optimize cooperation between ordinary desktop processors and more specialized high- performance processors (like a GPU) Performance and hardware simplicity prioritized over programming convenience

12 Shared Memory RAM Private Memory Local Stores on SPE Cache on PPE Pretty Simple

13 System Design Vector instruction optimization takes planning (ie shopping list) Gaming (PS3) Cryptography, graphics transform and lighting, physics, fast-Fourier transforms (FFT), matrix operations

14 Scalability Harder to optimize programs Easier to optimize hardware

15

16 Why use a cell processor? General purpose processor Designed not to have any slow components Even though you cannot vectorize every instruction, the SPE’s are still useful Worse case - just as fast as an ordinary desktop processor


Download ppt "IBM Cell Processor Ryan Carlson, Yannick Lanner-Cusin, & Cyrus Stoller CS87: Parallel and Distributed Computing."

Similar presentations


Ads by Google