Presentation is loading. Please wait.

Presentation is loading. Please wait.

SUPERSCALAR ARCHITECTURE

Similar presentations


Presentation on theme: "SUPERSCALAR ARCHITECTURE"— Presentation transcript:

1 SUPERSCALAR ARCHITECTURE
Ahmed Faraz Fall 2008 ELEC

2 Definition and Characteristics
Superscalar processing is the ability to initiate multiple instructions during the same clock cycle. A typical Superscalar processor fetches and decodes the incoming instruction stream several instructions at a time. Superscalar architecture exploit the potential of ILP(Instruction Level Parallelism). Fall 2008 ELEC

3 Fetching and dispatching two instructions per cycle
Fall 2008 ELEC

4 Uninterrupted stream of instructions
The outcomes of conditional branch instructions are usually predicted in advance to ensure uninterrupted stream of instructions Instructions are initiated for execution in parallel based on the availability of operand data, rather than their original program sequence. This is referred to as dynamic instruction scheduling. Upon completion instruction results are resequenced in the original order. Fall 2008 ELEC

5 Superscalar Execution Example
- With Register Renaming for WAR and WAW dependencies. Fall 2008 ELEC

6 . Register Renaming Example
WAR dependency exist between LD r7,(r3) and SUB r3, r12,r11 instructions With Register Renaming, the first write to r3 maps to hw3,while the second write maps to hw20.This converts four instruction dependency chain into 2 two instructions chains, which can then be executed in parallel if the processor allows out of order execution. Fall 2008 ELEC

7 Hardware Organization of a superscalar processor
Multiple paths connecting units are used to illustrate a typical level of parallelism. For example, four instructions can be fetched in parallel, two integer/address instruction scan issue in parallel, two floating point instructions can complete in parellel etc. Fall 2008 ELEC

8 CONCLUSION It thereby allows faster CPU throughput than would otherwise be possible at the same clock rate. All general-purpose CPUs developed since about 1998 are superscalar. The major problem of executing multiple instructions in a scalar program is the handling of data dependencies. If data dependencies are not effectively handled, it is difficult to achieve an execution rate of more than one instruction per clock cycle. Fall 2008 ELEC

9 References THE MICRO ARCHITECTURE OF SUPERSCALAR PROCESSORS BY
JAMES E. SMITH, MEMBER, IEEE, AND GURINDAR S. SOHI, SENIOR MEMBER, IEEE LIMITATION OF SUPERSCALAR MICROPROCESSOR PERFORMANCE THANG TRAN ,ADVANCED MICRO DEVICES, INC. AUSTIN, TEXAS AND CHUAN-LIN WU,DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING Fall 2008 ELEC


Download ppt "SUPERSCALAR ARCHITECTURE"

Similar presentations


Ads by Google