Scalable Processor Design

Slides:



Advertisements
Similar presentations
Computer Organization and Architecture
Advertisements

Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
Chapter 8: Central Processing Unit
POLITECNICO DI MILANO Parallelism in wonderland: are you ready to see how deep the rabbit hole goes? ILP: VLIW Architectures Marco D. Santambrogio:
Computer Architecture Abhinav Agarwal Veeramani V.
Tuan Tran. What is CISC? CISC stands for Complex Instruction Set Computer. CISC are chips that are easy to program and which make efficient use of memory.
Processor Technology and Architecture
Chapter XI Reduced Instruction Set Computing (RISC) CS 147 Li-Chuan Fang.
ELEC Fall 05 1 Very- Long Instruction Word (VLIW) Computer Architecture Fan Wang Department of Electrical and Computer Engineering Auburn.
Chapter 4 Processor Technology and Architecture. Chapter goals Describe CPU instruction and execution cycles Explain how primitive CPU instructions are.
State Machines Timing Computer Bus Computer Performance Instruction Set Architectures RISC / CISC Machines.
RISC. Rational Behind RISC Few of the complex instructions were used –data movement – 45% –ALU ops – 25% –branching – 30% Cheaper memory VLSI technology.
11/11/05ELEC CISC (Complex Instruction Set Computer) Veeraraghavan Ramamurthy ELEC 6200 Computer Architecture and Design Fall 2005.
Seqeuential Logic State Machines Memory
Pipelining By Toan Nguyen.
Processor Organization and Architecture
RISC and CISC. Dec. 2008/Dec. and RISC versus CISC The world of microprocessors and CPUs can be divided into two parts:
Basic Microcomputer Design. Inside the CPU Registers – storage locations Control Unit (CU) – coordinates the sequencing of steps involved in executing.
Guide to Operating Systems, 4th ed.
Basics and Architectures
RISC:Reduced Instruction Set Computing. Overview What is RISC architecture? How did RISC evolve? How does RISC use instruction pipelining? How does RISC.
Computers organization & Assembly Language Chapter 0 INTRODUCTION TO COMPUTING Basic Concepts.
RISC By Ryan Aldana. Agenda Brief Overview of RISC and CISC Features of RISC Instruction Pipeline Register Windowing and renaming Data Conflicts Branch.
Ramesh.B ELEC 6200 Computer Architecture & Design Fall /29/20081Computer Architecture & Design.
Chapter 8 CPU and Memory: Design, Implementation, and Enhancement The Architecture of Computer Hardware and Systems Software: An Information Technology.
Today’s topics Architecture overview Architecture overview Machine instructions Machine instructions Instruction Execution Cycle Instruction Execution.
CPS 4150 Computer Organization Fall 2006 Ching-Song Don Wei.
E X C E E D I N G E X P E C T A T I O N S VLIW-RISC CSIS Parallel Architectures and Algorithms Dr. Hoganson Kennesaw State University Instruction.
MICROOCESSORS AND MICROCONTROLLER:
DSP Architectures Additional Slides Professor S. Srinivasan Electrical Engineering Department I.I.T.-Madras, Chennai –
CISC and RISC 12/25/ What is CISC? acronym for Complex Instruction Set Computer Chips that are easy to program and which make efficient use of memory.
EECS 322 March 18, 2000 RISC - Reduced Instruction Set Computer Reduced Instruction Set Computer  By reducing the number of instructions that a processor.
Sun Microsystems’ UltraSPARC-IIi a Stunt-Free Presentation by Christine Munson Amanda Peters Carl Sadler.
RISC / CISC Architecture by Derek Ng. Overview CISC Architecture RISC Architecture  Pipelining RISC vs CISC.
The Processor & its components. The CPU The brain. Performs all major calculations. Controls and manages the operations of other components of the computer.
CISC. What is it?  CISC - Complex Instruction Set Computer  CISC is a design philosophy that:  1) uses microcode instruction sets  2) uses larger.
1  2004 Morgan Kaufmann Publishers No encoding: –1 bit for each datapath operation –faster, requires more memory (logic) –used for Vax 780 — an astonishing.
Addressing modes, memory architecture, interrupt and exception handling, and external I/O. An ISA includes a specification of the set of opcodes (machine.
Chapter Overview General Concepts IA-32 Processor Architecture
GCSE Computing - The CPU
Dr.Faisal Alzyoud 5/9/2018 Datapath and Control.
Advanced Architectures
ECE354 Embedded Systems Introduction C Andras Moritz.
Visit for more Learning Resources
Assembly Language for Intel-Based Computers, 5th Edition
Guide to Operating Systems, 5th Edition
Architecture & Organization 1
Overview Introduction General Register Organization Stack Organization
Chapter 14 Instruction Level Parallelism and Superscalar Processors
Components of Computer
CISC (Complex Instruction Set Computer)
Embedded Computer Architecture 5SAI0 Chip Multi-Processors (ch 8)
Pipelining: Advanced ILP
Computer Architecture
Central Processing Unit
Superscalar Processors & VLIW Processors
Architecture & Organization 1
Central Processing Unit
CISC AND RISC SYSTEM Based on instruction set, we broadly classify Computer/microprocessor/microcontroller into CISC and RISC. CISC SYSTEM: COMPLEX INSTRUCTION.
Morgan Kaufmann Publishers Computer Organization and Assembly Language
Control Unit Introduction Types Comparison Control Memory
CHAPTER 8: CPU and Memory Design, Enhancement, and Implementation
Chapter 1 Introduction.
Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.
Computer Evolution and Performance
Embedded Computer Architecture 5SAI0 Chip Multi-Processors (ch 8)
Introduction to Microprocessor Programming
Superscalar and VLIW Architectures
Computer Architecture
GCSE Computing - The CPU
Presentation transcript:

Scalable Processor Design Kshitij Bantupalli Peter Ding Teddy Mopewou

The Processor Electronic circuit External data source Memory or data stream Central processing unit (CPU) Specialized Processors Graphics processing unit (GPU) Neural processing unit (NPU)

CPU Fetch Decode Execute Retrieves an instruction from memory Program counter Decode Instruction decoder Instruction set architecture (ISA) Convert the instruction into signals that control other parts of CPU Execute Perform the instruction

CPU Components Control unit Datapath Memory Clock Arithmetic logic unit and pipelines Memory Register files, caches, memory management unit Clock Synchronous circuits

ISA Interface between software and hardware Specifies the instruction the computer can perform and the formart of the instruction Complex instruction set computer (CISC) Reduced instruction set computer (RISC) Very long instruction word (VLIW) Explicitly parallel instruction computing (EPIC)

CISC Attempts to minimize the number of instructions per program Sacrifices number of cycles per instruction Multiple operations are embedded in one instruction Complex instructions of varying lengths Large number of instructions Transistors for storing complex instructions

Advantages of CISC Microprogramming is easy assembly language to implement, and less expensive than hard wiring a control unit. As each instruction became more accomplished, fewer instructions could be used to implement a given task.

Disadvantages of CISC The performance of the machine slows down due to the amount of clock time taken by different instructions will be dissimilar Only 20% of the existing instructions is used in a typical programming event, even though there are various specialized instructions in reality which are not even used frequently.

RISC Attempts to reduce the cycles per instruction Pipelining Sacrificing number of instructions per program Pipelining Overlapping the execution of several instructions in a pipeline One cycle execution time Fixed length instructions

Advantages of RISC over CISC Many RISC processors use the registers for passing arguments and holding the local variables Reduced instructions require less transistors of hardware space Leaves more room for general purpose registers Use a fixed length instruction which is easy to pipeline The speed of the operation can be maximized and the execution time can be minimized More power efficient Mobile devices

CISC vs. RISC Implementation MULT 2:3, 5:2 RISC LOAD A, 2:3 LOAD B, 5:2 PROD A,B STORE 2:3, A

Register Windows Register window overflow and underflow Register window management Register window incremental compilation

Introduction to SPARC Short for scalable processor architecture. It is a reduced instruction set computing(RISC) originally developed by Sun Microsystems. It could have from 72 to 640 general purpose 64-bit registers. It was scalable as it could scale from embedded processors to large server processors all sharing the same instruction set. The number of implemented register windows changes in scaling.

VLIW Processors that execute one instruction after another may be using resources inefficiently Poor performance Josh Fisher of Yale University in early 1980s Exploits instruction level parallelism Execute more than one instruction at a time (superscalar) No instruction interdependencies Moves the complexity from the hardware to the software The complier handles the rest

VLIW Disadvantages VLIW instruction sets are not backwards compatible between implementations Load responses from memory do not have a deterministic delay Very difficult static scheduling of load instructions for the complier

EPIC Hewlett-Packard (HP) Multiple software instructions (bundles) has a stop bit Dependency information Software prefetch instruction Speculative load instruction Check load instruction

Multithreading A single CPU core executes multiple processes or threads concurrently Better utilization of CPU resources Multiple threads contenting with shared resources can degrade performance

Multithreading Types Coarse-grained multithreading Interleaved multithreading Simultaneous multithreading

Multi-core Processors Implements multiprocessing with one physical processor Allows cache coherency circuity to operate at a significantly higher clock rate Signals travel shorter distances, leading to less degradation Uses less power than multiple processors equivalent

Multi-core Processors Disadvantages Maximizing the potential of multi-core processors require adjustments to operating system and software More difficult to handle thermally Lower chip production yields

Asynchronous CPUs No central clock “Pipeline controls” or “FIFO sequencers” Starts the next stage of logic after the existing stage is completed Lower power consumption and electromagnetic interference Speed only limited by propagation delays of logic gates Components can run at different speeds Clocked CPU components are synchronized with the central clock Biggest disadvantage is most CPU design and testing tools are made for clocked CPUs

Caltech Asynchronous Microprocessor World’s first asynchronous microprocessor in 1988 When hot coffee was placed on the chip, the pulse rate slowed down When liquid nitrogen was poured on the chip, the pulse rate increased Ran on a potato

Optical Processors Use light instead of electricity for digital logic Up to 30% faster Uses less power However, current electric computing elements are far cheaper, faster, and more reliable Economically unfeasible for the foreseeable future

Processor Design Instructions per second Floating point operations per second (FLOPS) Performance per watt Parallel computing Low power consumption Small size or low weight Portable embedded systems

References http://www.edgefxkits.com/blog/what-is-risc-and-cisc- architecture/ https://cs.stanford.edu/people/eroberts/courses/soco/projects/risc /risccisc/ http://www.async.caltech.edu/cam.html http://www.async.caltech.edu/~mika/potato/potato.html https://dl.acm.org/citation.cfm?doid=800046.801649 https://courses.cs.washington.edu/courses/cse471/01au/epic_cgi.p df