Microprocessors. Von Neumann architecture Data and instructions in single read/write memory Contents of memory addressable by location, independent of.

Slides:



Advertisements
Similar presentations
CPU Structure and Function
Advertisements

Computer Architecture
Dr. Rabie A. Ramadan Al-Azhar University Lecture 3
Computer Organization and Architecture
Processor System Architecture
Khaled A. Al-Utaibi  Computers are Every Where  What is Computer Engineering?  Design Levels  Computer Engineering Fields  What.
Computer Organization and Architecture
Computer Organization and Architecture
Computer Architecture and Data Manipulation Chapter 3.
Processor Technology and Architecture
Chapter 5: Computer Systems Organization Invitation to Computer Science, Java Version, Third Edition.
Data Manipulation Computer System consists of the following parts:
Chapter 7 Interupts DMA Channels Context Switching.
Topics covered: CPU Architecture CSE 243: Introduction to Computer Architecture and Hardware/Software Interface.
Chapter 4 Processor Technology and Architecture. Chapter goals Describe CPU instruction and execution cycles Explain how primitive CPU instructions are.
Computer Organization and Assembly language
Group 5 Alain J. Percial Paula A. Ortiz Francis X. Ruiz.
Computers Central Processor Unit. Basic Computer System MAIN MEMORY ALUCNTL..... BUS CONTROLLER Processor I/O moduleInterconnections BUS Memory.
CH12 CPU Structure and Function
Processor Structure & Operations of an Accumulator Machine
Computers Central Processor Unit. Basic Computer System MAIN MEMORY ALUCNTL..... BUS CONTROLLER Processor I/O moduleInterconnections BUS Memory.
1 Computer System Overview Chapter 1. 2 n An Operating System makes the computing power available to users by controlling the hardware n Let us review.
MICROPROCESSOR INPUT/OUTPUT
Edited By Miss Sarwat Iqbal (FUUAST) Last updated:21/1/13
CHAPTER 3 TOP LEVEL VIEW OF COMPUTER FUNCTION AND INTERCONNECTION
Basic Architecture Lecture 15. In general, if the number of bits is n, then the number of different combinations of 0s and 1s that can be made is 2 n.
Computers organization & Assembly Language Chapter 0 INTRODUCTION TO COMPUTING Basic Concepts.
Multiple-bus organization
Fall 2012 Chapter 2: x86 Processor Architecture. Irvine, Kip R. Assembly Language for x86 Processors 6/e, Chapter Overview General Concepts IA-32.
MICROCOMPUTER ARCHITECTURE 1.  2.1 Basic Blocks of a Microcomputer  2.2 Typical Microcomputer Architecture  2.3 Single-Chip Microprocessor  2.4 Program.
General Concepts of Computer Organization Overview of Microcomputer.
CHAPTER 4 The Central Processing Unit. Chapter Overview Microprocessors Replacing and Upgrading a CPU.
Chapter 2 Data Manipulation © 2007 Pearson Addison-Wesley. All rights reserved.
ECEG-3202 Computer Architecture and Organization Chapter 3 Top Level View of Computer Function and Interconnection.
Chapter 2 Data Manipulation. © 2005 Pearson Addison-Wesley. All rights reserved 2-2 Chapter 2: Data Manipulation 2.1 Computer Architecture 2.2 Machine.
Chapter Overview Microprocessors Replacing and Upgrading a CPU.
Computer Organization CDA 3103 Dr. Hassan Foroosh Dept. of Computer Science UCF © Copyright Hassan Foroosh 2002.
Stored Program A stored-program digital computer is one that keeps its programmed instructions, as well as its data, in read-write,
Computer and Information Sciences College / Computer Science Department CS 206 D Computer Organization and Assembly Language.
Processor Structure and Function Chapter8:. CPU Structure  CPU must:  Fetch instructions –Read instruction from memory  Interpret instructions –Instruction.
Chapter 5: Computer Systems Design and Organization Dr Mohamed Menacer Taibah University
Overview von Neumann Architecture Computer component Computer function
Chapter 3 Basic Processing Unit.
Chapter 2 Data Manipulation © 2007 Pearson Addison-Wesley. All rights reserved.
Basic Elements of Processor ALU Registers Internal data pahs External data paths Control Unit.
Computer Architecture Lecture 4 by Engineer A. Lecturer Aymen Hasan AlAwady 17/11/2013 University of Kufa - Informatics Center for Research and Rehabilitation.
Simple ALU How to perform this C language integer operation in the computer C=A+B; ? The arithmetic/logic unit (ALU) of a processor performs integer arithmetic.
بسم الله الرحمن الرحيم MEMORY AND I/O.
1 Basic Processor Architecture. 2 Building Blocks of Processor Systems CPU.
Capability of processor determine the capability of the computer system. Therefore, processor is the key element or heart of a computer system. Other.
1 Computer Architecture. 2 Basic Elements Processor Main Memory –volatile –referred to as real memory or primary memory I/O modules –secondary memory.
BASIC COMPUTER ARCHITECTURE HOW COMPUTER SYSTEMS WORK.
CPIT Program Execution. Today, general-purpose computers use a set of instructions called a program to process data. A computer executes the.
Riyadh Philanthropic Society For Science Prince Sultan College For Woman Dept. of Computer & Information Sciences CS 251 Introduction to Computer Organization.
William Stallings Computer Organization and Architecture 8th Edition
Architecture & Organization 1
Processor Organization and Architecture
Architecture & Organization 1
MICROCOMPUTER ARCHITECTURE
Morgan Kaufmann Publishers Computer Organization and Assembly Language
Control Unit Introduction Types Comparison Control Memory
Chapter 11 Processor Structure and function
Presentation transcript:

Microprocessors

Von Neumann architecture Data and instructions in single read/write memory Contents of memory addressable by location, independent of content Execution occurs in sequential fashion

HIGH Level View of CPU

CPU Internal Registers –Memory address register –Memory buffer resiter –I/O buffer register

ALU – Arithmetic Logic Unit Function units Floating-point unit (FPU) –Stack oriented Communication –Control and status buses –RISC has several –Input – registers, Output – via storage to registers. Registers connected via signal paths.

ALU – continued

Control Unit

Control Unit – Function Fetch instruction, put it into IR, increment PC Decode and execute instruction Micro-orders Micro-instructions Micro-program

Control Units Microprogrammed –programmed Conventional –Hard-wired RISC vs CISC

Operations and IS Each instruction = 100, 1000, even logic operations. 1 instruction triggers cascade of logical operations

CPU Hardwired design Instruction activates circuits –PLUS -> Provides for fast execution –MINUS -> no flexibility Changes in hardware of machine require changes in code which means changes in hardwiring

Microcode Developed by IBM Chip executes program – on nanoprocessor Plus -> easier to make complex processor Minus -> slower operation To compensate, microcode allows very complex instructions to be used; so fewer instructions are necessary

Register Set User-visible –General Purpose Registers –Data Registers –Address Registers –Condition code registers Negative, zero, positive, overflow

Register Set cont. Control and status registers –Program counter (PC) –Instruction register (IR) –Memory address register (MAR) –Memory buffer register(MBR) –Program status word(PSW)

Register Set cont. May have others –Process control block registers (PCBRs) On some systems both sets of registers are visible (so PC could be seen by user on some systems)

I/O System Consists of I/O devices and interface devices I/O interfaces stand between CPU and the I/O devices.

I/O Unit Matches timing and signal levels of CPU to devices. Since CPU has lower signal capacity, this involves going through signal buffers to strengthen it.

I/O handling CPU-controlled Memory-mapped Direct-memory access

CPU-controlled I/O “Write A to Device N” Challenge to keep CPU utilization high –Multiprogrammed Operating Systems –Multi-ported Memory Systems –I/O processors DMA channels, peripheral processing units (PPUs)

Memory-mapped I/O Memory addresses reserved for interface devises Each interface has several port addresses (control ports, status ports, input ports, output ports) Requires no special I/O instructions CPU can’t distinguish I/O from normal address operation

DMA I/O Hardware devices that directly control transfer of data No CPU intervention Interrupts CPU when finished Can control bus during transfer

DMA examples IBM – DMA Channels –Selector channels – multiple devices, one at a time –Multiplexor channels – multiple devices simultaneously Peripheral-processing units (Ppus) – Main frames – Control Data Corp.

Closer Look – Pentium III and 2 P3 –L1 cache - operating at speed of CPU –L2 cache 2M (in XEON 2M - 2GB) –28.1 x 10 6 transistors P2 –7.5 x 10 6 transistors –16K L1 instruction cache, and 16K data cache

Closer look continued P3 – 2 ALUs, 2 FPUs –MMX unit

Clocked Logic Instructions are not carried out immediately as code signals reaches pins – there is a wait time Early processors did not execute 1 instruction/clock cycles, many instructions required as many as 100

Clocked Logic Using current RISC techniques, many instructions take less than 1 clock cycle (multiple ALUs, pipelined ALUs, SIMD, …) Clock multipliers allow CPU to run faster than system clock Basically, clock speed is not a good metric for different processors, just good indicator for identical processors with different speeds.

Clocked Logic Example: Suppose that processor P1 requires an average of six clock cycles per instruction and the system clock runs at 400 MHz. Processor P2 requires an average of two clock cycles per instruction and the system clock runs at 200 MHz. For P1 = 400 MHz / 6 cycles / instruction  67 For P2 = 200 MHz / 2 cycles / instruction  100 Thus, ( )/67  -50% or P1 is 50% slower than P2 even though it's clock speed is twice as fast as P1.

Modern CPUs Need faster processing –Reduce number of steps microprocessor must take Make processor complex (so can combine steps) Make instructions simpler Operating on more than 1 instruction at a time (pipelining and superscalar)

Pentium II L2 cache – 512 K, not part of CPU (MCM) –Speed – ½ of CPU L1 cache double of that before to deal with bus bottleneck Bus Interface Unit –Data is duplicated to L1 and L2 While fetch/Decode is pulling instructions, Branch Target Buffer looks for already completed instructions. BTB also looks for branching and predicts results with rate of 90%

Pentium II