Presentation is loading. Please wait.

Presentation is loading. Please wait.

System Integration and Performance

Similar presentations


Presentation on theme: "System Integration and Performance"— Presentation transcript:

1 System Integration and Performance
Chapter 6 System Integration and Performance

2 Chapter goals Describe the implementation of the system bus and bus protocol. Describe how the CPU and bus interact with peripheral devices. Describe the purpose and function of device controllers.

3 Chapter goals cont. Describe how interrupts coordinate actions of the CPU with secondary storage and I/O devices Describe how buffers, caches, and data compression improve computer system performance

4 Role of the system bus Bus is mechanism that allows computer components to work together Is made up of parallel communication lines connecting computer components CPU, hard drive, parallel port, modem, etc. Can connect two or more devices Information can travel in both directions

5 System bus (cont.) Can connect both internal (hard drive) and external (printer) devices System bus has three parts Data bus – carries data Address bus – used if RAM is involved Control bus – commands and status information Each bus line carries 1 bit of information

6 System bus

7 Bus Clock Like the CPU, the bus has a clock that acts as a timing device For CPU, each tick is trigger to execute an instruction For system bus, each tick is an opportunity to transmit data or a control message

8 Bus clock cont. Bus clock is MUCH slower than CPU clock
Think of CPU as the highway and the system bus as the local streets

9 Why slower? Data on bus must travel a longer physical distance than data in CPU Even though data is traveling at the speed of light it still needs more time to travel over a greater distance Need to allows time to factor out noise, interference Also allows time to operate controller logic in peripheral devices

10 Bus data transfer rate Called the bus data capacity
Expresses how much data can travel across bus over time Is a combination of Bus clock speed Data transfer unit (usually a word) Is used to calculate things like Time required to load large files (i.e. video)

11 Bus protocol Data transportation rules that ensure the smooth transfer of information without error Dictates the format, content, and timing of data, memory addresses, and messages Every peripheral device (no matter the manufacturer) must follow the bus protocol rules

12 Bus protocol cont. Protocol can impact (reduce) data transfer rates
Protocols often require exchanges of control signals Control signals consume bus cycles that could otherwise send data

13 Sample protocol Example: if a disk drive transfers data to RAM as the result of an explicit CPU instruction, the following steps are followed: CPU sends command to the drive Drive send acknowledgement to CPU Drive carries out transfer Drive sends confirmation to CPU that transfer is complete

14 Why use protocols? Protocol regulates bus access
Stops devices from interfering with each other I/O data transfer is the largest cause of errors in computers I/O commands need to be acknowledged and confirmed

15 What if two devices need the bus?
When two (or more) peripheral devices need access to the bus at the same time that is called a collision Three solutions are in place to deal with this Master-slave Multiple master Peer to peer

16 Master-slave CPU is bus master Traditional computer architecture
No device can access the bus unless in response to explicit command from CPU Allows a very simple protocol No collision is possible as long as CPU waits for response from device before proceeding to the next bus request

17 Master-slave cont. Overall system performance is severely degraded
If devices can only communicate through the CPU, then transfers between devices, i.e. memory to disk, must pass through the CPU Every transfer takes at least 2 bus cycles CPU cannot execute software while it is managing the bus

18 A better solution System performance is improved if storage and I/O devices can transmit data among themselves without explicit CPU involvement Direct Memory Access (DMA) controller is attached to the bus and main memory DMA assumes the role of bus master for all transfers between memory and other storage or I/O devices CPU is free to do whatever

19 Multiple master bus Any device can assume control of the bus, or act as bus master for transfer to any other device (not just memory) Still only a single device can be master at one time Bus arbitration unit is a simple processor attached to a multiple master bus It decides which devices must wait when multiple devices want to become a bus master

20 Logical vs. Physical Access
I/O port is a communication pathway from the CPU to a peripheral device I/O port is often implemented as a memory address that can be accessed (read or written to) by The CPU Or a single peripheral device

21 Logical and Physical Access

22 I/O Ports Each peripheral device may have several I/O ports and use them for different purposes Dedicated bus hardware controls data movement between I/O ports and peripheral devices CPU reads and writes to I/O ports using ordinary data movement instructions or dedicated I/O instructions

23 The CPU and I/O Ports I/O port is more than a memory address, it is a data conduit It is a logical abstraction used by the CPU and the bus to interact with each peripheral device in a similar way

24 Logical access CPU and the bus both interact with each peripheral device as if it was a storage device containing one or more bytes of contiguous memory CPU and the bus deals with each device the same way, but devices are different Storage capacity Internal data coding methods If storage or I/O device

25 Linear address space A read/write operation to/from this hypothetical device is called a logical access The set of sequentially numbered storage locations is called a linear address space

26 How logical becomes physical
Logical access assumes device is similar to memory (RAM) Bus address lines carry the position within the linear address space being read or written Device controller makes the conversion via a conversion table or a simple algorithm

27 Conversion table for disk

28 Device controllers Storage devices have intermediaries that connect them to the system bus Translate logical access to physical access Handles bus protocol (receiving and acknowledging commands) Permits several devices to share a bus connection

29 Device controllers

30 Device controllers cont.
Device controllers monitor the bus control lines for signals to peripheral devices Translates those signals into appropriate commands for its device

31 Interrupts Secondary storage and I/O device transfer rates are much slower than the CPU Why? Slower bus clock Peripheral devices have mechanical elements (access arm, spin mechanism) that are slower than speed of electricity

32 Interrupts cont. When the CPU issues a read/write instruction it ALWAYS has to wait This waiting time can translate into thousands, millions, or even billions of CPU cycles To allow CPU to be used more efficiently, interrupts are used

33 How interrupts work When a program (task, process, thread) needs I/O, CPU makes I/O request over the system bus Then puts your task aside (asleep) Does something else for the time being

34 Interrupts cont. When I/O is complete, interrupt signal is sent to the CPU CPU can now restart your task with I/O task being complete

35 CPU and Interrupts Portion of the CPU (separate from the fetch execute cycle) continuously monitors the bus for interrupt signals The signal is an interrupt code that indicates the bus port number of the device sending the interrupt CPU copies any interrupt signals it encounters into an interrupt register

36 The CPU and Interrupts cont.
As an extra step in the fetch execute cycle, the CPU checks the interrupt register after completing an instruction but before fetching another one If interrupt register has a non-zero value CPU must respond to the interrupt

37 CPU and Interrupts If CPU is to process an interrupt it does the following: Puts aside (suspends) current task Resets interrupt register to 0 (zero) Processes interrupt by calling interrupt handler After interrupt processing is complete, resumes suspended program

38 Interrupt handlers Interrupts are a mechanism for calling (invoking) system software processes and programs Operating system (OS) provides low-level processing routines (service calls) Examples: reading data in from the keyboard Writing to a file

39 Interrupt handlers cont.
There is a unique individual interrupt handler (i.e. program) to process each possible interrupt Each handler is a separate program stored in a separate part of main memory

40 Interrupt table A conversion table in main memory that has a list of all interrupt codes Interrupt code is used as an index into interrupt table For each interrupt code, interrupt table has the memory address of each interrupt handler

41 Interrupt handlers Supervisor (OS) examines the interrupt code, uses it as an index into the interrupt table Looks up memory location of needed interrupt handler Loads that memory location into the PC (program counter) Interrupt handler begins executing

42 Multiple interrupts It is possible (even likely) that interrupts will interrupt each other OS has an algorithm to determine what goes first Assigns priorities to different interrupts based on Error conditions Critical hardware failures

43 Suspending a process Whenever a process is suspended or interrupted the system must save whatever information is necessary to allow the process to restart again Typically that involved saving PC and IR Any other specialized or general purpose registers that were in use

44 Saving a process The collection of information needed to restart a process is called the “machine state” It is saved in a special storage location called the stack

45 The Stack The stack is a specialize storage location in RAM
It is a data structure where you add and delete information from the same end Therefore the last process saved by the CPU is the first one it will pick up

46 Interrupt process

47 Buffers Buffers are a mechanism that uses RAM to overcome slow data transfer rate to peripheral devices Small storage area (in RAM) used to hold data in transit from one device to another

48 Buffers and printing Printed version of document with formatting information is copied to RAM When full page is ready it is released from the buffer Document is written from RAM to printer Also have input buffers – keyboard, modem, etc.

49 Buffers

50 Cache Pronounced “cash”
Separate high speed storage area specifically managed to improve overall system performance Idea is most often needed data is kept in the cache Must be managed intelligently

51 Cache cont. Data content is not automatically removed (unlike buffer)
Used for bi-directional data transfer Used only for storage device access Larger than buffer

52 Cache cont. Basic idea is that access to high speed cache is faster than hard drive During a write operation cache acts as a buffer Data written to cache then to drive

53 Cache controller Manages the content of the cache
It must “guess” which files should be in the cache, i.e. what data the CPU will ask for Cache hit – when data is found in the cache Cache miss – when data is not found Cache swap – old data removed and new data inserted

54 Cache cont. Even a small cache can significantly improve performance
Ratio of primary storage to cache of 10,000 to 1 can result in cache hit rate of 90%

55 Processing Parallelism
Increases computer system computational capacity; breaks problems into pieces and solves each piece in parallel with separate CPUs Techniques Multicore processors Multi-CPU architecture Clustering

56 Multicore Processors Include multiple CPUs and shared memory cache in a single microchip Typically share memory cache, memory interface, and off-chip I/O circuitry among the cores Reduce total transistor count and cost and provide synergistic benefits

57

58 Multi-CPU Architecture
Employs multiple single or multicore processors sharing main memory and the system bus within a single motherboard or computer system Common in midrange computers, mainframe computers, and supercomputers Cost-effective for Single system that executes many different application programs and services Workstations

59 Scaling Up Increasing processing by using larger and more powerful computers Used to be most cost-effective Still cost-effective when maximal computer power is required and flexibility is not as important

60 Scaling Out Partitioning processing among multiple systems
Speed of communication networks; diminished relative performance penalty Economies of scale have lowered costs Distributed organizational structures emphasize flexibility Improved software for managing multiprocessor configurations

61 High-Performance Clustering
Connects separate computer systems with high-speed interconnections Used for the largest computational problems (e.g., modeling three-dimensional physical phenomena)

62 Partitioning the problem to match the cluster architecture ensures that most data exchange traverses high-speed paths.

63 Compression Technique to reduce the number of bits used to encode a set of related data items, i.e. a file or stream of video Some formats (MP3, GIF) are intentionally compressed data formats

64 Compression cont. Compression is accomplished by the application of a compression algorithm (specific mathematical technique) Also need corresponding de-compression algorithms to restore data to its original state

65 Compression cont. Compression algorithms vary
What types of data are appropriate Whether any data is lost Amount by which data is compressed Lossless compression (zip) – no loss of data Lossy – some loss of data (audio or video)

66 Compression cont. Compression rate – size before and after compression
Used to reduce secondary storage requirements Transmit over Internet Package files together

67 Data compression

68 MP3 encoding elements

69 MP3 How MP3 works

70 Summary The system bus is the communication pathway that connects the CPU with memory and other devices The CPU communicates with peripheral devices through I/O ports

71 Summary cont. Application programs use interrupt processing to coordinate data transfers to or from peripheral devices, notify the CPU of errors, and call operating system service programs A buffer is a region of memory that holds a single unit of data for transfer to or from a device Compression reduces the number of bits required to encode a data set or stream, effectively increasing the capacity of a communication channel or storage device


Download ppt "System Integration and Performance"

Similar presentations


Ads by Google