CMSC 611: Advanced Computer Architecture I/O & Storage Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.

Slides:



Advertisements
Similar presentations
IT253: Computer Organization
Advertisements

I/O Chapter 8. Outline Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
CS224 Spring 2011 Computer Organization CS224 Chapter 6A: Disk Systems With thanks to M.J. Irwin, D. Patterson, and J. Hennessy for some lecture slide.
I/O Management and Disk Scheduling
Faculty of Information Technology Department of Computer Science Computer Organization Chapter 7 External Memory Mohammad Sharaf.
CMSC 611: Advanced Computer Architecture
CSE431 Chapter 6A.1Irwin, PSU, 2008 Chapter 6A: Disk Systems Mary Jane Irwin ( ) [Adapted from Computer Organization.
CSCE430/830 Computer Architecture
Operating Systems ECE344 Ashvin Goel ECE University of Toronto Disks and RAID.
CPE 442 io.1 Introduction To Computer Architecture CpE 442 I/O Systems.
CSE521: Introduction to Computer Architecture Mazin Yousif I/O Subsystem RAID (Redundant Array of Independent Disks)
CSCE 212 Chapter 8 Storage, Networks, and Other Peripherals Instructor: Jason D. Bakos.
RAID Technology. Use Arrays of Small Disks? 14” 10”5.25”3.5” Disk Array: 1 disk design Conventional: 4 disk designs Low End High End Katz and Patterson.
CS 430 – Computer Architecture Disks
EECC551 - Shaaban #1 Lec # 13 Winter Magnetic Disk CharacteristicsMagnetic Disk Characteristics I/O Connection StructureI/O Connection Structure.
Computer ArchitectureFall 2007 © November 28, 2007 Karem A. Sakallah Lecture 24 Disk IO and RAID CS : Computer Architecture.
CS61C L16 Disks © UC Regents 1 CS61C - Machine Structures Lecture 16 - Disks October 20, 2000 David Patterson
1 Lecture 26: Storage Systems Topics: Storage Systems (Chapter 6), other innovations Final exam stats:  Highest: 95  Mean: 70, Median: 73  Toughest.
First magnetic disks, the IBM 305 RAMAC (2 units shown) introduced in One platter shown top right. A RAMAC stored 5 million characters on inch.
Prof. John Nestor ECE Department Lafayette College Easton, Pennsylvania ECE Computer Organization Lecture 22 - Input/Output.
Computer ArchitectureFall 2008 © November 12, 2007 Nael Abu-Ghazaleh Lecture 24 Disk IO.
Disk Technologies. Magnetic Disks Purpose: – Long-term, nonvolatile, inexpensive storage for files – Large, inexpensive, slow level in the memory hierarchy.
S.1 Review: Major Components of a Computer Processor Control Datapath Memory Devices Input Output Cache Main Memory Secondary Memory (Disk)
1 CS222: Principles of Database Management Fall 2010 Professor Chen Li Department of Computer Science University of California, Irvine Notes 01.
I/0 devices.
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
Redundant Array of Inexpensive Disks (RAID). Redundant Arrays of Disks Files are "striped" across multiple spindles Redundancy yields high data availability.
Memory/Storage Architecture Lab Computer Architecture Lecture Storage and Other I/O Topics.
Storage & Peripherals Disks, Networks, and Other Devices.
Lecture 4 1 Reliability vs Availability Reliability: Is anything broken? Availability: Is the system still available to the user?
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
RAID Storage EEL4768, Fall 2009 Dr. Jun Wang Slides Prepared based on D&P Computer Architecture textbook, etc.
CSE431 Chapter 6A.1Irwin, PSU, 2008 Chapter 6A: Disk Systems Mary Jane Irwin ( ) [Adapted from Computer Organization.
I/O – Chapter 8 Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
1 Chapter 7: Storage Systems Introduction Magnetic disks Buses RAID: Redundant Arrays of Inexpensive Disks.
Disk Access. DISK STRUCTURE Sector: Smallest unit of data transfer from/to disk; 512B 2/4/8 adjacent sectors transferred together: Blocks Read/write heads.
Lecture 9 of Advanced Databases Storage and File Structure (Part II) Instructor: Mr.Ahmed Al Astal.
1 (Based on text: David A. Patterson & John L. Hennessy, Computer Organization and Design: The Hardware/Software Interface, 3 rd Ed., Morgan Kaufmann,
CS2100 Computer Organisation Input/Output (AY2010/2011) Semester 2 Adapted from.
August 1, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 9: I/O Devices and Communication Buses * Jeremy R. Johnson Wednesday,
Αρχιτεκτονική Υπολογιστών Ενότητα # 6: RAID Διδάσκων: Γεώργιος Κ. Πολύζος Τμήμα: Πληροφορικής.
Csci 136 Computer Architecture II – IO and Storage Systems Xiuzhen Cheng
Adapted from Computer Organization and Design, Patterson & Hennessy ECE232: Hardware Organization and Design Part 17: Input/Output Chapter 6
Processor Memory Processor-memory bus I/O Device Bus Adapter I/O Device I/O Device Bus Adapter I/O Device I/O Device Expansion bus I/O Bus.
Chapter 8 Secondary Memory. Topics Types of External Memory Magnetic Disk Optical Magnetic Tape.
1 Lecture 27: Disks Today’s topics:  Disk basics  RAID  Research topics.
1 Lecture 23: Storage Systems Topics: disk access, bus design, evaluation metrics, RAID (Sections )
Disk Average Seek Time. Multi-platter Disk platter Disk read/write arm read/write head.
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
COSC 6340: Disks 1 Disks and Files DBMS stores information on (“hard”) disks. This has major implications for DBMS design! » READ: transfer data from disk.
Mohamed Younis CMCS 411, Computer Architecture 1 CMCS Computer Architecture Lecture 25 I/O Systems May 2,
LECTURE 13 I/O. I/O CANNOT BE IGNORED Assume a program requires 100 seconds, 90 seconds for main memory, 10 seconds for I/O. Assume main memory access.
1 Components of the Virtual Memory System  Arrows indicate what happens on a lw virtual address data physical address TLB page table memory cache disk.
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 10: Mass-Storage Systems.
Advanced Computer Architecture CS 704 Advanced Computer Architecture Lecture 40 Input Output Systems (RAID and I/O System Design) Prof. Dr. M. Ashraf Chughtai.
Storage HDD, SSD and RAID.
Multiple Platters.
Module: Storage Systems
I/O System Chapter 5 Designed by .VAS.
Vladimir Stojanovic & Nicholas Weaver
IT 251 Computer Organization and Architecture
Disks and Files DBMS stores information on (“hard”) disks.
Lecture 13 I/O.
CMSC 611: Advanced Computer Architecture
Input-output I/O is very much architecture/system dependent
Lecture 28: Reliability Today’s topics: GPU wrap-up Disk basics RAID
UNIT IV RAID.
Mass-Storage Systems.
Presentation transcript:

CMSC 611: Advanced Computer Architecture I/O & Storage Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / © 2003 Elsevier Science

Input/Output I/O Interface – Device drivers – Device controller – Service queues – Interrupt handling Design Issues – Performance – Expandability – Standardization – Resilience to failure Impact on Tasks – Blocking conditions – Priority inversion – Access ordering Processor Computer Control Datapath MemoryDevices Input Output Processor Computer Control Datapath MemoryDevices Input Output Network 2

Suppose we have a benchmark that executes in 100 seconds of elapsed time, where 90 seconds is CPU time and the rest is I/O time. If the CPU time improves by 1.5x per year for the next five years but I/O time does not improve, how much faster will our program run at the end of the five years? Answer: Elapsed Time = CPU time + I/O time Over five years: CPU improvement = 90/12 = 7. BUT System improvement = 100/22 = 4.5 Impact of I/O on System Performance 3

Processor Cache Memory - I/O Bus Main Memory I/O Controller Disk I/O Controller I/O Controller Graphics Network interrupts Typical I/O System The connection between the I/O devices, processor, and memory are usually called (local or internal) bus Communication among the devices and the processor use both protocols on the bus and interrupts 4

I/O Device Examples DeviceBehaviorPartnerData Rate KeyboardInputHuman200 B/s MouseInputHuman500 B/s PrinterOutputHuman20 KB/s Optical DiskStorageMachine10 MB/s Magnetic DiskStorageMachine1 GB/s Network-LANInput or OutputMachine 1–125 MB/s Graphics Display OutputHuman 300 MB/s 5

Disk History Data density in Mbit/square inch Capacity of Unit Shown in Megabytes source: New York Times, 2/23/98, page C3 6

Organization of a Hard Magnetic Disk Typical numbers (depending on the disk size): –500 to 2,000 tracks per surface –32 to 128 sectors per track A sector is the smallest unit that can be read or written to Traditionally all tracks have the same number of sectors: –Constant bit density: record more sectors on the outer tracks –Recently relaxed: constant bit size, speed varies with track location Platters Track Sector 7

Cylinder Sector Track Head Platter Magnetic Disk Operation Cylinder: all the tracks under the head at a given point on all surface Read/write is a three-stage process: –Seek time position the arm over proper track –Rotational latency wait for the sector to rotate under the read/write head –Transfer time transfer a block of bits (sector) under the read-write head Average seek time –(∑ time for all possible seeks) / (# seeks) –Typically in the range of 8 ms to 12 ms –Due to locality of disk reference, actual average seek time may only be 25% to 33% of the advertised number 8

Magnetic Disk Characteristic Rotational Latency: –Most disks rotate at 5,400 to 10,000 RPM –Approximately 11 ms to 6 ms per revolution, respectively –An average latency to the desired information is halfway around the disk: 5.5 ms at 5400 RPM, 3 ms at RPM Transfer Time is a function of : –Transfer size (usually a sector): 1 KB / sector –Rotation speed: 5400 RPM to RPM –Recording density: bits per inch on a track –Diameter: typical diameter ranges from 2.5 to 5.25” –Typical values ~500MB per second 9

Example Calculate the access time for a disk with 512 byte/sector and 12 ms advertised seek time. The disk rotates at 5400 RPM and transfers data at a rate of 4MB/sec. The controller overhead is 1 ms. Assume that the queue is idle (so no service time) Answer: Disk Time= Seek+ Rotation+ Transfer+ Controller+ Queue = 12 ms+.5/5400 RPM+ 0.5 KiB / 4 MiB/s+ 1 ms+ 0 = 12 ms+.5/90 RPs+ (0.125 / 1024) s+ 1 ms+ 0 = 12 ms+ 5.5 ms+ 0.1 ms+ 1 ms+ 0 = 18.6 ms If real seeks are 1/3 the advertised seeks, disk access time would be 10.6 ms, with rotation delay contributing 50% of the access time! If real seeks are 1/3 the advertised seeks, disk access time would be 10.6 ms, with rotation delay contributing 50% of the access time! 10

Characteristics IBM 3090 IBM UltraStar Integral 1820 Disk diameter (inches) Formatted data capacity (MB)22,700 4, MTTF (hours)50,000 1,000, ,000 Number of arms/box Rotation speed (RPM) 3,600 7,200 3,800 Transfer rate (MB/sec) Power/box (watts) 2, MB/watt Volume (cubic feet) MB/cubic feet Historical Trend 11

Reliability and Availability Two terms that are often confused: –Reliability: Is anything broken? –Availability: Is the system still available to the user? Availability can be improved by adding hardware: –Example: adding ECC on memory Reliability can only be improved by: –Enhancing environmental conditions –Building more reliable components –Building with fewer components Improve availability may come at the cost of lower reliability 12

Disk Arrays Increase potential throughput by having many disk drives: –Data is spread over multiple disk –Multiple accesses are made to several disks Reliability is lower than a single disk: –Reliability of N disks = Reliability of 1 Disk ÷ N (50,000 Hours ÷ 70 disks = 700 hours) Disk system MTTF: Drops from 6 years to 1 month –Arrays (without redundancy) too unreliable to be useful! –But availability can be improved by adding redundant disks (RAID): Lost information can be reconstructed from redundant information 13

Redundant Arrays of Disks Redundant Array of Inexpensive Disks (RAID) –Widely available and used in today’s market –Files are "striped" across multiple spindles –Redundancy yields high data availability despite low reliability –Contents of a failed disk is reconstructed from data redundantly stored in the disk array –Drawbacks include capacity penalty to store redundant data and bandwidth penalty to update a disk block –Different levels based on replication level and recovery techniques 14

Targeted for high I/O rate, high availability environments recovery group RAID 1: Disk Mirroring/Shadowing Each disk is fully duplicated onto its "shadow“ Very high availability can be achieved Bandwidth sacrifice on write: Logical write = two physical writes Reads may be optimized Most expensive solution: 100% capacity overhead 15

RAID 3: Parity Disk P logical record Striped physical records  Parity computed across recovery group to protect against hard disk failures  33% capacity cost for parity in this configuration: wider arrays reduce capacity costs, decrease expected availability, increase reconstruction time  Arms logically synchronized, spindles rotationally synchronized (logically a single high capacity, high transfer rate disk) Targeted for high bandwidth applications: Scientific, Image Processing 16

Block-Based Parity  Block-based parity leads to more efficient read access compared to RAID 3  Designating a parity disk allows recovery but will keep it idle in the absence of a disk failure  RAID 5 distribute the parity block to allow the use of all disk and enhance parallelism of disk access RAID 4RAID 5 17

RAID 5+: High I/O Rate Parity A logical write becomes four physical I/Os Independent writes possible because of interleaved parity Reed-Solomon Codes ("Q") for protection during reconstruction A logical write becomes four physical I/Os Independent writes possible because of interleaved parity Reed-Solomon Codes ("Q") for protection during reconstruction D0D1D2 D3 P D4D5D6 P D7 D8D9P D10 D11 D12PD13 D14 D15 PD16D17 D18 D19 D20D21D22 D23 P Disk Columns Increasing Logical Disk Addresses Stripe Unit 18

Problems of Small Writes D0D1D2 D3 P D0' + + D1D2 D3 P' new data old data old parity XOR (1. Read) (2. Read) (3. Write) (4. Write) RAID-5: Small Write Algorithm 1 Logical Write = 2 Physical Reads + 2 Physical Writes 19

Subsystem Organization host array controller single board disk controller single board disk controller single board disk controller single board disk controller host adapter manages interface to host, DMA control, buffering, parity logic physical device control often piggy-backed in small format devices striping software off-loaded from host to array controller no applications modifications no reduction of host performance 20

Array Controller String Controller String Controller String Controller String Controller String Controller String Controller... System Availability: Orthogonal RAIDs Data Recovery Group: unit of data redundancy Redundant Support Components: fans, power supplies, controller, cables End to End Data Integrity: internal parity protected data paths 21