RAID Redundant Arrays of Independent Disks Courtesy of Satya, 15-412 Fall 99.

Slides:



Advertisements
Similar presentations
A Case for Redundant Arrays Of Inexpensive Disks Paper By David A Patterson Garth Gibson Randy H Katz University of California Berkeley.
Advertisements

Disk Arrays COEN 180. Large Storage Systems Collection of disks to store large amount of data. Performance advantage: Each drive can satisfy only so many.
Redundant Array of Independent Disks (RAID) Striping of data across multiple media for expansion, performance and reliability.
A CASE FOR REDUNDANT ARRAYS OF INEXPENSIVE DISKS (RAID) D. A. Patterson, G. A. Gibson, R. H. Katz University of California, Berkeley.
I/O Chapter 8. Outline Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
CS224 Spring 2011 Computer Organization CS224 Chapter 6A: Disk Systems With thanks to M.J. Irwin, D. Patterson, and J. Hennessy for some lecture slide.
RAID Redundant Array of Inexpensive Disks Presented by Greg Briggs.
Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved.
What is RAID Redundant Array of Independent Disks.
IT 344: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Chia-Chi Teng CTB 265.
1 COMP 206: Computer Architecture and Implementation Montek Singh Mon., Nov. 25, 2002 Topic: Storage Systems (Disk Technology) – contd.
CSE 451: Operating Systems Spring 2012 Module 20 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570 ©
RAID (Redundant Arrays of Independent Disks). Disk organization technique that manages a large number of disks, providing a view of a single disk of High.
RAID Redundant Array of Independent Disks
CSCE430/830 Computer Architecture
RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.
RAID Redundant Arrays of Inexpensive Disks –Using lots of disk drives improves: Performance Reliability –Alternative: Specialized, high-performance hardware.
CSE521: Introduction to Computer Architecture Mazin Yousif I/O Subsystem RAID (Redundant Array of Independent Disks)
Lecture 36: Chapter 6 Today’s topic –RAID 1. RAID Redundant Array of Inexpensive (Independent) Disks –Use multiple smaller disks (c.f. one large disk)
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
Reliability of Disk Systems. Reliability So far, we looked at ways to improve the performance of disk systems. Next, we will look at ways to improve the.
EECC551 - Shaaban #1 Lec # 13 Winter Magnetic Disk CharacteristicsMagnetic Disk Characteristics I/O Connection StructureI/O Connection Structure.
1 Recap (RAID and Storage Architectures). 2 RAID To increase the availability and the performance (bandwidth) of a storage system, instead of a single.
Computer ArchitectureFall 2007 © November 28, 2007 Karem A. Sakallah Lecture 24 Disk IO and RAID CS : Computer Architecture.
Other Disk Details. 2 Disk Formatting After manufacturing disk has no information –Is stack of platters coated with magnetizable metal oxide Before use,
Lecture 17 I/O Optimization. Disk Organization Tracks: concentric rings around disk surface Sectors: arc of track, minimum unit of transfer Cylinder:
1 Lecture 26: Storage Systems Topics: Storage Systems (Chapter 6), other innovations Final exam stats:  Highest: 95  Mean: 70, Median: 73  Toughest.
Lecture 3: A Case for RAID (Part 1) Prof. Shahram Ghandeharizadeh Computer Science Department University of Southern California.
I/O Systems and Storage Systems May 22, 2000 Instructor: Gary Kimura.
RAID Systems CS Introduction to Operating Systems.
CSE 451: Operating Systems Winter 2010 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura.
Servers Redundant Array of Inexpensive Disks (RAID) –A group of hard disks is called a disk array FIGURE Server with redundant NICs.
Redundant Array of Inexpensive Disks (RAID). Redundant Arrays of Disks Files are "striped" across multiple spindles Redundancy yields high data availability.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 6 – RAID ©Manuel Rodriguez.
Chapter 6 RAID. Chapter 6 — Storage and Other I/O Topics — 2 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f.
Lecture 4 1 Reliability vs Availability Reliability: Is anything broken? Availability: Is the system still available to the user?
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
Redundant Array of Independent Disks
RAID: High-Performance, Reliable Secondary Storage Mei Qing & Chaoxia Liao Nov. 20, 2003.
Two or more disks Capacity is the same as the total capacity of the drives in the array No fault tolerance-risk of data loss is proportional to the number.
I/O – Chapter 8 Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
1 Chapter 7: Storage Systems Introduction Magnetic disks Buses RAID: Redundant Arrays of Inexpensive Disks.
RAID COP 5611 Advanced Operating Systems Adapted from Andy Wang’s slides at FSU.
Lecture 9 of Advanced Databases Storage and File Structure (Part II) Instructor: Mr.Ahmed Al Astal.
Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.
RAID REDUNDANT ARRAY OF INEXPENSIVE DISKS. Why RAID?
1/14/2005Yan Huang - CSCI5330 Database Implementation – Storage and File Structure Storage and File Structure.
Redundant Array of Independent Disks.  Many systems today need to store many terabytes of data.  Don’t want to use single, large disk  too expensive.
RAID Disk Arrays Hank Levy. 212/5/2015 Basic Problems Disks are improving, but much less fast than CPUs We can use multiple disks for improving performance.
1 Lecture 27: Disks Today’s topics:  Disk basics  RAID  Research topics.
Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.
I/O Errors 1 Computer Organization II © McQuain RAID Redundant Array of Inexpensive (Independent) Disks – Use multiple smaller disks (c.f.
RAID Redundant Arrays of Independent Disks
RAID Non-Redundant (RAID Level 0) has the lowest cost of any RAID
CSE 451: Operating Systems Spring 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan Allen Center.
RAID RAID Mukesh N Tekwani
ICOM 6005 – Database Management Systems Design
CSE 451: Operating Systems Spring 2005 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Autumn 2010 Module 19 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Winter 2009 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
RAID Redundant Array of Inexpensive (Independent) Disks
Mark Zbikowski and Gary Kimura
CSE 451: Operating Systems Autumn 2004 Redundant Arrays of Inexpensive Disks (RAID) Hank Levy 1.
CSE 451: Operating Systems Winter 2012 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
CSE 451: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Autumn 2009 Module 19 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
RAID RAID Mukesh N Tekwani April 23, 2019
CSE 451: Operating Systems Winter 2004 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
IT 344: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Chia-Chi Teng CTB
CSE 451: Operating Systems Winter 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
Presentation transcript:

RAID Redundant Arrays of Independent Disks Courtesy of Satya, Fall 99

Motivation for Disk Arrays Typical memory bandwidths 150 MB/sec Typical disk bandwidths 10 MB/sec Result: I/O-bound applications limited by disk bandwidth (not just by disk latency!) Two common disk bandwidth-limited scenarios Scientific applications: one huge I/O-hungry app Servers: many concurrent requests with modest I/O demands

S 2,0 S 1,0 Solution: Exploit Parallelism Stripe the data across an array of disks many alternative striping strategies possible Example: consider a big file striped across N disks stripe width is S bytes hence each stripe unit is S/N bytes sequential read of S bytes at a time s 0,0 s 0,1 s 0,2 s 0,N-1 s 1,0 s 1,1 s 1,2 s 1,N-1 s 2,0 s 2,1 s 2,2 s 2,N-1 SSS S 0,0 S 2,1 S 1,1 S 0,1 S 2,2 S 1,2 S 0,2 S 2,N-1 S 1,N-1 S 0,N-1 Disk 0Disk 1Disk 2Disk N-1

Performance Benefit Sequential read or write of large file application (or I/O buffer cache) reads in multiples of S bytes controller performs parallel access of N disks aggregate bandwidth is N times individual disk bandwidth (assumes that disk is the bottleneck) Disk 0Disk 1Disk 2 Disk N-1 Array Controller x MB/s Nx MB/s S k,N-1 S k,2 S k,1 S k,0 SkSk

N concurrent small read or write requests randomly distributed across N drives (we hope!) common in database and Web server environments Disk 0Disk 1Disk 2 Disk N-1 Array Controller x MB/s Nx MB/s N independent requests req 0 req 1 req 2 req N-1

Reliability of Disk Arrays As number of disks grows, chances of at least one failing increases Reliability of N disks = (reliability of 1 disk) / N suppose each disk has MTTF of 50,000 hours (roughly 6 years before any given disk fails) then some disk in a 70-disk array will fail in (50,000 / 70) hours (roughly once a month!) Large arrays without redundancy too unreliable to be useful Redundant Arrays of Independent Disks (RAID)

RAID Approaches Many alternative approaches to achieving this redundancy RAID levels 1 through 5 hot sparing allows reconstruction concurrently with accesses Key metrics to evaluate alternatives wasted space due to redundancy likelihood of hot spots during heavy loads degradation of performance during repair

RAID Level 1 Also known as mirroring To read a block: read from either data disk or backup To write a block: write both data and backup disks failure model determines whether writes can occur in parallel Backups can be located far way: safeguard against site failure D1D1 D2D2 B2B2 B1B1 D3D3 B3B3

RAID Levels 2 & 3 These are bit-interleaved schemes In Raid Level 2, P contains memory-style ECC In Rail Level 3, P contains simple parity Rarely used today D1D1 D2D2 D3D3 P

RAID Level 4 Block-interleaved parity Wasted storage is small: one parity block for N data blocks Key problem: parity disk becomes a hot spot write access to parity disk on every write to any block D 0,0 D 1,0 D 2,0 D 3,0 D 0,1 D 1,1 D 2,1 D 3,1 D 0,2 D 1,2 D 2,2 D 3,2 P0P0 P1P1 P2P2 P3P3 D 0,0 D 0,1 D 0,2 = P 0

RAID Level 5 Rotated parity Wastage is small: same as in Raid 4 Parity update traffic is distributed across disks D 0,0 D 1,0 D 2,0 P3P3 D 0,1 D 1,1 P2P2 D 3,1 D 0,2 P1P1 D 2,2 D 3,2 P0P0 D 1,3 D 2,3 D 3,3 D 0,0 D 0,1 D 0,2 = P 0

RAID 5 Actions D D D P Fault-free Read D D D P Fault-free Write D D D P Degraded Read D D D P Degraded Write