Copyright © Curt Hill, 2003-2010 RAID What every server wants!

Slides:



Advertisements
Similar presentations
A Case for Redundant Arrays Of Inexpensive Disks Paper By David A Patterson Garth Gibson Randy H Katz University of California Berkeley.
Advertisements

A CASE FOR REDUNDANT ARRAYS OF INEXPENSIVE DISKS (RAID) D. A. Patterson, G. A. Gibson, R. H. Katz University of California, Berkeley.
What is RAID Redundant Array of Independent Disks.
1 Lecture 18: RAID n I/O bottleneck n JBOD and SLED n striping and mirroring n classic RAID levels: 1 – 5 n additional RAID levels: 6, 0+1, 10 n RAID usage.
1 Jason Drown Mark Rodden (Redundant Array of Inexpensive Disks) RAID.
Faculty of Information Technology Department of Computer Science Computer Organization Chapter 7 External Memory Mohammad Sharaf.
RAID (Redundant Arrays of Independent Disks). Disk organization technique that manages a large number of disks, providing a view of a single disk of High.
RAID A RRAYS Redundant Array of Inexpensive Discs.
RAID Oh yes Whats RAID? Redundant Array (of) Independent Disks. A scheme involving multiple disks which replicates data across multiple drives. Methods.
RAID Redundant Array of Independent Disks
 RAID stands for Redundant Array of Independent Disks  A system of arranging multiple disks for redundancy (or performance)  Term first coined in 1987.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.
RAID Redundant Arrays of Inexpensive Disks –Using lots of disk drives improves: Performance Reliability –Alternative: Specialized, high-performance hardware.
R.A.I.D. Copyright © 2005 by James Hug Redundant Array of Independent (or Inexpensive) Disks.
Lecture 36: Chapter 6 Today’s topic –RAID 1. RAID Redundant Array of Inexpensive (Independent) Disks –Use multiple smaller disks (c.f. one large disk)
Sean Traber CS-147 Fall  7.9 RAID  RAID Level 0  RAID Level 1  RAID Level 2  RAID Level 3  RAID Level 4 
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
RAID Technology CS350 Computer Organization Section 2 Larkin Young Rob Deaderick Amos Painter Josh Ellis.
Other Disk Details. 2 Disk Formatting After manufacturing disk has no information –Is stack of platters coated with magnetizable metal oxide Before use,
RAID Systems CS Introduction to Operating Systems.
CSE 451: Operating Systems Winter 2010 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 6 – RAID ©Manuel Rodriguez.
Chapter 6 RAID. Chapter 6 — Storage and Other I/O Topics — 2 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f.
RAID Shuli Han COSC 573 Presentation.
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
Two or more disks Capacity is the same as the total capacity of the drives in the array No fault tolerance-risk of data loss is proportional to the number.
Day 10 Hardware Fault Tolerance RAID. High availability All servers should be on UPSs –2 Types Smart UPS –Serial cable connects from UPS to computer.
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
RAID COP 5611 Advanced Operating Systems Adapted from Andy Wang’s slides at FSU.
Lecture 9 of Advanced Databases Storage and File Structure (Part II) Instructor: Mr.Ahmed Al Astal.
Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.
CSI-09 COMMUNICATION TECHNOLOGY FAULT TOLERANCE AUTHOR: V.V. SUBRAHMANYAM.
RAID REDUNDANT ARRAY OF INEXPENSIVE DISKS. Why RAID?
RAID SECTION (2.3.5) ASHLEY BAILEY SEYEDFARAZ YASROBI GOKUL SHANKAR.
Redundant Array of Independent Disks.  Many systems today need to store many terabytes of data.  Don’t want to use single, large disk  too expensive.
Copyright © Curt Hill Error detection and correction Techniques to Increase the Reliability.
The concept of RAID in Databases By Junaid Ali Siddiqui.
RAID Disk Arrays Hank Levy. 212/5/2015 Basic Problems Disks are improving, but much less fast than CPUs We can use multiple disks for improving performance.
RAID Systems Ver.2.0 Jan 09, 2005 Syam. RAID Primer Redundant Array of Inexpensive Disks random, real-time, redundant, array, assembly, interconnected,
Part IV I/O System Chapter 12: Mass Storage Structure.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID Technology By: Adarsha A,S 1BY08A03. Overview What is RAID Technology? What is RAID Technology? History of RAID History of RAID Techniques/Methods.
RAID TECHNOLOGY RASHMI ACHARYA CSE(A) RG NO
Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.
CSE 451: Operating Systems Spring 2010 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan Allen Center 534.
CS Introduction to Operating Systems
RAID.
Multiple Platters.
What every server wants!
RAID Non-Redundant (RAID Level 0) has the lowest cost of any RAID
CSE 451: Operating Systems Spring 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan Allen Center.
RAID Disk Arrays Hank Levy 1.
RAID RAID Mukesh N Tekwani
ICOM 6005 – Database Management Systems Design
RAID Disk Arrays Hank Levy 1.
CSE 451: Operating Systems Spring 2005 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Winter 2009 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
TECHNICAL SEMINAR PRESENTATION
UNIT IV RAID.
Mark Zbikowski and Gary Kimura
CSE 451: Operating Systems Autumn 2004 Redundant Arrays of Inexpensive Disks (RAID) Hank Levy 1.
CSE 451: Operating Systems Winter 2012 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
CSE 451: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Autumn 2009 Module 19 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
RAID Disk Arrays Hank Levy 1.
RAID RAID Mukesh N Tekwani April 23, 2019
CSE 451: Operating Systems Winter 2004 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
IT 344: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Chia-Chi Teng CTB
CSE 451: Operating Systems Winter 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
Presentation transcript:

Copyright © Curt Hill, RAID What every server wants!

Copyright © Curt Hill, History I In the late part of the 1970s and there were two types of disks: –Hard –Floppy The hard drives were only used on mainframes or minis –Quite expensive The floppys were mostly used on personal computers In about 1984 hard drives started appearing on IBM personal computers

Copyright © Curt Hill, History II Now there were two types of hard drives: –Professional Large M Expensive –Amateur Small: 5-50 M Inexpensive

Copyright © Curt Hill, History III At this point economics takes charge The large disks were a small market, typically 10s of thousands The small disks were mass market, typically 10s of millions What happens to the prices?

Copyright © Curt Hill, History IV Someone gets a bright idea: –Should I buy one 500 M professional disk for $50,000 or 10 50M amateur disks at $100 each? RAID is born: Redundant Array Inexpensive Disks –Later Redundant Array Independent Disks There might even be a performance improvement if I can access the array independently However, we need some type of controller to treat the 10 disks as if it were one

Copyright © Curt Hill, Issues Redundancy –If the small is less reliable or if the data is so important we need multiple copies for safety Speed –Increase speed by writing part of the data to one drive and another part to another at the same time Versions –There have been many, termed Levels –Currently level 0 through level 6

Copyright © Curt Hill, Redundancy Multiple copies allows one disk to crash without losing data Two forms: –Mirroring AKA Shadowing, Duplexing –Error Correction Codes

Copyright © Curt Hill, Mirroring Write all the data twice –Once to a disk and its mirror –This is done simultaneously, so no extra delay A read only has to wait for the faster of the two If a disk crashes, no rebuild is needed since the other disk may just be copied High disk overhead, needs two disks to store one disk worth of data Can read different parts of the two at same time to increase speed

Copyright © Curt Hill, Error Correction Code Even more history and background is needed here See the ECC.ppt presentationECC.ppt Many of the EC codes were studied by Hamming of Bell Labs –Thus known as Hamming codes

Copyright © Curt Hill, ECC Instead of mirroring which requires double disk space use an ECC Conceptually: –The eight bit data placed on eight separate drives and the four bit ECC on another –Any one disk that fails may be recreated from the rest

Copyright © Curt Hill, Speed Speed requires parallelism –Do two things at once With a mirrored disk read the front half from one and the back half from the other Transfer time cut in half This leads to a more general approach: stripes

Copyright © Curt Hill, Stripes Cut the data into stripes If you have N disks –Partition into N pieces –Read or write N pieces at a time Best if each piece goes to a separate controller

Copyright © Curt Hill, Controllers Controllers may be hardware or software or both Some levels make a software controller a problem

Copyright © Curt Hill, RAID Levels Originally specified with six levels –0-5 Some of these are seldom used and new ones have been added

Copyright © Curt Hill, Level 0 Striped disk array Minimum of two disks No redundancy or error checking –If a drive is lost all the data is lost –Only RAID that does not protect data Real RAID or not?

Copyright © Curt Hill, Level 0 - Striping A1A2A3A4 B1B2B3B4 C1C2C3C4 Four stripes Each block of data is written in four pieces No redundancy

Copyright © Curt Hill, Level 1 Mirroring Minimum of two disks 100% redundancy –A rebuilt disk is just a copy, not computed Simple controller Cannot be expanded on the fly

Copyright © Curt Hill, Level 1 - Mirroring AAEE BBFF CCGG Four disks, each is a mirror of another Each block of data is written twice Need twice as much space

Copyright © Curt Hill, Level 2 Striped disk at bit level ECC provides the redundancy Minimum of seven disks for storing 4 bit word Not commercially viable Number of parity disks is proportional to log of number of data disks Not very flexible

Copyright © Curt Hill, Level 2 – Striping with ECC A1A2A3A4 B1B2B3B4 C1C2C3C4 Four stripes of data protected by three of ECC ECC is computed on the fly EAx EBx EAzEAy EByEBz ECxECyECz

Copyright © Curt Hill, Level 3 Striped disk array with one ECC disk Each block is striped across disks One disk may fail without diminishing throughput Minimum of three disks High data rates Controller should be in hardware, not just software

Copyright © Curt Hill, Level 3 – Striping with ECC A1A2A3A4 B1B2B3B4 C1C2C3C4 Four stripes of data protected by three of ECC ECC is computed on the fly ECCA ECCB ECCC

Copyright © Curt Hill, Level 4 Similar to level 3 except –Blocks are not subdivided –Different block are written to different disks Minimum of three disks Controller is complex –Should be hardware Not easy to rebuild in case of failure

Copyright © Curt Hill, Level 5 Striped disk array with distributed ECC blocks Each disk stores both data and ECC –A disk stores ECC data for other disks Minimum of three disks Any single disk failure will result in no loss of data Most complex controller design

Copyright © Curt Hill, Level 5 – Striping with Distributed ECC A1 A2 A3 A4 B1 B2 B3 B5 C1 C2 C5 C4 Four stripes of data Each ECC protects other four No disks are only data or only ECC ECC5 ECC4 ECC3 ECC2 ECC1D1 D5 D4 D3 E2 E3 E4 E5

Copyright © Curt Hill, Level 6 Striped disk array –Similar to five except two independent ECC schemes –ECC blocks distributed among data disks Minimum of four disks May have multiple disk failures without loss of data

Copyright © Curt Hill, Level 6 – Striping with Two Distributed ECCs A1 A2 A3 B1 B2 B3 C1 C3 C2 Four stripes of data Two types of ECC each of which protects a different group No disks are only data or only ECC ECC4 ECC3 ECC2 ECC1 D3 D2 D1 ECCb ECCc ECCdECCa

Copyright © Curt Hill, Others Several combinations of these exists –Mirroring a striped disk 10 –Striping a mirrored disk 53 –Level 0 and 3 –Level 0 whose stripes are level 3 arrays

What is coming? As disk arrays get larger the likelihood of a double failure becomes significant Although no product yet, it seems likely that a triple parity scheme is inevitable Copyright © Curt Hill,