CSE 451: Operating Systems Spring 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan zahorjan@cs.washington.edu Allen Center.

Slides:

Advertisements

Similar presentations

Redundant Array of Independent Disks (RAID) Striping of data across multiple media for expansion, performance and reliability.

Advertisements

1 Lecture 18: RAID n I/O bottleneck n JBOD and SLED n striping and mirroring n classic RAID levels: 1 – 5 n additional RAID levels: 6, 0+1, 10 n RAID usage.

IT 344: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Chia-Chi Teng CTB 265.

CSE 451: Operating Systems Spring 2012 Module 20 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570 ©

RAID (Redundant Arrays of Independent Disks). Disk organization technique that manages a large number of disks, providing a view of a single disk of High.

RAID Oh yes Whats RAID? Redundant Array (of) Independent Disks. A scheme involving multiple disks which replicates data across multiple drives. Methods.

RAID Redundant Array of Independent Disks

RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.

RAID Redundant Arrays of Inexpensive Disks –Using lots of disk drives improves: Performance Reliability –Alternative: Specialized, high-performance hardware.

R.A.I.D. Copyright © 2005 by James Hug Redundant Array of Independent (or Inexpensive) Disks.

CSE 486/586 CSE 486/586 Distributed Systems Case Study: Facebook f4 Steve Ko Computer Sciences and Engineering University at Buffalo.

Lecture 36: Chapter 6 Today’s topic –RAID 1. RAID Redundant Array of Inexpensive (Independent) Disks –Use multiple smaller disks (c.f. one large disk)

Other Disk Details. 2 Disk Formatting After manufacturing disk has no information –Is stack of platters coated with magnetizable metal oxide Before use,

I/O Systems and Storage Systems May 22, 2000 Instructor: Gary Kimura.

RAID Systems CS Introduction to Operating Systems.

CSE 451: Operating Systems Winter 2010 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura.

Servers Redundant Array of Inexpensive Disks (RAID) –A group of hard disks is called a disk array FIGURE Server with redundant NICs.

Storage System: RAID Questions answered in this lecture: What is RAID? How does one trade-off between: performance, capacity, and reliability? What is.

ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 6 – RAID ©Manuel Rodriguez.

Storage Systems CSE 598d, Spring 2007 Lecture 5: Redundant Arrays of Inexpensive Disks Feb 8, 2007.

Chapter 6 RAID. Chapter 6 — Storage and Other I/O Topics — 2 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f.

CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.

CSE 321b Computer Organization (2) تنظيم الحاسب (2) 3 rd year, Computer Engineering Winter 2015 Lecture #4 Dr. Hazem Ibrahim Shehata Dept. of Computer.

Redundant Array of Independent Disks

Two or more disks Capacity is the same as the total capacity of the drives in the array No fault tolerance-risk of data loss is proportional to the number.

N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.

I/O – Chapter 8 Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.

Lecture 9 of Advanced Databases Storage and File Structure (Part II) Instructor: Mr.Ahmed Al Astal.

Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.

Copyright © Curt Hill, RAID What every server wants!

Redundant Array of Independent Disks.  Many systems today need to store many terabytes of data.  Don’t want to use single, large disk  too expensive.

The concept of RAID in Databases By Junaid Ali Siddiqui.

RAID Disk Arrays Hank Levy. 212/5/2015 Basic Problems Disks are improving, but much less fast than CPUs We can use multiple disks for improving performance.

RAID Systems Ver.2.0 Jan 09, 2005 Syam. RAID Primer Redundant Array of Inexpensive Disks random, real-time, redundant, array, assembly, interconnected,

Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.

I/O Errors 1 Computer Organization II © McQuain RAID Redundant Array of Inexpensive (Independent) Disks – Use multiple smaller disks (c.f.

CSE 451: Operating Systems Spring 2010 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan Allen Center 534.

CS Introduction to Operating Systems

CMPE Database Systems Workshop June 16 Class Meeting

Multiple Platters.

RAID Redundant Arrays of Independent Disks

What every server wants!

Steve Ko Computer Sciences and Engineering University at Buffalo

Steve Ko Computer Sciences and Engineering University at Buffalo

RAID Non-Redundant (RAID Level 0) has the lowest cost of any RAID

Dave Eckhardt Disk Arrays Dave Eckhardt

Vladimir Stojanovic & Nicholas Weaver

CSE451 I/O Systems and the Full I/O Path Autumn 2002

RAID Disk Arrays Hank Levy 1.

RAID RAID Mukesh N Tekwani

ICOM 6005 – Database Management Systems Design

RAID Disk Arrays Hank Levy 1.

CSE 451: Operating Systems Spring 2005 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.

CSE 451: Operating Systems Autumn 2010 Module 19 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.

CSE 451: Operating Systems Winter 2009 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.

TECHNICAL SEMINAR PRESENTATION

RAID Redundant Array of Inexpensive (Independent) Disks

CSE 451: Operating Systems Spring 2006 Module 13 Secondary Storage

Mark Zbikowski and Gary Kimura

CSE 451: Operating Systems Autumn 2004 Redundant Arrays of Inexpensive Disks (RAID) Hank Levy 1.

CSE 451: Operating Systems Winter 2012 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.

CSE 451: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.

CSE 451: Operating Systems Autumn 2009 Module 19 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.

RAID Disk Arrays Hank Levy 1.

RAID RAID Mukesh N Tekwani April 23, 2019

CSE 451: Operating Systems Winter 2004 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.

IT 344: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Chia-Chi Teng CTB

CSE 451: Operating Systems Winter 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.

Presentation transcript:

CSE 451: Operating Systems Spring 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan zahorjan@cs.washington.edu Allen Center 534 1

Managing Physical Resources A single disk can be broken up into smaller pieces (partitions): Limit the damage of failures Unit of backup/migration Run different kinds of file systems What about making use of multiple disks? None of the file systems we’ve seen can span multiple disks Why? What about mounting multiple devices? More storage? Better performance? 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

© 2006 Gribble, Lazowska, Levy, Zahorjan Performance Disk transfer rates are improving, but much less fast than CPU performance We can use multiple disks to improve performance by striping files across multiple disks (placing parts of each file on a different disk), we can use parallel I/O to improve access time Striping reduces reliability 10 disks have about 1/10th the MTBF (mean time between failures) of one disk 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

© 2006 Gribble, Lazowska, Levy, Zahorjan Reliability It’s typically enough to be resilient to a single disk failure In theory, the odds that another disk fails while you’re replacing the first one are low The first time CSE ran a RAID it happened to us… To improve reliability, add redundant data to the disks We’ll see how in a moment So: Performance from striping Resilience from redundancy, which steals back some of the performance gain 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

© 2006 Gribble, Lazowska, Levy, Zahorjan RAID A RAID is a Redundant Array of Inexpensive Disks Disks are small and cheap, so it’s easy to put lots of disks (10s to 100s) in one box for increased storage, performance, and availability Data plus some redundant information is striped across the disks in some way How striping is done is key to performance and reliability 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

© 2006 Gribble, Lazowska, Levy, Zahorjan Some RAID tradeoffs Granularity fine-grained: stripe each file over all disks high throughput for the file limits transfer to 1 file at a time course-grained: stripe each file over only a few disks limits throughput for 1 file allows concurrent access to multiple files Redundancy uniformly distribute redundancy information on disks avoids load-balancing problems concentrate redundancy information on a small number of disks partition the disks into data disks and redundancy disks 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

RAID Level 0: Non-Redundant Striping RAID Level 0 is a non-redundant disk array Files are striped across disks, no redundant info High (single file) read throughput Best write throughput (no redundant info to write) Any disk failure results in data loss What is lost? 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

RAID Level 1: Mirrored Disks Files are striped across half the disks, and mirrored to the other half 2x space expansion Reads: Read from either copy Writes: Write both copies On failure, just use the surviving disk What is the effect on performance? How many simultaneous disk failures can be tolerated? 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

RAID Levels 2, 3, and 4: Striping + Parity Disk RAID levels 2, 3, and 4 use ECC (error correcting code) or parity disks E.g., each byte on the parity disk is a parity function of the corresponding bytes on all the other disks A large read accesses all the data disks A single block read accesses only one disk (RAID 4) A write updates one or more data disks plus the parity disk Resilient to single disk failures (How?) Better ECC  higher failure resilience  more parity disks 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

Refresher: What’s parity? To each byte, add a bit set so that the total number of 1’s is even Any single missing bit can be reconstructed (Why does memory parity not work quite this way?) Think of ECC as just being similar but fancier (more capable) 1 1 1 1 1 1 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

© 2006 Gribble, Lazowska, Levy, Zahorjan RAID Level 5 RAID Level 5 uses block interleaved distributed parity Like parity scheme, but distribute the parity info (as well as data) over all disks for each block, one disk holds the parity, and the other disks hold the data Significantly better performance parity disk is not a hot spot 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

Typical Implementation Controller Disks OS 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

JBOD (Just a Bunch Of Disks) Not really a RAID – no striping, no redundancy Blocks from disks are concatenated to form a single, logical device Allows use of disks of differing sizes 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

© 2006 Gribble, Lazowska, Levy, Zahorjan 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

© 2006 Gribble, Lazowska, Levy, Zahorjan 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan

© 2006 Gribble, Lazowska, Levy, Zahorjan Final Issues If you’re running a RAID level with sufficient redundancy, do you need backup? What’s the difference between RAID and backup? Does RAID provide “sufficient” reliability? If you’re Amazon? Tier I Single path for power and cooling distribution, no redundant components, 99.671% availability. Tier II Single path for power and cooling distribution, redundant components, 99.741% availability. Tier III Multiple power and cooling distribution paths, but only one path active, redundant components, concurrently maintainable, 99.982% availability. Tier IV Multiple active power and cooling distribution paths, redundant components, fault tolerant, 99.995% availability. 11/12/2018 © 2006 Gribble, Lazowska, Levy, Zahorjan