High Performance Computing Course Notes 2007-2008 High Performance Storage.

Slides:



Advertisements
Similar presentations
Redundant Array of Independent Disks (RAID) Striping of data across multiple media for expansion, performance and reliability.
Advertisements

RAID A RRAYS Redundant Array of Inexpensive Discs.
RAID Oh yes Whats RAID? Redundant Array (of) Independent Disks. A scheme involving multiple disks which replicates data across multiple drives. Methods.
RAID: Redundant Array of Inexpensive Disks Supplemental Material not in book.
NAS vs. SAN 10/2010 Palestinian Land Authority IT Department By Nahreen Ameen 1.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID Redundant Arrays of Inexpensive Disks –Using lots of disk drives improves: Performance Reliability –Alternative: Specialized, high-performance hardware.
1 CSC 486/586 Network Storage. 2 Objectives Familiarization with network data storage technologies Understanding of RAID concepts and RAID levels Discuss.
R.A.I.D. Copyright © 2005 by James Hug Redundant Array of Independent (or Inexpensive) Disks.
Chapter 3 Presented by: Anupam Mittal.  Data protection: Concept of RAID and its Components Data Protection: RAID - 2.
2P13 Week 11. A+ Guide to Managing and Maintaining your PC, 6e2 RAID Controllers Redundant Array of Independent (or Inexpensive) Disks Level 0 -- Striped.
Chapter 5: Server Hardware and Availability. Hardware Reliability and LAN The more reliable a component, the more expensive it is. Server hardware is.
CSCE 212 Chapter 8 Storage, Networks, and Other Peripherals Instructor: Jason D. Bakos.
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
1 Recap (RAID and Storage Architectures). 2 RAID To increase the availability and the performance (bandwidth) of a storage system, instead of a single.
1 Lecture 23: Multiprocessors Today’s topics:  RAID  Multiprocessor taxonomy  Snooping-based cache coherence protocol.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition Mass-Storage Systems Revised Tao Yang.
CSE 451: Operating Systems Winter 2010 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura.
Module – 7 network-attached storage (NAS)
© 2009 IBM Corporation Statements of IBM future plans and directions are provided for information purposes only. Plans and direction are subject to change.
Servers Redundant Array of Inexpensive Disks (RAID) –A group of hard disks is called a disk array FIGURE Server with redundant NICs.
Storage Networking. Storage Trends Storage growth Need for storage flexibility Simplify and automate management Continuous availability is required.
Session 3 Windows Platform Dina Alkhoudari. Learning Objectives Understanding Server Storage Technologies Direct Attached Storage DAS Network-Attached.
Data Storage Willis Kim 14 May Types of storages Direct Attached Storage – storage hardware that connects to a single server Direct Attached Storage.
Secondary Storage Unit 013: Systems Architecture Workbook: Secondary Storage 1G.
Storage System: RAID Questions answered in this lecture: What is RAID? How does one trade-off between: performance, capacity, and reliability? What is.
Chapter 6 RAID. Chapter 6 — Storage and Other I/O Topics — 2 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f.
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
CSE 321b Computer Organization (2) تنظيم الحاسب (2) 3 rd year, Computer Engineering Winter 2015 Lecture #4 Dr. Hazem Ibrahim Shehata Dept. of Computer.
Redundant Array of Independent Disks
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
I/O – Chapter 8 Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
1 Chapter 7: Storage Systems Introduction Magnetic disks Buses RAID: Redundant Arrays of Inexpensive Disks.
Disk Access. DISK STRUCTURE Sector: Smallest unit of data transfer from/to disk; 512B 2/4/8 adjacent sectors transferred together: Blocks Read/write heads.
Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.
Page 110/12/2015 CSE 30341: Operating Systems Principles Network-Attached Storage  Network-attached storage (NAS) is storage made available over a network.
Mark A. Magumba Storage Management. What is storage An electronic place where computer may store data and instructions for retrieval The objective of.
"1"1 Introduction to Managing Data " Describe problems associated with managing large numbers of disks " List requirements for easily managing large amounts.
The concept of RAID in Databases By Junaid Ali Siddiqui.
RAID Disk Arrays Hank Levy. 212/5/2015 Basic Problems Disks are improving, but much less fast than CPUs We can use multiple disks for improving performance.
RAID Systems Ver.2.0 Jan 09, 2005 Syam. RAID Primer Redundant Array of Inexpensive Disks random, real-time, redundant, array, assembly, interconnected,
RAID and RAS CIS 485 SAU SUJA SOLOMON TIFFANY STACIE.
Install, configure and test ICT Networks
1 Lecture 27: Disks Today’s topics:  Disk basics  RAID  Research topics.
Storage Networking. Storage Trends Storage grows %/year, gets more complicated It’s necessary to pool storage for flexibility Intelligent storage.
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 10: Mass-Storage Systems.
CSE 451: Operating Systems Spring 2010 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan Allen Center 534.
CMPE Database Systems Workshop June 16 Class Meeting
CS 554: Advanced Database System Notes 02: Hardware
CSE 451: Operating Systems Spring 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan Allen Center.
RAID Disk Arrays Hank Levy 1.
RAID RAID Mukesh N Tekwani
RAID Disk Arrays Hank Levy 1.
CSE 451: Operating Systems Autumn 2010 Module 19 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Winter 2009 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
UNIT IV RAID.
Mark Zbikowski and Gary Kimura
CSE 451: Operating Systems Autumn 2004 Redundant Arrays of Inexpensive Disks (RAID) Hank Levy 1.
CSE 451: Operating Systems Winter 2012 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
CSE 451: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Autumn 2009 Module 19 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
RAID Disk Arrays Hank Levy 1.
RAID RAID Mukesh N Tekwani April 23, 2019
IT 344: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Chia-Chi Teng CTB
Storage Management Lecture 7.
CSE 451: Operating Systems Winter 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CS 295: Modern Systems Organizing Storage Devices
Presentation transcript:

High Performance Computing Course Notes High Performance Storage

2 Computer Science, University of Warwick Storage devices  Primary storage:  register (1 CPU cycle, a few ns)  Cache ( cycles, us)  Main memory Local main memory (0.2-4us) NUMA (2-10xlocal memory)  Secondary storage:  Magnetic disk (2-20ms)  Solid state disk ( ms)  Cache in storage controller ( ms)  Tertiary storage  Removable media: tapes, floppies, CDs (ms-minutes)  Tape library (few seconds – few minutes)

3 Computer Science, University of Warwick Hard disk vs. solid state drive a) 2.5-inch hard disk b) solid state drive

4 Computer Science, University of Warwick Tape library

5 Computer Science, University of Warwick Disks

6 Disk failure and metrics  mean time between failures (MTBF): Mean time between failures (MTBF) is the average time between failures of a disk MTBF=  (downtime-uptime)/number-of-failures  Annual failure rate (AFR): number of failures per year AFR=running-hours-per-year/MTBF AFR disks =N disks *AFR disk

7 Computer Science, University of Warwick Solutions for disk failures  Redundancy  Replication (mirroring)  Partial Redundancy  Parity information

8 Computer Science, University of Warwick RAID  RAID: Redundant Arrays of Inexpensive Disks  Goals: increased data reliability and increased I/O performance  Main concepts in RAID  Mirroring  stripping  parity  Advantages:  High capability  High performance: data stripe  Graceful degrading  One disk fails, only that disk needs to be replaced

9 Computer Science, University of Warwick RAID  Disadvantage: failures AFR disks =N disks *AFR disk  Solution  Redundancy: 1) replication/mirroring: need more space 2) parity: recover from single disk failure; need more operations to maintain parity info and recover

10 Computer Science, University of Warwick Parity  Parity calculation is performed using “XOR”.  XOR operator is "true" if and only if one of its operands is true  Property of XOR: If D p =D 1 XOR …D k … XOR D n, then D k = D p XOR D 1 … D k-1 XOR D k+1 …XOR D n  Therefore, if any data is lost, we can recover the data from parity and the remaining data  Advantages: only one of the "N+1" drives contains redundancy information  Disadvantages: parity information has to be computed every time the data is updated

11 Computer Science, University of Warwick Disk arrays taxonomy  RAID levels  0: stripping without redundancy  1: full copy mirroring  2: Hamming-code  3: separate disk for parity  4: data of a file are put in a single disk  5: rotated distributed parity  6: double parity They are just classifications rather than a ordered list

12 Computer Science, University of Warwick RAID levels  RAID0  Stripped without redundancy  Data can be read off in parallel  Any disk failure destroys the entire array  RAID1  Mirrored  Array continues to operate so long as at least one drive is functioning

13 Computer Science, University of Warwick  RAID3  Striped set with dedicated parity  single parity disk is a bottleneck for writing  Byte-level striping (typically under 1k)  RAID4  Identical to RAID 3 but does block-level striping instead of byte-level striping  The block can be of any size

14 Computer Science, University of Warwick  RAID5  Striped set with distributed parity  the array is not destroyed by a single drive failure  Upon drive failure, any subsequent reads can be calculated from the distributed parity  The array will have data loss in the event of a second drive failure

15 Computer Science, University of Warwick  RAID6  Striped set with dual parity.  Provides fault tolerance from two drive failures

16 Computer Science, University of Warwick

17 Computer Science, University of Warwick

18 Computer Science, University of Warwick Network Attached Storage (NAS)  Follows a client/server design  A NAS head acts as the interface between the NAS and network clients  The NAS appears on the network as a single "node" that is the IP address of the head device  Clients access a NAS over an Ethernet connection  The NAS devices require no monitor, keyboard or mouse and run an embedded os  NAS uses file-based application protocols such as NFS (Network File System) and CIFS (Common Internet File System)

19 Computer Science, University of Warwick Storage Area Networks (SANs)  An architecture to attach remote computer storage devices to servers in such a way that the devices appear as locally attached to the OS  The data is accessed in blocks  Use FibreChannel protocol to access data

20 Computer Science, University of Warwick NAS vs. SAN