CSE 451: Operating Systems Winter 2015 Module 14 Secondary Storage Mark Zbikowski Allen Center 476 © 2013 Gribble, Lazowska, Levy,

Slides:



Advertisements
Similar presentations
CS 6560: Operating Systems Design
Advertisements

Operating Systems ECE344 Ashvin Goel ECE University of Toronto Disks and RAID.
OPERATING SYSTEMS CS3530 Summer 2014 OPERATING SYSTEMS CS3530 Summer 2014 Input/Output System Chapter 9.
Disk Drivers May 10, 2000 Instructor: Gary Kimura.
Disks.
1 File System Implementations. 2 Overview The Low Level –disks –caching –RAM disks –RAIDs –disk head scheduling The higher level –file systems –directories.
Disks CS 416: Operating Systems Design, Spring 2001 Department of Computer Science Rutgers University
Avishai Wool lecture Introduction to Systems Programming Lecture 9 Input-Output Devices.
Secondary Storage CSCI 444/544 Operating Systems Fall 2008.
Secondary Storage Management Hank Levy. 8/7/20152 Secondary Storage • Secondary Storage is usually: –anything outside of “primary memory” –storage that.
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 12: Mass-Storage Systems.
12.1 Silberschatz, Galvin and Gagne ©2009 Operating System Concepts with Java – 8 th Edition Chapter 12: Mass-Storage Systems.
Disk and I/O Management
Device Management. So far… We have covered CPU and memory management Computing is not interesting without I/Os Device management: the OS component that.
CS 153 Design of Operating Systems Spring 2015 Lecture 20: File Systems.
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 10: Mass-Storage Systems.
Physical Storage and File Organization COMSATS INSTITUTE OF INFORMATION TECHNOLOGY, VEHARI.
IT 344: Operating Systems Winter 2010 Module 13 Secondary Storage Chia-Chi Teng CTB 265.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 12: Mass-Storage Systems.
Disk Structure Disk drives are addressed as large one- dimensional arrays of logical blocks, where the logical block is the smallest unit of transfer.
Chapter 111 Chapter 11: Hardware (Slides by Hector Garcia-Molina,
Disks and Storage Systems
I/O Management and Disk Structure Introduction to Operating Systems: Module 14.
Silberschatz, Galvin, and Gagne  Applied Operating System Concepts Ref: Chap 12 Secondary Storage and I/O Systems.
Lecture 3 Page 1 CS 111 Online Disk Drives An especially important and complex form of I/O device Still the primary method of providing stable storage.
OPERATING SYSTEMS IO SYSTEMS. Categories of I/O Devices Human readable –Used to communicate with the user –Printers –Video display terminals Display Keyboard.
Disk Basics CS Introduction to Operating Systems.
1.  Disk Structure Disk Structure  Disk Scheduling Disk Scheduling  FCFS FCFS  SSTF SSTF  SCAN SCAN  C-SCAN C-SCAN  C-LOOK C-LOOK  Selecting a.
Operating Systems (CS 340 D) Princess Nora University Faculty of Computer & Information Systems Computer science Department.
Device Management Mark Stanovich Operating Systems COP 4610.
Chapter 9 I/O System. 2 Input/Output System I/O Major objectives are: Take an application I/O request and send it to the physical device. Then, take whatever.
CSE 451: Operating Systems Winter 2012 Secondary Storage Mark Zbikowski Gary Kimura.
COSC 6340: Disks 1 Disks and Files DBMS stores information on (“hard”) disks. This has major implications for DBMS design! » READ: transfer data from disk.
Device Management Andy Wang Operating Systems COP 4610 / CGS 5765.
CSE 451: Operating Systems Spring 2006 Module 14 From Physical to Logical: File Systems John Zahorjan Allen Center 534.
CSE 451: Operating Systems Spring 2010 Module 12.5 Secondary Storage John Zahorjan Allen Center 534.
Chapter 10: Mass-Storage Systems
Operating System (013022) Dr. H. Iwidat
Sarah Diesburg Operating Systems CS 3430
Disks and RAID.
Database Management Systems (CS 564)
Operating Systems Disk Scheduling A. Frank - P. Weisberg.
Secondary Storage Secondary storage typically: Characteristics:
CS703 - Advanced Operating Systems
Disk Scheduling Algorithms
IT 344: Operating Systems Winter 2008 Module 13 Secondary Storage
CSE 153 Design of Operating Systems Winter 2018
CSE 451: Operating Systems Winter 2006 Module 13 Secondary Storage
CSE 451: Operating Systems Autumn 2009 Module 13 Secondary Storage
CSE 451: Operating Systems Autumn 2003 Lecture 12 Secondary Storage
CSE 451: Operating Systems Winter 2007 Module 13 Secondary Storage
CSE 451: Operating Systems Spring 2006 Module 13 Secondary Storage
Secondary Storage Management Brian Bershad
Persistence: hard disk drive
Mass-Storage Systems.
CSE 451: Operating Systems Secondary Storage
CSE 451: Operating Systems Winter 2003 Lecture 12 Secondary Storage
Disks and Storage Systems
CSE 451: Operating Systems Winter 2009 Module 12 Secondary Storage
CSE 451: Operating Systems Spring 2005 Module 13 Secondary Storage
Secondary Storage Management Hank Levy
CSE451 File System Introduction and Disk Drivers Autumn 2002
CSE 451: Operating Systems Autumn 2004 Secondary Storage
CSE 451: Operating Systems Winter 2004 Module 13 Secondary Storage
CSE 451: Operating Systems Spring 2006 Module 14 From Physical to Logical: File Systems John Zahorjan Allen Center
Lecture 10: Magnetic Disks
Operating Systems 2019 Spring by Euiseong Seo
CSE 451: Operating Systems Spring 2007 Module 11 Secondary Storage
Andy Wang Operating Systems COP 4610 / CGS 5765
Presentation transcript:

CSE 451: Operating Systems Winter 2015 Module 14 Secondary Storage Mark Zbikowski Allen Center 476 © 2013 Gribble, Lazowska, Levy, Zahorjan

2 Secondary storage Secondary storage typically: –is anything that is outside of “primary memory” –does not permit direct execution of instructions or data retrieval via machine load/store instructions Characteristics: –it’s large: GB –it’s cheap: $0.05-$0.10/GB for hard drives from Dell (at 2TB size) –it’s persistent: data survives power loss –it’s slow: milliseconds to access why is this slow?? –it does fail, if rarely big failures (drive dies; MTBF ~3 years) –if you have 100K drives and MTBF is 3 years, that’s 1 “big failure” every 15 minutes! little failures (read/write errors, one byte in )

© 2013 Gribble, Lazowska, Levy, Zahorjan 3 Another trip down memory lane … IBM 2314 About the size of 6 refrigerators 8 x 29MB (M!) Required similar- sized air condx!.01% (not 1% –.01%!) the capacity of this $100 4”x6”x1” item

© 2013 Gribble, Lazowska, Levy, Zahorjan 4 Disk capacity, –doubled every 3+ years –25% improvement each year –factor of 10 every decade –Still exponential, but far less rapid than processor performance Disk capacity, 1990-recently –doubling every 12 months –100% improvement each year –factor of 1000 every decade –Capacity growth 10x as fast as processor performance! Disk trends

© 2013 Gribble, Lazowska, Levy, Zahorjan 5 Only a few years ago, we purchased disks by the megabyte (and it hurt!) Today, 1 GB (a billion bytes) costs $1 $0.50 $0.10- $0.05 from Dell (except you have to buy in increments of GB) –=> 1 TB costs $1K $500 $100, 1 PB costs $1M $500K $100K Technology is amazing –Flying a 747 6” above the ground –Reading/writing a strip of postage stamps But … –Jets do crash …

© 2013 Gribble, Lazowska, Levy, Zahorjan 6 Memory hierarchy Each level acts as a cache of lower levels CPU registers L1 cache L2 cache Primary Memory Secondary Storage Tertiary Storage 100 bytes 32KB 256KB 1GB 1TB 1PB 10 ms 1s-1hr < 1 ns 1 ns 4 ns 60 ns

© 2013 Gribble, Lazowska, Levy, Zahorjan 7 Memory hierarchy: distance analogy CPU registers L1 cache L2 cache Primary Memory Secondary Storage Tertiary Storage 1 minute 10 minutes 1.5 hours 2 years 2,000 years Pluto Andromeda “My head” “This room” “This building” Olympia seconds

© 2012 Gribble, Lazowska, Levy, Zahorjan 8 © 2004 Jim Gray, Microsoft Corporation

© 2013 Gribble, Lazowska, Levy, Zahorjan 9 Disks and the OS Disks are messy, messy devices –errors, bad blocks, missed seeks, etc. Job of OS is to hide this mess from higher-level software (disk hardware increasingly helps with this) –low-level device drivers (initiate a disk read, etc.) –higher-level abstractions (files, databases, etc.) –(note that modern disk drives do some of this masking for the OS) OS may provide different levels of disk access to different clients –physical disk block (surface, cylinder, sector) –disk logical block (disk block #) –file logical (filename, block or record or byte #)

© 2013 Gribble, Lazowska, Levy, Zahorjan 10 Physical disk structure Disk components –platters –surfaces –tracks –sectors –cylinders –arm –heads platter surface track sector cylinder arm head

© 2013 Gribble, Lazowska, Levy, Zahorjan 11 Disk performance Performance depends on a number of steps –seek: moving the disk arm to the correct cylinder depends on how fast disk arm can move –seek times aren’t diminishing very quickly (why?) –rotation (latency): waiting for the sector to rotate under head depends on rotation rate of disk –rates are increasing, but slowly (why?) –transfer: transferring data from surface into disk controller, and from there sending it back to host depends on density of bytes on disk –increasing, relatively quickly When the OS uses the disk, it tries to minimize the cost of all of these steps –particularly seeks and rotation

© 2013 Gribble, Lazowska, Levy, Zahorjan 12 Performance via disk layout OS may increase file block size in order to reduce seeking OS may seek to co-locate “related” items in order to reduce seeking –blocks of the same file –data and metadata for a file

© 2013 Gribble, Lazowska, Levy, Zahorjan 13 Performance via caching, pre-fetching Keep data or metadata in memory to reduce physical disk access –problem? If file access is sequential, fetch blocks into memory before requested

© 2013 Gribble, Lazowska, Levy, Zahorjan 14 Performance via disk scheduling Seeks are very expensive, so the OS attempts to schedule disk requests that are queued waiting for the disk –FCFS (do nothing) reasonable when load is low long waiting time for long request queues –SSTF (shortest seek time first) minimize arm movement (seek time), maximize request rate unfairly favors middle blocks –SCAN (elevator algorithm) service requests in one direction until done, then reverse skews wait times non-uniformly (why?) –C-SCAN like scan, but only go in one direction (typewriter) uniform wait times

© 2013 Gribble, Lazowska, Levy, Zahorjan 15 Interacting with disks In the old days… –OS would have to specify cylinder #, sector #, surface #, transfer size i.e., OS needs to know all of the disk parameters Modern disks are even more complicated –not all sectors are the same size, sectors are remapped, … –disk provides a higher-level interface, e.g., SCSI exports data as a logical array of blocks [0 … N] maps logical blocks to cylinder/surface/sector OS only needs to name logical block #, disk maps this to cylinder/surface/sector on-board cache as a result, physical parameters are hidden from OS –both good and bad

© 2013 Gribble, Lazowska, Levy, Zahorjan 16 Seagate Barracuda 3.5” disk drive 1Terabyte of storage (1000 GB) $100 4 platters, 8 disk heads 63 sectors (512 bytes) per track 16,383 cylinders (tracks) 164 Gbits / inch-squared (!) 7200 RPM 300 MB/second transfer 9 ms avg. seek, 4.5 ms avg. rotational latency 1 ms track-to-track seek 32 MB cache

© 2013 Gribble, Lazowska, Levy, Zahorjan 17 Solid state drives: disruption Hard drives are based on spinning magnetic platters –mechanics of drives determine performance characteristics sector addressable, not byte addressable capacity improving exponentially sequential bandwidth improving reasonably random access latency improving very slowly –cost dictated by massive economies of scale, and many decades of commercial development and optimization 17

© 2013 Gribble, Lazowska, Levy, Zahorjan 18 Solid state drives are based on NAND flash memory –no moving parts; performance characteristics driven by electronics and physics – more like RAM than spinning disk –relative technological newcomer, so costs are still quite high in comparison to hard drives, but dropping fast 18

© 2013 Gribble, Lazowska, Levy, Zahorjan 19 SSD performance: reads Reads –unit of read is a page, typically 4KB large –today’s SSD can typically handle 10,000 – 100,000 reads/s 0.01 – 0.1 ms read latency ( x better than disk seeks) MB/s read throughput (1-3x better than disk seq. thpt) 19

© 2013 Gribble, Lazowska, Levy, Zahorjan 20 SSD performance: writes Writes –flash media must be erased before it can be written to –unit of erase is a block, typically pages long usually takes 1-2ms to erase a block blocks can only be erased a certain number of times before they become unusable – typically 10,000 – 1,000,000 times –unit of write is a page writing a page can be 2-10x slower than reading a page Writing to an SSD is complicated –random write to existing block: read block, erase block, write back modified block leads to hard-drive like performance (300 random writes / s) –sequential writes to erased blocks: fast! SSD-read like performance ( MB/s) 20

© 2013 Gribble, Lazowska, Levy, Zahorjan 21 SSDs: dealing with erases, writes Lots of higher-level strategies can help hide the warts of an SSD –many of these work by virtualizing pages and blocks on the drive (i.e., exposing logical pages, not physical pages, to the rest of the computer) –wear-leveling: when writing, try to spread erases out evenly across physical blocks of of the SSD Intel promises 100GB/day x 5 years for its SSD drives –log-structured filesystems: convert random writes within a filesystem to log appends on the SSD (more later) –build drives out of arrays of SSDs, add lots of cache 21

© 2013 Gribble, Lazowska, Levy, Zahorjan 22 SSD cost Capacity –today, flash SSD costs ~$1.00/GB (down from $250 a year ago) 1TB drive costs around $1000 –1TB hard drive costs around $100 –Data on cost trends is a little sketchy and preliminary Energy –SSD is typically more energy efficient than a hard drive 1-2 watts to power an SSD ~10 watts to power a high performance hard drive –(can also buy a 1 watt lower-performance drive) 22