IT 344: Operating Systems Winter 2008 Module 13 Secondary Storage

Slides:



Advertisements
Similar presentations
Disk Storage SystemsCSCE430/830 Disk Storage Systems CSCE430/830 Computer Architecture Lecturer: Prof. Hong Jiang Courtesy of Yifeng Zhu (U. Maine) Fall,
Advertisements

Operating Systems ECE344 Ashvin Goel ECE University of Toronto Disks and RAID.
Disk Drivers May 10, 2000 Instructor: Gary Kimura.
Disks.
1 File System Implementations. 2 Overview The Low Level –disks –caching –RAM disks –RAIDs –disk head scheduling The higher level –file systems –directories.
Disks CS 416: Operating Systems Design, Spring 2001 Department of Computer Science Rutgers University
Secondary Storage CSCI 444/544 Operating Systems Fall 2008.
Secondary Storage Management Hank Levy. 8/7/20152 Secondary Storage • Secondary Storage is usually: –anything outside of “primary memory” –storage that.
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 12: Mass-Storage Systems.
12.1 Silberschatz, Galvin and Gagne ©2009 Operating System Concepts with Java – 8 th Edition Chapter 12: Mass-Storage Systems.
Disk and I/O Management
Device Management. So far… We have covered CPU and memory management Computing is not interesting without I/Os Device management: the OS component that.
CS 153 Design of Operating Systems Spring 2015 Lecture 20: File Systems.
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 10: Mass-Storage Systems.
IT 344: Operating Systems Winter 2010 Module 13 Secondary Storage Chia-Chi Teng CTB 265.
Disks and Storage Systems
Silberschatz, Galvin, and Gagne  Applied Operating System Concepts Ref: Chap 12 Secondary Storage and I/O Systems.
UNIVERSITY of WISCONSIN-MADISON Computer Sciences Department
OPERATING SYSTEMS IO SYSTEMS. Categories of I/O Devices Human readable –Used to communicate with the user –Printers –Video display terminals Display Keyboard.
CSE 451: Operating Systems Winter 2015 Module 14 Secondary Storage Mark Zbikowski Allen Center 476 © 2013 Gribble, Lazowska, Levy,
1.  Disk Structure Disk Structure  Disk Scheduling Disk Scheduling  FCFS FCFS  SSTF SSTF  SCAN SCAN  C-SCAN C-SCAN  C-LOOK C-LOOK  Selecting a.
Operating Systems (CS 340 D) Princess Nora University Faculty of Computer & Information Systems Computer science Department.
Device Management Mark Stanovich Operating Systems COP 4610.
CSE 451: Operating Systems Winter 2012 Secondary Storage Mark Zbikowski Gary Kimura.
CSE 451: Operating Systems Spring 2006 Module 14 From Physical to Logical: File Systems John Zahorjan Allen Center 534.
CSE 451: Operating Systems Spring 2010 Module 12.5 Secondary Storage John Zahorjan Allen Center 534.
Chapter 10: Mass-Storage Systems
Operating System (013022) Dr. H. Iwidat
Sarah Diesburg Operating Systems CS 3430
Disks and RAID.
Database Management Systems (CS 564)
Operating Systems Disk Scheduling A. Frank - P. Weisberg.
OPERATING SYSTEMS CS 3502 Fall 2017
Secondary Storage Secondary storage typically: Characteristics:
CS703 - Advanced Operating Systems
Operating System I/O System Monday, August 11, 2008.
Disk Scheduling Algorithms
Mass-Storage Structure
Disks and Files DBMS stores information on (“hard”) disks.
Chapter 14 Based on the slides supporting the text
CSE 153 Design of Operating Systems Winter 2018
Chapter 12: Mass-Storage Systems
Operating Systems (CS 340 D)
Disk Scheduling The operating system is responsible for using hardware efficiently — for the disk drives, this means having a fast access time and disk.
CSE 451: Operating Systems Winter 2006 Module 13 Secondary Storage
CSE 451: Operating Systems Autumn 2009 Module 13 Secondary Storage
CSE 451: Operating Systems Autumn 2003 Lecture 12 Secondary Storage
CSE 451: Operating Systems Winter 2007 Module 13 Secondary Storage
CSE 451: Operating Systems Spring 2006 Module 13 Secondary Storage
Disks and scheduling algorithms
Secondary Storage Management Brian Bershad
Persistence: hard disk drive
Mass-Storage Systems.
CSE 451: Operating Systems Secondary Storage
CSE 451: Operating Systems Winter 2003 Lecture 12 Secondary Storage
Disks and Storage Systems
CSE 451: Operating Systems Winter 2009 Module 12 Secondary Storage
CSE 451: Operating Systems Spring 2005 Module 13 Secondary Storage
Secondary Storage Management Hank Levy
CSE451 File System Introduction and Disk Drivers Autumn 2002
CSE 451: Operating Systems Autumn 2004 Secondary Storage
CSE 451: Operating Systems Winter 2004 Module 13 Secondary Storage
CSE 451: Operating Systems Spring 2006 Module 14 From Physical to Logical: File Systems John Zahorjan Allen Center
Lecture 10: Magnetic Disks
Operating Systems 2019 Spring by Euiseong Seo
CSE 451: Operating Systems Spring 2007 Module 11 Secondary Storage
Andy Wang Operating Systems COP 4610 / CGS 5765
CS 245: Database System Principles Notes 02: Hardware
Operating Systems Disk Scheduling A. Frank - P. Weisberg.
Presentation transcript:

IT 344: Operating Systems Winter 2008 Module 13 Secondary Storage Chia-Chi Teng ccteng@byu.edu CTB 265 1

© 2007 Gribble, Lazowska, Levy, Zahorjan Secondary storage Secondary storage typically: is anything that is outside of “primary memory” does not permit direct execution of instructions or data retrieval via machine load/store instructions Characteristics: it’s large: 50-1000GB it’s cheap: $0.25/GB it’s persistent: data survives power loss it’s slow: milliseconds to access why is this slow?? it does fail, if rarely 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

Another trip down memory lane … IBM 2314 About the size of 6 refrigerators 8 x 29MB (M!) 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

© 2007 Gribble, Lazowska, Levy, Zahorjan Disk trends Disk capacity, 1975-1989 doubled every 3+ years 25% improvement each year factor of 10 every decade Still exponential, but far less rapid than processor performance Disk capacity since 1990 doubling every 12 months 100% improvement each year factor of 1000 every decade 10x as fast as processor performance! 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

© 2007 Gribble, Lazowska, Levy, Zahorjan Only a few years ago, we purchased disks by the megabyte (and it hurt!) Today, 1 GB (a billion bytes) costs $1 $0.50 $0.25 from Dell (except you have to buy in increments of 40 80 250 GB) => 1 TB costs $1K $500 $250, 1 PB costs $1M $500K $250K 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

© 2007 Gribble, Lazowska, Levy, Zahorjan Memory hierarchy 100 bytes CPU registers < 1 ns 32KB L1 cache 1 ns 256KB L2 cache 4 ns 1GB Primary Memory 60 ns 100GB Secondary Storage 10+ ms Tertiary Storage 1s-1hr 1-1000TB Each level acts as a cache of lower levels 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

Memory hierarchy: distance analogy seconds CPU registers “My head” 1 minute L1 cache “This room” 10 minutes L2 cache “This building” 1.5 hours Primary Memory Olympia 2 years Secondary Storage Pluto Tertiary Storage 2,000 years Andromeda 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

This chart makes the point that processor speed is now bus and memory latency. We have an ongoing effort to measure and improve this latency. 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan © 2004 Jim Gray, Microsoft Corporation

© 2007 Gribble, Lazowska, Levy, Zahorjan Disks and the OS Disks are messy, messy devices errors, bad blocks, missed seeks, etc. Job of OS is to hide this mess from higher-level software low-level device drivers (initiate a disk read, etc.) higher-level abstractions (files, databases, etc.) OS may provide different levels of disk access to different clients physical disk block (surface, cylinder, sector) disk logical block (disk block #) file logical (filename, block or record or byte #) 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

Physical disk structure Disk components platters surfaces tracks sectors cylinders arm heads sector track surface cylinder platter arm head 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

© 2007 Gribble, Lazowska, Levy, Zahorjan Disk performance Performance depends on a number of steps seek: moving the disk arm to the correct cylinder depends on how fast disk arm can move seek times aren’t diminishing very quickly rotation (latency): waiting for the sector to rotate under head depends on rotation rate of disk rates are increasing, but slowly transfer: transferring data from surface into disk controller, and from there sending it back to host depends on density of bytes on disk increasing, relatively quickly When the OS uses the disk, it tries to minimize the cost of all of these steps particularly seeks and rotation 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

Performance via disk layout OS may increase file block size in order to reduce seeking OS may seek to co-locate “related” items in order to reduce seeking blocks of the same file data and metadata for a file 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

Performance via caching, pre-fetching Keep data or metadata in memory to reduce physical disk access problem? If file access is sequential, fetch blocks into memory before requested 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

Performance via disk scheduling Seeks are very expensive, so the OS attempts to schedule disk requests that are queued waiting for the disk FCFS (do nothing) reasonable when load is low long waiting time for long request queues SSTF (shortest seek time first) minimize arm movement (seek time), maximize request rate unfairly favors middle blocks SCAN (elevator algorithm) service requests in one direction until done, then reverse skews wait times non-uniformly (why?) C-SCAN like scan, but only go in one direction (typewriter) uniform wait times 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

Interacting with disks In the old days… OS would have to specify cylinder #, sector #, surface #, transfer size i.e., OS needs to know all of the disk parameters Modern disks are even more complicated not all sectors are the same size, sectors are remapped, … disk provides a higher-level interface, e.g., SCSI exports data as a logical array of blocks [0 … N] maps logical blocks to cylinder/surface/sector OS only needs to name logical block #, disk maps this to cylinder/surface/sector on-board cache as a result, physical parameters are hidden from OS both good and bad 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan

Example disk characteristics IBM Ultrastar 36XP drive form factor: 3.5” capacity: 36.4 GB (150x those 6 fridges!) rotation rate: 7,200 RPM (120 RPS) platters: 10 surfaces: 20 sector size: 512-732 bytes (why?) cylinders: 11,494 cache: 4MB transfer rate: 17.9 MB/s (inner) – 28.9 MB/s (outer) (why?) full seek: 14.5 ms head switch: 0.3 ms 9/13/2018 © 2007 Gribble, Lazowska, Levy, Zahorjan