4/5/20001 Windows 2000 IO Performance Leonard Chung & Jim Gray.

Slides:



Advertisements
Similar presentations
IT253: Computer Organization
Advertisements

Hard Disks Low-level format- organizes both sides of each platter into tracks and sectors to define where items will be stored on the disk. Partitioning:
I/O Chapter 8. Outline Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
1 Parallel Scientific Computing: Algorithms and Tools Lecture #2 APMA 2821A, Spring 2008 Instructors: George Em Karniadakis Leopold Grinberg.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID Redundant Arrays of Inexpensive Disks –Using lots of disk drives improves: Performance Reliability –Alternative: Specialized, high-performance hardware.
1 Magnetic Disks 1956: IBM (RAMAC) first disk drive 5 Mb – Mb/in $/year 9 Kb/sec 1980: SEAGATE first 5.25’’ disk drive 5 Mb – 1.96 Mb/in2 625.
1 Advanced Database Technology February 12, 2004 DATA STORAGE (Lecture based on [GUW ], [Sanders03, ], and [MaheshwariZeh03, ])
G Robert Grimm New York University SGI’s XFS or Cool Pet Tricks with B+ Trees.
Disk Drivers May 10, 2000 Instructor: Gary Kimura.
©Brooks/Cole, 2003 Chapter 5 Computer Organization.
Lecture 3: A Case for RAID (Part 1) Prof. Shahram Ghandeharizadeh Computer Science Department University of Southern California.
IT Systems Memory EN230-1 Justin Champion C208 –
IT Systems In and Out EN230-1 Justin Champion C208 –
Secondary Storage CSCI 444/544 Operating Systems Fall 2008.
1 CS222: Principles of Database Management Fall 2010 Professor Chen Li Department of Computer Science University of California, Irvine Notes 01.
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
Mass Storage System EMELIZA R. YABUT MSIT. Overview of Mass Storage Structure Traditional magnetic disks structure ◦Platter- composed of one or more.
* Definition of -RAM (random access memory) :- -RAM is the place in a computer where the operating system, application programs & data in current use.
Secondary Storage Unit 013: Systems Architecture Workbook: Secondary Storage 1G.
Prepared by Careene McCallum-Rodney Hardware specification of a computer system.
Cache Memory By Sean Hunter.
“Five minute rule ten years later and other computer storage rules of thumb” Authors: Jim Gray, Goetz Graefe Reviewed by: Nagapramod Mandagere Biplob Debnath.
COMPUTER CONCEPTS.
Memory. Random Access Memory Defined What is memory? operating system and other system software that control the usage of the computer equipment application.
Computer Architecture Part III-A: Memory. A Quote on Memory “With 1 MB RAM, we had a memory capacity which will NEVER be fully utilized” - Bill Gates.
CENG334 Introduction to Operating Systems Erol Sahin Dept of Computer Eng. Middle East Technical University Ankara, TURKEY URL:
Day 10 Hardware Fault Tolerance RAID. High availability All servers should be on UPSs –2 Types Smart UPS –Serial cable connects from UPS to computer.
Hardware (The part you can kick). Overview  Selection Process  Equipment Categories  Processors  Memory  Storage  Support.
Performance Tradeoffs for Static Allocation of Zero-Copy Buffers Pål Halvorsen, Espen Jorde, Karl-André Skevik, Vera Goebel, and Thomas Plagemann Institute.
Hardware Case that houses the computer Monitor Keyboard and Mouse Disk Drives – floppy disk, hard disk, CD Motherboard Power Supply (PSU) Speakers Ports.
… when you will open a computer We hope you will not look like …
5 Feb 2002Alternative Ideas for the CALICE Backend System 1 Alternative Ideas for the CALICE Back-End System Matthew Warren and Gordon Crone University.
SCSI Richard Goldman April 2000
1 Understanding and Assessing Hardware To buy or upgrade? Evaluating your system: –CPU –RAM –Storage devices –Video output –Sound systems –Computer ports.
Computing Hardware Starter.
I/O – Chapter 8 Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
Computers Central Processor Unit. Basic Computer System MAIN MEMORY ALUCNTL..... BUS CONTROLLER Processor I/O moduleInterconnections BUS Memory.
The Computer Systems By : Prabir Nandi Computer Instructor KV Lumding.
Random access memory.
CPU (CENTRAL PROCESSING UNIT): processor chip (computer’s brain) found on the motherboard.
1 Inside the Computer Chapter 6 Copyright Prentice-Hall, Inc
On Windows File Access Modes : A Performance Study Jalil Boukhobza & Claude Timsit laboratory Versailles Saint Quentin University.
Types of Computers Mainframe/Server Two Dual-Core Intel ® Xeon ® Processors 5140 Multi user access Large amount of RAM ( 48GB) and Backing Storage Desktop.
Implementing Hyper-V®
Lecture No 11 Storage Devices
1 Computer and Network Bottlenecks Author: Rodger Burgess 27th October 2008 © Copyright reserved.
CSCI-100 Introduction to Computing Hardware Part I.
Computer Organization. This module surveys the physical resources of a computer system.  Basic components  CPU  Memory  Bus  I/O devices  CPU structure.
Computer Hardware. Lally School of M&T- Microcomputing and Info Systems Lecture Topics 1. Data Representation 2. Data Metrics 3. Central processing Unit.
Sep. 17, 2002BESIII Review Meeting BESIII DAQ System BESIII Review Meeting IHEP · Beijing · China Sep , 2002.
© GCSE Computing Computing Hardware Starter. Creating a spreadsheet to demonstrate the size of memory. 1 byte = 1 character or about 1 pixel of information.
Introduction: Memory Management 2 Ideally programmers want memory that is large fast non volatile Memory hierarchy small amount of fast, expensive memory.
FAT File Allocation Table
Memory Hierarchy: Terminology Hit: data appears in some block in the upper level (example: Block X)  Hit Rate : the fraction of memory access found in.
COMP381 by M. Hamdi 1 Clusters: Networks of WS/PC.
1 COMPUTER ARCHITECTURE (for Erasmus students) Assoc.Prof. Stasys Maciulevičius Computer Dept.
BMTS 242: Computer and Systems Lecture 2: Memory, and Software Yousef Alharbi Website
Jeffrey Ellak CS 147. Topics What is memory hierarchy? What are the different types of memory? What is in charge of accessing memory?
Adapted from Computer Organization and Design, Patterson & Hennessy, UCB ECE232: Hardware Organization and Design Part 14: Memory Hierarchy Chapter 5 (4.
Computer Performance. Hard Drive - HDD Stores your files, programs, and information. If it gets full, you can’t save any more. Measured in bytes (KB,
Getting the Most out of Scientific Computing Resources
Getting the Most out of Scientific Computing Resources
Hardware specifications
Local Area Networks, 3rd Edition David A. Stamper
Computer Hard Drive.
2.C Memory GCSE Computing Langley Park School for Boys.
Hard disk basics Prof:R.CHARLES SILVESTER JOE Departmet of Electronics St.Joseph’s College,Trichy.
Introduction to Operating Systems
Presentation transcript:

4/5/20001 Windows 2000 IO Performance Leonard Chung & Jim Gray

4/5/20002 Study Goals Repeat and Extend the Riedel, et. al paper. Many things have changed: –Software: Windows 2000 instead of NT4SP3 –Hardware: New, faster drives and standards 3 main testing scenarios: –old-old: “old” machine with NT4SP6 –old-new: “old” machine with Win2000 –new-new: “new” machine with Win2000

4/5/20003 Hardware Configurations “old” hardware: –333 MHz PII –4 x 7200 RPM UW SCSI drives –128 MB SDRAM “new” hardware: –2 x 733 MHz PIII –4 x 10,000 RPM Ultra160 SCSI drives –256 MB RDRAM –4 x 5400 RPM UltraATA/66 IDE drives on a 3ware card

4/5/20004 Primary Test Tools SQLIO – the primary test tool CacheFlush – buffered sequential DiskCache – PCI/host adapter throughput Memspeed – memory subsystem

4/5/20005 Testing Methodology Before each test: –Drive formatted –Test files copied in same order –Test run Sequential test files made to live on outer edge of disk, giving disk’s max performance and consistent results.

4/5/20006 Media Banding Modern disks are zoned –More bits stored on outer tracks + constant angular velocity = fast outer tracks –We’ve measured inner tracks on some drives being up to 40% slower than the outer tracks –A “normal” disk map…

4/5/20007 Media Banding

4/5/20008 Overall Findings Changes in throughput performance are incremental rather than radical –Trendlines have the same general shape –Most of Riedel’s model still holds

4/5/ Hardware Bandwidth (RAP) System Bandwidth: What Riedel Saw in megabytes per second (not to scale!) 30 Hard Disk | SCSI | PCI | Memory | Processor 9 per disk

4/5/ Hardware Bandwidth (PAP) System Bandwidth Yesterday in megabytes per second (not to scale!) 40 Hard Disk | SCSI | PCI | Memory | Processor 15 per disk

4/5/ Hardware Bandwidth (PAP) System Bandwidth Yesterday in megabytes per second (not to scale!) 40 Hard Disk | SCSI | PCI | Memory | Processor 15 per disk The familiar bandwidth pyramid: The farther from the CPU, the less the bandwidth.

4/5/ ,600 Hardware Bandwidth (PAP) System Bandwidth Today in megabytes per second (not to scale!) Hard Disk | SCSI | PCI | Memory | Processor The familiar pyramid is gone! PCI is now the bottleneck! In practice, 3 disks can reach saturation using sequential IO

4/5/ , Hardware Bandwidth (PAP) System Bandwidth Today in megabytes per second (not to scale!) Hard Disk | SCSI | PCI | Memory | Processor Possible solutions: A fatter, 64bit 66MHz PCI bus or…

4/5/ , Hardware Bandwidth (PAP) System Bandwidth Today in megabytes per second (not to scale!) Hard Disk | SCSI | PCI | Memory | Processor Possible solutions: A fatter, 64bit 66MHz PCI bus or… multiple PCI busses

4/5/ Hardware Bandwidth (RAP) System Bandwidth Today (reads) Numbers we’ve seen in megabytes per second (not to scale!) each Hard Disk | SCSI | PCI | Memory | Processor

4/5/ old-old: NT4SP3 vs. NT4SP6 Unbuffered read and WCE writes no longer show decrease in throughput Buffered read bug is gone Overheads are different NT4SP3 Buffered Overhead Request Size (K Bytes) Overhead (cpu ms/MB) Read Write Write + WCE Read Write NT4SP3NT4SP6

4/5/ old-new: Windows 2000 Software: Major changes, minor differences –Dmio: The volume manager for Win2K More fixed overhead than ftdisk due to longer code paths More features than ftdisk (dynamically size volumes, etc.) –In the end, performance is the same. Processors are fast enough that there are more than enough cycles to spare.

4/5/ new-new: Windows 2000 Hardware: The American Way –Faster, bigger, cheaper Disks are now 4 times bigger and 3 times faster. SCSI bus bandwidth has surpassed the PC- standard 32bit, 33MHz PCI bus bandwidth. Random IO is unaffected by the PCI bottleneck. Additional SMP processor provided no additional throughput gains.

4/5/ new-new: Windows 2000 Scalability PCI Bottleneck

4/5/ new-new: Windows 2000 & IDE The real IO revolution: RAID priced for the masses! The good news: –IDE disks are cheap We bought 5400 RPM IDE 27GB drives for $209 ($7.75/GB) while our 10,000 RPM 18GB SCSI drive cost $534 ($30/GB) IDE costs $3.17 per Kaps while SCSI costs $5.09 per Kaps. Today, IDE is $6,500 per TB while SCSI costs $16,000

4/5/ new-new: Windows 2000 & IDE IDE Performance: –Single disk random IO performance on a 5400 RPM IDE is much slower than a 10,000 SCSI. However, multiple IDE disks can provide up to 60% more Kaps for the same price as a single SCSI disk.

4/5/ new-new: Windows 2000 & IDE IDE Performance: –Single disk sequential IO throughput on a 5400 RPM IDE drive is 80% of the more expensive 10,000 RPM SCSI drive.

4/5/ new-new: Windows 2000 & IDE Price/Performance for IDE is hard to beat –Performance For sequential and random IO, IDE is price/performance leader Overhead for SCSI and 3ware/DMA IDE is the same. –Capacity 69GB (~2.5 disks worth) of Quantum Fireball lct08s costs the same as one Quantum Atlas 10K 18GB disk.

4/5/ new-new: Windows 2000 & IDE The bad news about IDE – The quality of IDE controllers varies Revolutions are being missed due to slow controller

4/5/ new-new: Windows 2000 & IDE The bad news about IDE High controller overhead is causing the disk to miss revolutions at small request sizes Missing every other revolution Missing multiple revolutions

4/5/ new-new: Windows 2000 & IDE (3ware) The bad news about IDE –IDE RAID isn’t as mature as SCSI Driver bugs and incompatibilities Problems with multiple IDE drives –IDE spec gives 18” as the max cable length: getting cables to drives can be a chore –Avoid master/slave: reliability and possibly performance is lost –No hot swap

4/5/ new-new: Windows 2000 & IDE (3ware) The bad news about IDE –RAID isn’t as mature as SCSI 3ware’s card peaks out at 55MBps for reads and 40MBps for writes; 3 disks for reads and 2 for writes.

4/5/ Where do we go from here? Network IO over Gigabit –OOB performance and slight tuning Sqlio2: a complete rewrite of SQLIO

4/5/ And in conclusion… NT4SP6 –Unbuffered requests at 2KB, 4KB request sizes no longer have dip –Buffered read request bug gone –Buffered overhead appears to be lower Windows 2000 –Despite dmio replacing ftdisk, throughput remains unaffected

4/5/ And in conclusion… new-new SCSI performance –PCI is now the bottleneck with 3 drives able to reach saturation new-new IDE –IDE shows a lot of promise: cheap storage and good performance –Difficulty lies with multiple disks IDE RAID cards not quite ready for prime time Physically wiring the drives