Presentation is loading. Please wait.

Presentation is loading. Please wait.

Improving Disk Latency and Throughput with VMware Presented by Raxco Software, Inc. March 11, 2011.

Similar presentations


Presentation on theme: "Improving Disk Latency and Throughput with VMware Presented by Raxco Software, Inc. March 11, 2011."— Presentation transcript:

1

2

3 Improving Disk Latency and Throughput with VMware Presented by Raxco Software, Inc. March 11, 2011

4 Today’s Agenda Provide technical information on how NTFS impacts VMware I/O performance Examine ESX I/O test results Economic impact of Windows guests Solutions

5 Virtualization Benefits Server consolidation Less physical space for data centers Lower energy costs Easier management Eco-friendly alternative

6 Identifying and Correcting Problems Latency is your best indicator of a performance problem –Device latency is vSphere’s report of the physical storage response time –Kernel latency is vSphere’s report of ESC’s ability to manage IO Experts disagree on specifics, but most agree that… Device latency in excess of 15ms is worth inspection Device latency in excess of 30ms is likely a problem Kernel latency in excess of 2ms means ESX queues are overflowing High device latency can result in ESX queuing –So, correct slow hardware first! –Then, consider reducing VMDKs on a VMFS volume –Only then consider changing queue depths © Copyright 2010 EMC Corporation. All rights reserved.

7 Storage Contention Solution: Storage IO Control SIOC calculates data store latency to identify storage contention –Latency is a normalized, average across virtual machines –IO size and IOPS included SIOC enforces fairness when data store latency crosses threshold –Default of 30ms –Fairness enforced by limiting VMs access to queue slots Net effect: trade throughput for latency © Copyright 2010 EMC Corporation. All rights reserved. With Storage IO Control Actual Disk Resources utilized by each VM are in the correct ratio even across ESX Hosts

8 NTFS I/O Storms

9 NTFS Behavior NTFS fragments files and free space Increases logical I/O to storage controller More logical I/O = More physical I/O Multiple instances of Windows on host can lead to I/O contention

10 What is Fragmentation? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?

11 Logical v Physical Logical Level – NTFS needs disk and cluster size, enumerates LCNs – Creates $MFT and $Bitmap metadata – $Bitmap is how NTFS “sees” the disk – Has no idea about physical/virtual disk types

12 Anatomy of an MFT Record (vcn, lcn, run length): (8a85, 9189a, 7)

13 File Allocation Create $MFT record (one or more) $Bitmap accessed to locate free space $MFT record is updated with content Create Bitmap Access MFT Update

14 File Access Load portion of MFT with correct record via directory Locate file in the MFT Pass starting LCN’s and run lengths to disk controller Number of logical fragments influences number of physical seeks Load Locate File # LCN’s # Physical Seeks

15 Logical v. Physical Physical Level – Disk controller Maps LCNs to PCNs – Writes data to disk

16 Wasted Seeks Partition State Total Number of I/O Requests Sent to the File System Total Number of Resulting Disk Accesses/Seeks Net Wasted Seeks When Running SYSmark Percent Net Wasted Seeks When Running SYSmark Fragmented 1,320,6862,090,649769,963 58.30% After PerfectDisk 1,434,4541,616,847182,393 12.72% After Built-In1,411,6131,931,395519,78236.82%

17 How This Affects A Virtual Environment P2V Conversion Extra Hypervisor Overhead Disk Latency Degradation Overall Performance System Throughput Wasted Space Costly

18 P2V Conversion Physical Drive No Optimization Optimization 24GB 22GB 2GB Smaller

19 ESX Cluster Testing Identical disks - 40% free space Optimized one set, the other “as is” Installed MS Office and MS SQL Captured metrics with VMware’s vscsiStats utility

20 Fragmented PerfectDisk % Improvement Total IO Count 371912923821.3 Read IO Count306627998.7 Write IO Count341252643922.5 Total I/O Count

21 30ms50ms100ms>100ms Total Fragmented I/O 12749987787009116 40,442 PerfectDisk I/O 6707492340815053 20,764 49% Reduction in Latency!

22 Disk Latency

23 Fragmented DiskPerfectDisk Disk Total IO Equal to 524K2512848 Total IO > 524K 2472959 Read IO Equal to 524K 337 Read IO >524K12565 Write IO Equal to 524K2480841 Write IO >524K 1222894 12X More Large I/O

24 12 times more of the largest IO Large I/O

25 Improved Sequential I/O FragmentedPerfectDiskImprovement Percent Sequential17%27% 58% Total IO127703 9052625% Sequential IO221262434033%

26 Improved Sequential I/O

27 Installation Time Comparison FragmentedPerfectDisk% Improvement MS Office Install20 min15 min 25 MS SQL Install76 min51 min 33

28 The Cost of Fragmentation EXAMPLE: 20 files x 6 seconds = 2 minutes 300 users x 2 min = 10 hours/day 10 hrs x $25/hr = $250/day Annual cost = $62,500

29 Virtual Guest Fragmentation Windows guests have all the same NTFS behavior Fragmentation produces more IOPS Fragmentation reduces ESX throughput Fragmentation increases ESX disk latency Fragmentation creates resource contention between host & guests

30 Solutions Expensive – More disks and faster disks – Upgrade Fibre Channel – Troubleshooting Inexpensive – Optimize the Windows guest systems

31 PerfectDisk 12 vSphere Virtualization Awareness/host & client OptiWrite Fragmentation Avoidance “Zero-fill” free space NEW

32 PerfectDisk 12 vSphere “Short stroking” for thin provisioned disks Schedule guest compaction Snapshot & Linked Clone recognition NEW

33 PerfectDisk Benefits on ESX Saves $$$ in productivity and admin Reduces resource contention for VM’s Reduces total IO workload Improves throughput Reduces disk latency Delivers optimal performance

34 Contact Raxco Free Evaluation Software Excellent Support to Get You Started White Papers Great ROI www.raxco.com Toll Free: 1.800.546.9728

35


Download ppt "Improving Disk Latency and Throughput with VMware Presented by Raxco Software, Inc. March 11, 2011."

Similar presentations


Ads by Google