Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Adventures Installing Infiniband Storage Randy.

Slides:



Advertisements
Similar presentations
The Development of Mellanox - NVIDIA GPUDirect over InfiniBand A New Model for GPU to GPU Communications Gilad Shainer.
Advertisements

Copyright © 2014 EMC Corporation. All Rights Reserved. Linux Host Installation and Integration for Block Upon completion of this module, you should be.
PNFS, 61 th IETF, DC1 pNFS: Requirements 61 th IETF – DC November 10, 2004.
Fibre Channel over InfiniBand Dror Goldenberg Mellanox Technologies.
1 InfiniBand HW Architecture InfiniBand Unified Fabric InfiniBand Architecture Router xCA Link Topology Switched Fabric (vs shared bus) 64K nodes per sub-net.
IP –Based SAN extensions and Performance Thao Pham CS 622 Fall 07.
ISCSI Performance in Integrated LAN/SAN Environment Li Yin U.C. Berkeley.
Agenda CS C446 Data Storage Technologies & Networks
© 2009 IBM Corporation Statements of IBM future plans and directions are provided for information purposes only. Plans and direction are subject to change.
Storage Networking. Storage Trends Storage growth Need for storage flexibility Simplify and automate management Continuous availability is required.
IWARP Ethernet Key to Driving Ethernet into the Future Brian Hausauer Chief Architect NetEffect, Inc.
2730/2730T/5730 Performance Comparison 20 May 2008.
Infiniband enables scalable Real Application Clusters – Update Spring 2008 Sumanta Chatterjee, Oracle Richard Frank, Oracle.
Managing Storage Lesson 3.
SRP Update Bart Van Assche,.
Module 10 Configuring and Managing Storage Technologies.
Windows RDMA File Storage
Roland Dreier Technical Lead – Cisco Systems, Inc. OpenIB Maintainer Sean Hefty Software Engineer – Intel Corporation OpenIB Maintainer Yaron Haviv CTO.
Disk Access. DISK STRUCTURE Sector: Smallest unit of data transfer from/to disk; 512B 2/4/8 adjacent sectors transferred together: Blocks Read/write heads.
TPT-RAID: A High Performance Multi-Box Storage System
1 - Q Copyright © 2006, Cluster File Systems, Inc. Lustre Networking with OFED Andreas Dilger Principal System Software Engineer
Introduction to SAN – 1: iSCSI & FCIPBITS Pilani Alumni Association ( 19, 2006 Introduction to Storage Area Networks – I iSCSI.
1 March 2010 A Study of Hardware Assisted IP over InfiniBand and its Impact on Enterprise Data Center Performance Ryan E. Grant 1, Pavan Balaji 2, Ahmad.
Slide 1 DESIGN, IMPLEMENTATION, AND PERFORMANCE ANALYSIS OF THE ISCSI PROTOCOL FOR SCSI OVER TCP/IP By Anshul Chadda (Trebia Networks)-Speaker Ashish Palekar.
HPCS Lab. High Throughput, Low latency and Reliable Remote File Access Hiroki Ohtsuji and Osamu Tatebe University of Tsukuba, Japan / JST CREST.
Trends In Network Industry - Exploring Possibilities for IPAC Network Steven Lo.
The NE010 iWARP Adapter Gary Montry Senior Scientist
InfiniBand Routing Solution Approach Yaron Haviv, CTO, Voltaire
11/05/07 1TDC TDC 564 Local Area Networks Lecture 8 IP-based Storage Area Network.
ISER Update OpenIB Workshop, Feb 2006 Yaron Haviv, Voltaire John Hufferd, Brocade
ISER on SCTP & IB draft-hufferd-ips-iser-sctp-ib-00.txt Generalizations to iSER specification John Hufferd Mike Ko Yaron Haviv.
Remote Direct Memory Access (RDMA) over IP PFLDNet 2003, Geneva Stephen Bailey, Sandburst Corp., Allyn Romanow, Cisco Systems,
Copyright © 2014 EMC Corporation. All Rights Reserved. Windows Host Installation and Integration for Block Upon completion of this module, you should be.
OFED Usage in VMware Virtual Infrastructure Anne Marie Merritt, VMware Tziporet Koren, Mellanox May 1, 2007 Sonoma Workshop Presentation.
1 Public DAFS Storage for High Performance Computing using MPI-I/O: Design and Experience Arkady Kanevsky & Peter Corbett Network Appliance Vijay Velusamy.
Hosted by Minimizing the Impact of Storage on Your Network W. Curtis Preston President The Storage Group.
Storage and Storage Access 1 Rainer Többicke CERN/IT.
5/8/09 Titanium Performance Update 2 Dot Hill Confidential.
CMS week, June 2002, CERN 1 First P2P Measurements on Infiniband Luciano Berti INFN Laboratori Nazionali di Legnaro.
iSER update 2014 OFA Developer Workshop Eyal Salomon
IATA Kulveer Singh
Mr. P. K. GuptaSandeep Gupta Roopak Agarwal
STORAGE ARCHITECTURE/ MASTER): Disk Storage: What Are Your Options? Randy Kerns Senior Partner The Evaluator Group.
ISER on InfiniBand (and SCTP). Problem Statement Currently defined IB Storage I/O protocol –SRP (SCSI RDMA Protocol) –SRP does not have a discovery or.
Barriers to IB adoption (Storage Perspective) Ashish Batwara Software Solution Architect May 01, 2007.
OFA-IWG Interop Event April 2007 Rupert Dance Lamprey Networks Sonoma Workshop Presentation.
1 © 2003, Cisco Systems, Inc. All rights reserved. CCNA 2 Module 4 Learning About Other Devices.
Internet Protocol Storage Area Networks (IP SAN)
ISER Support Annex Arkady Kanevsky, Ph.D. IBTA SWG San Francisco September 25, 2006.
Storage Networking. Storage Trends Storage grows %/year, gets more complicated It’s necessary to pool storage for flexibility Intelligent storage.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
© 2007 EMC Corporation. All rights reserved. Internet Protocol Storage Area Networks (IP SAN) Module 3.4.
20071 Native Infiniband Storage John Josephakis, VP, Data Direct Networks St. Louis – November 2007.
July 30, 2009opsarea meeting, IETF Stockholm1 Operational Deployment and Management of Storage over the Internet David L. Black, EMC IETF opsarea meeting.
Tgt: Framework Target Drivers FUJITA Tomonori NTT Cyber Solutions Laboratories Mike Christie Red Hat, Inc Ottawa Linux.
Ryan Leonard Storage and Solutions Architect
Video Security Design Workshop:
Bruno Giovanini Manesco © 2016
Storage Networking.
SAN (Extension Protocol & Protocol Stack)
Introduction to Networks
Introduction to Networks
GGF15 – Grids and Network Virtualization
Joint Techs Workshop InfiniBand Now and Tomorrow
Storage Team (Controller Performance) Aggregate Utilization
Storage Networking.
Module – 6 IP san and fcOe Module 6: IP SAN and FCoE 1
High-Performance Storage System for the LHCb Experiment
Application taxonomy & characterization
Cost Effective Network Storage Solutions
Presentation transcript:

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Adventures Installing Infiniband Storage Randy Kreiser Chief Architect Sonoma OpenFabrics Workshop 1 May 2007

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Meet the Players (Hardware) Host Channel Adapters & Switches –Mellanox –Qlogic –Voltaire –Cisco Storage –Data Direct Networks –Engenio –Texas Memory (SSD) –Others?

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Meet the Players (Software) Infiniband Drivers –OFED –Mellanox IBGLD –Qlogic –Voltaire –Cisco Subnet Manager –OpenSM –Qlogic –Voltaire –Cisco

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Decisions, Decisions, Decisions What operating system am I using –SuSe –RedHat –Other? What HCA should I use? –PCI-x –PCI-e What switch should I use? –Port count? What initiator driver should I use? –Performance ??? –Compatibility –Failover What storage should I use? –Performance ???  IOPS  Bandwidth

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Decisions, Decisions, Decisions SRP or iSER drivers Which subnet manager should I use? Where should the subnet manager run? –Switch –Host Troubleshooting –I can’t see any luns Benchmarking –600MBS –800MBS –1000MBS –2000MBS

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Direct Connect S2A Controller 1 S2A Controller 2

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Benchmarking O_Direct I/O vs non O_Direct I/O –Large Sequential I/O –Small Random I/O Software Striping –Chunk Size Block device max sectors –MAX SECT –SG_TABLE_SIZE Block device read ahead –hdparm –blockdev Queue Depth –Setting RAID Controller Settings –Cache Size

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Benchmarking Write performance blk size/dev/sdcc+d+e+f 256MB MB MB MB MB MB MB MB MB KB KB KB Read performance blk size/dev/sdcc+d+e+f 256MB MB MB MB MB MB MB MB MB KB KB KB

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Supported Disk TechnologySAS & SATAFibreChannel & SATA RAID Parity ProtectionRAID6 8+2 OnlyRAID3 (8+1+1), RAID6 8+2 Sustained Throughput5.6GB/s – 6.0GB/s2.4 GB/s – 2.8GB/s Maximum Cache5.0 GB ECC Protected2.5GB RAID Protected Minimum Cache2.5 GB ECC Protected2.5GB RAID Protected Disk Side Ports20 x SAS 4 Lane20 x FC-2 Host Side FC Ports8 x IB 4x DDR or 8 x FC-88 x FC-4 or 8 x IB 4x Dimensions7 x 19 x 28 in. (4U)7 x 19 x 25 in. (4U) CertificationsUL,CE,CUL,C-Tick,FCC Release Date1Q/2008September 2005 Specification S2A9900 Couplet S2A9550 Couplet S2A 9900 Hardware Specifications (What’s Next)

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission SRP

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission SRP (SCSI RDMA Protocol) Advantages –Inifiniband native protocol –No new hardware required –Requests carry buffer information –All data transfer through Infiniband RDMA –No Need for Multiple Packets –No flow control for data packets necessary

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Direct Connect Example IB ports with direct connections Data distribution through servers Asymmetrical file systems (Lustre, etc.)

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission SRP General SCSI RDMA Protocol –SCSI over IB –Similar to FCP (SCSI over Fibre Channel) except that CMD Information Unit includes addresses to get/place data. –Initiator drivers available with IB Software Vendors and OFED.

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission SRP Command Request

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission iSER

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission iSER (iSCSI Extensions for RDMA) iSER leverages on iSCSI management and discovery –Zero-Configuration, global storage naming (SLP, iSNS) –Change Notifications and active monitoring of devices and initiators –High-Availability, and 3 levels of automated recovery –Multi-Pathing and storage aggregation –Industry standard management interfaces (MIB) –3 rd party storage managers –Security (Partitioning, Authentication, central login control,..) Working with iSER over IB Doesn’t require changes !!! –Enable investment protection (software, education, training,..) –Reduce the fear-factor of IB

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission iSCSI Mapping to iSER / RDMA Transport iSER eliminates the traditional iSCSI/TCP bottlenecks : –Zero copy using RDMA –CRC calculated by hardware –Work with message boundaries instead of streams –Transport protocol implemented in hardware (minimal CPU cycles per IO) BHSAHSHDDataDD Protocol frames (RDMA) iSCSI PDU RC SendRC RDMA Read/Write X In HW X In HW iSCSI Mapping to iSER

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission iSER Protocol (Read) SCSI Reads –Initiator Send Command PDU (Protocol data unit) to Target –Target return data using RDMA Write –Target send Response PDU back when completed transaction –Initiator receives Response and complete SCSI operation iSCSI InitiatoriSERHCA iSER TargetTarget Storage Send_Control (SCSI Read Cmd) RDMA Write for Data Send_Control + Buffer advertisement Control_Notify Data_Put (Data-In PDU) for Read Control_Notify Send_Control (SCSI Response)

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission iSCSI Discovery-Direct SLP Client Broadcast: I’m xx where is my storage ? FC Routers discover FC SAN Relevant iSCSI Targets & FC gateways respond Client may record multiple possible targets & Portals GbE Switch FC Switch IB to IP Router Native IB RAID IB to FC Routers iSCSI Client Portal – a network end-point (IP+port), indicating a path

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission iSCSI Discovery-iSNS FC Routers discover FC SAN iSCSI Targets & FC gateways report to iSNS Server Client ask iSNS Server: I’m xx where is my storage ? iSNS responds with targets and portals resources may be divided to domains Changes notified immediately (SCNs) GbE Switch FC Switch IB to IP Router Native IB RAID IB to FC Routers iSCSI Client iSNS or SLP run over IPoIB or GbE, and can span both networks iSNS Server

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Conclusion Both SRP and iSER support RDMA –Source and Destination Addresses in the SCSI transfer –Zero memory copy SRP Uses –Direct server connections –Small controlled environments iSER Uses –Large switch connected Networks –Discovery fully supported

Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Adventures Installing Infiniband Storage Randy Kreiser Chief Architect Sonoma OpenFabrics Workshop 1 May 2007