OFED 1.2 Status and Contents

Slides:



Advertisements
Similar presentations
System Area Network Abhiram Shandilya 12/06/01. Overview Introduction to System Area Networks SAN Design and Examples SAN Applications.
Advertisements

The Development of Mellanox - NVIDIA GPUDirect over InfiniBand A New Model for GPU to GPU Communications Gilad Shainer.
OFED TCP Port Mapper Proposal June 15, Overview Current NE020 Linux OFED driver uses host TCP/IP stack MAC and IP address for RDMA connections Hardware.
Uncovering Performance and Interoperability Issues in the OFED Stack March 2008 Dennis Tolstenko Sonoma Workshop Presentation.
RDS and Oracle 10g RAC Update Paul Tsien, Oracle.
An overview of Infiniband Reykjavik, June 24th 2008 R E Y K J A V I K U N I V E R S I T Y Dept. Computer Science Center for Analysis and Design of Intelligent.
2006 Sonoma Workshop January 2006 Pre-Plugfest Interop Session Tuan Phamdo – Intel – Co-Chair IBTA CIWG Sujal Das - Director, SW Product Mgmt, Mellanox.
Improving the OFED Development Process.
IB ACM InfiniBand Communication Management Assistant (for Scaling) Sean Hefty.
New Direction Proposal: An OpenFabrics Framework for high-performance I/O apps OFA TAC, Key drivers: Sean Hefty, Paul Grun.
OFA-IWG - March 2010 OFA Interoperability Working Group Update Authors: Mikkel Hagen, Rupert Dance Date: 3/15/2010.
SRP Update Bart Van Assche,.
Discussing an I/O Framework SC13 - Denver. #OFADevWorkshop 2 The OpenFabrics Alliance has recently undertaken an effort to review the dominant paradigm.
IDC HPC Users Forum Spring 2009 Gilad Shainer, Mellanox Marketing Working Group On behalf of Bill Boas, Executive Director
OFED 1.x Roadmap & Release Process November 06 Jeff Squyres, Woodruff, Robert J, Betsy Zeller, Tziporet Koren,
Page 1 Overview of the OpenFabrics Alliance and OpenFabrics Enterprise Distribution (OFED™) Open Source, High Performance and High Efficiency Software.
Roland Dreier Technical Lead – Cisco Systems, Inc. OpenIB Maintainer Sean Hefty Software Engineer – Intel Corporation OpenIB Maintainer Yaron Haviv CTO.
OFA Interoperability Logo Program Sujal Das, April 30, 2007 Sonoma Workshop Presentation.
OFA-IWG Interop Event March 2008 Rupert Dance, Arkady Kanevsky, Tuan Phamdo, Mikkel Hagen Sonoma Workshop Presentation.
Copyright DataDirect Networks - All Rights Reserved - Not reproducible without express written permission Adventures Installing Infiniband Storage Randy.
1 March 2010 A Study of Hardware Assisted IP over InfiniBand and its Impact on Enterprise Data Center Performance Ryan E. Grant 1, Pavan Balaji 2, Ahmad.
Open Fabrics BOF Supercomputing 2008 Tziporet Koren, Gilad Shainer, Yiftah Shahar, Bob Woodruff, Betsy Zeller.
OFED for Linux: Status and Next Steps 1 Betsy Zeller (Qlogic), Tziporet Koren (Mellanox) 3/16/2010.
OFED 1.2 Lessons, 1.3 Planning and Field Support May 07 Tziporet Koren.
Boosting Event Building Performance Using Infiniband FDR for CMS Upgrade Andrew Forrest – CERN (PH/CMD) Technology and Instrumentation in Particle Physics.
2006 Sonoma Workshop February 2006Page 1 of (#) General Windows Update Gilad Shainer Mellanox Technologies Inc.
InfiniSwitch Company Confidential. 2 InfiniSwitch Agenda InfiniBand Overview Company Overview Product Strategy Q&A.
OpenFabrics 2.0 or libibverbs 1.0 Sean Hefty Intel Corporation.
Management Scalability Author: Todd Rimmer Date: April 2014.
Update on Scalable SA Project #OFADevWorkshop Hal Rosenstock Mellanox Technologies.
ISER Update OpenIB Workshop, Feb 2006 Yaron Haviv, Voltaire John Hufferd, Brocade
OpenFabrics Windows Development and Microsoft Windows CCS 2003 Part1
High Availability through the Linux bonding driver
Open Fabrics BOF Supercomputing 2008 Tziporet Koren, Gilad Shainer, Yiftah Shahar, Bob Woodruff, Betsy Zeller Rev. 0.9.
Remote Direct Memory Access (RDMA) over IP PFLDNet 2003, Geneva Stephen Bailey, Sandburst Corp., Allyn Romanow, Cisco Systems,
OFED - Status and Process November 2007 Tziporet Koren.
Scalable RDMA Software Solution Sean Hefty Intel Corporation.
OFED Usage in VMware Virtual Infrastructure Anne Marie Merritt, VMware Tziporet Koren, Mellanox May 1, 2007 Sonoma Workshop Presentation.
1 Public DAFS Storage for High Performance Computing using MPI-I/O: Design and Experience Arkady Kanevsky & Peter Corbett Network Appliance Vijay Velusamy.
OpenFabrics Enterprise Distribution (OFED) Update
Windows OpenFabrics (WinOF) Update Gilad Shainer, Mellanox Technologies November 2007.
Open Fabrics BOF Supercomputing 2009 Tziporet Koren, Gilad Shainer, Yiftah Shahar, Stan Smith Hal Rosenstock, Jeff Squyres, DK Panda,
Intel Research & Development ETA: Experience with an IA processor as a Packet Processing Engine HP Labs Computer Systems Colloquium August 2003 Greg Regnier.
OFED 1.3 InfiniBand Management Update Hal Rosenstock.
Open Fabrics BOF Tziporet Koren, Gilad Shainer, Yiftah Shahar, Bob Woodruff, Betsy Zeller.
InfiniBand at Sun Carl Hensler Distinguished Engineer Solaris Engineering Sun Microsystems.
iSER update 2014 OFA Developer Workshop Eyal Salomon
OpenFabrics Interface WG A brief introduction Paul Grun – co chair OFI WG Cray, Inc.
Shawn Hansen Director of Marketing. Windows Compute Cluster Server 2003 Enable scientist and researcher to focus on Science, not IT. Mission: Enable scientist.
Open MPI OpenFabrics Update April 2008 Jeff Squyres.
OFED 1.2 Management Update Hal Rosenstock.
Barriers to IB adoption (Storage Perspective) Ashish Batwara Software Solution Architect May 01, 2007.
OFA-IWG Interop Event April 2007 Rupert Dance Lamprey Networks Sonoma Workshop Presentation.
InfiniBand Routing in OFA Jason Gunthorpe – Obsidian Sean Hefty – Intel Hal Rosenstock – Voltaire.
Progress in Standardization of RDMA technology Arkady Kanevsky, Ph.D Chair of DAT Collaborative.
SC’13 BoF Discussion Sean Hefty Intel Corporation.
Advisor: Hung Shi-Hao Presenter: Chen Yu-Jen
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
The Holmes Platform and Applications
Enhancements for Voltaire’s InfiniBand simulator
Introduction to Networks
Introduction to Networks
Joint Techs Workshop InfiniBand Now and Tomorrow
QNX Technology Overview
OpenFabrics Alliance An Update for SSSI
Web Application Server 2001/3/27 Kang, Seungwoo. Web Application Server A class of middleware Speeding application development Strategic platform for.
Application taxonomy & characterization
Microsoft Core Storage Update
Presentation transcript:

OFED 1.2 Status and Contents April 07 Tziporet Koren 1

Agenda What is OFED? OFED Architecture and Components OFED 1.2 New Features OFED 1.2 Status What’s next?

OFED - Open Fabrics Enterprise Distribution Enterprise Working Group (EWG) within Open Fabrics Alliance (OFA) Collaborative effort to test & release OFA software Broader test participation Multi-vendor interoperability Ready for OS vendor adoption Support for many distributions Components – Kernel & User Space Add-on components for vendors to differentiate above OFA Reduces deployment complexity and cost

Agenda What is OFED? OFED Architecture and Components OFED 1.2 New Features OFED 1.2 Status What’s next?

Apps & Access Methods for using OF Stack OFA Linux Software Stack SA Subnet Administrator MAD Management Datagram SMA Subnet Manager Agent PMA Performance Manager Agent IPoIB IP over InfiniBand SDP Sockets Direct Protocol SRP SCSI RDMA Protocol (Initiator) iSER iSCSI RDMA Protocol (Initiator) RDS Reliable Datagram Service VNIC Virtual NIC UDAPL User Direct Access Programming Lib HCA Host Channel Adapter R-NIC RDMA NIC Application Level IP Based App Access Sockets Based Access Various MPIs Block Storage Access Clustered DB Access Access to File Systems Diag Tools Open SM User Level MAD API UDAPL User APIs OpenFabrics User Level Verbs & CMA / API User Space SDP Lib Kernel Space Upper Layer Protocol VNIC IPoIB SDP SRP iSER RDS NFS-RDMA RPC Cluster File Sys Connection Manager Abstraction (CMA) Mid-Layer SA Client MAD SMA Connection Manager Connection Manager Kernel bypass Kernel bypass OpenFabrics Kernel Level Verbs / API Provider Hardware Specific Driver Hardware Specific Driver Common Apps & Access Methods for using OF Stack Key Hardware InfiniBand HCA iWARP R-NIC InfiniBand iWARP

OFED 1.2 Components OFA development Add on HCA/NIC Drivers Bonding module MPI Components: MVAPICH Open MPI MVAPICH2 MPI tests: OSU benchmarks, Intel MPI benchmarks, Presta HCA/NIC Drivers Mellanox, QLogic, IBM, Chelsio Core: verbs, MAD, SMA, CMA, SA cache IPoIB SDP SRP, iSER RDS VNIC UDAPL OSM Diagnostic tools New in 1.2

Agenda What is OFED? OFED Architecture and Components OFED 1.2 New Features Kernel High Availability User Level Management iWARP MPI OFED 1.2 Status What’s next?

Main New Features - Kernel GA level for the EDC market: SDP, RDS, High Availability, Storage (iSER and SRP) Stability improvements Performance improvements: New - IPoIB Connected Mode (~1000 MB/Sec) New - RDS for Oracle SDP message BW: 10X for small messages 5X for medium messages Scalability: SDP memory consumption limit

High Availability - IPoIB Fails over from one interface to another on carrier off Two solutions: User space: A script that detects carrier off/on events, reconfigures interfaces, sends rarp to notify remote side Status: GA Kernel module: Bonding Covered in a separate talk Status: Beta

High Availability - SRP Failover between two ports/HCAs SRP HA is built of two parts: srp_daemon – discovery and sets up for all possible paths to SRP targets on the fabric Multipath tool – switches to a different path when a path fails. Current version uses Device Mapper multi-path Device Mapper (DM) driver from Linux kernel The persistent binding and HA are provided by user space apps (dm-multipath & dm-multipathd) Solution works for RedHat EL4 and SLES10 Status: Beta

High Availability – RDS & DSP RDS implementation fully support HA Requires IPoIB HA to work SDP: Does not support HA Can be implemented for same HCA using APM For multi-HCA support requires protocol change

Main New Features - User Level libibverbs 1.1: Fork support (requires apps change) Better low-level driver handling, including multiple drivers linked in statically Documentation: man pages librdmacm (uCMA) 1.0: Multicast joining from user space UD support

Main New Features - Management OpenSM: Routing improvements Performance improvement to min hop and up/down of over an order of magnitude New fat-tree and LASH algorithms SA optional record support “virtually” complete IB router enablement SA database dump/restore Many diagnostic improvements since OFED 1.1 Covered in DoE tools talk ibdiagui GUI for ibdiagnet Used at SC06

OFED 1.2 - iWARP Support Chelsio NIC supported Verbs and CMA APIs are the same as Infiniband ULPs that are supported: MPI (mvapich2 tested) uDAPL Basic Testing: mvapich2 NFS-RDMA Status: Beta

Main New Features - MPI MPI implementations: MVAPICH: version 0.9.9 Open MPI: version 1.2.1 MVAPICH2: version 0.9.8 (New) Common MPI setup sourcing: Simple menu-driven interface to choose which MPI implementation to set as the default on a per-user and/or system-wide basis

Main New Features - MVAPICH MVAPICH – Version 0.9.9 Improved message coalescing: Reduction of per QP send queues for reduction in memory requirement Increases the small message messaging rate significantly Multi-core optimizations: Optimized scalable shared memory design Optimized, high-performance shared memory aware collective operations Multi-port support for enabling user processes to bind to different IB ports for balanced communication performance On-demand connection management using native IB UD support Multi-path support for hot-spot avoidance in large scale clusters using LMC Memory Hook Support provided by integration with ptmalloc2 library

Main New Features - Open MPI Improvements to scalability of launching applications on large numbers of nodes "Installdirs" functionality (install OMPI into one place and then be able to move it elsewhere; good for ISV's) Support for fork() when using the OF libibverbs Support for setting fixed limits on registered memory Fixes for heterogeneous network environments (e.g., different number of IB ports on different hosts)

MPI - MVAPICH2 Includes most of the features of MVAPICH Performance and scalability comparable to MVAPICH for two-sided communication Added MPI-2 features (one-sided communication, collectives and datatype) Integrated Multi-rail support Multi-threading support (MPI_Thread_Multiple) RDMACM support for InfiniBand and iWARP Checkpoint/Restart support for application transparent systems-level fault tolerance

Agenda What is OFED? OFED Architecture and Components OFED 1.2 New Features OFED 1.2 Status OFED 1.2 Release Status OFED 1.2 System Matrix Third Party Components Testing What’s next?

OFED 1.2 Release Status Feature freeze: Feb 2 Alpha: Feb 14 Beta: Mar 14 RC1: Apr 4 RC2: Apr 18 RC3: May 3 Release: May 16

OFED 1.2 System Matrix CPU Arch: kernel.org: kernel 2.6.20 and 2.6.19 X86, x86_64, PPC64, ia64 (IB only) kernel.org: kernel 2.6.20 and 2.6.19 Novell: SLES 9 SP3 SLES 10 (SP1) Redhat: RHEL 4 (up3 and up4) RHEL 5 Free distros (Fedora, SuSE Pro, Ubuntu) Basic testing only

Third Party Components Testing Proprietary MPIs: Intel HP (over uDAPL) Proprietary SMs: Cisco, Voltaire, Qlogic Storage Targets: iSER: IP Store (FalconStore), Voltaire FC GW SRP: Engenio, MTD2000, Areca-1220, DDN, Cisco GW

Agenda What is OFED? OFED Architecture and Components OFED 1.2 New Features OFED 1.2 Status What’s next?

What’s Next? OFED 1.3 – Oct/Nov 07 Features that didn’t make it in 1.2: Minimize integration effort into OS distribution Definition immediately after 1.2 is out QoS - collaborate with IBTA to align schedule of software delivery IPoIB: NAPI NFS over RDMA integration Mellanox ConnectX IB HCA support Including new features Other features to be agreed upon by OFA and EWG at the conference Effort into OS distribution, QoS - collaborate with IBTA to align schedule of software delivery with OFED 1.3 (make sure all end users understand what is the hold up here), Mellanox ConnectX IB HCA support, including new features (subject to discussion and approval by OFA and EWG - to be discussed and prelim plan put in place at this conference)

Summary OFED becomes the industry standard OFED 1.2 for the EDC market: Stability Performance High Availability Scalability OFED 1.2 for the HPC market: Scalable for large clusters Multi-core support Multi-rail Performance improvements Successful collaboration between all participants

Thank You