InfiniBand support for Socket- based connection model by CM Arkady Kanevsky November 16, 2005 version 4.

Slides:



Advertisements
Similar presentations
Transport Layer3-1 Transport Overview and UDP. Transport Layer3-2 Goals r Understand transport services m Multiplexing and Demultiplexing m Reliable data.
Advertisements

Introduction 1-1 Chapter 3 Transport Layer Intro and Multiplexing Computer Networking: A Top Down Approach 6 th edition Jim Kurose, Keith Ross Addison-Wesley.
1 Ports and IPv6. 2 Ports Transmission Control Protocol (TCP) or the User Datagram Protocol (UDP), used for communication Generally speaking, a computer.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Addressing the Network – IPv4 Network Fundamentals – Chapter 6.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public ITE PC v4.0 Chapter 1 1 Addressing the Network – IPv4 Network Fundamentals – Chapter 6.
Umut Girit  One of the core members of the Internet Protocol Suite, the set of network protocols used for the Internet. With UDP, computer.
Media Access Control (MAC) addresses in the network access layer ▫ Associated w/ network interface card (NIC) ▫ 48 bits or 64 bits IP addresses for the.
OFED TCP Port Mapper Proposal June 15, Overview Current NE020 Linux OFED driver uses host TCP/IP stack MAC and IP address for RDMA connections Hardware.
CCNA – Network Fundamentals
BZUPAGES.COM 1 User Datagram Protocol - UDP RFC 768, Protocol 17 Provides unreliable, connectionless on top of IP Minimal overhead, high performance –No.
Natting NATTING. Private vs Public IP Addresses Whatever connects directly into Internet must have public (globally unique) IP address There is a shortage.
© 2002 IBM Corporation IPoIB IETF-60 May 16, 2015 IPoIB Vivek Kashyap
Understanding Networks. Objectives Compare client and network operating systems Learn about local area network technologies, including Ethernet, Token.
WXES2106 Network Technology Semester /2005 Chapter 8 Intermediate TCP CCNA2: Module 10.
IP-UDP-RTP Computer Networking (In Chap 3, 4, 7) 건국대학교 인터넷미디어공학부 임 창 훈.
Gursharan Singh Tatla Transport Layer 16-May
OFED (iWarp) Enhancements Felix Marti, Open Fabrics Alliance Workshop Sonoma, April 2008 Chelsio Communications.
IB ACM InfiniBand Communication Management Assistant (for Scaling) Sean Hefty.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Addressing the Network – IPv4 Network Fundamentals – Chapter 6.
New Direction Proposal: An OpenFabrics Framework for high-performance I/O apps OFA TAC, Key drivers: Sean Hefty, Paul Grun.
1 Transport Layer Computer Networks. 2 Where are we?
Chapter 17 Networking Dave Bremer Otago Polytechnic, N.Z. ©2008, Prentice Hall Operating Systems: Internals and Design Principles, 6/E William Stallings.
Enabling Embedded Systems to access Internet Resources.
© 2009 Pearson Education Inc., Upper Saddle River, NJ. All rights reserved. © The McGraw-Hill Companies, Inc. Transport Layer Protocols – UDP and TCP Asst.
Jozef Goetz, Application Layer PART VI Jozef Goetz, Position of application layer The application layer enables the user, whether human.
InfiniBand Routing Solution Approach Yaron Haviv, CTO, Voltaire
CS 1652 The slides are adapted from the publisher’s material All material copyright J.F Kurose and K.W. Ross, All Rights Reserved Jack Lange.
Chapter 3 Transport Layer
2006 Sonoma Workshop February 2006Page 1 Sockets Direct Protocol (SDP) for Windows - Motivation and Plans Gilad Shainer Mellanox Technologies Inc.
ISER on SCTP & IB draft-hufferd-ips-iser-sctp-ib-00.txt Generalizations to iSER specification John Hufferd Mike Ko Yaron Haviv.
Scalable RDMA Software Solution Sean Hefty Intel Corporation.
RDMA IP CM Service Annex Arkady Kanevsky, Ph.D. IBTA SWG San Francisco September 25, 2006.
Transport Layer3-1 Chapter 3 Transport Layer Computer Networking: A Top Down Approach 4 th edition. Jim Kurose, Keith Ross Addison-Wesley, July A.
Layer 3: Internet Protocol.  Content IP Address within the IP Header. IP Address Classes. Subnetting and Creating a Subnet. Network Layer and Path Determination.
IWARP Status Tom Tucker. 2 iWARP Branch Status  OpenFabrics SVN  iWARP in separate branch in SVN  Current with trunk as of SVN 7626  Support for two.
ISCSI Extensions for RDMA (iSER) draft-ko-iwarp-iser-02 Mike Ko IBM August 2, 2004.
ISER on InfiniBand (and SCTP). Problem Statement Currently defined IB Storage I/O protocol –SRP (SCSI RDMA Protocol) –SRP does not have a discovery or.
TURN Jonathan Rosenberg Cisco Systems. Changes since last version Moved to behave terminology Many things moved into STUN –Basic request/response formation.
MULTIPLEXING/DEMULTIPLEXING, CONNECTIONLESS TRANSPORT.
August 04, 2004John Carrier, Adaptec1 One-Shot STags John Carrier Adaptec.
ISER Support Annex Arkady Kanevsky, Ph.D. IBTA SWG San Francisco September 25, 2006.
IP Protocol CSE TCP/IP Concepts Connectionless Operation Internetworking involves connectionless operation at the level of the Internet Protocol.
1 Minneapolis‘ IETF IPFIX Aggregation draft-dressler-ipfix-aggregation-00.txt.
1 Network Communications A Brief Introduction. 2 Network Communications.
1 K. Salah Module 5.1: Internet Protocol TCP/IP Suite IP Addressing ARP RARP DHCP.
Progress in Standardization of RDMA technology Arkady Kanevsky, Ph.D Chair of DAT Collaborative.
CSEN 404 Transport Layer I Amr El Mougy Lamia Al Badrawy.
SC’13 BoF Discussion Sean Hefty Intel Corporation.
Chapter 3 Transport Layer
Introduction to Networks
Chapter 3 Transport Layer
The Transport Layer Implementation Services Functions Protocols
Transport Layer Slides are originally from instructor: Carey Williamson at University of Calgary Very minor modification are made Notes derived from “Computer.
Multiplexing.
Chapter 3 outline 3.1 Transport-layer services
Transport Layer.
06- Transport Layer Transport Layer.
Simple Connectivity Between InfiniBand Subnets
Understand the OSI Model Part 2
Introduction to Networking
CS 1652 Jack Lange University of Pittsburgh
Introduction to Networks
Subject Name: Computer Communication Networks Subject Code: 10EC71
Transport Layer Our goals:
September 19th, 2013 CS1652 Jack Lange University of Pittsburgh
Process-to-Process Delivery:
Process-to-Process Delivery: UDP, TCP
Chapter 3 Transport Layer
Transport Layer Our goals:
Presentation transcript:

InfiniBand support for Socket- based connection model by CM Arkady Kanevsky November 16, 2005 version 4

2 Work Items for InfiniBand CM extension Maximize Consumer-usable private data Standard IBTA CM REQ with private data format extensions –CM REQ provides IP address and port (analog) of the requestor, and IP address of the destination TCP Port of the destination is available from receiver Service ID –Use SID range to identify that private data is formatted Define mapping between TCP port space and IB Service IDs for RDMA aware ULPs.

3 CM REQ Message Private Data Format Source IP Address (31 – 0) Source IP Address (63 – 32) Source IP Address (95 – 64) Source IP Address (127 – 96) Source Port IP Version Consumer Private Data Destination IP Address (31 – 0) Destination IP Address (63 – 32) Destination IP Address (95 – 64) Destination IP Address (127 – 96) reserved

4 CM REQ Message Private Data Fields Version (4 bits) – protocol version IP Version (4 bits) –0100b: IPv4 –0110b: IPv6 –other values reserved reserved (8 bits) –set to 0 byte aligned Source Port (2 bytes) 16-byte aligned Source IP Address (16 bytes) –IP address of initiator 16-byte aligned Destination IP Address (16 bytes) –IP address of destination 16-byte aligned Consumer Private Data (56 bytes)

5 CM REQ Message Private Data formatting indicator SID range indicates that private data is formatted API or verbs that use IETF addressing indicate that IETF protocol # and port need to be mapped into IB SID

6 Mapping between TCP ports and IB Service IDs Use simple mapping –predefine upper 48 bits to form “RDMA aware ULPs service IDs” Use IBTA Service ID range –byte 1 – 0x00 –byte 2-4 – 0x00s –byte 5 – 0x01 – indicator that SID is in RDMA aware SID range –byte 6 - IETF Protocol number (8 bits) »TCP=0x06 (6), UDP=0x11 (17), SCTP= 0x83 (132) –byte 7-8 – protocol ports –1-to-1 map protocol ports to lower 16 bits of SID ensures that all well known ports are mapped the same way for each IETF protocol

Rationale and Discussions

8 Mapping Rationale - I Support for a ULP running over raw IB, IPoIB, and intermediate protocols (SDP) –SDP defines its own Service ID space and port mappings. Proposal shall not alter existing SDP protocol or its implementations. SDP ULPs are not RDMA aware. –IPoIB does not expose IB ports and all IP traffic goes over a UD IB over a single IB QP. All port multiplexing is done by IPoIB. ULP is not RDMA aware. –RDMA aware ULPs need port mapping.

9 Mapping Rationale - II What is RDMA aware ULP? –ULP that explicitly use RDMA semantic: memory registration, preposting recv buffers, RDMA operations and so on. RDMA transport independent –run on IB and iWARP without transport dependent code SDP ULPs are not RDMA aware –SDP provides port mapping and startup protocol SDP is transport dependent –ULPs do not use RDMA semantic

10 Alternatives Port mapping Use port mapper –designated Service IDs for SDP, RDMA aware ULPs. in-line port mapper for RDMA aware ULPs –Service on top of CM will redirect connection requests based on 5-tuple: IETF protocol, DST IP address and Port, SRC IP address and Port –designated Service ID for port mapper

11 Discussions - I Do we need include 0-bazed VA (ZB) into this proposal? iSER protocol does not advertise VA and relies implicitly on 0-based VA –This is ULP specific protocol/assumption and should be handled by the ULP –Outside the scope of this proposal

12 Discussions - II Do we need include support for Remote Send with Invalidate (SI) into this proposal? Rationale why it was remove for v3: –QP does not have an attribute for turning on or off SI support Hence CM can not convey this info on its own, nor take any action based on this field in CM messages –Remote Invalidate usage support request and response must come from ULP Different ULPs can take difference actions based on its ability to adapt to remote side usage of SI as well as IB support for it: –example 1: Responder refuses the connection if requestor ask for SI and responder either will not do it or its IB Provider does not support it. –example 2: Responder accepts the connection if requestor ask for SI but convey to the requestor whether or not it will use it –example 3: Responder refuses the connection if requestor does not ask for SI –example 4: Responder accepts the connection if requestor do not ask for SI but convey to the requestor whether or not it will use it Thus, it is ULP specific connection management and outside the scope of this proposal IBTA iSER protocol should define the protocol to use Consumer Private Data for SI. IETF iSER does not expose it Current IETF NFS-RDMA protocol suite does not use SI

13 Discussions - III Do we need to support for IETF Protocol number? IETF iWARP is only defined for TCP and SCTP Since IETF post spaces are independently managed based on protocol number it is better to support it to avoid issues in the future when RDMA will be defined/used over non-TCP protocol Protocol # is in SID – so no bandwidth savings by not supporting it Clean and complete IETF 5-tuple support

14 Discussions - IV Do we need to pass Destination Port? CM can do the mapping between Service IDs and ports to provide Destination Port to a Responder. –The conversion is an API issue is outside the scope of the spec The same is true for populating Private Data CM formatted fields. –IBTA does not define CM verbs.

15 Discussions - V The use of IP addresses reserved for privileged/kernel users Not an issue since CM REQ message Q- key is setup by CM appropriately. IBTA spec already covers it. –If packet is tampered on a wire then it is analogous to the Ethernet situation. Nothing special to be done. –provided by IB security model

16 Discussion VI 64 bit IP addresses alignment private data starts at 140 byte –IP addresses start at 144 byte –This is 8 byte align IP addresses are 8 byte aligned