Presentation is loading. Please wait.

Presentation is loading. Please wait.

High-Performance Networking With NDIS 6.0, TCP Chimney Offload, and RSS Vik Desai Program Manager Windows Networking Microsoft Corporation.

Similar presentations


Presentation on theme: "High-Performance Networking With NDIS 6.0, TCP Chimney Offload, and RSS Vik Desai Program Manager Windows Networking Microsoft Corporation."— Presentation transcript:

1 High-Performance Networking With NDIS 6.0, TCP Chimney Offload, and RSS Vik Desai Program Manager Windows Networking Microsoft Corporation

2 Appropriate Audience Who should attend this session? Networking product builders Product decision makers Hardware and software engineers Architects Network designers and deployers IT Managers IT Consultants Venture Capitalists and Private Investors Industry analysts

3 Agenda Networking stack challenges Scalable networking goals Scalable networking architecture Receive Side Scaling (RSS) TCP Chimney Offload Scalable networking demo NetXen Demo – Vikram Karvat Broadcom Demo – Uri Elzur Offload roadmap Summary and Call to Action

4 Networking Challenges Receive processing limited to a single CPU on a multi-processor system CPU utilized in Protocol Processing increases with Physical layer speeds Data Movement between network and application buffers is a bottleneck Large number of Interrupts even with Interrupt Moderation

5 Scalable Networking Goals Boost application scalability on 1 GB and 10 GB Ethernet with an integrated architecture That preserves standard infrastructure (1500b MTU) That maintains standard network and server management practices That does not compromise security, server reliability, and application compatibility Enable Ethernet fabric convergence Robustly support new class of protocol offload NICs in Microsoft Windows

6 Receive Side Scaling Networking Challenge Receive processing limited to a single CPU on a multi-processor system Solution Parallelize receive processing by Queuing incoming packets to multiple CPUs Implementing Solution via RSS NIC manages multiple hardware queues NIC hashes incoming TCP segments to different hardware queues NIC driver requests DPCs on appropriate CPUs

7 RSS Description – Non RSS Capable NIC Regular NIC Receive FIFO Interrupt Logic Processor 0 ISRNDIS TCPIP APP DPCDPC Incoming Packet

8 RSS Description – RSS Capable NIC RSS CapableNIC Receive FIFOs Interrupt Logic Incoming Packet Processor 0 ISR NDIS TCPIP APP DPCDPC Processor 1 NDIS TCPIP APP DPCDPC Processor 2 NDIS TCPIP APP DPCDPC Toeplitz Hash

9 TCP Chimney Offload Networking Challenges Data Movement between network and application buffers is a bottleneck Large number of Interrupts even with Interrupt Moderation CPU utilized in Protocol Processing increases with Physical layer speeds Solution Provide Zero Copy solution for pre posted buffers Change interrupts from a per packet basis to a per segment basis Offload Protocol Processing to hardware

10 TCP Chimney Architecture NDIS 5.2 / 6.0 NDIS Miniport Driver TCP Chimney Offload Capable Hardware Framing Layer (Ethernet) Path Layer IPv4 or IPv6 Transport Layer (TCP) Other Misc. Layers Switch Applications State Updates Data Transfer TCP Chimney Interfaces

11 TCP Chimney Interface Details TCP/IP States Divided into Const State – Does not change for connection lifetime Cached State – Controlled by host stack and updated appropriately to offload target Delegated State – Controlled by Offload Target NDIS Supports Offload Capability Advertisement Interface to transfer and update state information Interface to query statistics Interface to transfer data

12 TCP Chimney Initialization Offload Manager determines suitability of connection for offload State from each layer is captured and transferred to offload target Incoming Data packets/outgoing sends are queued Data packets will be replayed to offload targets for successful offload attempts Data packets will be processed by stack for unsuccessful offload attempts Data Transfer Begins

13 TCP Chimney Data Transfer Sends Segment passed to offload target for completion Send Completions after end-to-end TCP Ack Receive If no receive buffers posted indicate data If receive buffers are posted indication occurs as appropriate OOB/Urgent Data passed to Host Stack

14 TCP Chimney Connection Teardown Connections can be uploaded/offloaded at any time Heuristics Manager tracks connections appropriate for upload/offload Half Closed Connections are not uploaded Upload request initiated by offload target Offload target to provide delegated state to host stack Offload target keeps connection state till host sends upload call

15 TCP Chimney Implications IPsec Chimney required for IPsec traffic Will not work with IM drivers incapable of understanding Chimney interfaces Hooking Firewalls Best benefits for Long Lived Connections Pre-posted Receive Buffers Large Application IO Sizes

16 10GbE Chimney Offload Vikram Karvat VP Marketing Faisal Latif Principal Software Engineer

17 NetXen Next generation Ethernet silicon provider focused on server OEMs Chips, Boards, S/W Founded February 2002 Top tier investors Accel, Benchmark, Integral Capital Expertise in semiconductor, software, systems and servers Intelligent NIC™ product line Launched March 27, 2006 REAL products, REAL customers

18 Intelligent NIC Architecture Single-Chip Dual 10GbE Quad GbE Protocol Features TCP/IPRDMAiSCSIVirtualizationSecurity Native 8X PCI-express 1X/4X/8X 10GE Flow Classifier DDR Protocol Processing Engine CAM PCI-E 8X CORE INTERCONNECT FABRIC 10GbE L2 Caches GbE QDR QM

19 NetXen 10GbE Chimney Tx Windows Server 2003 SP1 with SNP Rx 3.4 GHz Xeon 10GbE Switch

20 10GbE Chimney Results Throughput CPU Utilization Configuration: DP Xeon, 3.4GHz, HT off, 2GB 60% Throughput 800% Processor Efficiency

21 Demo Conclusion 10GbE is happening NOW Chimney enables Scalability with balanced system design Increased datacenter power efficiency The Agile Datacenter requires Adaptability, Scalability, Intelligence

22 Broadcom Uri Elzur Director, Advanced Technology Broadcom Gururaj Ananthateerta Senior Staff Engineer Broadcom

23 Scalable TCP Chimney enables Convergence Over Ethernet Scalable TCP Chimney - basis for Convergence over Ethernet TCP based - Socket applications, iSCSI, iSCSI boot, iWARP (RDMA) Microsoft’s SNP enable convergence over Ethernet Secure (Network based security), robust and standard compliant implementation is required Ethernet requires Layer 2 functionality – VLAN, WoL, power management Integrated Management File System TCP/IP NDIS NDIS IM Driver NDIS Miniport Class Driver iSCSI Miniport iSCSI Port Driver. Storage Applications NIC Partition HBA Windows Socket Switch Sockets Applications Windows Sockets RDMA Driver User Mode Kernel Mode RDMA Provider RNIC (iscsiprt sys) C-NIC

24 Broadcom’s C-NIC 2.5G/S NTTCP over 2.5 GB/s TCP Chimney S2 (TX/RX) S1 (TX/RX) HP DL 380G4 server 3.4GHz Intel Xeon CPU 1 GB RAM Windows Server 2003 SP1-SNP build 2670 Two BCM5708S NICs Broadcom Miniport driver v * C-NIC Perfmon BCM5708S BCM5708S fiber cable NTTTCPs HP DL 380G4 server 3.4GHz Intel Xeon CPU 1GB RAM Windows Server 2003 SP1-SNP build 2670 Two BCM5708S NICs Broadcom Miniport driver v * Broadcom 2.5G Switch BCM56580 StrataXGS III

25 TCP Chimney scales… 2.5G/S offers more BW than non-TOE, at 1/6 of the CPU utilization 2.5G/S offers more BW than non-TOE, at 1/6 of the CPU utilization Microsoft’s SNP combined with BCM5708 provides 7.5 times better P/E Microsoft’s SNP combined with BCM5708 provides 7.5 times better P/E Performance Efficiency ( P/E) is network throughput divided by CPU Utilization Performance Efficiency ( P/E) is network throughput divided by CPU Utilization At Gigabit and beyond, TCP Chimney is critical to free up cycles for the applications At Gigabit and beyond, TCP Chimney is critical to free up cycles for the applications Higher is better Lower is better BW improvement TOE vs. L2 CPU Utilization reduction TOE vs. L2 Demo: NTTTCP TOE L2

26 RSS Improves SMP Scalability With RSS web traffic is more evenly distributed on multiple CPUs Web Bench delivers up to 50% more requests/sec Demo: Web Bench 5.0

27 Demo Conclusion Broadcom’s C-NIC with Microsoft’s TCP Chimney is here TODAY TCP Chimney scales to accommodate the needs of the server and applications TCP Chimney is the basis for the future of Networking in Windows Architecture allows for IPsec based security RSS provides for a better load spreading on SMP servers

28 Scalable Networking Pack Partners

29 Future Chimney Offloads IPsec Chimney RDMA Chimney SSL Chimney

30 Call To Action Develop low cost TCP Chimney Offload and RSS hardware for Windows Vista and Windows Server codenamed “Longhorn” Deploy TCP Chimney Offload and RSS hardware in enterprise and personal computing environments

31 Additional Resources Web Resources Documentation, White Papers, and software bits available today for TCP Chimney Offload and RSS: Specs: DDK and Documentation will available on: White Paper: Other Resources: Related Sessions Net088 – Technical Overview of Microsoft’s NetDMA Architecture Please send to with questions microsoft.com

32 © 2006 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

33


Download ppt "High-Performance Networking With NDIS 6.0, TCP Chimney Offload, and RSS Vik Desai Program Manager Windows Networking Microsoft Corporation."

Similar presentations


Ads by Google