Reconfigurable Network Topologies at Rack Scale

Slides:



Advertisements
Similar presentations
M A Wajid Tanveer Infrastructure M A Wajid Tanveer
Advertisements

Software-defined networking: Change is hard Ratul Mahajan with Chi-Yao Hong, Rohan Gandhi, Xin Jin, Harry Liu, Vijay Gill, Srikanth Kandula, Mohan Nanduri,
A Novel 3D Layer-Multiplexed On-Chip Network
SDN + Storage.
Presentation of Designing Efficient Irregular Networks for Heterogeneous Systems-on-Chip by Christian Neeb and Norbert Wehn and Workload Driven Synthesis.
Slick: A control plane for middleboxes Bilal Anwer, Theophilus Benson, Dave Levin, Nick Feamster, Jennifer Rexford Supported by DARPA through the U.S.
Brocade VDX 6746 switch module for Hitachi Cb500
Pelican: A Building Block for Exascale Cold Data Storage
Module 1: Demystifying Software Defined Networking Module 2: Realizing SDN - Microsoft’s Software Defined Networking Solutions with Windows Server 2012.
Towards Virtual Routers as a Service 6th GI/ITG KuVS Workshop on “Future Internet” November 22, 2010 Hannover Zdravko Bozakov.
High Performance Router Architectures for Network- based Computing By Dr. Timothy Mark Pinkston University of South California Computer Engineering Division.
1 13-Jun-15 S Ward Abingdon and Witney College LAN design CCNA Exploration Semester 3 Chapter 1.
1 BGL Photo (system) BlueGene/L IBM Journal of Research and Development, Vol. 49, No. 2-3.
CAD and Design Tools for On- Chip Networks Luca Benini, Mark Hummel, Olav Lysne, Li-Shiuan Peh, Li Shang, Mithuna Thottethodi,
1 Evgeny Bolotin – ICECS 2004 Automatic Hardware-Efficient SoC Integration by QoS Network on Chip Electrical Engineering Department, Technion, Haifa, Israel.
Dynamic routing – QoS routing Load sensitive routing QoS routing.
ProActive Routing In Scalable Data Centers with PARIS Joint work with Dushyant Arora + and Jennifer Rexford* + Arista Networks *Princeton University Theophilus.
FireFly: A Reconfigurable Wireless Datacenter Fabric using Free-Space Optics Navid Hamedazimi, Zafar Qazi, Himanshu Gupta, Vyas Sekar, Samir Das, Jon.
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Data-Center Traffic Management COS 597E: Software Defined Networking.
A Scalable, Commodity Data Center Network Architecture.
Storage area network and System area network (SAN)
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
Practical TDMA for Datacenter Ethernet
C OLUMBIA U NIVERSITY Lightwave Research Laboratory Embedding Real-Time Substrate Measurements for Cross-Layer Communications Caroline Lai, Franz Fidler,
Identifying and Using Energy Critical Paths Nedeljko Vasić with Dejan Novaković, Satyam Shekhar, Prateek Bhurat, Marco Canini, and Dejan Kostić EPFL, Switzerland.
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
EVOLVING TRENDS IN HIGH PERFORMANCE INFRASTRUCTURE Andrew F. Bach Chief Architect FSI – Juniper Networks.
Copyright 2009 Fujitsu America, Inc. 0 Fujitsu PRIMERGY Servers “Next Generation HPC and Cloud Architecture” PRIMERGY CX1000 Tom Donnelly April
CustomerSegment and workloads Your Datacenter Active Directory SharePoint SQL Server.
Network Aware Resource Allocation in Distributed Clouds.
IT Infrastructure Chap 1: Definition
David G. Andersen CMU Guohui Wang, T. S. Eugene Ng Rice Michael Kaminsky, Dina Papagiannaki, Michael A. Kozuch, Michael Ryan Intel Labs Pittsburgh 1 c-Through:
SMART: A Single- Cycle Reconfigurable NoC for SoC Applications -Jyoti Wadhwani Chia-Hsin Owen Chen, Sunghyun Park, Tushar Krishna, Suvinay Subramaniam,
DARD: Distributed Adaptive Routing for Datacenter Networks Xin Wu, Xiaowei Yang.
InfiniSwitch Company Confidential. 2 InfiniSwitch Agenda InfiniBand Overview Company Overview Product Strategy Q&A.
Lessons Learned The Hard Way: FPGA  PCB Integration Challenges Dave Brady & Bruce Riggins.
Scalable Reconfigurable Interconnects Ali Pinar Lawrence Berkeley National Laboratory joint work with Shoaib Kamil, Lenny Oliker, and John Shalf CSCAPES.
LAN Switching and Wireless – Chapter 1
1 LAN design- Chapter 1 CCNA Exploration Semester 3 Modified by Profs. Ward and Cappellino.
© 1999, Cisco Systems, Inc. Module 9: Understanding Virtual LANs.
LAN Switching and Wireless – Chapter 1 Vilina Hutter, Instructor
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
RF network in SoC1 SoC Test Architecture with RF/Wireless Connectivity 1. D. Zhao, S. Upadhyaya, M. Margala, “A new SoC test architecture with RF/wireless.
Computer Networks: Switching and Queuing Ivan Marsic Rutgers University Chapter 4 – Switching and Queuing Delay Models.
Software Defined Networks for Dynamic Datacenter and Cloud Environments.
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
Symbiotic Routing in Future Data Centers Hussam Abu-Libdeh Paolo Costa Antony Rowstron Greg O’Shea Austin Donnelly MICROSOFT RESEARCH Presented By Deng.
Advanced Computer Networks Lecturer: E EE Eng. Ahmed Hemaid Office: I 114.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
Theophilus Benson*, Ashok Anand*, Aditya Akella*, Ming Zhang + *University of Wisconsin, Madison + Microsoft Research.
R2C2: A Network Stack for Rack-scale Computers Paolo Costa, Hitesh Ballani, Kaveh Razavi, Ian Kash Microsoft Research Cambridge EECS 582 – W161.
1 Scalability and Accuracy in a Large-Scale Network Emulator Nov. 12, 2003 Byung-Gon Chun.
Runtime Reconfigurable Network-on- chips for FPGA-based systems Mugdha Puranik Department of Electrical and Computer Engineering
XFabric: a Reconfigurable In-Rack Network for Rack-Scale Computers Sergey Legtchenko, Nicholas Chen, Daniel Cletheroe, Antony Rowstron, Hugh Williams,
Yiting Xia, T. S. Eugene Ng Rice University
CIS 700-5: The Design and Implementation of Cloud Networks
Flamingo: Enabling Evolvable HDD-based Near-Line Storage
Campus Communications Fabric
ElasticTree Michael Fruchtman.
Anne Pratoomtong ECE734, Spring2002
Israel Cidon, Ran Ginosar and Avinoam Kolodny
NTHU CS5421 Cloud Computing
Storage area network and System area network (SAN)
Specialized Cloud Architectures
Network-on-Chip Programmable Platform in Versal™ ACAP Architecture
Computer Networks: Switching and Queuing
NetFPGA - an open network development platform
Towards Predictable Datacenter Networks
Presentation transcript:

Reconfigurable Network Topologies at Rack Scale Sergey Legtchenko, Xiaohan Zhao, Daniel Cletheroe, Ant Rowstron Microsoft Research Cambridge

Networking for Rack-Scale Computers Trend: density in the rack is increasing HP Moonshot: 360 cores in 4.3U Boston Viridis: 192 cores in 2U MSR Pelican: 9PB of storage/rack [OSDI 2014] Systems-on-a-Chip (SoC) Challenge for in-rack networking Traditional racks: 40-80 servers + Top of Rack (ToR) switch Rack-scale computers: 100s/1,000s servers Hard to build 1,000-port ToRs Hard to add too many ToRs Pelican rack Distributed network fabrics SoCs with embedded packet switching no ToR: switching distributed across SoCs Direct uplinks to datacenter Cheap, low power, small physical space Uplink to datacenter XFabric: Reconfigurable network topologies at rack scale

How to choose the topology? Topology impacts performance Topology must fit the workload Workloads vary: Different traffic patterns Clustered, uniform… Different requirements Latency, bandwidth sensitive… Variability over time daily patterns, bursts… 125 SoCs, 6 links/SoC Shortest path routing Challenge: No topology fits all workloads XFabric: Reconfigurable network topologies at rack scale

Looking for solutions… Design the network for a workload? Lack of flexibility: one network fabric per workload Overprovision the network? Higher cost One static topology for all workloads? Less performant HP Moonshot: 4 separate fabrics! Servers to ToR switches (Radial) Between servers (2D-Torus) Servers to Storage (Custom) Management (Radial) Requirements: Flexibility: One network fabric for all workloads Performance: Topology must be adapted to the workload Low cost: No overprovisioning, hardware available today Solution: reconfigurable topology XFabric: Reconfigurable network topologies at rack scale

A Reconfigurable Topology Principle: packet switching over circuit switching Physical Logical Commodity crossbar switch ASICs 144x144 @ 10 Gbps No queuing Electrical signal forwarding Cost : $3/port Physical circuit PCB track Crossbar switch Building blocks: SoCs with packet switches Crossbar switch N ports, each connected to a SoC physical circuits between SoCs Can be reconfigured at runtime N Logical Physical Crossbar switch N Logical Physical Crossbar switch N Logical Physical Crossbar switch

Circuit Switching Cost Rack-scale fabric with N SoCs and d links/SoC Do we need one crossbar with N x d ports? We can do better: d crossbars of size N (typically d < 6) Possibility to connect each link of a SoC to any other SoC Any d-regular topology XFabric: Reconfigurable network topologies at rack scale

XFabric Architecture Overview Traffic monitoring SoCs … 1 2 3 n L uplinks Controller Printed Circuit Board Nx(d+L)+L tracks Utility function Generate topology Analyse traffic Configure XSwitches 1 2 d d + 1 d+L … … Control plane Crossbar Switches Speed to reconfigure Instantiate Uplink map XFabric: Reconfigurable network topologies at rack scale

Controller: Challenges Optimal topology for a given traffic? NP-Hard problem Time constraints (needs to run online) Current approach: lightweight greedy algorithm Start with simple topology Add links that maximize utility How to reconfigure at runtime without stopping traffic? Inconsistent forwarding state in the network Current approach: controller-driven switch reconfiguration Manageable at rack-scale Lower inconsistency period: avoids distributed link state discovery XFabric: Reconfigurable network topologies at rack scale

XFabric: Does It Work? Building a rack-scale SoC emulator Goals: 27 servers 7 NICs/server, emulating SoC functionality Supports unmodified applications Goals: Understand how to build SoCs How to build rack-scale systems XSwitch hardware: Gen 1: 32x 1 Gbps Gen 2: 36x 40 Gbps (in progress) Gen 1 XSwitch microcontroller 32 Gigabit Ethernet ports Non blocking 40x40 @ 1 Gbps/port XFabric: Reconfigurable network topologies at rack scale

Performance of XFabric Lower is better Flow-based simulation 125 SoCs, 6 links/SoC Utility function used: minimizing path length Production workload trace How stable are the workloads? Hourly reconfiguration 2.7x path length reduction XFabric: Reconfigurable network topologies at rack scale

XFabric: Reconfigurable network topologies at rack scale Conclusion Reconfigurable network topology Packet switching over circuit switching Benefits: Flexibility, performance, low cost Low cost: all components available today Perspectives: exploring rack-scale design How to deliver performance without overprovisioning? Building proof-of-concept rack hardware [Pelican, OSDI 2014] Rethinking hardware and software at rack scale Flexible network stacks Tighter integration with storage, compute 10 XFabric: Reconfigurable network topologies at rack scale