Chuanxiong Guo, et al, Microsoft Research Asia, SIGCOMM 2008

Slides:



Advertisements
Similar presentations
Comparison Of Network On Chip Topologies Ahmet Salih BÜYÜKKAYHAN Fall.
Advertisements

COMPUTER NETWORK TOPOLOGIES
Computer Network Topologies
Chuanxiong Guo, Haitao Wu, Kun Tan, Lei Shi, Yongguang Zhang, Songwu Lu Microsoft Research Asia, Tsinghua University, UCLA 1 DCell: A Scalable and Fault-Tolerant.
BCube: A High Performance, Server-centric Network Architecture for Modular Data Centers Chuanxiong Guo1, Guohan Lu1, Dan Li1, Haitao Wu1, Xuan Zhang2,
1 Greedy Forwarding in Dynamic Scale-Free Networks Embedded in Hyperbolic Metric Spaces Dmitri Krioukov CAIDA/UCSD Joint work with F. Papadopoulos, M.
1 Exploring Efficient and Scalable Multicast Routing in Future Data Center Networks Dan Li, Jiangwei Yu, Junbiao Yu, Jianping Wu Tsinghua University Presented.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Emery Berger University of Massachusetts Amherst Operating Systems CMPSCI 377 Lecture.
1 Meshes of Trees (MoT) and Applications in Integer Arithmetic Panagiotis Voulgaris Petros Mol Course: Parallel Algorithms.
Towards Virtual Routers as a Service 6th GI/ITG KuVS Workshop on “Future Internet” November 22, 2010 Hannover Zdravko Bozakov.
Datacenter Network Topologies
Chuanxiong Guo, Haitao Wu, Kun Tan,
Data Center Network Topologies: FatTree
Department of Computer Science, Jinan University, Guangzhou, P.R. China Lijun Lyu, Junjie Xie, Yuhui Deng, Yongtao Zhou ICA3PP 2014: The 14th International.
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
Ji-Yong Shin * Bernard Wong +, and Emin Gün Sirer * * Cornell University + University of Waterloo 2 nd ACM Symposium on Cloud ComputingOct 27, 2011 Small-World.
A Scalable, Commodity Data Center Network Architecture Mohammad AI-Fares, Alexander Loukissas, Amin Vahdat Presented by Ye Tao Feb 6 th 2013.
A Scalable, Commodity Data Center Network Architecture
ElasticTree: Saving Energy in Data Center Networks 許倫愷 2013/5/28.
Network Support for Cloud Services Lixin Gao, UMass Amherst.
1 The Google File System Reporter: You-Wei Zhang.
Network Aware Resource Allocation in Distributed Clouds.
Routing & Architecture
1 Department of Computer Science, Jinan University 2 School of Computer Science & Technology, Huazhong University of Science & Technology Junjie Xie 1,
A Scalable, Commodity Data Center Network Architecture Jingyang Zhu.
DARD: Distributed Adaptive Routing for Datacenter Networks Xin Wu, Xiaowei Yang.
© Copyright 2010 Hewlett-Packard Development Company, L.P. 1 Jayaram Mudigonda, HP Labs Praveen Yalagandula, HP Labs Mohammad Al-Fares, UCSD Jeff Mogul,
LAN Switching and Wireless – Chapter 1
Floodless in SEATTLE : A Scalable Ethernet ArchiTecTure for Large Enterprises. Changhoon Kim, Matthew Caesar and Jenifer Rexford. Princeton University.
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
InterConnection Network Topologies to Minimize graph diameter: Low Diameter Regular graphs and Physical Wire Length Constrained networks Nilesh Choudhury.
Quality of Service Routing Anunay Tiwari Anirudha Sahoo.
Dual Centric Data Center Network Architectures DAWEI LI, JIE WU (TEMPLE UNIVERSITY) ZHIYONG LIU, AND FA ZHANG (CHINESE ACADEMY OF SCIENCES) ICPP 2015.
Chuanxiong Guo, Haitao Wu, Kun Tan, Lei Shi, Yongguang Zhang, Songwu Lu SIGCOMM 2008 Presented by Ye Tian for Course CS05112.
Subways: A Case for Redundant, Inexpensive Data Center Edge Links Vincent Liu, Danyang Zhuo, Simon Peter, Arvind Krishnamurthy, Thomas Anderson University.
SYMBIOTIC ROUTING IN FUTURE DATA CENTER 工科三 陳泰穎. Outline 1. CamCube 1. Traditional data center 2. The problems 3. CamCube philosophy 4. Feature 5. What’s.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
SecondNet: A Data Center Network Virtualization Architecture with Bandwidth Guarantees Chuanxiong Guo 1, Guohan Lu 1, Helen J. Wang 2, Shuang Yang 3, Chao.
Jiaxin Cao, Rui Xia, Pengkun Yang, Chuanxiong Guo,
Click to edit Master title style Literature Review Interconnection Architectures for Petabye-Scale High-Performance Storage Systems Andy D. Hospodor, Ethan.
Data Centers and Cloud Computing 1. 2 Data Centers 3.
VL2: A Scalable and Flexible Data Center Network
Data Center Architectures
Yiting Xia, T. S. Eugene Ng Rice University
CIS 700-5: The Design and Implementation of Cloud Networks
Data Center Network Topologies II
Data Center Network Architectures
A Survey of Data Center Network Architectures By Obasuyi Edokpolor
Hydra: Leveraging Functional Slicing for Efficient Distributed SDN Controllers Yiyang Chang, Ashkan Rezaei, Balajee Vamanan, Jahangir Hasan, Sanjay Rao.
Computer Network Topologies
FAR: A Fault-avoidance Routing Method for Data Center Networks with Regular Topology Please send.
NTHU CS5421 Cloud Computing
A Scalable, Commodity Data Center Network Architecture
IS3120 Network Communications Infrastructure
BCube: A High Performance, Server-centric Network Architecture for Modular Data Centers Chuanxiong Guo1, Guohan Lu1, Dan Li1, Haitao Wu1, Xuan Zhang2,
SCOPE: Scalable Consistency in Structured P2P Systems
Chuanxiong Guo, Haitao Wu, Kun Tan,
湖南大学-信息科学与工程学院-计算机与科学系
Dingming Wu+, Yiting Xia+*, Xiaoye Steven Sun+,
Data Center Networking
Generic and Automatic Address Configuration for Data Center Networks
Interconnection Network Design Lecture 14
NTHU CS5421 Cloud Computing
Jellyfish: Networking Data Centers Randomly
 What is Topology  Categories of Topology  Definition, structure, advantage and disadvantage of all of the following topologies: o Mesh o Bus o Ring.
Data Center Architectures
Centralized Arbitration for Data Centers
In-network computation
Elmo Muhammad Shahbaz Lalith Suresh, Jennifer Rexford, Nick Feamster,
Data Center Traffic Engineering
Presentation transcript:

Dcell : A Scalable and Fault Tolerant Network Structure for Data Centers Chuanxiong Guo, et al, Microsoft Research Asia, SIGCOMM 2008 Presented by Jinyoung Han 2009/03/26

Outline DCN motivation DCell Routing in DCell Experiments Results Conclusion

Data Center Networking (DCN) Provide increasingly popular online application services, such as search, e-mails, web 2.0, and gaming, etc Data center network Network infrastructure inside a data center, which connects a large number of servers via high-speed links and switches

Observations for DCN Ever increasing scale Network capacity Google has 450,000 servers (in 30 data centers) in 2006 Microsoft doubles its number of servers in 14 months The expansion rate exceeds Moore’s Law When data centers scale, failures become the norm Using high-end switches/routers to scale up is costly Network capacity Many bandwidth hungry data-centric applications MapReduce (Google) File replication in GFS (Google File System)

Design goals for DCN Scalable to a large number of servers Fault tolerant Provide high network capacity

Interconnection Structure for Data Centers Existing tree structure does not scale Expensive high-end switches to scale up Single point of failure and bandwidth bottleneck ? New network structure -> DCell

DCell Ideas #1: Use mini-switches to scale out #2: Leverage servers be part of the routing infrastructure Servers have multiple ports and need to forward packets #3: Use recursion to scale and build complete graph to increase capacity

DCell structure DCell is a recursively defined structure DCell_0 Have n servers and a mini-switch. All servers are connected to switch. DCell_1 n+1 Dcell_0s 2-tuple [a1, a0], a1 for level 1 / a0 for level 0 two servers with 2-tuples [i, j − 1] and [j, i] are connected with a link for every i and every j > i DCell_k Recursively defined

DCell: the Construction n=2, k=2 DCell_1 n=2, k=1 Dcell_0 Server Mini-switch n servers in a DCell_0 n=2, k=0

DCell: The Properties Scalability: The number of servers scales doubly exponentially as the node degree increases For example, number of servers in a DCell_0 is 8 (n=8) and the number of server ports is 4 (i.e., k=3) -> N=27,630,792 Fault-tolerance: The bisection width is larger than Bisection width Minimal number of links to be removed to partition a network into two parts of equal size Large bisection implies high network capacity and fault-tolerance

DCell Routing src n1 n2 dst

DCell Routing (cont.) Network diameter: The maximum path length using DCellRouting in a DCellk is at most But: DCellRouting is NOT a shortest-path routing is NOT a tight diameter bound for DCell The mean and max path lengths of shortest-path and DCellRouting n k N Shortest-path DCellRouting Mean Max 4 2 420 4.87 7 5.16 5 930 5.22 5.50 6 1806 5.48 5.73 3 176,820 9.96 15 11.29 865,830 10.74 11.98 3,263,442 11.31 12.46 Yet: DCellRouting is close to shortest-path routing

DFR: DCell Fault-tolerant Routing Design goal Support millions of servers Advantages to take DCellRouting and DCell topology

DFR: DCell Fault-tolerant Routing src dst m1 m2 n2 n1 r1 DCellb i1 i2 p1 q2 i3 DCellb p2 q1 Proxy Proxy s2 s1 Servers in a same share local link-state

Experimental Results: Testbed DCell1: 20 servers, 5 DCell0s DCell0: 4 servers Ethernet wires 8-port mini-switches, 50$ each

Experimental Results: Fault Tolerance DCell fault-tolerant routing can handle various failures Link failure Server/switch failure Rack failure Link failure Server shutdown

Experimental Results: Network Capacity All to all traffic: each server sends 5GB file to every other servers

Summary DCell: Benefits: Use commodity mini-switches to scale out Let servers be part of the routing infrastructure Use recursion to reduce the node degree and complete graph to increase network capacity Benefits: Scales doubly exponentially High aggregate bandwidth capacity Fault tolerance Cost saving