1 Finding Constant From Change: Revisiting Network Performance Aware Optimizations on IaaS Clouds Yifan Gong, Bingsheng He, Dan Li Nanyang Technological.

Slides:



Advertisements
Similar presentations
Energy-efficient Task Scheduling in Heterogeneous Environment 2013/10/25.
Advertisements

Making Time-stepped Applications Tick in the Cloud Tao Zou, Guozhang Wang, Marcos Vaz Salles*, David Bindel, Alan Demers, Johannes Gehrke, Walker White.
Towards Predictable Datacenter Networks
VSMC MIMO: A Spectral Efficient Scheme for Cooperative Relay in Cognitive Radio Networks 1.
Cloud Computing at GES DISC Presented by: Long Pham Contributors: Aijun Chen, Bruce Vollmer, Ed Esfandiari and Mike Theobald GES DISC UWG May 11, 2011.
SLA-Oriented Resource Provisioning for Cloud Computing
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada ISP-Friendly Peer Matching without ISP Collaboration Mohamed Hefeeda (Joint.
A Parallel Computational Model for Heterogeneous Clusters Jose Luis Bosque, Luis Pastor, IEEE TRASACTION ON PARALLEL AND DISTRIBUTED SYSTEM, VOL. 17, NO.
Scaling Distributed Machine Learning with the BASED ON THE PAPER AND PRESENTATION: SCALING DISTRIBUTED MACHINE LEARNING WITH THE PARAMETER SERVER – GOOGLE,
CPSC 689: Discrete Algorithms for Mobile and Wireless Systems Spring 2009 Prof. Jennifer Welch.
Introduction Future wireless systems will be characterized by their heterogeneity - availability of multiple access systems in the same physical space.
Dept. of Computer Science & Engineering, CUHK1 Trust- and Clustering-Based Authentication Services in Mobile Ad Hoc Networks Edith Ngai and Michael R.
© nCode 2000 Title of Presentation goes here - go to Master Slide to edit - Slide 1 Reliable Communication for Highly Mobile Agents ECE 7995: Term Paper.
On the Construction of Energy- Efficient Broadcast Tree with Hitch-hiking in Wireless Networks Source: 2004 International Performance Computing and Communications.
An Authentication Service Against Dishonest Users in Mobile Ad Hoc Networks Edith Ngai, Michael R. Lyu, and Roland T. Chin IEEE Aerospace Conference, Big.
Design, Implementation, and Evaluation of Differentiated Caching Services Ying Lu, Tarek F. Abdelzaher, Avneesh Saxena IEEE TRASACTION ON PARALLEL AND.
Measuring Network Performance of Multi-Core Multi-Cluster (MCMCA) Norhazlina Hamid Supervisor: R J Walters and G B Wills PUBLIC.
Heterogeneous and Grid Computing2 Communication models u Modeling the performance of communications –Huge area –Two main communities »Network designers.
1 Algorithms for Bandwidth Efficient Multicast Routing in Multi-channel Multi-radio Wireless Mesh Networks Hoang Lan Nguyen and Uyen Trang Nguyen Presenter:
Bandwidth Measurements for VMs in Cloud Amit Gupta and Rohit Ranchal Ref. Cloud Monitoring Framework by H. Khandelwal, R. Kompella and R. Ramasubramanian.
1 ProActive performance evaluation with NAS benchmarks and optimization of OO SPMD Brian AmedroVladimir Bodnartchouk.
ElasticTree: Saving Energy in Data Center Networks 許倫愷 2013/5/28.
Self-Adaptive QoS Guarantees and Optimization in Clouds Jim (Zhanwen) Li (Carleton University) Murray Woodside (Carleton University) John Chinneck (Carleton.
HeteroPar 2013 Optimization of a Cloud Resource Management Problem from a Consumer Perspective Rafaelli de C. Coutinho, Lucia M. A. Drummond and Yuri Frota.
A User Experience-based Cloud Service Redeployment Mechanism KANG Yu.
CS492: Special Topics on Distributed Algorithms and Systems Fall 2008 Lab 3: Final Term Project.
Predicting performance of applications and infrastructures Tania Lorido 27th May 2011.
Presenter: Hung-Fu Li HPDS Lab. NKUAS vCUDA: GPU Accelerated High Performance Computing in Virtual Machines Lin Shi, Hao Chen and Jianhua.
Network Aware Resource Allocation in Distributed Clouds.
Minimal Hop Count Path Routing Algorithm for Mobile Sensor Networks Jae-Young Choi, Jun-Hui Lee, and Yeong-Jee Chung Dept. of Computer Engineering, College.
Rate-based Data Propagation in Sensor Networks Gurdip Singh and Sandeep Pujar Computing and Information Sciences Sanjoy Das Electrical and Computer Engineering.
IEEE Globecom 2010 Tan Le Yong Liu Department of Electrical and Computer Engineering Polytechnic Institute of NYU Opportunistic Overlay Multicast in Wireless.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.
 Collectives on Two-tier Direct Networks EuroMPI – 2012 Nikhil Jain, JohnMark Lau, Laxmikant Kale 26 th September, 2012.
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
Advanced Spectrum Management in Multicell OFDMA Networks enabling Cognitive Radio Usage F. Bernardo, J. Pérez-Romero, O. Sallent, R. Agustí Radio Communications.
Job scheduling algorithm based on Berger model in cloud environment Advances in Engineering Software (2011) Baomin Xu,Chunyan Zhao,Enzhao Hua,Bin Hu 2013/1/251.
Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.
Test Architecture Design and Optimization for Three- Dimensional SoCs Li Jiang, Lin Huang and Qiang Xu CUhk Reliable Computing Laboratry Department of.
Implementation of Collection Tree Protocol in QualNet
Dr. Sudharman K. Jayaweera and Amila Kariyapperuma ECE Department University of New Mexico Ankur Sharma Department of ECE Indian Institute of Technology,
MMAC: A Mobility- Adaptive, Collision-Free MAC Protocol for Wireless Sensor Networks Muneeb Ali, Tashfeen Suleman, and Zartash Afzal Uzmi IEEE Performance,
Performance of Adaptive Beam Nulling in Multihop Ad Hoc Networks Under Jamming Suman Bhunia, Vahid Behzadan, Paulo Alexandre Regis, Shamik Sengupta.
Network Computing Laboratory 1 Vivaldi: A Decentralized Network Coordinate System Authors: Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris MIT Published.
MPI implementation – collective communication MPI_Bcast implementation.
Scheduling MPI Workflow Applications on Computing Grids Juemin Zhang, Waleed Meleis, and David Kaeli Electrical and Computer Engineering Department, Northeastern.
University of Westminster – Checkpointing Mechanism for the Grid Environment K Sajadah, G Terstyanszky, S Winter, P. Kacsuk University.
Efficient Resource Allocation for Wireless Multicast De-Nian Yang, Member, IEEE Ming-Syan Chen, Fellow, IEEE IEEE Transactions on Mobile Computing, April.
Global Clock Synchronization in Sensor Networks Qun Li, Member, IEEE, and Daniela Rus, Member, IEEE IEEE Transactions on Computers 2006 Chien-Ku Lai.
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
A Bandwidth Scheduling Algorithm Based on Minimum Interference Traffic in Mesh Mode Xu-Yajing, Li-ZhiTao, Zhong-XiuFang and Xu-HuiMin International Conference.
On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.
A Two-Tier Heterogeneous Mobile Ad Hoc Network Architecture and Its Load-Balance Routing Problem C.-F. Huang, H.-W. Lee, and Y.-C. Tseng Department of.
EuroSys Doctoral Workshop 2011 Resource Provisioning of Web Applications in Heterogeneous Cloud Jiang Dejun Supervisor: Guillaume Pierre
Zhaoxia Fu, Yan Han Measurement Volume 45, Issue 4, May 2012, Pages 650–655 Reporter: Jing-Siang, Chen.
Chen Qian, Xin Li University of Kentucky
Presented by Edith Ngai MPhil Term 3 Presentation
Hydra: Leveraging Functional Slicing for Efficient Distributed SDN Controllers Yiyang Chang, Ashkan Rezaei, Balajee Vamanan, Jahangir Hasan, Sanjay Rao.
Authors: Sajjad Rizvi, Xi Li, Bernard Wong, Fiodar Kazhamiaka
Memory Management for Scalable Web Data Servers
A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.
Bandwidth Measurements for VMs in Cloud
Pei Fan*, Ji Wang, Zibin Zheng, Michael R. Lyu
Congestion Control in SDN-Enabled Networks
Motion-Aware Routing in Vehicular Ad-hoc Networks
Resource Allocation for Distributed Streaming Applications
Congestion Control in SDN-Enabled Networks
Requirements of Computing in Network
Towards Predictable Datacenter Networks
Presentation transcript:

1 Finding Constant From Change: Revisiting Network Performance Aware Optimizations on IaaS Clouds Yifan Gong, Bingsheng He, Dan Li Nanyang Technological University Tsinghua University

Outline Introduction RPCA-based approach Evaluation results Conclusions 2

Outline Introduction RPCA-based approach Evaluation results Conclusions 3

4 Infrastructure as a Service is Pervasive Many cloud providers supply IaaS clouds Amazon EC2 Google Compute Engine Rackspace IaaS clouds have emerged as a popular computing infrastructure for many distributed applications Life Science [Nagavaram 2011] Physics [Nunez 2010] Big data processing [He 2013]

5 Network Heterogeneity in IaaS cloud Topology of data center in IaaS cloud switch machines rack Network Heterogeneity switch machine rack Low Performance High Performance Heterogeneity

Network Performance-aware Optimization: State-of-the-art Network Performance-aware Optimization is an effective approach to optimize distributed applications on cloud MPI Collective Operation [Gong 2013] Topology Mapping [Hoefler 2011] Workflow Management [Spooner 2005 ] 6 Definition of Network Performance-aware Optimization Estimating or measuring the all-link network performance and carefully selecting communication links for minimizing the network transfer time of the application

Example of MPI Broadcast MPI Broadcast Binominal Tree Algorithm Root Machine data 7

Network Performance-aware Optimization of MPI Broadcast m0 Topology m1 m6 m7 m2 m3 m4 m Total Cost=6+6+6= Total Cost= =12 m0m1m2m3m4m5m6m7 m m m m m m m m Performance Matrix

Network Interference Problem 9 Machine 0 Application Machine 1 Application Background Application Network Interference Interference Network performance is changing as the time goes by –network interference

Distribution of Latency and Bandwidth between a pair of instance (virtual machine) in a long period Cloud Performance Dynamics: Latency/Bandwidth The range of Bandwidth(45~110MB/s) 10 Network performance is changing as the time goes by Interference largely impacts the network performance Network performance-aware optimization could not work

Our Work 11 Our solution: Robust Principal Component Analysis  Decouple the constant component from the dynamic network performance while minimizing the difference between the network performance and the constant component  Use the interference component to determine the effectiveness of network performance aware optimizations  Utilize the constant component to guide network performance aware optimizations Problem  How to distinguish network interference in the cloud?  How to enable existing network performance-aware optimizations in the dynamic cloud environment?

12 Robust Principal Component Analysis(RPCA) Frame 1 at t 1 Frame 2 at t 2 RPCA Low rank matrix D Sparse matrix E Mathematic Problem minimize rank(D) + λ ∥ E ∥ subject to A = D + E Video surveillance Figures are cited from [Wright 2009] * Frame 3 at t 3 Frame 4 at t 4 + Video

Outline Introduction RPCA-based approach Evaluation results Conclusions 13

Definition (b)TP-Matrix A Performance Matrix We can measure the performance of each link for a set of instances and the all-link network performance are defined as Performance Matrix Temporal Performance Matrix (TP-matrix) Performance matrix can only reflect a snapshot of network performance at a certain point of time. We define Temporal Performance Matrix as the combination of Performance matrix in a continuous time interval.

15 Analogy to RPCA in computer vision Frame 1 at t 1 Frame 2 at t 2 Frame 3 at t 3 Frame 4 at t 4 Performance Matrix at t Reshape t1t1 Performance Matrix at t Reshape t2t2 Combine t 1 t 2, …, t t1t1 t3t3 t2t2 t4t t3t t4t4 Temporal Performance Matrix Video

RPCA Video (Original matrix A) Analogy to RPCA in computer vision (con’t) RPCA Temporal Performance Matrix A t1t1 t3t3 t2t2 t4t4 matrix D (Low rank ) matrix E (Sparse) + Matrix D (Constant) Matrix E (Interference)

17 RPCA-based Approach Reshape Matrix D (Constant) Matrix E (Interference) Performance Matrix 0.1 Network Performance-aware Optimization Calculate quantity of interference

A is the temporal performance matrix from direct measurement Use Robust PCA to distinguish the interference minimize rank(D) + λ ∥ E ∥ subject to A = D + E D is constant and E represents the interference Use matrix E to calculate quantity of interference in the cluster and predict the efficiency of network performance- aware optimizations Norm (E) = ∥ E ∥ / ∥ A ∥ Use matrix D to optimize some applications 18 Basic procedure

Model calibration How to reduce the calibration overhead for each performance matrix? How many performance matrix to calibrate? Update maintenance When to re-calibrate the network performance? 19 Implementation Details Detailed techniques are in the paper.

Outline Introduction RPCA-based approach Evaluation results Conclusions 20

Experiment Setup 21 Experiments with Amazon EC2 196 instances m1.medium Simulation with ns-2 Tree-structured topology 1024 machines: totally 32 racks (10Gb/s) and each rack contains 32 servers (1Gb/s) Application MPI Broadcast/Scatter Topology Mapping Conjugate gradient (CG) N-body

Comparisons 22 Baseline This simulates the scenario of running directly in the cloud environment, essentially without network performance aware optimizations Heuristics We capture the TP-matrix and use the average value of each link to optimize the applications. RPCA The network performance aware optimizations are guided by the constant part captured by our RPCA approach.

23 The impact of interference on different applications in the virtual cluster (RPCA approach) The impact of interference on different approaches in the virtual cluster (MPI Scatter) 25% 32% 37% 24% 32% Norm(E) The impact of interference The interference in Amazon EC2 is relatively low (about 0.1) Performance improvement in Amazon EC2 is about 25%-37%

Overall Performance of CG 24 31%14% The trend of improvement When the vector size is small, our algorithm is slower than MPICH2-based CG, due to the calibrating and calculation overheads. As the vector size increases, the performance gain compensates the overhead, with 31% and 14% performance improvement over baseline and Heuristics.

Background traffic simulation 25 Choose the links and then vary two parameters to control the background traffic  The distribution of waiting time between sending the message  poisson distribution and the expected value is λ  Message size The impact of network interference on Norm(E) Norm(E) (a) Different expected values (λ) of Poisson Distribution (b) Different message size Norm(E) clearly has positive correlations with the background traffic in the simulated cluster.

Outline Introduction RPCA-based approach Evaluation results Conclusions 26

Conclusions Network is dynamic in the cloud and network performance- aware optimizations could not be directly used because of network interference. We distinguish the constant and interference from the network and calculate the quantity of interference Norm (E) = ∥ E ∥ / ∥ A ∥ We study the relationship between the interference and the network aware optimization efficiency We propose approaches to enable existing/new network performance-aware optimizations 27

Thank you!