Look Who’s Talking: Discovering Dependencies between Virtual Machines Using CPU Utilization HotCloud 10 Presented by Xin.

Slides:



Advertisements
Similar presentations
Key Metrics for Effective Storage Performance and Capacity Reporting.
Advertisements

High Availability Deep Dive What’s New in vSphere 5 David Lane, Virtualization Engineer High Point Solutions.
University of Minnesota Optimizing MapReduce Provisioning in the Cloud Michael Cardosa, Aameek Singh†, Himabindu Pucha†, Abhishek Chandra
Lecture 5: Cloud Security: what’s new? Xiaowei Yang (Duke University)
Ragib Hasan Johns Hopkins University en Spring 2010 Lecture 3 02/15/2010 Security and Privacy in Cloud Computing.
System Center 2012 R2 Overview
Locality-Aware Dynamic VM Reconfiguration on MapReduce Clouds Jongse Park, Daewoo Lee, Bokyeong Kim, Jaehyuk Huh, Seungryoul Maeng.
A Hadoop Overview. Outline Progress Report MapReduce Programming Hadoop Cluster Overview HBase Overview Q & A.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Performance Anomalies Within The Cloud 1 This slide includes content from slides by Venkatanathan Varadarajan and Benjamin Farley.
Pankaj Kumar Qinglan Zhang Sagar Davasam Sowjanya Puligadda Wei Liu
Cloud Computing Part #3 Zigmunds Buliņš, Mg. sc. ing 1.
Xen , Linux Vserver , Planet Lab
Virtualization in HPC Minesh Joshi CSC 469 Dr. Box Feb 1, 2012.
Sandpiper : Black box and Gray-Box resource management for Virtual Machines Journal : Computer Networks: The International Journal of Computer and Telecommunications.
Scalable and Crash-Tolerant Load Balancing based on Switch Migration
SLA-aware Virtual Resource Management for Cloud Infrastructures
COMS E Cloud Computing and Data Center Networking Sambit Sahu
COMS E Cloud Computing and Data Center Networking Sambit Sahu
Authors: Thomas Ristenpart, et at.
Bandwidth Measurements for VMs in Cloud Amit Gupta and Rohit Ranchal Ref. Cloud Monitoring Framework by H. Khandelwal, R. Kompella and R. Ramasubramanian.
Implementing Failover Clustering with Hyper-V
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
VIRTUALISATION OF HADOOP CLUSTERS Dr G Sudha Sadasivam Assistant Professor Department of CSE PSGCT.
Resource Management in Virtualization-based Data Centers Bhuvan Urgaonkar Computer Systems Laboratory Pennsylvania State University Bhuvan Urgaonkar Computer.
Next Generation of Apache Hadoop MapReduce Arun C. Murthy - Hortonworks Founder and Architect Formerly Architect, MapReduce.
Google Distributed System and Hadoop Lakshmi Thyagarajan.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Black-box and Gray-box Strategies for Virtual Machine Migration Timothy Wood, Prashant.
Real Security for Server Virtualization Rajiv Motwani 2 nd October 2010.
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
Author : Chengwei Wang, Vanish Talwar*, Karsten Schwan, Parthasarathy Ranganathan* Conference: IEEE 2010 Network Operations and Management Symposium (NOMS)
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Eucalyptus on FutureGrid: A case for Eucalyptus 3 Sharif Islam, Javier Diaz, Geoffrey Fox Gregor von Laszewski Indiana University.
Shilpa Seth.  Centralized System Centralized System  Client Server System Client Server System  Parallel System Parallel System.
Tyson Condie.
Map Reduce: Simplified Data Processing On Large Clusters Jeffery Dean and Sanjay Ghemawat (Google Inc.) OSDI 2004 (Operating Systems Design and Implementation)
Department of Computer Science Engineering SRM University
Bottlenecks: Automated Design Configuration Evaluation and Tune.
Projects. High Performance Computing Projects Design and implement an HPC cluster with one master node and two compute nodes. (Hint: use Rocks HPC Cluster.
Face Detection And Recognition For Distributed Systems Meng Lin and Ermin Hodžić 1.
Systems Support for End-to-End Performance Management Sandip Agarwala PhD Advisor: Karsten Schwan College of Computing Georgia Tech.
IISWC 2007 Panel Benchmarking in the Web 2.0 Era Prashant Shenoy UMass Amherst.
Ragib Hasan University of Alabama at Birmingham CS 491/691/791 Fall 2012 Lecture 4 09/10/2013 Security and Privacy in Cloud Computing.
CSE 548 Advanced Computer Network Security Document Search in MobiCloud using Hadoop Framework Sayan Cole Jaya Chakladar Group No: 1.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
From Virtualization Management to Private Cloud with SCVMM 2012 Dan Stolts Sr. IT Pro Evangelist Microsoft Corporation
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Profiling and Modeling Resource Usage.
Loosely Coupled Parallelism: Clusters. Context We have studied older archictures for loosely coupled parallelism, such as mesh’s, hypercubes etc, which.
Sumit Kumar Archana Kumar Group # 4 CSE 591 : Virtualization and Cloud Computing3/3/2011.
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
Case Study: A Database Service CSCI 8710 September 25, 2008.
Embedded System Lab. 정범종 A_DRM: Architecture-aware Distributed Resource Management of Virtualized Clusters H. Wang et al. VEE, 2015.
Green Computing Metrics: Power, Temperature, CO2, … Computing system: Many-cores, Clusters, Grids and Clouds Algorithm and model: task scheduling, CFD.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
NTU Cloud 2010/05/30. System Diagram Architecture Gluster File System – Provide a distributed shared file system for migration NFS – A Prototype Image.
Virtualization One computer can do the job of multiple computers, by sharing the resources of a single computer across multiple environments. Turning hardware.
MidVision Enables Clients to Rent IBM WebSphere for Development, Test, and Peak Production Workloads in the Cloud on Microsoft Azure MICROSOFT AZURE ISV.
Bring Your Own Security (BYOS™): Deploy Applications in a Manageable Java Container with Waratek Locker on Microsoft Azure MICROSOFT AZURE ISV PROFILE:
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
Next Generation of Apache Hadoop MapReduce Owen
Data-Centric Systems Lab. A Virtual Cloud Computing Provider for Mobile Devices Gonzalo Huerta-Canepa presenter 김영진.
#msitconf. Damien Caro Technical Evangelist Manager, Что будет, если приложение поместить в контейнер? What happens if the application.
© 2010 VMware Inc. All rights reserved Why Virtualize? Beng-Hong Lim, VMware, Inc.
Mapping/Topology attacks on Virtual Machines
Yarn.
Bandwidth Measurements for VMs in Cloud
Overview Introduction VPS Understanding VPS Architecture
Resource-Efficient and QoS-Aware Cluster Management
Presentation transcript:

Look Who’s Talking: Discovering Dependencies between Virtual Machines Using CPU Utilization HotCloud 10 Presented by Xin

Problem with current cloud Hard to monitor, manage and debug Complex dependency causes butterfly effect

We need to know VM Interdependencies Better VM placement and migration decisions –Being not restricted to the physical machine, isn’t it what we want at the beginning? Better resource allocation –Reduce intra cloud traffic. Better disaster recovery automation –What can we do about this? Better troubleshooting

How to infer VM Interdependencies? In a multi-tier application, VMs have request-response interactions The server’s workload is determined by the clients workload

How to infer VM Interdependencies? Monitor –Sample ‘per VM’ CPU utilization Model –Estimate an model for CPU utilization of each VM Cluster –K-means clusters VMs with similar models VMs together

Monitoring CPU utilization sampled per VM Sampling Period –Too small : increases computation Too large : Might miss relevant spikes Sample size –300 seconds

Modeling Auto Regressive modeling

Modeling If VM1 and VM2 have the same coefficients, we say they are interdependent Why do we need this model? –Can we just compare the time series using a scale factor?

Clustering

Evaluation 31 VMs spread over 5 physical servers Applications –RUBiS : eBay like benchmark 4VMs – Apache, Tomcat, MySQL and RUBiS client –Hadoop MapReduce Framework 3 VMs – 1 master and 3 slave nodes –Iperf : Network testing tool 2 VMs – sender and receiver

Evaluation – 91.67% true positives – 99.08% true negatives

Why it works ? RUBiS –100% accuracy –Lot of request-response interaction between the VMs –‘n-tier’ application model

Why it fails ? Hadoop –false positive and false negative. –Mappers and reducers communicate intermediate results via files

How does the system work?

Discussion Security concerns –Who can monitor the CPU utilization of all VMs? The provider can ues “xentop” command to display real- time information about a Xen system –Can one VM infer the CPU utilization of another VM located at the same physical machine? Infer from CPU cache utilization? –What can an attacker do when it detects the interdependencies of other VMs? Attack the bottleneck Estimate the workload

Discussion Is identifying VM interdependency important? –Yes, debugging, resource allocation –Is locating interdependent VMs on the same machine a good idea? For provider: yes. For user: no –What can we do about disaster recovery? What kind of disaster? What causes it? Can we rank the top 10 reasons? Is VM interdependence on the list?

Discussion Is CPU utilization a good metric? –No. What is a VM runs multiple applications? –No. We cannot infer who depends on whom. What metrics can we use?