Embedded System Lab. 정범종 A_DRM: Architecture-aware Distributed Resource Management of Virtualized Clusters H. Wang et al. VEE, 2015.

Slides:



Advertisements
Similar presentations
Live migration of Virtual Machines Nour Stefan, SCPD.
Advertisements

Module 13: Performance Tuning. Overview Performance tuning methodologies Instance level Database level Application level Overview of tools and techniques.
KAIST Computer Architecture Lab. The Effect of Multi-core on HPC Applications in Virtualized Systems Jaeung Han¹, Jeongseob Ahn¹, Changdae Kim¹, Youngjin.
Application-Aware Memory Channel Partitioning † Sai Prashanth Muralidhara § Lavanya Subramanian † † Onur Mutlu † Mahmut Kandemir § ‡ Thomas Moscibroda.
1 A Hybrid Adaptive Feedback Based Prefetcher Santhosh Verma, David Koppelman and Lu Peng Louisiana State University.
Thread Criticality Predictors for Dynamic Performance, Power, and Resource Management in Chip Multiprocessors Abhishek Bhattacharjee Margaret Martonosi.
SLA-Oriented Resource Provisioning for Cloud Computing
Difference Engine: Harnessing Memory Redundancy in Virtual Machines by Diwaker Gupta et al. presented by Jonathan Berkhahn.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
Memory Buddies: Exploiting Page Sharing for Smart Colocation in Virtualized Data Centers Timothy Wood, Gabriel Tarasuk-Levin, Prashant Shenoy, Peter Desnoyers*,
Memory System Characterization of Big Data Workloads
An Adaptable Benchmark for MPFS Performance Testing A Master Thesis Presentation Yubing Wang Advisor: Prof. Mark Claypool.
1 Virtual Private Caches ISCA’07 Kyle J. Nesbit, James Laudon, James E. Smith Presenter: Yan Li.
Disco: Running Commodity Operating Systems on Scalable Multiprocessors Bugnion et al. Presented by: Ahmed Wafa.
Disco Running Commodity Operating Systems on Scalable Multiprocessors.
Copyright © 2002 Pearson Education, Inc. Slide 4-1 Choosing the Hardware for an E-commerce Site  Hardware platform  Refers to all the underlying computing.
ENFORCING PERFORMANCE ISOLATION ACROSS VIRTUAL MACHINES IN XEN Diwaker Gupta, Ludmila Cherkasova, Rob Gardner, Amin Vahdat Middleware '06 Proceedings of.
Techniques for Efficient Processing in Runahead Execution Engines Onur Mutlu Hyesoon Kim Yale N. Patt.
By- Jaideep Moses, Ravi Iyer , Ramesh Illikkal and
5205 – IT Service Delivery and Support
23 September 2004 Evaluating Adaptive Middleware Load Balancing Strategies for Middleware Systems Department of Electrical Engineering & Computer Science.
Presented by : Ran Koretzki. Basic Introduction What are VM’s ? What is migration ? What is Live migration ?
Measuring zSeries System Performance Dr. Chu J. Jong School of Information Technology Illinois State University 06/11/2012 Sponsored in part by Deer &
Exploring the Tradeoffs of Configurability and Heterogeneity in Multicore Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable.
Virtualization Technology Prof D M Dhamdhere CSE Department IIT Bombay Moving towards Virtualization… Department of Computer Science and Engineering, IIT.
Abstract Cloud data center management is a key problem due to the numerous and heterogeneous strategies that can be applied, ranging from the VM placement.
Department of Computer Science Engineering SRM University
SAIGONTECH COPPERATIVE EDUCATION NETWORKING Spring 2009 Seminar #1 VIRTUALIZATION EVERYWHERE.
A Bandwidth-aware Memory-subsystem Resource Management using Non-invasive Resource Profilers for Large CMP Systems Dimitris Kaseridis, Jeffery Stuecheli,
Virtualization. Virtualization  In computing, virtualization is a broad term that refers to the abstraction of computer resources  It is "a technique.
Cloud Computing Energy efficient cloud computing Keke Chen.
Cluster Reliability Project ISIS Vanderbilt University.
USTH Presentation Power-aware Scheduler for Virtualization TRAN Giang Son Prof. Daniel HAGIMONT Oct 19th, 2011.
Adaptive Cache Partitioning on a Composite Core Jiecao Yu, Andrew Lukefahr, Shruti Padmanabha, Reetuparna Das, Scott Mahlke Computer Engineering Lab University.
Challenges towards Elastic Power Management in Internet Data Center.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Profiling and Modeling Resource Usage.
SAN FRANCISCO, CA, USA Adaptive Energy-efficient Resource Sharing for Multi-threaded Workloads in Virtualized Systems Can HankendiAyse K. Coskun Boston.
Our work on virtualization Chen Haogang, Wang Xiaolin {hchen, Institute of Network and Information Systems School of Electrical Engineering.
High Performance Computing on Virtualized Environments Ganesh Thiagarajan Fall 2014 Instructor: Yuzhe(Richard) Tang Syracuse University.
Performance evaluation of component-based software systems Seminar of Component Engineering course Rofideh hadighi 7 Jan 2010.
OPERETTA: An Optimal Energy Efficient Bandwidth Aggregation System Karim Habak†, Khaled A. Harras‡, and Moustafa Youssef† †Egypt-Japan University of Sc.
Managing Distributed, Shared L2 Caches through OS-Level Page Allocation Jason Bosko March 5 th, 2008 Based on “Managing Distributed, Shared L2 Caches through.
Embedded System Lab 김해천 Thread and Memory Placement on NUMA Systems: Asymmetry Matters.
Disco : Running commodity operating system on scalable multiprocessor Edouard et al. Presented by Vidhya Sivasankaran.
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
MIAO ZHOU, YU DU, BRUCE CHILDERS, RAMI MELHEM, DANIEL MOSSÉ UNIVERSITY OF PITTSBURGH Writeback-Aware Bandwidth Partitioning for Multi-core Systems with.
Simics: A Full System Simulation Platform Synopsis by Jen Miller 19 March 2004.
Warp-Level Divergence in GPUs: Characterization, Impact, and Mitigation Ping Xiang, Yi Yang, Huiyang Zhou 1 The 20th IEEE International Symposium On High.
Modeling Virtualized Environments in Simalytic ® Models by Computing Missing Service Demand Parameters CMG2009 Paper 9103, December 11, 2009 Dr. Tim R.
Full and Para Virtualization
Embedded System Lab. 오명훈 Addressing Shared Resource Contention in Multicore Processors via Scheduling.
E-MOS: Efficient Energy Management Policies in Operating Systems
Virtualizing a Multiprocessor Machine on a Network of Computers Easy & efficient utilization of distributed resources Goal Kenji KanedaYoshihiro OyamaAkinori.
Quantifying and Controlling Impact of Interference at Shared Caches and Main Memory Lavanya Subramanian, Vivek Seshadri, Arnab Ghosh, Samira Khan, Onur.
© 2004 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Understanding Virtualization Overhead.
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
LIOProf: Exposing Lustre File System Behavior for I/O Middleware
Running Commodity Operating Systems on Scalable Multiprocessors Edouard Bugnion, Scott Devine and Mendel Rosenblum Presentation by Mark Smith.
- 세부 1 - 이종 클라우드 플랫폼 데이터 관리 브로커 연구 및 개발 Network and Computing Lab.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
1 Automated Power Management Through Virtualization Anne Holler, VMware Anil Kapur, VMware.
Urban Mobility Management and Emissions Measurement System Boile Maria 1,2 Afroditi Anagnostopoulou 1 Evangelia Papargyri 1 1 Centre for Research and Technology.
Lecture 2: Performance Evaluation
Adaptive Cache Partitioning on a Composite Core
Application Slowdown Model
Overview Introduction VPS Understanding VPS Architecture
Some challenges in heterogeneous multi-core systems
Milad Hashemi, Onur Mutlu, Yale N. Patt
The Performance of Big Data Workloads in Cloud Datacenters
Virtual Memory: Working Sets
Presentation transcript:

Embedded System Lab. 정범종 A_DRM: Architecture-aware Distributed Resource Management of Virtualized Clusters H. Wang et al. VEE, 2015

정 범 종정 범 종 Embedded System Lab. Table of contents Background Problems & Challenges Reference paper A-DRM  A-DRM : Design  A-DRM : Implementation Evaluation Conclusion Reference

정 범 종정 범 종 Embedded System Lab. Backgound Virtualized systems  Virtual machine can interact independently with other devices, applications, data and users as though it were a separate physical resource  DRM(Distributed Resource Management)  Virtualized cluster  Para-Virtualization / Full-Virtualization

정 범 종정 범 종 Embedded System Lab. Backgound Live migration  The process of moving a running virtual machine or application between different physical machines without disconnecting the client or application  High resource utilization and energy savings Microarchitecture / Performance counters  Performance Monitoring Unit  hardware performance counters To provide clear and accurate performance information to the software developer IPC (Instruction Per Cycle)

정 범 종정 범 종 Embedded System Lab. Problems & Challenges DRM schemes usually use operating-system-level metrics  CPU utilization, memory capacity demand and I/O utilization DRM schemes are oblivious to microarchitecture-level resource interference A-DRM takes into account microarchitecture-level resource interference  when making migration decisions in a virtualized cluster

정 범 종정 범 종 Embedded System Lab. Reference Paper Cuanta: Quantifying Effects of Shared On-chip Resource Interference for Consolidated Virtual Machines this paper focus on the performance impact of consolidated applications due to shared on-chip resources such as the lastlevel cache space and memory bandwidth An average prediction error of less than 4% is achieved across a wide variety of benchmark workload

정 범 종정 범 종 Embedded System Lab. A-DRM : Design Profiler  monitor resource usage/demands and report them to the controller periodically  Composition CPU and Memory profiler, architectural resource profiler Controller  detect microarchitecture-level shared resource interference  leverage this information to perform VM migration.  Composition Profiling Engine, Architecture-aware Interference Detector, Architecture-aware DRM policy, Migration Engine

정 범 종정 범 종 Embedded System Lab. A-DRM : Design - Controller Profiling Engine  The profiling engine stores the data collected by the profiler Architecture-aware Interference Dectector  It is invoked at each scheduling interval to detect microarchitecture-level shared resource interference Architecture-aware DRM policy  It is used to determine new VM-to-Host mappings to mitigate the detected interference  computes the increase in LLC miss rates at each potential destination host, to quantify the cost and benefit / Migration Engine  The migration engine is then invoked to achieve the new VM-to-Host mappings via VM migration

정 범 종정 범 종 Embedded System Lab. A-DRM : Implementation A-DRM use the Linux performance monitoring tool perf to access the hardware performance counters Memory Bandwidth Measurement in NUMA System  Cost-Benefit Analysis  Cost  VM Migration  Performance Degradation at dst Benefit  Performance Improvement of vm  Performance Improvement at src

정 범 종정 범 종 Embedded System Lab. Evaluation Workload Characterization  there is no strong correlation between memory capacity demand and memory bandwidth (left figure)  generally, workloads that consume low memory bandwidth exhibit a high LLC hit ratio (right figure)

정 범 종정 범 종 Embedded System Lab. Evaluation A-DRM Case Study we conclude that by migrating VMs appropriately using online measurement of microarchitecture-level resource usage

정 범 종정 범 종 Embedded System Lab. Conclusion A-DRM can enhance the performance of virtual machines by up to 26.55% (average of 9.67%), A-DRM improves the average cluster-wide memory bandwidth utilization by 17% (up to 36%) Results show that being aware of microarchitecture-level shared resource usage can enable A-DRM scheme to make more effective migration decisions

정 범 종정 범 종 Embedded System Lab. Q & A

정 범 종정 범 종 Embedded System Lab. Architecture-aware Interference Detector Architecture-aware DRM policy

정 범 종정 범 종 Embedded System Lab. Evaluation Performance Studies for Heterogeneous Workloads

정 범 종정 범 종 Embedded System Lab. Evaluation Sensitivity to Workload Intensity

정 범 종정 범 종 Embedded System Lab. Evaluation Parameter Sensitivity  The performance of A-DRM can be affected by control knobs such as the MBW_Threshold, live migration timeout, and the sliding window size  evaluate the impact of these different parameters