Presentation is loading. Please wait.

Presentation is loading. Please wait.

Virtual Machine in HPC PAK MARKTHUB (13M54040) 1 VIRTUAL MACHINE IN HPC.

Similar presentations


Presentation on theme: "Virtual Machine in HPC PAK MARKTHUB (13M54040) 1 VIRTUAL MACHINE IN HPC."— Presentation transcript:

1 Virtual Machine in HPC PAK MARKTHUB (13M54040) 1 VIRTUAL MACHINE IN HPC

2 2 Review paper “Evaluation of Virtual Machine Scalability on Distributed Multi/Many-core Processors for Big Data Analytics” [2012 IEEE Conference on Open Systems (ICOS)] Amril Nazir, Yaszrina Mohamad Yassin, Chong Poh Kit, Ettikan Kandasamy Karuppiah MIMOS Bhd, Technology Park Malaysia VIRTUAL MACHINE IN HPC

3 3 Review paper “Performance Overhead among Three Hypervisors: An Experimental Study Using Hadoop Benchmarks” [2013 IEEE International Congress on Big Data (BigData Congress)] Jack Li, Qingyang Wang, Deepal Jayasinghe, Junhee Park, Tao Zhu, Calton Pu School of Computer Science at the College of Computing Georgia Institute of Technology VIRTUAL MACHINE IN HPC

4 4 Review paper “Cooperative VM migration for a virtualized HPC cluster with VMM-bypass I/O devices” [2012 IEEE 8th International Conference on E-Science (e-Science)] Ryousei Takano, Hidemoto Nakada, Takahiro Hirofuchi, Yoshio Tanaka, and Tomohiro Kudoh National Institute of Advanced Industrial Science and Technology (AIST) VIRTUAL MACHINE IN HPC

5 Outline ◦A Brief Introduction to Virtual Machine Technologies ◦Evaluation of Virtual Machine Scalability on Distributed Multi/Many-core Processors for Big Data Analytics ◦Performance Overhead among Three Hypervisors: An Experimental Study Using Hadoop Benchmarks 5 VIRTUAL MACHINE IN HPC

6 A Brief Introduction to Virtual Machine Technologies 6 VIRTUAL MACHINE IN HPC

7 Cloud Computing The phrase is also more commonly used to refer to network- based services which appear to be provided by real server hardware, which in fact are served up by virtual hardware, simulated by software running on one or more real machines. Such virtual servers do not physically exist and can therefore be moved around and scaled up (or down) on the fly without affecting the end user—arguably, rather like a cloud. 7 Source: http://en.wikipedia.org/wiki/Cloud_computing VIRTUAL MACHINE IN HPC

8 Advantages of Cloud Computing ◦Economies of scale ◦Scalability ◦Hardware independent ◦Fault tolerant ◦Etc. 8 VIRTUAL MACHINE IN HPC

9 Cloud Computing & Big Data 9 Source: Gartner VIRTUAL MACHINE IN HPC

10 Virtual Machine (VM) A virtual machine (VM) is a software implementation of a machine (i.e. a computer) that executes programs like a physical machine. ◦2 classifications: ◦System virtual machine ◦Process virtual machine 10 Source: http://en.wikipedia.org/wiki/Virtual_machine VIRTUAL MACHINE IN HPC

11 VM Architecture 11 Source: http://docs.oracle.com/cd/E15458_01 /doc.22/e15444/ovmserver.htm VIRTUAL MACHINE IN HPC

12 Evaluation of Virtual Machine Scalability on Distributed Multi/Many-core Processors for Big Data Analytics 2012 IEEE Conference on Open Systems (ICOS) Amril Nazir, Yaszrina Mohamad Yassin, Chong Poh Kit, Ettikan Kandasamy Karuppiah 12 VIRTUAL MACHINE IN HPC

13 Abstract - Summarized ◦Objective: Evaluate how well large-scale distributed data analysis application can run in virtualized environment. ◦Why?: ◦Data analytics in Cloud computing is very attractive for small and medium organizations. ◦Non-expert user can provision resources as VMs on Cloud very easily. ◦Results: ◦One should minimize the number of VMs deployed for each application. ◦One should equip each VM with sufficient memory and reasonable number of CPU cores. 13 VIRTUAL MACHINE IN HPC

14 Financial Application Architecture 14 Data extraction ◦Process of identifying source and collection data ◦Source of data: Bursa, NASDAQ, LSE, etc. Data pre-processing ◦Process of transforming data into appropriate format for data analysis ◦Ex: Text matching, mismatched/duplicated records identification/elimination. Data analysis ◦Sometimes data are too large for processing in 1 machines ◦Cloud can be useful to distribute tasks on- demand. VIRTUAL MACHINE IN HPC

15 Distributed Financial Application Architecture 15 VIRTUAL MACHINE IN HPC

16 Computational Needs for Financial Application Data intensive ◦Extraction of financial data from source. ◦Need high-memory machines for extraction. Compute intensive ◦Data pre-processing ◦Data analysis 16 VIRTUAL MACHINE IN HPC

17 Experiment – Environment ◦Use 4 physical machines (with Hypervisor) ◦1 Gb Ethernet connection ◦VM (RHEL5 with kernel 2.6.33) ◦Use RapidMiner as a financial application 17 VIRTUAL MACHINE IN HPC

18 Experiment I ◦Increase problem size. ◦Observe how much time is used for finishing the task for each configuration. 18 Task CPU 1 core CPU Task 4 cores CPU VM Task 1 core per VM CPU Task VM 4 cores per VM CPU Task VM 4 VMs 1 core each NameCPUMemoryHypervisor phy44 2.00GHz-Xeon3.2GBNot state VIRTUAL MACHINE IN HPC

19 Experiment I - Result 19 NameCPUMemoryHypervisor phy44 2.00GHz-Xeon3.2GBNot state VIRTUAL MACHINE IN HPC

20 Experiment II ◦Fix the problem size (the size of securities) ◦Vary resources assigned to a VM ◦The number of CPU cores ◦Amount of memory ◦Observe time used for executing the task 20 NameCPUMemoryHypervisor phy18 2.4GHz-Xeon10.5GBXen VIRTUAL MACHINE IN HPC

21 Experiment II - Result 21 Size of SecuritiesMemory (GB)1 core per VM4 cores per VM8 cores per VM 32273(16,17)(119,1430) 321071(12,13)(14,18) NameCPUMemoryHypervisor phy18 2.4GHz-Xeon10.5GBXen VIRTUAL MACHINE IN HPC

22 Experiment III ◦Fix the problem size (32 securities) ◦Fix VM configuration (4 cores, 2 GB) ◦Vary physical machines (phy1 VS phy4) ◦Observe the task execution time 22 NameCPUMemoryHypervisor phy18 2.4GHz-Xeon10.5GBXen phy44 2.00GHz-Xeon3.2GBNot state VIRTUAL MACHINE IN HPC

23 Experiment III - Result 23 NameCPUMemoryHypervisor phy18 2.4GHz-Xeon10.5GBXen phy44 2.00GHz-Xeon3.2GBNot state Machine1 core per VM4 cores per VM phy173(16,17) phy4104(25,34) VIRTUAL MACHINE IN HPC

24 Experiment IV ◦Compare the execution time of ◦4 cores per VM ◦2 VMs with 2 cores each 24 NameCPUMemoryHypervisor phy44 2.00GHz-Xeon3.2GBNot state VIRTUAL MACHINE IN HPC

25 Experiment IV - Result 25 NameCPUMemoryHypervisor phy44 2.00GHz-Xeon3.2GBNot state VIRTUAL MACHINE IN HPC

26 Experiment V – VM Provisioning ◦By the previous experiments, the sufficient amount of memory for each CPU core is around 1024 MB. ◦Provision VM using memory per cores constrain VS OpenNebula VM provisioning which uses round robin. 26 VIRTUAL MACHINE IN HPC

27 Experiment V - Result 27 av=the available machine memory m=the optimal memory per core set by user (1024 MB in this experiment) n=the number of cores assigned to VM NameCPUMemoryHypervisor phy18 2.4GHz-Xeon10.5GBXen phy24 2.53GHz-Xeon12GBKVM phy38 3.2GHz-i73GBKVM phy44 2.00GHz-Xeon3.2GBNot state VIRTUAL MACHINE IN HPC

28 Conclusion ◦One should minimize the number of VMs deployed for each application. ◦One should equip a VM with sufficient memory. ◦There is a need for more effective discovery, scheduling, and load balancing than the typical round robin employed by current Cloud middleware. 28 VIRTUAL MACHINE IN HPC

29 My opinions STRONG POINTS ◦Experiment with various configurations of VMs. ◦The results can directly use for improving nowadays VM provisioning software. WEAK POINTS ◦Experimental datasets are too small. ◦Only concern about data- and compute-intensive workload. ◦Experiment V does not concern about the effect of different hypervisors. 29 VIRTUAL MACHINE IN HPC

30 30 Questions Questions And And Answers Answers VIRTUAL MACHINE IN HPC

31 Performance Overhead among Three Hypervisors: An Experimental Study Using Hadoop Benchmarks 2013 IEEE International Congress on Big Data (BigData Congress) Jack Li, Qingyang Wang, Deepal Jayasinghe, Junhee Park, Tao Zhu, Calton Pu 31 VIRTUAL MACHINE IN HPC

32 Abstract - Summarized ◦Objective: Benchmark the performance of 3 popular hypervisors using various type of Hadoop workloads ◦Why?: Hypervisors are widely used in Cloud environment ◦Results: ◦Workload type, workload size, and VM placement yielded performance differences amount different Hypervisors. ◦CPU-bound workload has negligible performance differences among different Hypervisors, however I/O- bound workload has significant differences. 32 VIRTUAL MACHINE IN HPC

33 Testing Environment ◦Benchmark 3 hypervisors: KVM, Xen, and a commercial hypervisor (CVM – not state which one is used due to license). ◦Run Hadoop MapReduce 4- node virtual cluster in the same physical machine. ◦Each VM node has 1 virtual core pinned to a different physical core. 33 VIRTUAL MACHINE IN HPC

34 CPU-Bound Benchmark 34 VIRTUAL MACHINE IN HPC

35 TestDFSIO Write & Filebench Write 35 VIRTUAL MACHINE IN HPC

36 TestDFSIO Read & Filebench Read 36 VIRTUAL MACHINE IN HPC

37 TestDFSIO Read CPU Utilization 37 VIRTUAL MACHINE IN HPC

38 TeraSort Benchmark 38 VIRTUAL MACHINE IN HPC

39 TeraSort – Why Xen is good? 39 VIRTUAL MACHINE IN HPC

40 Effects of Adding VMs on the same Host 40 VIRTUAL MACHINE IN HPC

41 Effects of Adding VMs on the same Host: Read/Write Throughput 41 VIRTUAL MACHINE IN HPC

42 Effect of Changing VM Placement 42 VIRTUAL MACHINE IN HPC

43 Conclusion ◦CPU-bound benchmark: no significant performance difference. ◦KVM is the best for disk reading. ◦CVM is the best for disk writing. ◦Xen is the best for CPU-and-I/O-intensive task. ◦Increasing the number of VMs per host is not good for I/O-intensive workload, but may be good for CPU-and- I/O-intensive one. 43 VIRTUAL MACHINE IN HPC

44 My opinions STRONG POINTS ◦Benchmark datasets are quite large. ◦Compare many aspects of different hypervisors. WEAK POINTS ◦Don’t have any physical-host benchmark results. ◦Putting 4 VMs in the same host does not reflect real- world provisioning pattern. 44 VIRTUAL MACHINE IN HPC

45 45 Questions Questions And And Answers Answers VIRTUAL MACHINE IN HPC

46 46 The End The End VIRTUAL MACHINE IN HPC

47 Supporting Slides 47 VIRTUAL MACHINE IN HPC

48 KVM Overview 48 Source: http://slid.es/braoru/kvm VIRTUAL MACHINE IN HPC

49 KVM – Host Interaction 49 Source: http://slid.es/braoru/kvm Userspace VIRTUAL MACHINE IN HPC

50 KVM – VIRTIO Device 50 Source: http://slid.es/braoru/kvm VIRTUAL MACHINE IN HPC

51 Hadoop Map-Reduce 51 Source: http://architects.dzone.com/articles/how-hadoop-mapreduce-works VIRTUAL MACHINE IN HPC

52 Hadoop Word Count 52 Source: http://kickstarthadoop.blogspot.jp/2011/04/word-count-hadoop-map-reduce-example.html VIRTUAL MACHINE IN HPC


Download ppt "Virtual Machine in HPC PAK MARKTHUB (13M54040) 1 VIRTUAL MACHINE IN HPC."

Similar presentations


Ads by Google