Presentation is loading. Please wait.

Presentation is loading. Please wait.

NTU Cloud 2010/05/30. System Diagram Architecture Gluster File System – Provide a distributed shared file system for migration NFS – A Prototype Image.

Similar presentations


Presentation on theme: "NTU Cloud 2010/05/30. System Diagram Architecture Gluster File System – Provide a distributed shared file system for migration NFS – A Prototype Image."— Presentation transcript:

1 NTU Cloud 2010/05/30

2 System Diagram

3 Architecture Gluster File System – Provide a distributed shared file system for migration NFS – A Prototype Image storage space Node Gluster File System Compute ImgC- Img S- ImgStorage ImgC- Img NFS Prototype Img

4 Architecture Prototype Image – Original Image e.g. Hadoop MPI Compute Image – Modified Images for user – Do not preserve the content after cluster shutdown Node Gluster File System Compute ImgC- Img S- ImgStorage ImgC- Img NFS Prototype Img

5 XEN A hypervisor Virtualization

6 Cloud Master Monitor system state Scheduling Use NFS to store Prototype Image Web server

7 OpenNebula A middleware Provides an interface to manage virtual infrastructure (computation and network) VM Migration => We use OpenNebula to manage VM deployment, migration and set up virtual local area network(VLAN).

8 Gluster file system User level distributed file system Client/Server Architecture Use TCP/IP to transfer data =>We use GlusterFS to build our share file system environment for VM live migration. =>Our deployment is "symmetrical" - every machine is both a server and a client.

9 System Flow

10 Hadoop Benchmark Case 1 – M1 : Master + Slave-01 + Slave02 Case 2 – M1 : Master – M2 : Slave-01 + Slave-02 Case 3 – M1 : Master – M3 : Slave-01 + Slave-02 Case 4 – M1 : Master – M2 : Slave-01 – M3 : Slave-02

11 All in M1Slave-01 02 in M2Slave-01 02 in M3Slave-01 in M2 Slave-02 in M3 215.45188191.86139.59 Sec Sec Iteration

12 Set 1 VMHost MachineVCPUMemPurpose Set 1.1Single machine MasterM112.2GNamenode+Datanode+Jobtracker+Tasktracker WorkerM111.2GDatanode+Tasktracker Set 1.2Two machine MasterM112.2GNamenode+Datanode+Jobtracker+Tasktracker WorkerM211.2GDatanode+Tasktracker M1&M2 has same CPU and Memory size. HADOOP_HEAPSIZE=500MB mapred.child.java.opts=100MB RandomWriter 10M for 30Maps Sortting HDFS_BYTES_READ=210543161 HDFS_BYTES_WRITTEN=210541669

13 Sort Therefore, putting two VM into one machine performance slowdown to 88.92% two machine / single machine = 88.92 % Launched reduce tasks=4 Others=3 Reduce shuffle bytes=203039958 Reduce shuffle bytes=199629523 Exactly the same!

14 Set 2 VMHost MachineVCPUMemPurpose Set 2.1Single machine MasterM122.2GNamenode+Datanode+Jobtracker+Tasktracker WorkerM121.2GDatanode+Tasktracker Set 2.2Two machine MasterM122.2GNamenode+Datanode+Jobtracker+Tasktracker WorkerM221.2GDatanode+Tasktracker 1.RandomWriter10M for 30Maps 2.Sort HADOOP_HEAPSIZE=500MB mapred.child.java.opts=100MB

15 RandomWriter Therefore, putting two VM into one machine performance slowdown to 80.70% two machine / single machine = 80.70 %

16 RandomWriter Single machineTwo machine IterationSecHDFS_BYTES_WRITTENSecHDFS_BYTES_WRITTEN 1 5521052247949210545910 2 5521054202246210549359 3 5721054814747210505092 4 5521054591744210578791 5 6321056246644210508035 Avg. 57.00210544206.2046.00210537437.40 Avg. on 1,2,4 55.00210536806.0046.33210558020.00

17 Sort

18 Current Progress Xen 4.0 is ready on each node. We can offer two kinds of images – Hadoop – MPI Start up VMs to destination node automatically. Configure MPI and Hadoop environment for use automatically.


Download ppt "NTU Cloud 2010/05/30. System Diagram Architecture Gluster File System – Provide a distributed shared file system for migration NFS – A Prototype Image."

Similar presentations


Ads by Google