Grid Appliance The World of Virtual Resource Sharing Group # 14 Dhairya Gala Priyank Shah.

Slides:



Advertisements
Similar presentations
INTRODUCTION TO SIMULATION WITH OMNET++ José Daniel García Sánchez ARCOS Group – University Carlos III of Madrid.
Advertisements

Education and training on FutureGrig Salt Lake City, Utah July 18 th 2011 Presented by Renato Figueiredo
SLA-Oriented Resource Provisioning for Cloud Computing
Locality-Aware Dynamic VM Reconfiguration on MapReduce Clouds Jongse Park, Daewoo Lee, Bokyeong Kim, Jaehyuk Huh, Seungryoul Maeng.
SALSA HPC Group School of Informatics and Computing Indiana University.
Advanced Computing and Information Systems laboratory Educational Virtual Clusters for On- demand MPI/Hadoop/Condor in FutureGrid Renato Figueiredo Panoat.
IMapReduce: A Distributed Computing Framework for Iterative Computation Yanfeng Zhang, Northeastern University, China Qixin Gao, Northeastern University,
Fault-tolerant Adaptive Divisible Load Scheduling Xuan Lin, Sumanth J. V. Acknowledge: a few slides of DLT are from Thomas Robertazzi ’ s presentation.
Parallel K-Means Clustering Based on MapReduce The Key Laboratory of Intelligent Information Processing, Chinese Academy of Sciences Weizhong Zhao, Huifang.
VIRTUALISATION OF HADOOP CLUSTERS Dr G Sudha Sadasivam Assistant Professor Department of CSE PSGCT.
Reproducible Environment for Scientific Applications (Lab session) Tak-Lon (Stephen) Wu.
SALSASALSASALSASALSA Performance Analysis of High Performance Parallel Applications on Virtualized Resources Jaliya Ekanayake and Geoffrey Fox Indiana.
Iterative computation is a kernel function to many data mining and data analysis algorithms. Missing in current MapReduce frameworks is collective communication,
Ch 4. The Evolution of Analytic Scalability
10 -1  The Term Project demands in-depth research and investigated reporting. All reported contents, figures, and tables must be originally generated.
G RID R ESOURCE BROKER FOR SCHEDULING COMPONENT - BASED APPLICATIONS ON DISTRIBUTED RESOURCES Reporter : Yi-Wei Wu.
Applying Twister to Scientific Applications CloudCom 2010 Indianapolis, Indiana, USA Nov 30 – Dec 3, 2010.
Cyberaide Virtual Appliance: On-demand Deploying Middleware for Cyberinfrastructure Tobias Kurze, Lizhe Wang, Gregor von Laszewski, Jie Tao, Marcel Kunze,
CS492: Special Topics on Distributed Algorithms and Systems Fall 2008 Lab 3: Final Term Project.
Advanced Computing and Information Systems laboratory Plug-and-play Virtual Appliance Clusters Running Hadoop Dr. Renato Figueiredo ACIS Lab - University.
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
Mantychore Oct 2010 WP 7 Andrew Mackarel. Agenda 1. Scope of the WP 2. Mm distribution 3. The WP plan 4. Objectives 5. Deliverables 6. Deadlines 7. Partners.
Location-aware MapReduce in Virtual Cloud 2011 IEEE computer society International Conference on Parallel Processing Yifeng Geng1,2, Shimin Chen3, YongWei.
Submitted by: Shailendra Kumar Sharma 06EYTCS049.
การติดตั้งและทดสอบการทำคลัสเต อร์เสมือนบน Xen, ROCKS, และไท ยกริด Roll Implementation of Virtualization Clusters based on Xen, ROCKS, and ThaiGrid Roll.
Grid Appliance – On the Design of Self-Organizing, Decentralized Grids David Wolinsky, Arjun Prakash, and Renato Figueiredo ACIS Lab at the University.
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
1 Time & Cost Sensitive Data-Intensive Computing on Hybrid Clouds Tekin Bicer David ChiuGagan Agrawal Department of Compute Science and Engineering The.
1 A Framework for Data-Intensive Computing with Cloud Bursting Tekin Bicer David ChiuGagan Agrawal Department of Compute Science and Engineering The Ohio.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
Simplifying Resource Sharing in Voluntary Grid Computing with the Grid Appliance David Wolinsky Renato Figueiredo ACIS Lab University of Florida.
A Cloudy View on Computing Workshop and CReSIS Field Data Accessibility Jerome E. Mitchell Indiana University.
SALSA HPC Group School of Informatics and Computing Indiana University.
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
A Hierarchical MapReduce Framework Yuan Luo and Beth Plale School of Informatics and Computing, Indiana University Data To Insight Center, Indiana University.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Scalarm: Scalable Platform for Data Farming D. Król, Ł. Dutka, M. Wrzeszcz, B. Kryza, R. Słota and J. Kitowski ACC Cyfronet AGH KU KDM, Zakopane, 2013.
Flexibility, Manageability and Performance in a Grid Storage Appliance John Bent, Venkateshwaran Venkataramani, Nick Leroy, Alain Roy, Joseph Stanley,
1 Development of a High-Throughput Computing Cluster at Florida Tech P. FORD, R. PENA, J. HELSBY, R. HOCH, M. HOHLMANN Physics and Space Sciences Dept,
Using Map-reduce to Support MPMD Peng
 Frequent Word Combinations Mining and Indexing on HBase Hemanth Gokavarapu Santhosh Kumar Saminathan.
Supporting Load Balancing for Distributed Data-Intensive Applications Leonid Glimcher, Vignesh Ravi, and Gagan Agrawal Department of ComputerScience and.
Big data Usman Roshan CS 675. Big data Typically refers to datasets with very large number of instances (rows) as opposed to attributes (columns). Data.
MapReduce & Hadoop IT332 Distributed Systems. Outline  MapReduce  Hadoop  Cloudera Hadoop  Tutorial 2.
Copyright © 2012, SAS Institute Inc. All rights reserved. SAS ® GRID AT PHAC SAS OTTAWA PLATFORM USERS SOCIETY, NOVEMBER 2012.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Performance guided scheduling in GENIE through ICENI
Using Map-reduce to Support MPMD Peng
Evaluating Clouds for Smart Grid Computing: early Results using GE MARS App Ketan Maheshwari
Group # 14 Dhairya Gala Priyank Shah. Introduction to Grid Appliance The Grid appliance is a plug-and-play virtual machine appliance intended for Grid.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Introduction to Makeflow and Work Queue Nicholas Hazekamp and Ben Tovar University of Notre Dame XSEDE 15.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
Item Based Recommender System SUPERVISED BY: DR. MANISH KUMAR BAJPAI TARUN BHATIA ( ) VAIBHAV JAISWAL( )
Implementation of Classifier Tool in Twister Magesh khanna Vadivelu Shivaraman Janakiraman.
Big Data is a Big Deal!.
Introduction to Distributed Platforms
By Chris immanuel, Heym Kumar, Sai janani, Susmitha
Dynamic Deployment of VO Specific Condor Scheduler using GT4
GWE Core Grid Wizard Enterprise (
Distributed Operating Systems
Overview Introduction VPS Understanding VPS Architecture
Introduction to Makeflow and Work Queue
B534 distributed computing
Applying Twister to Scientific Applications
湖南大学-信息科学与工程学院-计算机与科学系
Ch 4. The Evolution of Analytic Scalability
Twister4Azure : Iterative MapReduce for Azure Cloud
Group 15 Swathi Gurram Prajakta Purohit
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Grid Appliance The World of Virtual Resource Sharing Group # 14 Dhairya Gala Priyank Shah

Agenda Introduction to Grid Appliances Implementation Results and Conclusions Knowledge Gained Implementation Timeline Future Work Acknowledgements

Introduction to Grid Appliance The Grid appliance is a plug-and-play virtual machine appliance intended for Grid computing to execute many long-running simulations concurrently in resources across virtual machines that can be distributed across the world.

Features  Setting up on demand ad-hoc virtual resource pools.  OS independent.  Use of VM’s helps run multiple operating systems simultaneously and isolated from each other, but on the same computer.  Application independent.  Easy to configure and use.  Plug and play solution that can run on the desktop.

Grid Appliance - Insight  Virtual machine with its own operating system.  Linux and Grid middleware  Connects individual appliances in the virtual network using IPOP at P2P level.  Uses Condor for job scheduling.

Implementation  Configure Twister Appliance environment  Package Twister Iterative Map-Reduce along with the Grid Appliance framework  Run distributed applications over the configured Twister Appliance Virtual network  Run distributed applications with same configuration over a standard cluster  Evaluate the computational performance of the two environments

Evaluation Parameters  Applications  Kmeans Clustering  WordCount  PageRank Algorithm  Number of compute nodes  Nodes  Input data size  Kmeans Clustering - 80K to 3 million (data points)  WordCount GB to 0.5 GB (text file size)  PageRank Algorithm - 50K to 300K (number of URL’s)

Results(1) Execution Time: Grid Appliance > Standard Cluster

Results(2) With increase in the number of nodes, it behaves as any other environment running distributed application would.

Results(3) Speed-up: Not ideal but acceptable.

Conclusions  Grid Appliance brings with it a small but an acceptable dip in the system performance,  But is accompanied by,  Independence from cloud service providers  Expected results  Ease of configuration  Ease of resource availability due to its ad-hoc nature  Real World Applicability

Knowledge Gained  In-depth understanding of Grid Appliance Framework  Closer look at Twister Framework  Team Work  Designing a Poster

Timeline TaskSchedule Configure Twister Appliance environmentOct 24, 2011 Running distributed applications over a standard cluster Oct 31, 2011 Run distributed applications over the configured Twister Appliance Virtual network Nov 14, 2011 Evaluate the computational performance of the two environments Nov 28, 2011

Future Work Perform intensive Testing by:  Running more data parallel applications  Increasing the number of compute nodes  Increasing the data size used for the applications  Running the application for a large number of times and at different times of the day  Conducting some kind of a reliability check

Acknowledgements  Prof. Judy Qiu  Stephen Wu  FutureGrid Team Special Thanks to Prof. Renato Figueiredo (University of Florida)

Thank You !! Questions ??