Advanced Computing and Information Systems laboratory Educational Virtual Clusters for On- demand MPI/Hadoop/Condor in FutureGrid Renato Figueiredo Panoat.

Slides:



Advertisements
Similar presentations
Sponsors and Acknowledgments This work is supported in part by the National Science Foundation under Grants No. OCI , IIP and CNS
Advertisements

Advanced Computing and Information Systems laboratory Virtual Appliances and Education using Clouds Dr. Renato Figueiredo ACIS Lab - University of Florida.
Education and training on FutureGrig Salt Lake City, Utah July 18 th 2011 Presented by Renato Figueiredo
FutureGrid related presentations at TG and OGF Sun. 17th: Introduction to FutireGrid (OGF) Mon. 18th: Introducing to FutureGrid (TG) Tue. 19th –Educational.
Advanced Computing and Information Systems laboratory Virtual Private Clusters: Virtual Appliances and Networks in the Cloud Renato Figueiredo ACIS Lab.
SLA-Oriented Resource Provisioning for Cloud Computing
SALSA HPC Group School of Informatics and Computing Indiana University.
Implementing a menu based application in FutureGrid
Aneka: A Software Platform for .NET-based Cloud Computing
Center for Autonomic Computing Intel Portland, April 30, 2010 Autonomic Virtual Networks and Applications in Cloud and Collaborative Computing Environments.
Reproducible Environment for Scientific Applications (Lab session) Tak-Lon (Stephen) Wu.
Design Discussion Rain: Dynamically Provisioning Clouds within FutureGrid Geoffrey Fox, Andrew J. Younge, Gregor von Laszewski, Archit Kulshrestha, Fugang.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Utility Computing Casey Rathbone 1http://cyberaide.org.edu.
Cyberaide Virtual Appliance: On-demand Deploying Middleware for Cyberinfrastructure Tobias Kurze, Lizhe Wang, Gregor von Laszewski, Jie Tao, Marcel Kunze,
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Customized cloud platform for computing on your terms !
Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.
Advanced Computing and Information Systems laboratory Plug-and-play Virtual Appliance Clusters Running Hadoop Dr. Renato Figueiredo ACIS Lab - University.
Promoting Open Source Software Through Cloud Deployment: Library à la Carte, Heroku, and OSU Michael B. Klein Digital Applications Librarian
Creating an EC2 Provisioning Module for VCL Cameron Mann & Everett Toews.
Science Clouds and FutureGrid’s Perspective June Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox
Grid Appliance – On the Design of Self-Organizing, Decentralized Grids David Wolinsky, Arjun Prakash, and Renato Figueiredo ACIS Lab at the University.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
1 Evolution of OSG to support virtualization and multi-core applications (Perspective of a Condor Guy) Dan Bradley University of Wisconsin Workshop on.
The Open Source Virtual Lab: a Case Study Authors: E. Damiani, F. Frati, D. Rebeccani, M. Anisetti, V. Bellandi and U. Raimondi University of Milan Department.
Grids, Clouds and the Community. Cloud Technology and the NGS Steve Thorn Edinburgh University Matteo Turilli, Oxford University Presented by David Fergusson.
FutureGrid Dynamic Provisioning Experiments including Hadoop Fugang Wang, Archit Kulshrestha, Gregory G. Pike, Gregor von Laszewski, Geoffrey C. Fox.
Advanced Computing and Information Systems laboratory Virtual Appliances for Training and Education in FutureGrid Renato Figueiredo Arjun Prakash, David.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
Simplifying Resource Sharing in Voluntary Grid Computing with the Grid Appliance David Wolinsky Renato Figueiredo ACIS Lab University of Florida.
Center for Autonomic Computing Intel Portland, April 30, 2010 Autonomic Virtual Networks and Applications in Cloud and Collaborative Computing Environments.
ETICS All Hands meeting Bologna, October 23-25, 2006 NMI and Condor: Status + Future Plans Andy PAVLO Peter COUVARES Becky GIETZEL.
Magellan: Experiences from a Science Cloud Lavanya Ramakrishnan.
SALSA HPC Group School of Informatics and Computing Indiana University.
FutureGrid Cyberinfrastructure for Computational Research.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Grid and Cloud Computing Globus Provision Dr. Guy Tel-Zur.
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
SALSASALSASALSASALSA FutureGrid Venus-C June Geoffrey Fox
Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University.
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen CERN
GLIDEINWMS - PARAG MHASHILKAR Department Meeting, August 07, 2013.
Virtual Appliances CTS Conference 2011 Philadelphia May Geoffrey Fox
Design Discussion Rain: Dynamically Provisioning Clouds within FutureGrid PI: Geoffrey Fox*, CoPIs: Kate Keahey +, Warren Smith -, Jose Fortes #, Andrew.
The Eucalyptus Open-source Cloud Computing System Daniel Nurmi Rich Wolski, Chris Grzegorczyk, Graziano Obertelli, Sunil Soman, Lamia Youseff, Dmitrii.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Grid Appliance The World of Virtual Resource Sharing Group # 14 Dhairya Gala Priyank Shah.
CSE 548 Advanced Computer Network Security Trust in MobiCloud using Hadoop Framework Updates Sayan Kole Jaya Chakladar Group No: 1.
Application Programming in Cloud via Swift Swift Tutorial, CCGrid 2013, Hour 2 Ketan Maheshwari.
Grid testing using virtual machines Stephen Childs*, Brian Coghlan, David O'Callaghan, Geoff Quigley, John Walsh Department of Computer Science Trinity.
| Basel Fabric Management with Virtual Machine Manager Philipp Witschi – Cloud Architect & Microsoft vTSP Thomas Maurer – Cloud Architect & Microsoft MVP.
Group # 14 Dhairya Gala Priyank Shah. Introduction to Grid Appliance The Grid appliance is a plug-and-play virtual machine appliance intended for Grid.
EGI-InSPIRE RI EGI Webinar EGI-InSPIRE RI Porting your application to the EGI Federated Cloud 17 Feb
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
Information Initiative Center, Hokkaido University North 11, West 5, Sapporo , Japan Tel, Fax: General.
DIRAC for Grid and Cloud Dr. Víctor Méndez Muñoz (for DIRAC Project) LHCb Tier 1 Liaison at PIC EGI User Community Board, October 31st, 2013.
Architecture of a platform for innovation and research Erik Deumens – University of Florida SC15 – Austin – Nov 17, 2015.
Introductory Tutorial: OpenStack, Chef, Hadoop, Hbase, Pig I590 Data Science Curriculum Big Data Open Source Software and Projects September Geoffrey.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
Progress on NA61/NA49 software virtualisation Dag Toppe Larsen Wrocław
ATLAS Cloud Operations
Haiyan Meng and Douglas Thain
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Sky Computing on FutureGrid and Grid’5000
Module 01 ETICS Overview ETICS Online Tutorials
Using and Building Infrastructure Clouds for Science
Sky Computing on FutureGrid and Grid’5000
Presentation transcript:

Advanced Computing and Information Systems laboratory Educational Virtual Clusters for On- demand MPI/Hadoop/Condor in FutureGrid Renato Figueiredo Panoat Chuchaisri, David Wolinsky ACIS Lab - University of Florida

Advanced Computing and Information Systems laboratory 2 Goals and Approach A flexible, extensible platform for hands- on, education on parallel and distributed systems Focus on usability – lower entry barrier Plug and play, open-source Virtualization + virtual networking to create educational sandboxes Virtual appliances: self-contained, pre- packaged execution environments Group VPNs: simple management of virtual clusters by students and educators

Advanced Computing and Information Systems laboratory 3 Guiding principles Full-blown, pre-configured, plug-and-play well-known middleware stacks Condor – high-throughput, workflows MPI – parallel Hadoop – data parallel, Map/Reduce Quick user start – minutes to first job Gain access to FutureGrid, or use desktop VMs Shared playground cluster Isolated sandbox clusters; flexibility Allow individuals, groups complete control over their virtual cluster - root access

Advanced Computing and Information Systems laboratory 4 Outline Overview Virtual appliances, networks FutureGrid resources FutureGrid accounts Deploying an appliance and connecting to playground virtual cluster Condor self-configuration Deploying MPI parallel jobs Deploying Hadoop pools

Advanced Computing and Information Systems laboratory 5 What is a virtual appliance? An appliance that packages software and configuration needed for a particular purpose into a virtual machine “image” The virtual appliance has no hardware – just software and configuration The image is a (big) file It can be instantiated on hardware Desktops: VMware, VirtualBox Clouds: FutureGrid, Amazon EC2

Advanced Computing and Information Systems laboratory 6 Grid appliances Baseline image: self-configures Condor copy instantiate Appliance image A Condor node Another Condor node Repeat… Virtualization Layer

Advanced Computing and Information Systems laboratory 7 Virtual network, configuration P2P overlay used to self-organize virtual private network (VPN) of appliances Akin to Skype Virtual cluster; assign IP addresses in virtual space through DHCP – support existing middleware (Condor, MPI, Hadoop) P2P overlay also used to self-configure middleware Akin to Bonjour/UPnP Condor manager advertises itself; Condor workers discover and register with manager

Advanced Computing and Information Systems laboratory 8 FutureGrid resources Nimbus Eucalyptus Appliance image Education Training

Advanced Computing and Information Systems laboratory 9 Using FutureGrid – accounts 101 Create a portal account Can access and post content, manage profile Identity verification – no resources allocated, but users can interact with portal E.g. cloud computing class community page Create or join a project Request needs to be authorized, and resources granted Portal users can then be added to the project E.g. a cloud class instructor submits a project request; students request portal accounts; instructor uses portal to add students to class

Advanced Computing and Information Systems laboratory Web site – FG account 10

Advanced Computing and Information Systems laboratory 11 Using FutureGrid – cloud 101 Once a user has a portal account and project, he/she can use Nimbus or Eucalyptus to instantiate appliances on the different FutureGrid Clouds Tutorials show steps to deploy appliances with a single-line command Refer to portal.futuregrid.org Under “User information”: Getting started – to get accounts Using Clouds – Nimbus, Eucalyptus Pointers to relevant tutorials

Advanced Computing and Information Systems laboratory User perspective – first steps Deploying the baseline Grid appliance: Nimbus: cloud-client.sh --run --name grid-appliance gz --hours 24 Eucalyptus: euca-run-instance -k mykey -t c1.medium emi- E4ED1880 Wait a few minutes ssh You are connected to a pre-deployed ‘playground’ Condor cluster condor_status 12

Advanced Computing and Information Systems laboratory cloud_client.sh Joining Condor pool 13 Shared Playground Join P2P network Get DHCP address Discover Condor manager

Advanced Computing and Information Systems laboratory User perspective – running MPI User can install MPI on their appliance “Vanilla” MPI – just run a script to build Advanced classes - user can also deploy custom MPI stacks Condor is used to bootstrap MPI rings on demand with help of a script Takes executable and number of nodes Dispatches MPI daemons as Condor jobs Waits for all nodes to report Creates configuration based on nodes Submits MPI task Nodes auto-mount the MPI binaries over NFS 14

Advanced Computing and Information Systems laboratory MPI dynamic pools 15 mpi_submit.py –n 4 HelloWorld Shared Playground NFS read-only automount –n 2 HelloWorld NFS read-only automount

Advanced Computing and Information Systems laboratory User perspective – running Hadoop User can install Hadoop on their appliance “Vanilla” Hadoop – pre-installed Advanced classes - user can also deploy custom Hadoop stacks Condor is used to bootstrap Hadoop pools Takes number of nodes as input Dispatches namenodes, task trackers Waits for all nodes to report Creates configuration based on nodes Nodes auto-mount the Hadoop binaries over NFS After pool is configured, submit tasks, use Hadoop HDFS 16

Advanced Computing and Information Systems laboratory Hadoop dynamic pools - create 17 hadoop_condor.py –n 4 start Shared Playground NFS read-only automount –n 2 start NFS read-only automount

Advanced Computing and Information Systems laboratory Hadoop dynamic pools - run 18 hdfs dfsadmin hadoop jar app1 args Shared Playground NFS read-only automount hdfs dfsadmin hadoop jar app2 args NFS read-only automount

Advanced Computing and Information Systems laboratory Hadoop dynamic pools - teardown 19 hadoop_condor.py –n 4 stop Shared Playground NFS read-only automount –n 2 start NFS read-only automount

Advanced Computing and Information Systems laboratory 20 One appliance, multiple ways to run Allow same logical cluster environment to instantiate on a variety of platforms Local desktop, clusters; FutureGrid; EC2 Avoid dependence on host environment Make minimum assumptions about VM and provisioning software Desktop: VMware, VirtualBox; KVM Para-virtualized VMs (e.g. Xen) and cloud stacks – need to deal with idiosyncrasies Minimum assumptions about networking Private, NATed Ethernet virtual network interface

Advanced Computing and Information Systems laboratory 21 Creating private clusters The default ‘playground’ environment allows new users to quickly get started Users and instructors can also deploy their own private clusters The Condor pool becomes a dedicated resource Same appliance – what changes is a configuration file that specifies which virtual cluster to connect to Web interface to create groups

Advanced Computing and Information Systems laboratory Web site – GroupVPN 22

Advanced Computing and Information Systems laboratory Deploying private virtual pools 23 Dedicated Virtual pool cloud_client.sh –n 7 upload groupVPN configuration Student 1 Student 2

Advanced Computing and Information Systems laboratory 24 Summary Hands-on experience with clusters is essential for education and training Virtualization, clouds simplify software packaging/configuration Grid appliance allows users to easily deploy hands-on virtual clusters FutureGrid provides resources and cloud stacks for educators to easily deploy their own virtual clusters Towards a community-based marketplace of educational appliances

Advanced Computing and Information Systems laboratory 25 Thank you! More information: This document was developed with support from the National Science Foundation (NSF) under Grant No to Indiana University for "FutureGrid: An Experimental, High-Performance Grid Test-bed." Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the NSF

Advanced Computing and Information Systems laboratory 26