Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University.

Slides:



Advertisements
Similar presentations
Sponsors and Acknowledgments This work is supported in part by the National Science Foundation under Grants No. OCI , IIP and CNS
Advertisements

Advanced Computing and Information Systems laboratory Virtual Appliances and Education using Clouds Dr. Renato Figueiredo ACIS Lab - University of Florida.
Education and training on FutureGrig Salt Lake City, Utah July 18 th 2011 Presented by Renato Figueiredo
FutureGrid related presentations at TG and OGF Sun. 17th: Introduction to FutireGrid (OGF) Mon. 18th: Introducing to FutureGrid (TG) Tue. 19th –Educational.
Future Grid Early Projects 2010 User Advisory Board Meeting Pittsburgh, PA.
Clouds from FutureGrid’s Perspective April Geoffrey Fox Director, Digital Science Center, Pervasive.
FutureGrid Training, Education and Outreach Bloomington Indiana January Presented by Renato Figueiredo
Advanced Computing and Information Systems laboratory Educational Virtual Clusters for On- demand MPI/Hadoop/Condor in FutureGrid Renato Figueiredo Panoat.
Implementing a menu based application in FutureGrid
FutureGrid Image Repository: A Generic Catalog and Storage System for Heterogeneous Virtual Machine Images Javier Diaz, Gregor von Laszewski, Fugang Wang,
SALSASALSASALSASALSA Digital Science Center June 25, 2010, IIT Geoffrey Fox Judy Qiu School.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Design Discussion Rain: Dynamically Provisioning Clouds within FutureGrid Geoffrey Fox, Andrew J. Younge, Gregor von Laszewski, Archit Kulshrestha, Fugang.
Panel Session The Challenges at the Interface of Life Sciences and Cyberinfrastructure and how should we tackle them? Chris Johnson, Geoffrey Fox, Shantenu.
Big Data and Clouds: Challenges and Opportunities NIST January Geoffrey Fox
Eucalyptus on FutureGrid: A case for Eucalyptus 3 Sharif Islam, Javier Diaz, Geoffrey Fox Gregor von Laszewski Indiana University.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
FutureGrid: an experimental, high-performance grid testbed Craig Stewart Executive Director, Pervasive Technology Institute Indiana University
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Cyber-Infrastructure in Education South Carolina State University Cyberinfrastructure Day March Geoffrey Fox
© Spinnaker Labs, Inc. Google Cluster Computing Faculty Training Workshop Open Source Tools for Teaching.
 Prototype for Course on Web Security ETEC 550.  Huge topic covering both system/network architecture and programming techniques.  Identified lack.
MapReduce TG11 BOF FutureGrid Team (Geoffrey Fox) TG11 19 July 2011 Downtown Marriott Salt Lake City.
Experimenting with FutureGrid CloudCom 2010 Conference Indianapolis December Geoffrey Fox
Advanced Computing and Information Systems laboratory The case for UF in PRAGMA Jose Fortes (also on behalf of Renato Figueiredo and Reed Beaman)
Science Clouds and FutureGrid’s Perspective June Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
James Williams e: eTutor Project SUMMARY OF KEY FINDINGS for 2 Pilot studies of the.
FutureGrid Dynamic Provisioning Experiments including Hadoop Fugang Wang, Archit Kulshrestha, Gregory G. Pike, Gregor von Laszewski, Geoffrey C. Fox.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
FutureGrid Connection to Comet Testbed and On Ramp as a Service Geoffrey Fox Indiana University Infra structure.
Magellan: Experiences from a Science Cloud Lavanya Ramakrishnan.
FutureGrid Cyberinfrastructure for Computational Research.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Performance Evaluation of Image Conversion Module Based on MapReduce for Transcoding and Transmoding in SMCCSE Speaker : 吳靖緯 MA0G IEEE.
RAIN: A system to Dynamically Generate & Provision Images on Bare Metal by Application Users Presented by Gregor von Laszewski Authors: Javier Diaz, Gregor.
SALSASALSASALSASALSA FutureGrid Venus-C June Geoffrey Fox
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Virtual Appliances CTS Conference 2011 Philadelphia May Geoffrey Fox
Design Discussion Rain: Dynamically Provisioning Clouds within FutureGrid PI: Geoffrey Fox*, CoPIs: Kate Keahey +, Warren Smith -, Jose Fortes #, Andrew.
Award # funded by the National Science Foundation Award #ACI Jetstream: A Distributed Cloud Infrastructure for.
Computing Research Testbeds as a Service: Supporting large scale Experiments and Testing SC12 Birds of a Feather November.
OOI-CYBERINFRASTRUCTURE OOI Cyberinfrastructure Education and Public Awareness Plan Cyberinfrastructure Design Workshop October 17-19, 2007 University.
Grid Appliance The World of Virtual Resource Sharing Group # 14 Dhairya Gala Priyank Shah.
Architecture & Cybersecurity – Module 3 ELO-100Identify the features of virtualization. (Figure 3) ELO-060Identify the different components of a cloud.
2005 GRIDS Community Workshop1 Learning From Cyberinfrastructure Initiatives Grid Research Integration Development & Support
Remarks on MOOC’s Open Grid Forum BOF July 24 OGF38B at XSEDE13 San Diego Geoffrey Fox Informatics, Computing.
SALSASALSASALSASALSA Digital Science Center February 12, 2010, Bloomington Geoffrey Fox Judy Qiu
Vignesh Ravindran Sankarbala Manoharan. Infrastructure As A Service (IAAS) is a model that is used to deliver a platform virtualization environment with.
This material is based upon work supported by the National Science Foundation under Grant No. ANT Any opinions, findings, and conclusions or recommendations.
SALSASALSA Dynamic Virtual Cluster provisioning via XCAT on iDataPlex Supports both stateful and stateless OS images iDataplex Bare-metal Nodes Linux Bare-
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
Grappling Cloud Infrastructure Services with a Generic Image Repository Javier Diaz Andrew J. Younge, Gregor von Laszewski, Fugang.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
Architecture of a platform for innovation and research Erik Deumens – University of Florida SC15 – Austin – Nov 17, 2015.
Introductory Tutorial: OpenStack, Chef, Hadoop, Hbase, Pig I590 Data Science Curriculum Big Data Open Source Software and Projects September Geoffrey.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Organizations Are Embracing New Opportunities
Private Public FG Network NID: Network Impairment Device
Joslynn Lee – Data Science Educator
Status and Challenges: January 2017
NSF : CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science PI: Geoffrey C. Fox Software: MIDAS HPC-ABDS.
Sky Computing on FutureGrid and Grid’5000
Clouds from FutureGrid’s Perspective
Using and Building Infrastructure Clouds for Science
Sky Computing on FutureGrid and Grid’5000
Presentation transcript:

Hosting Cloud, HPC and Grid Educational Activities on FutureGrid Renato Figueiredo – U. of Florida Geoffrey Fox, Barbara Ann O’Leary – Indiana University

Introduction Some of the pressing needs faced by educators when teaching Cloud, Grid, HPC: Availability of suitable resources A flexible environment Fast turn-around times Reuse and sharing of educational materials FutureGrid sees education as one of its main use cases and seeks to address these needs Goals of this BOF: Highlight key aspects of FG and its use to date Benefit from your input and discussions to make it a primary resource used by you and the community for education activities 2

Needs addressed by FutureGrid Resources: FutureGrid is an XSEDE resource available to the community at large which provides unique opportunities for education Thousands of cores, multiple sites Heterogeneous: supporting HPC, cloud, Grid Flexibility: Cloud, virtualization, dynamic provisioning Environment can adapt to the user, rather than expect user to adapt to the environment 3

Needs addressed by FutureGrid Fast turn-around times Reduce barriers to entry and engage new users Web 2.0 portal to create class project and manage users; fast turn-around Reuse and sharing of educational materials Use of encapsulated “appliances” as a primary delivery mechanism of education/training modules Promoting reuse, replication, and sharing 4

Guiding principles Fidelity: activities should use full-fledged, executable software: education/training modules Learn using the proper tools Reproducibility: Creators of content should be able to install, configure, and test their modules once, and be assured of the same functional behavior regardless of where the module is deployed Incentive to invest effort in developing, testing and documenting new modules 5

Guiding principles Deployability: Students and users should be able to deploy modules in a simple manner, and in a variety of resources Reduce barriers to entry; avoid dependences upon a particular infrastructure Community-oriented: Modules should be simple to share, discover, reuse, and expand Create conditions for “viral” growth 6

FutureGrid Web Portal How classes are setup and managed Create a Web account Submit a project proposal: define a class, workshop, short course, tutorial Education is one of the key use cases of FG; eager to have more projects!!! Needs approval, but not tied to a fixed schedule Typically, turn-around of days Add users to project Users create own accounts using the portal Project leaders authorize them to gain access Students can then interactively use FG resources (e.g. to start VMs) Share and reuse – appliances, documents 7

Educational appliances A flexible, extensible platform for hands-on, lab-oriented education on FutureGrid Support clustering of resources Virtual private clusters: Virtual machines + virtual networking to create isolated, repeatable environments Virtual appliances: self-contained, pre-packaged VMs Virtual networks: simple management of virtual clusters by students and educators 8

9 Virtual appliance example Virtual appliance + virtual network copy instantiate Hadoop + Virtual Network A Hadoop worker Another Hadoop worker Repeat… Virtual machine Virtual network

Use of FutureGrid in classes Cloud/distributed computing classes U.of Florida, U. Central Florida, U. of Puerto Rico, Univ. of Piemonte Orientale (Italy), Univ. of Mostar (Croatia) Distributed scientific computing Louisiana State University Tutorials, workshops: Big Data for Science summer school A cloudy view on computing SC’11 tutorial – Clouds for science Science Cloud Summer School (July/Aug’12) 10

Cloud computing classes Massimo Canonico, U. Piemonte Orientale Difficulties to overcome: Hardware issues: find enough free physical machines able to host virtual machines Software issues: time to install/configure as many as possible different cloud platforms University was not able to provide me the necessary hardware and software support Students started to play with FutureGrid After attending few lessons, they were able to start/stop virtual instance with several Cloud Computing platforms 11

Cloud computing classes Students used Eucalyptus, OpenStack and Nimbus Half were not computer scientists. As FutureGrid freely shares their physical machines and their cloud platforms, decided to freely share all materials of my class. Hands-out, configuration files and link to useful documentation are available computing-class 12

Cloud computing classes Graduate-level “Cloud computing for Data-Intensive Sciences” (Judy Qiu, Fall 2010) Virtualization technologies and tools Infrastructure as a service Parallel programming (MPI, Hadoop) FutureGrid provided a set of software options that made it possible for students to work on different projects along the system stack. 13

Cloud Storage #8 Cloud Storage Survey (Xiaoming, Nixiaogang) Cloud Storage #8 Cloud Storage Survey (Xiaoming, Nixiaogang) Iterative MapReduce #3 LDA (Changsi, Yang) #4 MemCache (Saliya, Yiming,Jerome) #5 Avro (Yuduo, Yuan, patanachai) #6 PageRank (Shuo-Huan,Parag) Iterative MapReduce #3 LDA (Changsi, Yang) #4 MemCache (Saliya, Yiming,Jerome) #5 Avro (Yuduo, Yuan, patanachai) #6 PageRank (Shuo-Huan,Parag) Virtualization #9 Hypervisor Performance Analysis Project (James, Andrew) Virtualization #9 Hypervisor Performance Analysis Project (James, Andrew) Cloud Platform Cloud Infrastructu re Cloud Infrastructure #7 Nimbus, Eucalyptus (Stephen, Sonali, Shakeela) Cloud Infrastructure #7 Nimbus, Eucalyptus (Stephen, Sonali, Shakeela) Hypervisor/ Virtualizatio n Dryad/DryadLINQ #1 Matrix Multiplication (Swapnil,Amit,Pradnay) #2 PhyloD (Ratul,Adrija,Chengming) Dryad/DryadLINQ #1 Matrix Multiplication (Swapnil,Amit,Pradnay) #2 PhyloD (Ratul,Adrija,Chengming) Higher Level Languages Term Projects (Slide courtesy of Judy Qiu)

University of Arkansas Indiana University University of California at Los Angeles Penn State Iowa State Univ.Illinois at Chicago University of Minnesota Michigan State Notre Dame University of Texas at El Paso IBM Almaden Research Center Washington University San Diego Supercomputer Center University of Florida Johns Hopkins July 26-30, 2010 NCSA Summer School Workshop Students (200 on sites from 10 institutes; 100 online) IU MapReduce and UF Virtual Appliance technologies are supported by FutureGrid. (Slide courtesy of Judy Qiu) Big Data for Science

16 Uploading and sharing images APIs available to upload images, customize, save, and share images Community education pages are available FutureGrid Web portal allows users to publish their own content Tutorials, presentations on Web portal; VMs on image repositories

Discussion starters At a high-level, does FutureGrid provide capabilities that are aligned with your expectations for an CI useful to hosting educational activities? What would you need to evaluate whether using FutureGrid is a good idea? 17

Cyberinfrastructure What cyber-infrastructure features does FutureGrid have that may help you with educational needs? What cyber-infrastructure features may be missing which would be important? 18

Educational materials What documentation do educators, students need to use this infrastructure? What formats are effective? Tutorials, manuals, videos? 19

Topics IaaS cloud computing? Map/reduce? HPC? Grid? Applications? 20

Outreach, Community-building What information may be missing, unclear, hindering ability to reach out? Ways to attract/foster a community that will share and benefit from sharing educational materials? Within and beyond XSEDE 21

22 Questions? More information: This document was developed with support from the National Science Foundation (NSF) under Grant No to Indiana University for "FutureGrid: An Experimental, High-Performance Grid Test-bed." Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the NSF