Virtualization, Cloud Computing, and TeraGrid Kate Keahey (University of Chicago, ANL) Marlon Pierce (Indiana University)

Slides:



Advertisements
Similar presentations
First Steps in the Clouds
Advertisements

Open Science Grid Living on the Edge: OSG Edge Services Framework Kate Keahey Abhishek Rana.
A Scalable Approach to Deploying and Managing Appliances Kate Keahey Rick Bradshaw, Narayan Desai, Tim Freeman Argonne National Lab, University of Chicago.
Virtual Appliances for Scientific Applications Kate Keahey Argonne National Laboratory University of Chicago.
Wei Lu 1, Kate Keahey 2, Tim Freeman 2, Frank Siebenlist 2 1 Indiana University, 2 Argonne National Lab
Cloud Computing and Virtualization with Globus Oakland, May 2008 Kate Keahey Tim Freeman University of Chicago.
Sponsors and Acknowledgments This work is supported in part by the National Science Foundation under Grants No. OCI , IIP and CNS
Science Clouds: Early Experiences in Cloud Computing for Scientific Applications Chicago, October 2008 Kate Keahey, Renato Figueiredo, Jose Fortes, Tim.
Globus Virtual Workspaces An Update SC 2007, Reno, NV Kate Keahey Argonne National Laboratory University of Chicago
Nimbus or an Open Source Cloud Platform or the Best Open Source EC2 No Money Can Buy ;-) Kate Keahey Tim Freeman University of Chicago.
Cloud Computing with Nimbus FNAL, January 2009 Kate Keahey University of Chicago Argonne National Laboratory.
Globus Virtual Workspaces HEPiX Fall 2007, St Louis Kate Keahey Argonne National Laboratory University of Chicago
Infrastructure-as-a-Service Cloud Computing for Science
University of St Andrews School of Computer Science Experiences with a Private Cloud St Andrews Cloud Computing co-laboratory James W. Smith Ali Khajeh-Hosseini.
1/16 Distributed Systems Architecture Research Group Universidad Complutense de Madrid An Introduction to Virtualization and Cloud Technologies to Support.
1 Cloud Computing Prof. Ravi Sandhu Executive Director and Endowed Chair April 12, © Ravi Sandhu World-Leading.
The Case for Enterprise Ready Virtual Private Clouds Timothy Wood, Alexandre Gerber *, K.K. Ramakrishnan *, Jacobus van der Merwe *, and Prashant Shenoy.
Obstacles Security Culture Cloud Cloud Computing will be the primary delivery model, the big question is how fast is going to get there. The cost is the.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.
M.A.Doman Model for enabling the delivery of computing as a SERVICE.
Architecture overview 6/03/12 F. Desprez - ISC Cloud Context : Development of a toolbox for deploying application services providers with a hierarchical.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
Cloud Computing Why is it called the cloud?.
What Happens When Cloud Computing Meets HPC Dr. Dan Fraser Director, CDIGS (Community Driven Improvement of Globus Software)
TG RoundTable, Purdue RP Update October 11, 2008 Carol Song Purdue RP PI Rosen Center for Advanced Computing.
Ocean Observatories Initiative Common Execution Environment Kate Keahey OOI Cyberinfrastructure Life Cycle Objectives Milestone Review, Release 1 San Diego,
1/8 Enhancing Grid Infrastructures with Virtualization and Cloud Technologies Ignacio M. Llorente Business Workshop EGEE’09 September 21st, 2009 Distributed.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Purdue RP Highlights TeraGrid Round Table September 23, 2010 Carol Song Purdue TeraGrid RP PI Rosen Center for Advanced Computing Purdue University.
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
DISTRIBUTED COMPUTING
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Grid Appliance – On the Design of Self-Organizing, Decentralized Grids David Wolinsky, Arjun Prakash, and Renato Figueiredo ACIS Lab at the University.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks C. Loomis (CNRS/LAL) M.-E. Bégin (SixSq.
Grids, Clouds and the Community. Cloud Technology and the NGS Steve Thorn Edinburgh University Matteo Turilli, Oxford University Presented by David Fergusson.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
Globus Virtual Workspaces OOI Cyberinfrastructure Design Meeting, San Diego, October Kate Keahey University of Chicago Argonne National Laboratory.
Magellan: Experiences from a Science Cloud Lavanya Ramakrishnan.
What is the cloud ? IT as a service Cloud allows access to services without user technical knowledge or control of supporting infrastructure Best described.
Virtual Workspaces Kate Keahey Argonne National Laboratory.
Ian Gable University of Victoria 1 Deploying HEP Applications Using Xen and Globus Virtual Workspaces A. Agarwal, A. Charbonneau, R. Desmarais, R. Enge,
The Eucalyptus Open-source Cloud Computing System Daniel Nurmi Rich Wolski, Chris Grzegorczyk, Graziano Obertelli, Sunil Soman, Lamia Youseff, Dmitrii.
| nectar.org.au NECTAR TRAINING Module 1 Overview of cloud computing and NeCTAR services.
7. Grid Computing Systems and Resource Management
Chapter 8 – Cloud Computing
Web Technologies Lecture 13 Introduction to cloud computing.
© Copyright AARNet Pty Ltd PRAGMA Update & some personal observations James Sankar Network Engineer - Middleware.
Microsoft Cloud Solution.  What is the cloud?  Windows Azure  What services does it offer?  How does it all work?  How to go about using it  Further.
1/23 Distributed Systems Architecture Research Group Universidad Complutense de Madrid Nuevos modelos de provisión de recursos para infrestructuras GRID:
Purdue RP Highlights TeraGrid Round Table May 20, 2010 Preston Smith Manager - HPC Grid Systems Rosen Center for Advanced Computing Purdue University.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
Information Initiative Center, Hokkaido University North 11, West 5, Sapporo , Japan Tel, Fax: General.
Nimbus Update March 2010 OSG All Hands Meeting Kate Keahey Nimbus Project University of Chicago Argonne National Laboratory.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
Workspace Management Services Kate Keahey Argonne National Laboratory.
New Paradigms: Clouds, Virtualization and Co.
Grid and Cloud Computing
Cloud Technology and the NGS Steve Thorn Edinburgh University (Matteo Turilli, Oxford University)‏ Presented by David Fergusson.
Cloud Computing with Nimbus
Cloud Computing with Nimbus
Sky Computing on FutureGrid and Grid’5000
Virtualization, Cloud Computing, and TeraGrid
Cloud Computing: Concepts
Fundamental Concepts and Models
Using and Building Infrastructure Clouds for Science
Sky Computing on FutureGrid and Grid’5000
Presentation transcript:

Virtualization, Cloud Computing, and TeraGrid Kate Keahey (University of Chicago, ANL) Marlon Pierce (Indiana University)

Virtual Workspaces: http//workspace.globus.org Virtualization and Cloud Computing l The Virtues of Virtualization u Portable environments, enforcement and isolation, fast to deploy, suspend/resume, migration… l Cloud computing: a nebulous concept… u SaaS: software as a service u Service: provide me with a workspace u Virtualization makes it easy to provide a workspace/VM l Cloud computing u resource leasing, utility computing, elastic computing u Amazons Elastic Compute Cloud (EC2) l Is this real? Or is this just a proof-of-concept? u Successfully used commercially on a large scale u More experience for scientific applications

Virtual Workspaces: http//workspace.globus.org What is a Cloud? Two major types of cloud (at least) l Compute and Data Cloud u EC2, Google Map Reduce, Science clouds u Provision platform for running science codes u Open source infrastructure: workspace, eucalyptus, hub0 u Virtualization: providing environments as VMs l Hosting Cloud u GoogleApp Engine u Highly-available, fault tolerance, robustness, etc for Web capabilities u Community example: IU hosting environment (quarry)

Virtual Workspaces: http//workspace.globus.org The Science Clouds: A Case Study l Objectives: u Make it easy for scientific projects to experiment with cloud computing l You too can run on the cloud! (we can give you cycles) l You too can be a cloud provider! (we can give you open source software) u Evolve software in response to the needs of scientific projects Start with EC2 - Refine SLAs - One-click virtual clusters (contextualization) - Lower adoption barriers - Miscellaneous useful new features

Virtual Workspaces: http//workspace.globus.org The Science Clouds l Powered by workspace tools l EC2-like interfaces (PKI credential vs credit card) l More clouds on the way l Stratus University of Florida 16x4 nodes Nimbus University of Chicago 16x2 nodes Public IPs Private IPs (via VPN)

Virtual Workspaces: http//workspace.globus.org Who Runs on the Science Clouds? l Nimbus utilization breakdown since March 4th l ~30 DNs (a DN represents a community)

Virtual Workspaces: http//workspace.globus.org STAR l Motivation for STAR u Resources with the right configuration are hard to find l Complex environments: correct versions of operating systems, libraries, tools, etc all have to be installed. l Require validation l Virtual Workspace: an OSG STAR cluster u OSG cluster l OSG CE (headnode), gridmapfiles, host certificates, NSF, PBS u STAR worker nodes: SL4 + STAR conf l Requirements u One-click virtual clusters u Migration: nimbus/scientific resources -> EC2

Virtual Workspaces: http//workspace.globus.org STAR (cntd) l From proof-of-concept to production runs u ~2 years ago: proof-of-concept u Last September: EC2 runs of up to 100 nodes (production scale) u Testing for full production deployment l Performance u Within 10% of expected performance for applications l Work by Jerome Lauret, Doug Olson, Leve Hajdu, Lidia Didenko l Long-lived community of many l Similar work for other HEP communities (Alice and Atlas), bioinformatics, geofest, and others

Virtual Workspaces: http//workspace.globus.org Virtual Network Overlays l Motivation u CS research: investigate latency-sensitive apps l Virtual workspace: ViNE router + app VM l Requirements: access to distributed resources l First steps in creating a federated cloud l Work by Mauricio Tsugawa, Andrea Matsunaga, Jose Fortes and others l Medium-lived community of a few Stratus Nimbus ViNE router ViNE router

Virtual Workspaces: http//workspace.globus.org Scalability Testing l Motivation u Test scalability of various Globus components u Test on a different platform l Workspaces u Globus others l Requirements u very short-term but flexible access to diverse platforms l Work by various members of the Globus Toolkit (Tom Howe and John Bresnahan) l Typically very short-lived communities of one

Virtual Workspaces: http//workspace.globus.org Resource Providers: Scientific computing providers: Science Clouds Commercial providers: EC2 Grid Providers? Users, Communities, Providers Appliance Providers: All communities large and small commercial and open marketplaces Appliance management software available Appliance Deployment: appliances -> leased compute resources Coordinating creation of virtual resources Software layers: an evolving middleware for clouds

Why isnt TeraGrid like this? (science cloud user)

Are there any benefits of this approach that would be relevant to you as a user? l What do you hate about supercomputers? l What would convince you to go to the hassle of providing a VM image for your community and giving it a shot? l What problems does it solve? l What problems does it create? l (Are we overall in the black on that?)

Virtual Workspaces: http//workspace.globus.org Are there any benefits of this approach that would be relevant to you as a provider? l What would have to happen to convince you to provide a part of your resource to the user community as a VM-serving platform? l What problems does it solve? l What problems does it create? l That balance sheet again?

Virtual Workspaces: http//workspace.globus.org What Should We Do? l Establish interest group to coordinate and communicate TG activities? u Evaluate existing software l What are the gaps? l What are the best solutions for various problems? u What are the problems? l Overhead is really an issue? l What about security? l How do you deal with big data? l What are interesting projects that we can do?