What Happens When Cloud Computing Meets HPC Dr. Dan Fraser Director, CDIGS (Community Driven Improvement of Globus Software)

Slides:



Advertisements
Similar presentations
Virtualization, Cloud Computing, and TeraGrid Kate Keahey (University of Chicago, ANL) Marlon Pierce (Indiana University)
Advertisements

Wei Lu 1, Kate Keahey 2, Tim Freeman 2, Frank Siebenlist 2 1 Indiana University, 2 Argonne National Lab
Science Clouds: Early Experiences in Cloud Computing for Scientific Applications Chicago, October 2008 Kate Keahey, Renato Figueiredo, Jose Fortes, Tim.
Nimbus or an Open Source Cloud Platform or the Best Open Source EC2 No Money Can Buy ;-) Kate Keahey Tim Freeman University of Chicago.
The future of Globus (Grid meets Cloud) Ian Foster Computation Institute University of Chicago & Argonne National Laboratory.
Virtual Machine Technology Dr. Gregor von Laszewski Dr. Lizhe Wang.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
Seminar Grid Computing ‘06 Hui Li Sep 18, Overview Brief Introduction Presentations –Architecture –Functionality/Middleware –Applications Projects.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
GT4 Introductory and Advanced Practicals Rachana Ananthakrishnan, Charles Bacon, Lisa Childers Argonne National Laboratory University of Chicago.
Globus Computing Infrustructure Software Globus Toolkit 11-2.
1 Globus Developments Malcolm Atkinson for OMII SC 18 th January 2005.
Globus 4 Guy Warner NeSC Training.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
Virtual Infrastructure in the Grid Kate Keahey Argonne National Laboratory.
OSG End User Tools Overview OSG Grid school – March 19, 2009 Marco Mambelli - University of Chicago A brief summary about the system.
Virtual Machine Hosting for Networked Clusters: Building the Foundations for “Autonomic” Orchestration Based on paper by Laura Grit, David Irwin, Aydan.
A Swift Talk about Globus Technology: What Can It Do for Me? OOI Cyberinfrastructure Design Meeting, San Diego, October The Globus Team (presented.
OPEN GRID SERVICES ARCHITECTURE AND GLOBUS TOOLKIT 4
DISTRIBUTED COMPUTING
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
Grid Security Issues Shelestov Andrii Space Research Institute NASU-NSAU, Ukraine.
Ian Foster on behalf of the Globus Alliance Computation Institute Argonne National Lab & University of Chicago Globus: State of the Union.
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Grids, Clouds and the Community. Cloud Technology and the NGS Steve Thorn Edinburgh University Matteo Turilli, Oxford University Presented by David Fergusson.
Globus Virtual Workspaces OOI Cyberinfrastructure Design Meeting, San Diego, October Kate Keahey University of Chicago Argonne National Laboratory.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Introduction to Grid Computing Ed Seidel Max Planck Institute for Gravitational Physics
Tools for collaboration How to share your duck tales…
Virtual Workspaces Kate Keahey Argonne National Laboratory.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
Turning Software Projects into Production Solutions Dan Fraser, PhD Production Coordinator Open Science Grid OU Supercomputing Symposium October 2009.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
EVGM081 Multi-Site Virtual Cluster: A User-Oriented, Distributed Deployment and Management Mechanism for Grid Computing Environments Takahiro Hirofuchi,
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Cole David Ronnie Julio. Introduction Globus is A community of users and developers who collaborate on the use and development of open source software,
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Portal Update Plan Ashok Adiga (512)
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Alain Roy Computer Sciences Department University of Wisconsin-Madison Condor & Middleware: NMI & VDT.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
1/23 Distributed Systems Architecture Research Group Universidad Complutense de Madrid Nuevos modelos de provisión de recursos para infrestructuras GRID:
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
Globus Security: Features and Roadmap & Building Secure VOs using Globus Toolkit Frank Siebenlist Rachana Ananthakrishnan Computation Institute, University.
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
DataGrid is a project funded by the European Commission EDG Conference, Heidelberg, Sep 26 – Oct under contract IST OGSI and GT3 Initial.
Nimbus Update March 2010 OSG All Hands Meeting Kate Keahey Nimbus Project University of Chicago Argonne National Laboratory.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Windows 2012R2 Hyper-V and System Center 2012
New Paradigms: Clouds, Virtualization and Co.
Viet Tran Institute of Informatics Slovakia
Cloud Computing with Nimbus
Virtualization, Cloud Computing, and TeraGrid
Presentation transcript:

What Happens When Cloud Computing Meets HPC Dr. Dan Fraser Director, CDIGS (Community Driven Improvement of Globus Software)

2 Outline l Intro to Cloud Computing and Concepts l Cloud Computing’s Impact on HPC l A Brief Look at Grid, Globus, and Clouds u Globus Incubator Program u Open Source EC2-like Capability l Impact and Opportunity for Supercomputing Centers l Dan’s Head in the Clouds

3 “Cloud” Computing is ~1 yr old Michael Sheehan’s GoGrid Blog, July 25,

4 Sorting out the Pieces Cloud SaaS Grid Cluster Utility Computing SaaS = Software as a Service ? ? ? Globus

5 One can categorize each component Cloud SaaS Grid Cluster Utility Computing SaaS = Software as a Service Globus Usage Model Infrastructure BUT…

6 Clouds can have any/all of these SaaS Grid Cluster Utility Computing And the descriptions often overlap ! Globus

7 What makes a Cloud? l Virtual Machines l VM Manager (Amazon EC2, …) u Scalability l File system Infrastructure l Remote access (portal) l Cost? u One reason the EC2 is successful is because of the low cost for cpu/data movement. l Security? Key Parts of Cloud Definition

8 Where is the value? l Much of the value is in the Virtual Machines l What are VMs used for? u Server Consolidation (Fermilab) u Disaster recovery (commercial) u Component Isolation (sandboxing) u Hardware Independence (any OS on any Box) u Cluster Computing l E.g. Deploy a classroom environment l E.g. Deploy a multi-use cluster with ROCKS l Adding VM Management takes this to the “clouds” u Access resources on-demand u Isolate Users from each other u Schedule VM usage

9 Where is the HPC value? l Much of the value is in the Virtual Machines l What are VMs used for? u Server Consolidation (Fermilab) u Disaster recovery (commercial) u Component Isolation (sandboxing) u Hardware Independence (any OS on any Box) u Cluster Computing l E.g. Deploy a classroom environment l E.g. Deploy a multi-use cluster with ROCKS l Adding VM Management takes this to the “clouds” u Access resources on-demand u Isolate Users from each other u Schedule VM usage √ x x x √ √ √ √

10 What is a Grid? Enable “coordinated resource sharing & problem solving in dynamic, multi- institutional virtual organizations.” (Source: “The Anatomy of the Grid”)

11 What does Globus do? l Globus provides a u Secure… u Uniform Remote Job Submission Interface… u Plus numerous capabilities that make the environment “useful.” l Data movement, Job monitoring, Service discovery, Security credential mgmt, Uniform data interfaces, … l Many Globus components can be used as stand-alone software products l GridFTP, RLS, Index service, MyProxy

12 Creating a Useful Environment Uniform interfaces, security mechanisms, Web service transport, monitoring ComputersStorage Specialized resource User Application GRAMGridFTP Host Env User Svc DAIS Database Tool Workflow Credent. Host Env User Svc Registry

13 Grid Communication Protocol Service Description Service Workflow Service Registry Security Metadata Management Resource Management Functions Management ID Resolution Transport Cancer Biomedical Informatics Grid Schema Management GSI CAS myProxy Globus OGSA-DAI GlobusGRAM Globus Toolkit Globus BPEL Mobius caCORE Slide credit: Peter Covitz, National Institutes of Health Spans 60 NIH cancer centers across the U.S.

14 Incubator Projects Globus Software: dev.globus.org Security Execution Mgmt Info Services Common Runtime Globus Projects Other MPICH G2 GridWay Data Mgmt Incubation Mgmt GridFTP Reliable File Transfer OGSA-DAI GRAM MDS4 CAS Data Rep Delegation Replica Location Java Runtime C Runtime Python Runtime GT4 C SecGT4 Docs GSI- OpenSSH MyProxy

15 Incubator Projects l Contributed from teams around the world u Must utilize a “Globus” open source License u Code can be sold, used by others, adapted… l Each project has its own “Committers” u Committers govern the project l Globus Provides Infrastructure & Oversight u Project site, lists, some publicity u Overall project approval, & follow-up l You can add your Incubator: u

16 Incubator Projects Globus Software: dev.globus.org Security Execution Mgmt Info Services Common Runtime Globus Projects Other MPICH G2 GridWay Data Mgmt Incubation Mgmt Cog WF LRMA GAARDS OGROGDTEUGP HOC-SAPURSE GridShib Introduce Dyn Acct WEEP Gavia JSC Gavia MS DDM Virt WkSp SGGC Metrics ServMark GridFTP Reliable File Transfer OGSA-DAI GRAM MDS4 CAS Data Rep Delegation Replica Location Java Runtime C Runtime Python Runtime GT4 C SecGT4 Docs MEDICUS GSI- OpenSSH MyProxy SwiftMonMan NetLogger GEMLCARAVI

17 Globus & Cloud Computing l Virtual Workspaces is a Globus Incubator l An Open Source EC2-like Management System u You can run on the cloud u You can even build your own cloud

18 http//workspace.globus.org Science Clouds l Powered by workspace tools l EC2-like interfaces (PKI credential vs credit card) l More clouds on the way l “Stratus” University of Florida 16x4 nodes “Nimbus” University of Chicago 16x2 nodes Public IPs Private IPs (via VPN)

19 Who Runs on the Science Clouds? l Nimbus utilization breakdown since March 4th l ~30 Communities http//workspace.globus.org

20 Interacting With Workspaces Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node (1) The workspace service allows users to deploy and manage workspaces on a pool of nodes through a WSRF interface (3) Information on each workspace is published as WSRF Resource Properties ao that users can find out information about their workspace (e.g. what IP the workspace was bound to) or subscribe to notifications on changes VWS Service (2) Each pool node requires a VMM and a lightweight management script http//workspace.globus.org

21 STAR l Motivation for STAR u Resources with the right configuration are hard to find l Complex environments: correct versions of operating systems, libraries, tools, etc all have to be installed. l Require validation l Virtual Workspace: an OSG STAR cluster u OSG cluster l OSG CE (headnode), gridmapfiles, host certificates, NSF, PBS u STAR worker nodes: SL4 + STAR conf l Requirements u One-click virtual clusters u Migration: nimbus/scientific resources -> EC2 http//workspace.globus.org

22 STAR (cont.) l From proof-of-concept to production runs u ~2 years ago: proof-of-concept u Last September: EC2 runs of up to 100 nodes (production scale) u Testing for full production deployment l Performance u Within 10% of expected performance for applications l Work by Jerome Lauret, Doug Olson, Leve Hajdu, Lidia Didenko l Long-lived community of many l Similar work for other HEP communities (Alice and Atlas), bioinformatics, geofest, and others http//workspace.globus.org

23 The Supercomputing Center “Threat” l Grid computing provides uniform access to computational resources u Computational resources become commodities u Supercomputing Centers offer a variety of applications, libraries, and support l Cloud Computing Makes Use of Virtual Machines where applications, libraries and dependencies can be hidden u Supercomputing Centers can become commodities in themselves l Ok so “threat” may be a bit overstated u Problems don’t go away quite so easily (shell game) u But shake-outs can/do happen along the way…

24 The Opportunity l Be the Supercomputing Center that enables cloud computing! u (Gradually) turn the center into a big cloud l Today’s clouds have only ~16 VMs u Conduct Research in VMs, VM Management, and VM Maintenance u Develop Tools to make Cloud Computing accessible to the scientists u Become the center of HPC Cloud expertise

25 So what happens when HPC meets Cloud computing? We don’t really know because the possibilities are just now emerging!

26 Dan’s Head in the Clouds l What if… scientists could: u Download and use a VM that would make it easy to parallelize their application; u And test it in parallel right on their laptop. l What if… scientists could: u Run a converter to change one VM type to another; u Or enable a VM created at one center to automatically run other places even though the infrastructure may be different (VMWare, Xen, RPATH, …) l What if… scientists could: u Select applications and components from a list; u Select some of their own applications; u Push a button to create a cluster-ready VM image; u Then push another button to automatically deploy them. l And the list goes on …

27 Conclusion l HPC cloud computing is an emerging technology l There are big opportunities for leadership to develop in this space. l Using VMs is only the beginning. There must also be collections of tools for managing and maintaining VMs …