Presentation is loading. Please wait.

Presentation is loading. Please wait.

What Happens When Cloud Computing Meets HPC Dr. Dan Fraser Director, CDIGS (Community Driven Improvement of Globus Software)

Similar presentations


Presentation on theme: "What Happens When Cloud Computing Meets HPC Dr. Dan Fraser Director, CDIGS (Community Driven Improvement of Globus Software)"— Presentation transcript:

1 What Happens When Cloud Computing Meets HPC Dr. Dan Fraser Director, CDIGS (Community Driven Improvement of Globus Software) http://www.cdigs.org

2 2 Outline l Intro to Cloud Computing and Concepts l Cloud Computing’s Impact on HPC l A Brief Look at Grid, Globus, and Clouds u Globus Incubator Program u Open Source EC2-like Capability l Impact and Opportunity for Supercomputing Centers l Dan’s Head in the Clouds

3 3 “Cloud” Computing is ~1 yr old Michael Sheehan’s GoGrid Blog, July 25, 2008 http://linux.sys-con.com/node/587717

4 4 Sorting out the Pieces Cloud SaaS Grid Cluster Utility Computing SaaS = Software as a Service ? ? ? Globus

5 5 One can categorize each component Cloud SaaS Grid Cluster Utility Computing SaaS = Software as a Service Globus Usage Model Infrastructure BUT…

6 6 Clouds can have any/all of these SaaS Grid Cluster Utility Computing And the descriptions often overlap ! Globus

7 7 What makes a Cloud? l Virtual Machines l VM Manager (Amazon EC2, …) u Scalability l File system Infrastructure l Remote access (portal) l Cost? u One reason the EC2 is successful is because of the low cost for cpu/data movement. l Security? Key Parts of Cloud Definition

8 8 Where is the value? l Much of the value is in the Virtual Machines l What are VMs used for? u Server Consolidation (Fermilab) u Disaster recovery (commercial) u Component Isolation (sandboxing) u Hardware Independence (any OS on any Box) u Cluster Computing l E.g. Deploy a classroom environment l E.g. Deploy a multi-use cluster with ROCKS l Adding VM Management takes this to the “clouds” u Access resources on-demand u Isolate Users from each other u Schedule VM usage

9 9 Where is the HPC value? l Much of the value is in the Virtual Machines l What are VMs used for? u Server Consolidation (Fermilab) u Disaster recovery (commercial) u Component Isolation (sandboxing) u Hardware Independence (any OS on any Box) u Cluster Computing l E.g. Deploy a classroom environment l E.g. Deploy a multi-use cluster with ROCKS l Adding VM Management takes this to the “clouds” u Access resources on-demand u Isolate Users from each other u Schedule VM usage √ x x x √ √ √ √

10 10 What is a Grid? Enable “coordinated resource sharing & problem solving in dynamic, multi- institutional virtual organizations.” (Source: “The Anatomy of the Grid”)

11 11 What does Globus do? l Globus provides a u Secure… u Uniform Remote Job Submission Interface… u Plus numerous capabilities that make the environment “useful.” l Data movement, Job monitoring, Service discovery, Security credential mgmt, Uniform data interfaces, … l Many Globus components can be used as stand-alone software products l GridFTP, RLS, Index service, MyProxy

12 12 Creating a Useful Environment Uniform interfaces, security mechanisms, Web service transport, monitoring ComputersStorage Specialized resource User Application GRAMGridFTP Host Env User Svc DAIS Database Tool Workflow Credent. Host Env User Svc Registry

13 13 Grid Communication Protocol Service Description Service Workflow Service Registry Security Metadata Management Resource Management Functions Management ID Resolution Transport Cancer Biomedical Informatics Grid Schema Management GSI CAS myProxy Globus OGSA-DAI GlobusGRAM Globus Toolkit Globus BPEL Mobius caCORE Slide credit: Peter Covitz, National Institutes of Health Spans 60 NIH cancer centers across the U.S.

14 14 Incubator Projects Globus Software: dev.globus.org Security Execution Mgmt Info Services Common Runtime Globus Projects Other MPICH G2 GridWay Data Mgmt Incubation Mgmt GridFTP Reliable File Transfer OGSA-DAI GRAM MDS4 CAS Data Rep Delegation Replica Location Java Runtime C Runtime Python Runtime GT4 C SecGT4 Docs GSI- OpenSSH MyProxy

15 15 Incubator Projects l Contributed from teams around the world u Must utilize a “Globus” open source License u Code can be sold, used by others, adapted… l Each project has its own “Committers” u Committers govern the project l Globus Provides Infrastructure & Oversight u Project site, e-mail lists, some publicity u Overall project approval, & follow-up l You can add your Incubator: u http://dev.globus.org/

16 16 Incubator Projects Globus Software: dev.globus.org Security Execution Mgmt Info Services Common Runtime Globus Projects Other MPICH G2 GridWay Data Mgmt Incubation Mgmt Cog WF LRMA GAARDS OGROGDTEUGP HOC-SAPURSE GridShib Introduce Dyn Acct WEEP Gavia JSC Gavia MS DDM Virt WkSp SGGC Metrics ServMark GridFTP Reliable File Transfer OGSA-DAI GRAM MDS4 CAS Data Rep Delegation Replica Location Java Runtime C Runtime Python Runtime GT4 C SecGT4 Docs MEDICUS GSI- OpenSSH MyProxy SwiftMonMan NetLogger GEMLCARAVI

17 17 Globus & Cloud Computing l Virtual Workspaces is a Globus Incubator l An Open Source EC2-like Management System u You can run on the cloud u You can even build your own cloud

18 18 http//workspace.globus.org Science Clouds l Powered by workspace tools l EC2-like interfaces (PKI credential vs credit card) l More clouds on the way l http://workspace.globus.org/clouds “Stratus” University of Florida 16x4 nodes “Nimbus” University of Chicago 16x2 nodes Public IPs Private IPs (via VPN)

19 19 Who Runs on the Science Clouds? l Nimbus utilization breakdown since March 4th l ~30 Communities http//workspace.globus.org

20 20 Interacting With Workspaces Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node (1) The workspace service allows users to deploy and manage workspaces on a pool of nodes through a WSRF interface (3) Information on each workspace is published as WSRF Resource Properties ao that users can find out information about their workspace (e.g. what IP the workspace was bound to) or subscribe to notifications on changes VWS Service (2) Each pool node requires a VMM and a lightweight management script http//workspace.globus.org

21 21 STAR l Motivation for STAR u Resources with the right configuration are hard to find l Complex environments: correct versions of operating systems, libraries, tools, etc all have to be installed. l Require validation l Virtual Workspace: an OSG STAR cluster u OSG cluster l OSG CE (headnode), gridmapfiles, host certificates, NSF, PBS u STAR worker nodes: SL4 + STAR conf l Requirements u One-click virtual clusters u Migration: nimbus/scientific resources -> EC2 http//workspace.globus.org

22 22 STAR (cont.) l From proof-of-concept to production runs u ~2 years ago: proof-of-concept u Last September: EC2 runs of up to 100 nodes (production scale) u Testing for full production deployment l Performance u Within 10% of expected performance for applications l Work by Jerome Lauret, Doug Olson, Leve Hajdu, Lidia Didenko l Long-lived community of many l Similar work for other HEP communities (Alice and Atlas), bioinformatics, geofest, and others http//workspace.globus.org

23 23 The Supercomputing Center “Threat” l Grid computing provides uniform access to computational resources u Computational resources become commodities u Supercomputing Centers offer a variety of applications, libraries, and support l Cloud Computing Makes Use of Virtual Machines where applications, libraries and dependencies can be hidden u Supercomputing Centers can become commodities in themselves l Ok so “threat” may be a bit overstated u Problems don’t go away quite so easily (shell game) u But shake-outs can/do happen along the way…

24 24 The Opportunity l Be the Supercomputing Center that enables cloud computing! u (Gradually) turn the center into a big cloud l Today’s clouds have only ~16 VMs u Conduct Research in VMs, VM Management, and VM Maintenance u Develop Tools to make Cloud Computing accessible to the scientists u Become the center of HPC Cloud expertise

25 25 So what happens when HPC meets Cloud computing? We don’t really know because the possibilities are just now emerging!

26 26 Dan’s Head in the Clouds l What if… scientists could: u Download and use a VM that would make it easy to parallelize their application; u And test it in parallel right on their laptop. l What if… scientists could: u Run a converter to change one VM type to another; u Or enable a VM created at one center to automatically run other places even though the infrastructure may be different (VMWare, Xen, RPATH, …) l What if… scientists could: u Select applications and components from a list; u Select some of their own applications; u Push a button to create a cluster-ready VM image; u Then push another button to automatically deploy them. l And the list goes on …

27 27 Conclusion l HPC cloud computing is an emerging technology l There are big opportunities for leadership to develop in this space. l Using VMs is only the beginning. There must also be collections of tools for managing and maintaining VMs …


Download ppt "What Happens When Cloud Computing Meets HPC Dr. Dan Fraser Director, CDIGS (Community Driven Improvement of Globus Software)"

Similar presentations


Ads by Google