Presentation is loading. Please wait.

Presentation is loading. Please wait.

Grid Computing Anda Iamnitchi Federated Distributed Systems, Fall ‘06 Including slides adapted from presentations by Ian Foster, Lee Liming, Paul Jeffreys.

Similar presentations


Presentation on theme: "Grid Computing Anda Iamnitchi Federated Distributed Systems, Fall ‘06 Including slides adapted from presentations by Ian Foster, Lee Liming, Paul Jeffreys."— Presentation transcript:

1 Grid Computing Anda Iamnitchi Federated Distributed Systems, Fall ‘06 Including slides adapted from presentations by Ian Foster, Lee Liming, Paul Jeffreys

2 Front page FT, 7th March 2000

3

4

5 But…

6 What is the Grid? “ Resource sharing & coordinated problem solving in dynamic, multi- institutional virtual organizations” “When the network is as fast as the computer's internal links, the machine disintegrates across the net into a set of special purpose appliances” (George Gilder) “The Anatomy of the Grid”, Foster, Kesselman, Tuecke, 2001

7 Motivation (1): Revolution in Science Pre-Internet –Theorize &/or experiment, alone or in small teams; publish paper Post-Internet –Construct and mine large databases of observational or simulation data –Develop simulations & analyses –Access specialized devices remotely –Exchange information within distributed multidisciplinary teams

8 Motivation (2): Revolution in Business Pre-Internet –Central data processing facility Post-Internet –Enterprise computing is highly distributed, heterogeneous, inter-enterprise (B2B) –Business processes increasingly computing- & data-rich –Outsourcing becomes feasible => service providers of various sorts

9 The (Power) Grid: On-Demand Access to Electricity Time Quality, economies of scale

10 By Analogy, A Computing Grid Decouple production and consumption –Enable on-demand access –Achieve economies of scale –Enhance consumer flexibility –Enable new devices

11 Not Exactly a New Idea … “The time-sharing computer system can unite a group of investigators …. one can conceive of such a facility as an … intellectual public utility.” –Fernando Corbato and Robert Fano, 1966 “We will perhaps see the spread of ‘computer utilities’, which, like present electric and telephone utilities, will service individual homes and offices across the country.” –Len Kleinrock, 1967

12 But Things are Different Now …

13 Computing isn’t Really Like Electricity I import electricity but must export data “Computing” is not interchangeable but highly heterogeneous: data, sensors, services, … This complicates things; but also means that the sum can be greater than the parts –Real opportunity: Construct new capabilities dynamically from distributed services Raises fundamental questions –Achieving economies of scale –Quality of service across distributed services –Applications that exploit synergies

14 How Can We Tell Hype from Facts? Everyday problem, isn’t it? Learn/verify the facts Know the context –Multi-institutional (== federated) Thus, a cluster? (Sun Grid Engine!!!) –Dynamic (somewhat) Look at results –Research innovation (in computer and computational science) –Scientific discovery –Existing/deployed grids

15 “We must address scale & failure” P2P and Grids: Resource Sharing Across Administrative Domains “We need infrastructure” “On Death, Taxes and the Convergence of P2P and Grids”, Foster, Iamnitchi 2003

16 Compare & Contrast (1): Definitions Grid: P2P: “Infrastructure that provides dependable, consistent, pervasive, and inexpensive access to high-end computational capabilities” (1998) “A system that coordinates resources not subject to centralized control, using open, general-purpose protocols to deliver nontrivial QoS” (2002) “Applications that takes advantage of resources at the edges of the Internet” (2000) “Decentralized, self-organizing distributed systems, in which all or most communication is symmetric” (2002)

17 Compare and Contrast (2): Details of Deployed Systems Target communities and incentives Resources engaged Applications Scale and failure Services and infrastructure

18 Target Communities & Incentives Grid Established communities –Science, some industry –Homogeneous –Restricted participation Good behavior: –Implicit incentives –Means to enforce it Consequences: Trust Well-defined “tax base” Less flexibility? P2P Anonymous individuals No implicit incentives for good behavior Consequences: No trust Free riding Implicit incentives for cheating: Seti@home, music sharing

19 Resources Grid More diverse (in type): –Files, storage, computing power, network, instruments More powerful Good availability Well connected Technical support Consequence: Costly resource integration P2P Computing cycles XOR files Less powerful Intermittent participation –Gnutella: avg. lifetime 1h (‘01) –MojoNation: 1/6 users always on –Overnet: 50% nodes available 70% of time over a week (‘02) Variably connected Some technical support as community effort Consequence: Ease of integration of new resources an early priority

20 Applications Grid Often complex & involving various combinations of –Data manipulation –Computation –Tele-instrumentation Wide range of computational models, e.g. –Embarrassingly || –Tightly coupled –Workflow Consequences: –Complexity often inherent in the application itself –(Inevitably?) Complex infrastructure to support applications P2P Some –File sharing –Number crunching –Content distribution –Measurements “Toy” applications only? –Albeit very popular “toys”! Consequence: –Complexity often derives from scale

21 Scale and Failure Grid Moderate number of entities –100s institutions, 1000s users Large amounts of activity –4.5 TB/day (D0 experiment) Approaches to failure reflect assumptions –E.g., centralized components P2P Large numbers of entities: –Millions of users Moderate activity –E.g., 1-2 TB in Gnutella (’01) Diverse approaches to failure –Some centralized (SETI, …) –Some highly self-configuring FastTrack3,488,719 eDonkey1,661,132 iMesh1,211,965 Overnet1,146,880 MP2P250,927 Gnutella219,009 DirectConnect204,237 (www.slyck.com, January 25, 2004)

22 Grids for Physics: LHC Computing Grid

23 Services and Infrastructure Grid Standard protocols (Global Grid Forum, etc.) De facto standard software (open source Globus Toolkit) Shared infrastructure (authentication, discovery, resource access, etc.) Consequences: Reusable services Large developer & user communities Interoperability & code reuse P2P Each application defines & deploys completely independent “infrastructure” JXTA, BOINC, XtremWeb? Efforts started to define common APIs, albeit with limited scope to date Consequences: New (albeit simple) install per application Interoperability & code reuse not achieved

24 Convergent Environment: Large, Dynamic, Self-Configuring Grids Scale & volatility Functionality & infrastructure Grids P2P Large scale Weaker trust assumptions Ease of integration No centralized authority Intermittent resource/user participation Diversity in: Shared resources Sharing characteristics Variable technical support Infrastructure (sharable services) Support for diverse applications

25 Existing Technologies are Helpful, but Not Complete Solutions Peer-to-peer technologies –Limited scope and mechanisms Enterprise-level distributed computing –Limited cross-organizational support Databases –Vertically integrated solutions Web services –Not dynamic Semantic web –Limited focus

26 What’s Missing is Support for … Sharing & integration of resources, via –Discovery –Provisioning –Access (computation, data, …) –Security –Policy –Fault tolerance –Management In dynamic, scalable, multi-organizational settings

27 Building the Grid Open source software –Globus Toolkit ®, UK OGSA DAI, Condor, … Open standards –OGSA, other GGF, IETF, W3C standards, … Open communities –Global Grid Forum, Globus International, collaborative projects, … Open infrastructure –UK eScience, NSF Cyberinfrastructure, StarLight, AP- Grid, …

28 Globus Toolkit ® History DARPA, NSF, and DOE begin funding Grid work NASA begins funding Grid work, DOE adds support The Grid: Blueprint for a New Computing Infrastructure published GT 1.0.0 Released Early Application Successes Reported NSF & European Commission Initiate Many New Grid Projects Anatomy of the Grid Paper Released Significant Commercial Interest in Grids Physiology of the Grid Paper Released GT 2.0 Released Does not include downloads from: NMI, UK eScience, EU Datagrid, IBM, Platform, etc.

29 How It Started While helping to build/integrate a diverse range of distributed applications, the same problems kept showing up over and over again. –Too hard to keep track of authentication data (ID/password) across institutions –Too hard to monitor system and application status across institutions –Too many ways to submit jobs –Too many ways to store & access files and data –Too many ways to keep track of data –Too easy to leave “dangling” resources lying around (robustness)

30 Forget Homogeneity! Trying to force homogeneity on users is futile. Everyone has their own preferences, sometimes even dogma. The Internet provides the model…

31 What Does the Globus Toolkit Cover? Goal Today

32 Theory -> Practice

33 building a grid (in practice)

34 Methodology Building a Grid system or application is currently an exercise in software integration. –Define user requirements –Derive system requirements or features –Survey existing components –Identify useful components –Develop components to fit into the gaps –Integrate the system –Deploy and test the system –Maintain the system during its operation This should be done iteratively, with many loops and eddys in the flow.

35 How it Really Happens Web Browser Compute Server Data Catalog Data Viewer Tool Certificate authority Chat Tool Credential Repository Web Portal Compute Server Resources implement standard access & management interfaces Collective services aggregate &/or virtualize resources Users work with client applications Application services organize VOs & enable access to other services Database service Database service Database service Simulation Tool Camera Telepresence Monitor Registration Service

36 How it Really Happens (without Globus) Web Browser Compute Server Data Catalog Data Viewer Tool Certificate authority Chat Tool Credential Repository Web Portal Compute Server Resources implement standard access & management interfaces Collective services aggregate &/or virtualize resources Users work with client applications Application services organize VOs & enable access to other services Database service Database service Database service Simulation Tool Camera Telepresence Monitor Registration Service A B C D E Application Developer 10 Off the Shelf12 Globus Toolkit0 Grid Community 0

37 How it Really Happens (with Globus) Web Browser Compute Server Globus MCS/RLS Data Viewer Tool Certificate Authority CHEF Chat Teamlet MyProxy CHEF Compute Server Resources implement standard access & management interfaces Collective services aggregate &/or virtualize resources Users work with client applications Application services organize VOs & enable access to other services Database service Database service Database service Simulation Tool Camera Telepresence Monitor Globus Index Service Globus GRAM Globus DAI Application Developer 2 Off the Shelf9 Globus Toolkit4 Grid Community 4

38 What Is the Globus Toolkit? The Globus Toolkit is a collection of solutions to problems that frequently come up when trying to build collaborative distributed applications. Not turnkey solutions, but building blocks and tools for application developers and system integrators. –Some components (e.g., file transfer) go farther than others (e.g., remote job submission) toward end-user relevance. To date (v1.0 - v4.0), the Toolkit has focused on simplifying heterogeneity for application developers. The goal has been to capitalize on and encourage use of existing standards (IETF, W3C, OASIS, GGF). –The Toolkit also includes reference implementations of new/proposed standards in these organizations.

39 How To Use the Globus Toolkit By itself, the Toolkit has surprisingly limited end user value. –There’s very little user interface material there. –You can’t just give it to end users (scientists, engineers, marketing specialists) and tell them to do something useful! The Globus Toolkit is useful to application developers and system integrators. –You’ll need to have a specific application or system in mind. –You’ll need to have the right expertise. –You’ll need to set up prerequisite hardware/software. –You’ll need to have a plan.

40 Data Management Security Common Runtime Execution Management Information Services Web Services Components Non-WS Components Pre-WS Authentication Authorization GridFTP Grid Resource Allocation Mgmt (Pre-WS GRAM) Monitoring & Discovery System (MDS2) C Common Libraries GT2GT2 WS Authentication Authorization Reliable File Transfer OGSA-DAI [Tech Preview] Grid Resource Allocation Mgmt (WS GRAM) Monitoring & Discovery System (MDS4) Java WS Core Community Authorization Service GT3GT3 Replica Location Service XIO GT3GT3 Credential Management GT4GT4 Python WS Core [contribution] C WS Core Community Scheduler Framework [contribution] Delegation Service GT4GT4 Globus Toolkit Components

41 Increased functionality, standardization Custom solutions 1990199520002005 Open Grid Services Arch Real standards Multiple implementations Web services, etc. Managed shared virtual systems Computer science research Globus Toolkit Defacto standard Single implementation Internet standards The Emergence of Open Grid Standards 2010

42 Grid Communities Global Grid Forum –Standards, information exchange, advocacy –1000+ participants in tri-annual meetings Application communities –E.g., physics, earthquake engineering, biomedical, etc. Software development and support –NSF Middleware Initiative, UK eScience, Globus Toolkit, EGEE, …

43 Grid Communities & Technologies Yesterday –Small, static communities, primarily in science –Focus on sharing of computing resources –Globus Toolkit as technology base Today –Larger communities in science; early industry –Focused on sharing of data and computing –Open Grid Services Architecture Tomorrow –Large, dynamic, diverse communities that share a wide variety of services, resources, data –Challenging computer science research issues

44 Grid Dynamics: Vision vs. Reality Vision: On-demand access to computing –New communities form easily –On-demand resources from providers –Adapt easily to new missions, requirements Reality: Much manual configuration, e.g.: –Manually deployed services on dedicated hardware –Manually maintained access control lists –Sysadmin-maintained allocation policies –Human-mediated resource reservation

45 Reading Sources http://www- fp.mcs.anl.gov/~foster/talks.htm http://www.globus.org/ The Grid Book (other links on the course page)


Download ppt "Grid Computing Anda Iamnitchi Federated Distributed Systems, Fall ‘06 Including slides adapted from presentations by Ian Foster, Lee Liming, Paul Jeffreys."

Similar presentations


Ads by Google