Presentation is loading. Please wait.

Presentation is loading. Please wait.

GridLab Resource Management System (GRMS) Jarek Nabrzyski GridLab Project Coordinator Poznań Supercomputing and.

Similar presentations


Presentation on theme: "GridLab Resource Management System (GRMS) Jarek Nabrzyski GridLab Project Coordinator Poznań Supercomputing and."— Presentation transcript:

1 GridLab Resource Management System (GRMS) Jarek Nabrzyski GridLab Project Coordinator naber@man.poznan.pl office@gridlab.org Poznań Supercomputing and Networking Center

2 GGF7, Tokyo, March 4-7, 2003 GridLab EU funded project, involving 11 European and 3 American partners (Globus and Condor teams), January 2002 – December 2004 Main goal: to develop a Grid Application Toolkit (GAT) and set of grid services and tools... resource management (GRMS), data management, monitoring, adaptive components, mobile user support, security services, portals,... and test them on a real testbed with real applications

3 GGF7, Tokyo, March 4-7, 2003 GridLab Members n PSNC (Poznan) - coordination n AEI (Potsdam) n ZIB (Berlin) n Univ. of Lecce n Cardiff University n Vrije Univ. (Amsterdam) n SZTAKI (Budapest) n Masaryk Univ. (Brno) n NTUA (Athens) Sun Microsystems Compaq (HP) n ANL (Chicago, I. Foster) n ISI (LA, C.Kesselman) n UoWisconsin (M. Livny) collaborating with: Users! EU Astrophysics Network, DFN TiKSL/GriKSL NSF ASC Project other Grid projects Globus, Condor, GrADS, PROGRESS, GriPhyn/iVDGL, CrossGrid and all the other European Grid Projects (GRIDSTART) other...

4 GGF7, Tokyo, March 4-7, 2003 GridLab Applications n Cactus (www.cactuscode.org) n Triana (www.triana.co.uk)

5 GGF7, Tokyo, March 4-7, 2003 What our users want... Two primary applications: Cactus and Triana other application communities are also being engaged, Application oriented environment Resources (grid) on demand Adaptive applications – adaptive grid environment job checkpoint, migration, spawn off a new job when needed, Open, pervasive, not even restricted to a single Virtual Organization The ability to work in a disconnected environment start my job on a disconnected laptop; migrate it to grid when it becomes available from laptops to fully deployed Virtual Organisations Mobile working Security

6 GGF7, Tokyo, March 4-7, 2003 What our users want... (cont.) The infrastructure must provide capabilities to customise choice of service implementation (e.g. using efficiency, reliability, first succeeding, all) Advance reservation of resources, To be able to express their preferences regarding their jobs on one hand and to understand the resource policies on the other hand, Policy information and negotiation mechanisms what is a policy of usage of the remote resources? Prediction-based information How long will my job run on a particular resource? What resources do I need to complete the job before deadline?

7 GGF7, Tokyo, March 4-7, 2003 The Grid is complex … Monitoring Resource Management Information Security Data Management GLOBUS Application Manager Logging Notification Migration Profiling SOAPWSDLCorbaOGSAOther Other Grid Infrastructure Application “Is there a better resource I could be using?”

8 GGF7, Tokyo, March 4-7, 2003 …need to make it easier to use GAT Application “Is there a better resource I could be using?” GAT_FindResource( ) The Grid

9 GGF7, Tokyo, March 4-7, 2003 The Same Application … Application GAT Application GAT Application GAT Laptop The Grid Super Computer No network! Firewall issues!

10 GGF7, Tokyo, March 4-7, 2003 GAT: What is It? GAT: Grid Application Toolkit Implements the GAT-API Used by applications (different languages) GAT Adaptors Connect to capabilities/services GAT Engine Provides the function bindings for the GAT-API

11 GGF7, Tokyo, March 4-7, 2003 Grid Application Toolkit The GAT provides functionality through a carefully constructed set of generic high-level APIs, through which an application will be able to call the underlying grid services, Set of application developer APIs for Grid tools, services and software libraries, (and example implementations) that support the development of grid-enabled applications (open source!) Usable from any high level “application” (any generic code, Cactus, Triana, Portals, Scripts, …)

12 GGF7, Tokyo, March 4-7, 2003 GAT More or less … Set of calls GAT_ToolOrService(arguments) Your chosen tools/services: resource brokers, information servers, application managers, grid monitoring, data managers, notification, etc. Set of APIs for dealing with the GAT (registration, information, errors, fault tolerance)

13 GGF7, Tokyo, March 4-7, 2003 GridLab Architecture

14 GGF7, Tokyo, March 4-7, 2003 GridLab RMS approach Grid resources are not only the machines, but also databases, files, users, administrators, instruments, mobile devices, jobs/applications... Many metrics for scheduling: throughput, cost, latency, deadline, other time and cost metrics... Grid resource management consists of job/resource scheduling, security (authorization services,...), local policies, negotiations, accounting,... GRM is both, user and resource owner driven negotiation process and thus, multicriteria decision making process

15 GGF7, Tokyo, March 4-7, 2003 GRMS is a bag of services Core GRMS Services Job Receiver Service Resource Discovery Resource Evaluation Brokering Prediction Service QoS/SLA Service Advanced Reservation Resource Estimation Job Execution Service Job/Application Mgmt Scheduler Distributed Workflow Infrastructure Services Job logging and tracking service Security service (WP6) System Monitoring (WP11) Grid Information System (WP10) Adaptive Services (WP7) Data mgmt services (WP8) GRMS Policy/ Configuration Services System Configuration Mgmt System Policy Mgmt

16 GGF7, Tokyo, March 4-7, 2003 GridLab RMS Job Receiver Jobs Queue BROKER Execution Unit Resource Discovery Scheduler Resource Reservation Prediction Unit File Transfer Unit Information Services Data Management Authorization System Adaptive Workflow Manager SLA Negotiation GRMS Monitoring GLOBUS, other Local Resources (Managers) Application Manager

17 GGF7, Tokyo, March 4-7, 2003 GRMS and SLA

18 GGF7, Tokyo, March 4-7, 2003 GRMS and SLA (cont.)

19 GGF7, Tokyo, March 4-7, 2003 Research focus of GRMS Focus on the infrastructure is not enough for the efficient GRM Focus on policies Focus on multicriteria aspects of the GRM users, their preferences and applications resource owners’ preferences preference models, multicriteria decision making, knowledge will be crucial for efficient resource management Focus on AI techniques for GRM Focus on business models, economy grids Cost negotiation mechanisms could be part of the SLA negotiation process contradictory in nature

20 GGF7, Tokyo, March 4-7, 2003 Multicriteria RM in GridLab Gathering of information apps requirements (resource requirements, environment, etc.) user preferences (which criteria and how important) user support, preference modeling tools, Selection phase choose the best resources (schedule) based on the information provided and on the resource availability (estimates, predictions) from simple matchmaking to multiple optimisation techniques Execution phase file staging, execution control, job monitoring, migration, usually re-selection of resources, application adaptation (application managers, adaptive services from GridLab)

21 GGF7, Tokyo, March 4-7, 2003 Policy representation Local agents responsible for the policy information gethering Interface for the GRMS (VO) policy configuration Local queue configurations and global VO policies are represented in the form of rules which can be read by the GRMS

22 GGF7, Tokyo, March 4-7, 2003 Current implementation Runs at rage1.man.poznan.pl, which is the front-end to the linux cluster and uses tomcat and axis as a hosting environment httpg://rage1.man.poznan.pl:8443/axis/services/gsiSc enarioBroker The WSDL document and the code of client is available on the page: http://www.gridlab.org/WorkPackages/wp-9/ in the section: Resources/Our Software

23 GGF7, Tokyo, March 4-7, 2003 Current implementation submitJob - submits new job, migrateJob - migrates existing job, getMyJobsList - returns list of jobs belonging to the user, registerApplicationAccess - registers application access, getJobStatus - returns GRMS status of the job, getHostName - returns host name, on which the job is/was running getJobInfo - returns a structure describing the job, findResources - returns resources matching user's requirements, cancelJob - cancels the job, getServiceDescription - returns description of a service.


Download ppt "GridLab Resource Management System (GRMS) Jarek Nabrzyski GridLab Project Coordinator Poznań Supercomputing and."

Similar presentations


Ads by Google