Presentation is loading. Please wait.

Presentation is loading. Please wait.

Holding slide prior to starting show. Scheduling Parametric Jobs on the Grid Jonathan Giddy

Similar presentations


Presentation on theme: "Holding slide prior to starting show. Scheduling Parametric Jobs on the Grid Jonathan Giddy"— Presentation transcript:

1 Holding slide prior to starting show

2 Scheduling Parametric Jobs on the Grid Jonathan Giddy J.P.Giddy@wesc.ac.uk

3 Parametric computation Scientifically: –Study the behaviour of output variables against a range of different input scenarios Computationally: –Execute an application multiple times, each time with a different combination of input parameters

4 Why use the Grid? Parametric computations –Require high performance computational resources –Require large numbers of computational resources –Generate large amounts of concurrency –Generate uncoupled computations –Tolerate high latencies

5 Nimrod/G CostDeadline

6 Minimise Cost Increasing price 7 4 1 8 5 2 6 3 Node 4 Node 3 Node 2 Node 1 Time Jobs Budget Cost 8 20 0 7 19 1 6 18 2 5 17 3 4 15 5 3 13 7 2 11 9 1 8 12 0 5 15

7 7 Minimise Time Increasing price 4 2 1 6 5 3 Node 4 Node 3 Node 2 Node 1 Time Jobs Budget Cost Budget / Job 8 20 0 2.5 7 19 1 2.71 6 17 3 2.83 5 16 4 3.2 4 13 7 3.25 3 11 9 3.67 2 8 12 4.0 1 4 16 4.0 0 20 8

8 Globus 1.1 GRAM API int globus_gram_client_job_check( char *resource_manager_contact, const char *description, const float conf_percentage, globus_gram_client_time_t *estimate, globus_gram_client_time_t *interval) Note: This is not yet implemented This function returns an estimate of the time it would take for a job of the description provided to reach an ACTIVE state.

9 Historical profiling Examine characteristics of all jobs in queue against historical profiles in order to determine expected start time of a job Returns start time and error estimate Warren Smith, Ian T. Foster, Valerie E. Taylor: Predicting Application Run Times Using Historical Information. Job Scheduling Strategies for Parallel Processing Workshop (JSSPP) 1998: 122-142

10 Information Overload Too many variables: –Number of CPUs –CPU speed –Processor architecture –Operating system –Real memory –Disk speed –Bandwidth –Latency –Other users

11 Extrapolation of completion rate A B C 2 jobs/hr 3 jobs/hr 6 jobs/hr 1 hr2 hr

12 0 10 20 30 40 50 60 70 80 02.557.51012.51517.520 Time Average No. Processors 20 Hour deadline 15 hour deadline 10 hour deadline

13 Assumptions Compute time >> Network time All jobs are the same length on any particular resource Price of a resource is constant over time Not much wriggle room during the end- game –Both scheduling schemes push up against the limit that they’re not minimising –Heuristic nature of completion time

14 What we really want… Guaranteed completion time –globus_gram_client_job_check() with teeth –Requires scheduler to internally reserve space for job in advance Advance reservation –As above, but with external interface

15 And this too… A real grid economy –Incentive for providers to provide resources –Incentive for consumers to describe requirements accurately –Incentive for consumers to use resources judiciously –Price mechanism budget as a timely global information parameter universally understood enables trade-offs in making QoS decisions

16 A final point Optimising is really hard in a wide area network –Requires centralised decision maker –Information is missing –Information is not contemporaneous –Information is out-of-date

17 Scalable information …is slow to change Budget and deadline are (relatively) constant and can be propagated far and wide in a timely manner Slow information comes from specifying requirements in the real world Satisfying (instead of optimising) a requirement is relatively simple –A resource can so it does –A resource can’t so it doesn’t

18


Download ppt "Holding slide prior to starting show. Scheduling Parametric Jobs on the Grid Jonathan Giddy"

Similar presentations


Ads by Google