Presentation is loading. Please wait.

Presentation is loading. Please wait.

INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org EGEE Middleware The Resource Broker EGEE project members.

Similar presentations


Presentation on theme: "INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org EGEE Middleware The Resource Broker EGEE project members."— Presentation transcript:

1 INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org EGEE Middleware The Resource Broker EGEE project members

2 Enabling Grids for E-sciencE INFSO-RI-508833 EGEE ResourceBroker 2 Contents Short review of concepts Requirements of the applications communities Overview of the main grid services A closer look

3 Enabling Grids for E-sciencE INFSO-RI-508833 EGEE ResourceBroker 3 Current production middleware Logging & Book-keeping ResourceBrokerStorageElementComputingElement InformationService Job Status DataSets info Author. &Authen. Job Submit Event Job Query Job Status Input “sandbox” Input “sandbox” + Broker Info Output “sandbox” Publish SE & CE info “User interface” LCG FileCatalogue (LFC)

4 Enabling Grids for E-sciencE INFSO-RI-508833 EGEE ResourceBroker 4 Building on basic tools and Information Service Example JDL file Executable = “gridTest”; StdError = “stderr.log”; StdOutput = “stdout.log”; InputSandbox = {“/home/joda/test/gridTest”}; OutputSandbox = {“stderr.log”, “stdout.log”}; … Submit job to grid via the “resource broker”, edg_job_submit my.jdl

5 Enabling Grids for E-sciencE INFSO-RI-508833 EGEE ResourceBroker 5 User Interface node The user’s interface to the Grid Command-line interface to –Proxy server –Job operations  To submit a job  Monitor its status  Retrieve output – Data operations  Upload file to SE  Create replica  Discover replicas –Other grid services Also C++ and Java APIs To run a job user creates a JDL (Job Description Language) file UI JDL

6 Enabling Grids for E-sciencE INFSO-RI-508833 EGEE ResourceBroker 6 Example JDL file Executable = “gridTest”; StdError = “stderr.log”; StdOutput = “stdout.log”; InputSandbox = {“/home/joda/test/gridTest”}; OutputSandbox = {“stderr.log”, “stdout.log”}; InputData = “lfn:/grid/VOname/mydir/testbed0.00019”; DataAccessProtocol = “gridftp”; Requirements = other.Architecture==“INTEL” && \ other.OpSys==“LINUX” && other.FreeCpus >=4; Rank = “other.GlueHostBenchmarkSF00”; Building on basic tools and Information Service Submit job to grid via the “resource broker (RB)”, edg_job_submit my.jdl Returns a “job-id” used to monitor job, retrieve output

7 Enabling Grids for E-sciencE INFSO-RI-508833 EGEE ResourceBroker 7 Example JDL file Executable = “gridTest”; StdError = “stderr.log”; StdOutput = “stdout.log”; InputSandbox = {“/home/joda/test/gridTest”}; OutputSandbox = {“stderr.log”, “stdout.log”}; InputData = “lfn:/grid/VOname/mydir/testbed0-00019”; DataAccessProtocol = “gridftp”; Requirements = other.Architecture==“INTEL” && \ other.OpSys==“LINUX” && other.FreeCpus >=4; Rank = “other.GlueHostBenchmarkSF00”; Building on basic tools and Information Service Submit job to grid via the “resource broker”, edg_job_submit my.jdl Returns a “job-id” used to monitor job, retrieve output lfn: logical file name RB uses Catalog to find replica locations

8 Enabling Grids for E-sciencE INFSO-RI-508833 EGEE ResourceBroker 8 Example JDL file Executable = “gridTest”; StdError = “stderr.log”; StdOutput = “stdout.log”; InputSandbox = {“/home/joda/test/gridTest”}; OutputSandbox = {“stderr.log”, “stdout.log”}; InputData = “lfn:testbed0-00019”; DataAccessProtocol = “gridftp”; Requirements = other.Architecture==“INTEL” && \ other.OpSys==“LINUX” && other.FreeCpus >=4; Rank = “other.GlueHostBenchmarkSF00”; Building on basic tools and Information Service Submit job to grid via the “resource broker”, edg_job_submit my.jdl Returns a “job-id” used to monitor job, retrieve output Uses BDII Information System

9 9 UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status Job submission

10 10 UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status Job Status UI: allows users to access the functionalities of the WMS (via command line, GUI, C++ and Java APIs) WMS: Workload Management System

11 11 UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status edg-job-submit myjob.jdl Myjob.jdl JobType = “Normal”; Executable = "$(CMS)/exe/sum.exe"; InputSandbox = {"/home/user/WP1testC","/home/file*”, "/home/user/DATA/*"}; OutputSandbox = {“sim.err”, “test.out”, “sim.log"}; Requirements = other. GlueHostOperatingSystemName == “linux" && other. GlueHostOperatingSystemRelease == "Red Hat 7.3“ && other.GlueCEPolicyMaxCPUTime > 10000; Rank = other.GlueCEStateFreeCPUs; submitted Job Status Job Description Language (JDL) to specify job characteristics and requirements

12 12 UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage Input Sandbox files Job waiting submitted Job Status NS: network daemon responsible for accepting incoming requests

13 13 UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage waiting submitted Job Status WM: responsible to take the appropriate actions to satisfy the request Job

14 14 Job submission UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage waiting submitted Job Status Match- Maker/ Broker Where must this job be executed ?

15 15 Job submission UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage waiting submitted Job Status Match- Maker/ Broker Matchmaker: responsible to find the “best” CE where to submit a job

16 16 Job submission UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage waiting submitted Job Status Match- Maker/ Broker Where are (which SEs) the needed data ? What is the status of the Grid ?

17 17 Job submission UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage waiting submitted Job Status Match- Maker/ Broker CE choice

18 18 Job submission UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage waiting submitted Job Status Job Adapter JA: responsible for the final “touches” to the job before performing submission (e.g. creation of wrapper script, etc.)

19 19 Job submission UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage Job Status JC: responsible for the actual job management operations (done via CondorG) Job submitted waiting ready

20 20 Job submission UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage Job Status Job Input Sandbox files submitted waiting ready scheduled

21 21 UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node RB storage Job Status Input Sandbox submitted waiting ready scheduled running “Grid enabled” data transfers/ accesses Job

22 22 UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node RB storage Job Status Output Sandbox files submitted waiting ready scheduled running done

23 23 UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node RB storage Job Status Output Sandbox submitted waiting ready scheduled running done edg-job-get-output

24 24 UI Network Server Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB node RB storage Job Status Output Sandbox files submitted waiting ready scheduled running done cleared

25 25 Job monitoring UI Log Monitor Logging & Bookkeeping Network Server Job Contr. - CondorG Workload Manager Computing Element RB node LM: parses CondorG log file (where CondorG logs info about jobs) and notifies LB LB: receives and stores job events; processes corresponding job status Log of job events edg-job-status edg-job-get-logging-info Job status

26 Enabling Grids for E-sciencE INFSO-RI-508833 EGEE ResourceBroker 26 Possible job states Flag Meaning SUBMITTEDsubmission logged in the LB WAITjob match making for resources READYjob being sent to executing CE SCHEDULEDjob scheduled in the CE queue manager RUNNINGjob executing on a WN of the selected CE queue DONEjob terminated without grid errors CLEAREDjob output retrieved ABORTjob aborted by middleware, check reason


Download ppt "INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org EGEE Middleware The Resource Broker EGEE project members."

Similar presentations


Ads by Google