Presentation is loading. Please wait.

Presentation is loading. Please wait.

CSF4, SGE and Gfarm Integration Zhaohui Ding Jilin University.

Similar presentations


Presentation on theme: "CSF4, SGE and Gfarm Integration Zhaohui Ding Jilin University."— Presentation transcript:

1 CSF4, SGE and Gfarm Integration Zhaohui Ding zhaohui.ding@email.jlu.edu.cn zding@sdsc.edu Jilin University

2 2 Agenda CSF4 CSF4 integrate with SGE(Done at SDSC) CSF4 integrate with Gfarm(Done at SDSC) CSF4 other new functionalities(Done at SDSC)

3 3 CSF4 What is CSF CSF4 Functionalities & Services CSF4 Architecture

4 4 CSF4 – What is CSF What is CSF Full name: Community Scheduler Framework Full name: Community Scheduler Framework CSF is a meta-scheduler working at grid level contributed by Platform (a CA software company/LSF) CSF is a meta-scheduler working at grid level contributed by Platform (a CA software company/LSF) The first version of CSF, CSF3, was developed based on GT3-OGSI The first version of CSF, CSF3, was developed based on GT3-OGSI CSF4 is the GT4-WSRF compliant version of CSF with providing the same functionalities with CSF3 CSF4 is the GT4-WSRF compliant version of CSF with providing the same functionalities with CSF3 CSF is an open source project and can be accessed at http://sourceforge.net (the cvs mainline code is csf4) CSF is an open source project and can be accessed at http://sourceforge.net (the cvs mainline code is csf4)http://sourceforge.net The development team of CSF4 is from Jilin University, PRC The development team of CSF4 is from Jilin University, PRC

5 5 CSF4 – What is CSF ( cont.) CSF4 is a contribution to GT4

6 6 What is CSF ( cont.) Meta-scheduler vs Local Resource Manger In a Grid-computing environment, there is a common requirement for users to query, negotiate access and manage resources existing within different administrative domains at Grid level. The meta- scheduler is designed to perform such global wide policies. Typically different Resource Management soft wares (RMs) like LSF, PBS, and Sun Grid Engine are responsible for load balancing and resource sharing within each local administrative domain. Typically different Resource Management soft wares (RMs) like LSF, PBS, and Sun Grid Engine are responsible for load balancing and resource sharing within each local administrative domain.

7 7 What is CSF ( cont.) A typical deployment for Meta-Scheduler and RMs.

8 8 CSF4 – Functionalities Functionalities provided by CSF4 Submit, control and monitor jobs at a grid level. Submit, control and monitor jobs at a grid level. Create and manage advanced reservations at the grid level. (only support LSF now) Create and manage advanced reservations at the grid level. (only support LSF now) Send job and advanced reservation operations to local resource managers. Send job and advanced reservation operations to local resource managers. A plug in scheduler interface so that site and user specific scheduling policies can be implemented regardless of the underlying resource manager. A plug in scheduler interface so that site and user specific scheduling policies can be implemented regardless of the underlying resource manager. Create queues of jobs, each with separately define- able scheduling policies. Create queues of jobs, each with separately define- able scheduling policies.

9 9 CSF4 – Services ( cont.) CSF4 will start the following services in GT4 container - Services available for grid users: JobService JobService csf-job-create, csf-job-start, csf-job-submit, csf-job-status, csf-job-stop, csf-job-resume, …, ReservationService ReservationService csf-rsv-create, csf-rsv-status, csf-rsv-cancel csf-job-submit, … QueuingService QueuingService csf-queue-create, csf-queue-conf, csf-queue-data

10 10 CSF4 – Services ( cont.) Services for internal use only: ResourceManagerFactoryService ResourceManagerFactoryService Used by Job Service to Start a job via RM Adapter (Configuration needed for LSF Adapters) ResourceManagerLsfService ResourceManagerLsfService Used by Job Service to start a job in LSF via LSF RM Adapter ResourceManagerGramService (new service) ResourceManagerGramService (new service) Used by Job Service to start a job in Gatekeeper via Java Cog Kit. They are not supposed to be used by the user directly, and there is no user client provided.

11 11 CSF4 – Schedule plugin & scheduling policies Scheduling polices are defined in a queue Each policy is implemented inside a scheduling plugin module A queue can load multiple plugin modules FCFS plugin is mandatory for all the queues and is always loaded by CSF Throttle plugin is an optional plugin provided by CSF to constraint the number of jobs lunched in one scheduling cycle Users can write their own plugins to realize customized scheduling policies

12 12 CSF4 – Architecture

13 13 CSF4 – Architecture ( cont. ) Terms RM Gram Adapter Services to submit jobs to resource manager via GRAM protocol. (LSF/PBS/Condor/SGE) Specific RM Adapter MetaScheduler sends job/reservation requests to RM. Every resource manager needs to implement its own RM Adapter. Now only LSF Adapter is available. RM LSF Adapter is able to talk with remote LSF clusters. Support more functionalities: advance reservation, complex job control.

14 14 CSF4 Integrate with SGE for SGE6 Before Globus Toolkit 4 released, the scheming documents said GT4 wouldn't offer scheduler adapter for SGE, so CSF4 didn't support SGE, either. A third-part adapter developed by London e-Science Centre, Gridwise Technologies and MCNC is released. The adapter only supports SGE6.0. With extendable architecture, CSF4 can support SGE easily.

15 15 CSF4 Integrate with SGE for SGE5.3 Now SGE5.3 can only integrate with GT2.x. Most clusters in SDSC is using GT2 and SGE 5.3. GT2 is still popular. CSF4 Supports GT2 gatekeeper is significant. How to support GT2? Java CoG Kit: The Java Commodity Grid Kit provides convenient access to Grid middleware through the Java framework. It supports GT2.

16 16 CSF4 Integrate with SGE for SGE5.3(Cont.) ResourceManagerGramService (new service) ResourceManagerGramService (new service) Used by Job Service to start a job in Gatekeeper via Java Cog Kit. Used by Job Service to start a job in Gatekeeper via Java Cog Kit. Config gatekeeper at resourcemanager-config.xml for example: Config gatekeeper at resourcemanager-config.xml for example: gatekeeper32 gatekeeper32 GRAM GRAM rocks-32.sdsc.edu/jobmanager-fork rocks-32.sdsc.edu/jobmanager-fork 2119 2119 <version>2.4</version></cluster>

17 17 CSF4 integrate with Gfarm Gfarm Security Share Secure Key Share Secure Key GSI Authentication GSI Authentication User certificate User certificate Delegate Delegate Proxy certificate Proxy certificate

18 18 CSF4 integrate with Gfarm(Cont.) Introduce four terms Two kinds of proxy Full proxy : Generally, is a proxy that has been created by grid-proxy-init or a proxy created from such a proxy by full delegation mechanisms. Full proxy : Generally, is a proxy that has been created by grid-proxy-init or a proxy created from such a proxy by full delegation mechanisms. Limited proxy : is a proxy that is created from a full Proxy when it delegated with the limited delegation mechanism. The first time a proxy is created by the limited delegation mechanism a level 1 Limited Proxy is created. Any subsequent delegation (limited or full) of a level N Limited Proxy creates a level N+1 limited proxy. Limited proxy : is a proxy that is created from a full Proxy when it delegated with the limited delegation mechanism. The first time a proxy is created by the limited delegation mechanism a level 1 Limited Proxy is created. Any subsequent delegation (limited or full) of a level N Limited Proxy creates a level N+1 limited proxy.

19 19 CSF4 integrate with Gfarm(Cont.) Two Kinds of Delegation Full delegation : Full delegation of a Full Proxy results in a Full Proxy on the remote side. Full delegation of a level N Limited Proxy results in a level N+1 Limited Proxy Full delegation : Full delegation of a Full Proxy results in a Full Proxy on the remote side. Full delegation of a level N Limited Proxy results in a level N+1 Limited Proxy Limited delegation : Limited Delegation of a Full Proxy results in a level 1 Limited Proxy. Limited delegation of a level N Limited Proxy results in a level N+1 Limited Proxy. Limited delegation : Limited Delegation of a Full Proxy results in a level 1 Limited Proxy. Limited delegation of a level N Limited Proxy results in a level N+1 Limited Proxy.

20 20 CSF4 integrate with Gfarm(Cont.) Three Server authentication flags Default, In this mode a Full Proxy or a level 1 Limited Proxy will be accepted for authentication. (e.g. Gfarm1.1) (middle) Default, In this mode a Full Proxy or a level 1 Limited Proxy will be accepted for authentication. (e.g. Gfarm1.1) (middle) GSS_C_GLOBUS_LIMITED_PROXY_FLAG, With this flag only a Full Proxy will be accepted for authentication. This mode should be used by applications that do job start- up (e.g. the gatekeeper and ws-gram). (strict) GSS_C_GLOBUS_LIMITED_PROXY_FLAG, With this flag only a Full Proxy will be accepted for authentication. This mode should be used by applications that do job start- up (e.g. the gatekeeper and ws-gram). (strict) GSS_C_GLOBUS_LIMITED_PROXY_MANY_FLAG, With this flag any Full Proxy or Limited Proxy (of any level) will be accepted. (e.g. GridFTP, Gfarm1.2) GSS_C_GLOBUS_LIMITED_PROXY_MANY_FLAG, With this flag any Full Proxy or Limited Proxy (of any level) will be accepted. (e.g. GridFTP, Gfarm1.2) (loose) (loose)

21 21 CSF4 integrate with Gfarm(Cont.) Two factors WS-Gram and Gatekeeper client delegate user certificate with limited delegation. WS-Gram and Gatekeeper client delegate user certificate with limited delegation. gfsd (Gfarm1.1 file node deamon) only accept 1 level limited proxy and full proxy gfsd (Gfarm1.1 file node deamon) only accept 1 level limited proxy and full proxy Two conclusions CSF4 must delegate to local scheduler with full delegation CSF4 must delegate to local scheduler with full delegation CSF4 can’t re-use WS-GRAM and Gatekeeper client library CSF4 can’t re-use WS-GRAM and Gatekeeper client library

22 22 CSF4 integrate with Gfarm(Cont.) How to Support Full Delegation for WS-GRAM Delegation Service : A new component of GT4, this component provides an interface for delegation of credentials to a hosting environment. This enables a single delegated credential to be shared across multiple invocations of services on that hosting environment Delegation Service : A new component of GT4, this component provides an interface for delegation of credentials to a hosting environment. This enables a single delegated credential to be shared across multiple invocations of services on that hosting environment jobcredentialEndpoint: In the Job Description, a new schema element is supported, an EndpointReference which points to the deletated credential resource (i.e. DelegationService Resource). jobcredentialEndpoint: In the Job Description, a new schema element is supported, an EndpointReference which points to the deletated credential resource (i.e. DelegationService Resource).

23 23 CSF4 integrate with Gfarm(Cont.) How to enable WS-GRAM run the job with full delegation? CSF4 will access the DelegationFactoryService of the hosting which the job will be submit to firstly, get a DelegationService EndpointReference. (With the EndpointReference, user can retrieve a resource property that contain a full proxy), CSF add the EndpointReferce to job description as jobCredentialEndpoint, then the job will be delegate to GRAM Server with full delegation. CSF4 will access the DelegationFactoryService of the hosting which the job will be submit to firstly, get a DelegationService EndpointReference. (With the EndpointReference, user can retrieve a resource property that contain a full proxy), CSF add the EndpointReferce to job description as jobCredentialEndpoint, then the job will be delegate to GRAM Server with full delegation.

24 24 CSF4 integrate with Gfarm(Cont.)

25 25 CSF4 integrate with Gfarm(Cont.) How to Support Full Delegation for GateKeeper Java CoG Kit support full delegation A class org.globus.gram.GramJob which represents a simple gram job A class org.globus.gram.GramJob which represents a simple gram job The class support full delegation. The class support full delegation.

26 26 CSF4 new functionality One meta-scheduler work with Multiple-site via GRAM

27 27 CSF4 new functionality(Cont.) a new interface to query all clusters available A new command line csf-job-RmInfo will print all clusters information available

28 28 CSF4 – Demo environment Set up GT4/CSF4 at rocks-110(frontend) Set up GT2 at rocks-32 cluster Set up SGE6.0u4 at rocks-110 cluster and SGE adapter Set up SGE5.3 at rocks-32 cluster and SGE adapter Set up Gfarm1.2 at rocks-32 and rocks-110 clusters

29 29 CSF4 – Demo Configuration for CSF Resource Manager (resourcemanager-config.xml) (resourcemanager-config.xml)<cluster> gatekeeper32 gatekeeper32 GRAM GRAM rocks-32.sdsc.edu/jobmanager-fork rocks-32.sdsc.edu/jobmanager-fork 2119 2119 <version>2.4</version></cluster><cluster> sge32 sge32 GRAM GRAM rocks-32.sdsc.edu/jobmanager-sge rocks-32.sdsc.edu/jobmanager-sge 2119 2119 <version>2.4</version></cluster>

30 30 CSF4 – Demo (Job Execution) Demo 1 – Query Clusters available Demo 2 – Ran a job in local SGE6.0 Demo 3 – Run a job (need full delegation) in remote GT2 gatekeeper with different delegation Demo 4 – Run a Gfarm job in remote SGE5.3

31 31 CSF4 – Related work Condor-G Condor-G integrate Condor project and Globus project. Condor-G integrate Condor project and Globus project. USE GRAM protocol, only support Globus Toolkit 2.x USE GRAM protocol, only support Globus Toolkit 2.x

32 32 CSF4 – Related work Moab Grid Scheduler (SILVER)

33 33 CSF4 – Our Future Plans A new policy plug-in to Support work flow We’re considering to integrate CSF with INFORMNET(http://grid-devel.sdsc.edu/informnet/) http://grid-devel.sdsc.edu/informnet/ A new policy plug-in to schedule Gfarm job in CSF4 Data aware plug-in Data aware plug-in Make grid level & cluster level data-aware scheduling work together efficiently

34 34 Seeking for Collaboration Opportunities Q/AThanks


Download ppt "CSF4, SGE and Gfarm Integration Zhaohui Ding Jilin University."

Similar presentations


Ads by Google