Presentation is loading. Please wait.

Presentation is loading. Please wait.

ALCF Argonne Leadership Computing Facility GridFTP Roadmap Bill Allcock (on behalf of the GridFTP team) Argonne National Laboratory.

Similar presentations


Presentation on theme: "ALCF Argonne Leadership Computing Facility GridFTP Roadmap Bill Allcock (on behalf of the GridFTP team) Argonne National Laboratory."— Presentation transcript:

1 ALCF Argonne Leadership Computing Facility GridFTP Roadmap Bill Allcock (on behalf of the GridFTP team) Argonne National Laboratory

2 – Usability & Performance Packaging GridFTP as RPM GWFTP GridFTP GUI Automatic Firewall Traversal Sync feature for globus-url-copy

3 Argonne National Laboratory Packaging GridFTP as RPM Modify packaging of GridFTP and its dependencies Make it suitable for packaging as an RPM Make it compatible with major Linux distribution standards Eventually some distribution might pick it up GridFTP available as part of standard Linux distribution –Attract a whole new set of users –Put it in par with scp, standard ftp in terms of availability

4 Argonne National Laboratory GridFTP Where there’s FTP (GWFTP) GridFTP has been in existence for some time and has proven to be quite robust and useful Only few GridFTP clients available FTP has innumerable clients GWFTP - created to leverage the FTP clients A proxy between FTP clients and GridFTP servers

5 Argonne National Laboratory GWFTP FTP Client GWFTP (GSI Credential) wiggum.mcs.anl.gov GridFTP Server (2811) USER ::gsiftp://wiggum.mcs.anl.gov:2811/ PASS GSI Authentication Get request Data

6 Argonne National Laboratory GUI Client 08/14/2008Computation Institute

7 Argonne National Laboratory GridFTP GUI A Java Web Start Application –Updates automatically –Users always use the latest release Transfer files and directories Third-party transfer Multiple concurrent transfers Support authentication through MyProxy Manage local and remote files and directories –Browse –Create and delete

8 Argonne National Laboratory Automatic Firewall Traversal Control channel port is statically assigned Data channel ports are dynamically assigned GridFTP Protocol Changes New commands to communicate the 4 tuple (src ip, src port, dst ip, dst port) to both ends of transfer Use simultaneous Open/TCP splicing or Use a broker to open ports temporarily Hooks in GridFTP to contact a broker at the right time

9 Argonne National Laboratory Firewall GridFTP Source Server GridFTP Dest Server Client TCP 2811 DATA

10 Argonne National Laboratory Automatic traversal using a connection Broker GridFTP Source Server GridFTP Dest Server Client TCP 2811 CB DATA IP 4 tuple Temporary hole

11 Argonne National Laboratory Sync feature for globus-url- copy Check for the existence of a file at the destination before transferring If exists, determine whether the source version is different from that of the destination Based on how much the source has changed, optimize the transfer Research into developing a logic that does not involve any changes to the GridFTP protocol

12 Argonne National Laboratory – Reliability & Security Improved restart mechanism Improved memory management algorithm Load balancing Data channel security for SSH based GridFTP GUMS authorization callout

13 Argonne National Laboratory Improved Restart Mechanism globus-url-copy can recover from server and network failures Can not recover from its own failure Number of users including ESG, APS and SNS use this client to transfer large data sets with complex directory structures Develop methods to enable globus-url-copy to recover from its failure

14 Argonne National Laboratory Gfork architecture Server Host GFork Server GridFTP Plugin GridFTP Server Instance Fork GridFTP Server Instance GridFTP Server Instance State Sharing Link Client Inherited Links Control Channel Connections Client

15 Argonne National Laboratory Memory Management Optimistic memory provisioning by operating system –possible that under heavy loads GridFTP server can consume all of systems memory resources. Gfork – xinted like super server daemon –Allows state to be maintained across connections GridFTP plugin for Gfork has a simple memory limiting option –90% of the memory to the first 10% of the allowed connections –Remaining connections receive half of what is available Develop an improved memory management algorithm

16 Argonne National Laboratory Load balancing capabilities The separation of processes buys the ability to proxy –Allows for load balancing –Frontend can choose from a pool of DPIs to service a client request Client DPI IPC DPI Frontend DPI

17 Argonne National Laboratory sshd SSH based GridFTP (GridFTP- Lite) Client GridFTP Server 2811 Port 22 ROOT USER ssh Stdin/out (control channel)

18 Argonne National Laboratory Data Channel Security for SSH based GridFTP SSH based GridFTP does not have data channel security Investigate and prototype a way to let a client send a shared secret to both source and destination GridFTP servers Used to secure the data channel(s) between the two servers Shared secret can be used to authenticate, integrity-protect and encrypt the data channel This feature will increase the adoption of SSH based GridFTP

19 Argonne National Laboratory GUMS Authorization Callout GUMS – Grid User Management System –Grid identity mapping service –Maps grid identity to local site identity –Used in OSG GUMS server 3. Obtain local identity from GUMS server /DC=org/DC=doegrids/OU=People/CN=John Bresnahanz bresnaha GridFTP Client GUMS callout 1. Authentication 2. Data transfer operations Disk 4. Access data as local identity

20 Argonne National Laboratory GUMS Authorization Callout Role based authorization using voms extended proxy GUMS server 3. Obtain local identity from GUMS server /DC=org/DC=doegrids/OU=People/CN=John Bresnahanz usatlasdev GridFTP Client GUMS callout 1. Authentication 2. Data transfer operations Disk 4. Access data as local identity /VO=ATLAS/Group=USATLAS/Role=developer

21 Argonne National Laboratory – Quality of Service Information provider Provision end-point GridFTP resources Integrate network provisioning Integrate storage provisioning Co-schedule data transfer resources

22 Argonne National Laboratory GridFTP information provider service –Max connections –Open connections –Load Higher level services can utilize this information for scheduling data transfers –Help with selecting the appropriate replica of data Information Provider

23 Argonne National Laboratory Provision end-point resources GridFTP Server GridFTP Info Provider CPU MemoryBW Resource Limiter Ad Control Channel Data Movement Service (RFT replacement) Data Point GFTP Resource Broker Provision GridFTP

24 Argonne National Laboratory Integrate Network Provisioning GridFTP Server GridFTP Info Provider CPU MemoryBW Resource Limiter Ad Control Channel Data Movement Service Data Point GFTP Resource Broker Provision GridFTP Network Reservation Service Reserve Bandwidth Bandwidth Token

25 Argonne National Laboratory Integrate Storage Provisioning GridFTP Server GridFTP Info Provider CPU MemoryBW Resource Limiter Ad Control Channel Data Movement Service Data Point GFTP Resource Broker Provision GridFTP Network Reservation Service Provision Bandwidth Bandwidth Token File System Lotman Provision Storage

26 Argonne National Laboratory Co-schedule Data Transfer Resources Data Movement Service Network Reservation Service Provision Bandwidth Source Data Point Destination Data Point Provision GridFTP and Storage resources


Download ppt "ALCF Argonne Leadership Computing Facility GridFTP Roadmap Bill Allcock (on behalf of the GridFTP team) Argonne National Laboratory."

Similar presentations


Ads by Google