Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations
What is the TeraGrid?
TGUP (TeraGrid User Portal)
Accessing TeraGrid User Portal
Portal Overview
Portal: MyTeraGrid
Portal: Resources
Portal: Documentation
Accessing Resources
Web-based SSO via Portal
SSO (Non-Portal) from a TeraGrid Resource
Example - SSH to tg-login.ncsa.teragrid.org or another resource that you have SSH access. - grid-proxy-info - myproxy-logon -l username -grid-proxy-info -gsissh tg-login.purdue.teragrid.org
TeraGrid Resources – Sorted by site – Sorted by machine type – My TeraGrid -> Accounts – Resources s/ s/ – URLs here contain detailed user documentation
Moving data to/from TeraGrid systems Sftp clients from your office to TeraGrid – Command line sftp on Linux – GUI sftp clients GSI-SSHTerm sftp button – Resources -> File Manager [beta] High speed gridftp between TeraGrid systems – globus-url-copy – Uberftp
File Transfers: Small ( <100 MB) Files
Large ( >100 MB) File Transfers: globus-url-copy The globus-url-copy client program is a GridFTP client for transferring files from the command line. Usage: – globus-url-copy where or is of the format: if local file, file: if remote file, gsiftp:// /
Example – Two Party Transfer Logon to NCSA Abe Cluster. Create a large file on NCSA Abe Cluster: – dd bs=100MB count=1 if=/dev/zero of=testfile Copy this file to Purdue Steele Cluster: – globus-url-copy –vb file:///u/ac/username/testfile gsiftp://tg- steele.purdue.teragrid.org:2811/autohome/u108/userna me/
Example – Third Party Transfer Logon to NCSA Abe Cluster Copy the testfile at Purdue Steele Cluster to NCAR Frost Cluster – globus-url-copy -vb gsiftp://tg- steele.purdue.teragrid.org:2811/autohome/u108/username/test file gsiftp://gridftp.frost.ncar.teragrid.org:2811//home/username/ GridFTP server addresses for each site are listed at: – ion.php#deployment ion.php#deployment
Optimized Data Transfer with globus- url-copy Using large TCP windows – globus-url-copy –vb -tcp-bs file:///u/ac/username/testfile gsiftp://tg- steele.purdue.teragrid.org:2811/autohome/u108/username/ Using large memory buffers – globus-url-copy –vb -bs file:///u/ac/username/testfile gsiftp://tg- steele.purdue.teragrid.org:2811/autohome/u108/username/ Using multiple parallel streams – globus-url-copy –vb –p 4 file:///u/ac/username/testfile gsiftp://tg- steele.purdue.teragrid.org:2811/autohome/u108/username/
Large ( >100 MB) File Transfers: UberFTP UberFTP is an interactive GridFTP file transfer client. Opens a session with a remote host, within which files may be transfered and directories and files may be manipulated. Requires GSI authentication. Hands-On: – Login to NCSA Abe – uberftp – open tg-steele.rcac.purdue.edu – parallel 2 – tcpbuf – ls/lls/put/get/…
Imaginations unbound Data movement tips To move a collection of small files, make an archive and move it instead of moving the files individually – tar – zip For high bandwidth links and moderate file or archive size, do not compress, it's usually faster to just move the data [compression is a time waster] For low bandwidth links, compression is usually a time saver – tar z or j options for compression – zip – ssh -C, sftp -C
Permanent Storage at NCSA The larger TeraGrid sites provide persistent high-capacity storage Details vary by site, consult local site documentation for specifics Refer to: – for detailed information about different quotas, policies, and tools (such as SRB, HPSS) at each site.
Managing Your Environment: Softenv
Softenv
Managing Your Environment: Modules * Try at tg-steele.rcac.purdue.edu
Softenv and Modules: Which do I use? Rule of thumb: go with the default on a given machine – When you login for the first time issue ‘softenv’ and ‘module list’ commands – In general, only one should be active by default: go with that one – If you have questions or run into any problems contact
29 Grid Job Management using Globus Common WS interface to schedulers – Unix, Condor, LSF, PBS, SGE, … More generally: interface for process execution management – Lay down execution environment – Stage data – Monitor & manage lifecycle – Kill it, clean up
30 Grid Job Management Goals Provide a service to securely: Create an environment for a job Stage files to/from environment Cause execution of job process(es) – Via various local resource managers Monitor execution Signal important state changes to client Enable client access to output files – Streaming access during execution
31 GRAM GRAM: Globus Resource Allocation and Management GRAM is a Globus Toolkit component – For Grid job management GRAM is a unifying remote interface to Resource Managers – Yet preserves local site security/control Remote credential management File staging via RFT and GridFTP
32 A Simple Example First, login to queenbee.loni-lsu.teragrid.orgqueenbee.loni-lsu.teragrid.org Command example: % globusrun-ws -submit -c /bin/date Submitting job...Done. Job ID: uuid:002a6ab d9-bae6-0002a5ad41e5 Termination time: 01/07/ :55 GMT Current job state: Active Current job state: CleanUp Current job state: Done Destroying job...Done. A successful submission will create a new ManagedJob resource with its own unique EPR for messaging Use – o option to create the EPR file % globusrun-ws -submit –o job.epr -c /bin/date
33 A Simple Example(2) To see the output, use –s (stream) option % globusrun-ws -submit –s -c /bin/date Termination time: 06/14/ :07 GMT Current job state: Active Current job state: CleanUp-Hold Wed Jun 13 14:07:54 EDT 2007 Current job state: CleanUp Current job state: Done Destroying job...Done. Cleaning up any delegated credentials...Done. If you want to send the output to a file, use –so option % globusrun-ws -submit –s –so job.out -c /bin/date … % cat job.out Wed Jun 13 14:07:54 EDT 2007
34 A Simple Example(3) Submitting your job to different schedulers – Fork % globusrun-ws -submit -Ft Fork -s -c /bin/date (Actually, the default is Fork. So, you can skip it in this case.) – SGE % globusrun-ws -submit -Ft PBS-s -c /bin/date Submitting to a remote site % globusrun-ws -submit -F tg- login.frost.ncar.teragrid.org -c /bin/date
35 Batch Job Submissions % globusrun-ws -submit -batch -o job_epr -c /bin/sleep 50 Submitting job...Done. Job ID: uuid:f c5-11d9-97e3-0002a5ad41e5 Termination time: 01/08/ :05 GMT % globusrun-ws -status -j job_epr Current job state: Active % globusrun-ws -status -j job_epr Current job state: Done % globusrun-ws -kill -j job_epr Requesting original job description...Done. Destroying job...Done.
36 Resource Specification Language (RSL) RSL is the language used by the clients to submit a job. All job submission parameters are described in RSL, including the executable file and arguments. You can specify the type and capabilities of resources to execute your job. You can also coordinate Stage-in and Stage-out operations through RSL.
37 Submitting a job through RSL Command: % globusrun-ws -submit -f touch.xml Contents of touch.xml file: /bin/touch touched_it
Security - Basics
How to get Help First, try searching the Knowledge Base or other Documentation If that doesn’t help, submit a ticket – Send an to – Use the TeraGrid User Portal ‘Consulting’ tab Can also call TeraGrid Help Desk 24/7:
Submitting a Ticket
More Info TeraGrid Resource User Guides – p p File Transfers and Data Management on TeraGrid – More Training – nlinetraining nlinetraining