Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations.

Slides:



Advertisements
Similar presentations
Globus Workshop at CoreGrid Summer School 2006 Dipl.-Inf. Hamza Mehammed Leibniz Computing Centre.
Advertisements

Cross-site data transfer on TeraGrid using GridFTP TeraGrid06 Institute User Introduction to TeraGrid June 12 th by Krishna Muriki
Grid Resource Allocation Management (GRAM) GRAM provides the user to access the grid in order to run, terminate and monitor jobs remotely. The job request.
CERN LCG Overview & Scaling challenges David Smith For LCG Deployment Group CERN HEPiX 2003, Vancouver.
GridFTP: File Transfer Protocol in Grid Computing Networks
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
2-1.1 Job Submission © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification date: Jan 18, 2010.
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
6a.1 Globus Toolkit Execution Management. Data Management Security Common Runtime Execution Management Information Services Web Services Components Non-WS.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
Sun Grid Engine Grid Computing Assignment – Fall 2005 James Ruff Senior Department of Mathematics and Computer Science Western Carolina University.
Assignment 3 Using GRAM to Submit a Job to the Grid James Ruff Senior Western Carolina University Department of Mathematics and Computer Science.
Grid Services at NERSC Shreyas Cholia Open Software and Programming Group, NERSC NERSC User Group Meeting September 17, 2007.
1 Short Course on Grid Computing Jornadas Chilenas de Computación 2010 INFONOR-CHILE 2010 November 15th - 19th, 2010 Antofagasta, Chile Dr. Barry Wilkinson.
1c.1 Assignment 2 Preliminaries Review (Full details in assignment write-up.)‏ © 2011 B. Wilkinson/Clayton Ferner. Fall 2011 Grid computing course. Modification.
Asynchronous Solution Appendix Eleven. Training Manual Asynchronous Solution August 26, 2005 Inventory # A11-2 Chapter Overview In this chapter,
Grids and Globus at BNL Presented by John Scott Leita.
1b.1 Globus Toolkit 4.0 Grid Resource Allocation Manager (GRAM) Job submission ITCS 4146/5146 Grid Computing, 2007, UNC-Charlotte, B. Wilkinson. Jan 24,
Simo Niskala Teemu Pasanen
Evaluation of the Globus GRAM Service Massimo Sgaravatto INFN Padova.
Globus 4 Guy Warner NeSC Training.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
GridFTP Guy Warner, NeSC Training.
Part Three: Data Management 3: Data Management A: Data Management — The Problem B: Moving Data on the Grid FTP, SCP GridFTP, UberFTP globus-URL-copy.
High Performance Louisiana State University - LONI HPC Enablement Workshop – LaTech University,
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
© 2008 Pittsburgh Supercomputing Center So you have a TeraGrid Allocation What now?
NeSC Apps Workshop July 20 th, 2002 Customizable command line tools for Grids Ian Kelley + Gabrielle Allen Max Planck Institute for Gravitational Physics.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
Job Submission Condor, Globus, Java CoG Kit Young Suk Moon.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
COMP3019 Coursework: Introduction to GridSAM Steve Crouch School of Electronics and Computer Science.
GRAM5 - A sustainable, scalable, reliable GRAM service Stuart Martin - UC/ANL.
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
Rochester Institute of Technology Job Submission Andrew Pangborn & Myles Maxfield 10/19/2015Service Oriented Cyberinfrastructure Lab,
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
2-1.1 Job Submission Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 2, pp
Globus GridFTP and RFT: An Overview and New Features Raj Kettimuthu Argonne National Laboratory and The University of Chicago.
© 2008 Pittsburgh Supercomputing Center So you have a TeraGrid Allocation What now?
August 13, 2003Eric Hjort Getting Started with Grid Computing in STAR Eric Hjort, LBNL STAR Collaboration Meeting August 13, 2003.
1 TeraGrid Data Transfer Jeffrey P. Gardner Pittsburgh Supercomputing Center
Grid NERSC demo Shreyas Cholia Open Software and Programming NERSC User Group Meeting September 19, 2007.
1 Short Course on Grid Computing Jornadas Chilenas de Computación 2010 INFONOR-CHILE 2010 November 15th - 19th, 2010 Antofagasta, Chile Dr. Barry Wilkinson.
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
Part Five: Globus Job Management A: GRAM B: Globus Job Commands C: Laboratory: globusrun.
SAN DIEGO SUPERCOMPUTER CENTER Inca TeraGrid Status Kate Ericson November 2, 2006.
APST Internals Sathish Vadhiyar. apstd daemon should be started on the local resource Opens a port to listen for apst client requests Runs on the host.
GridFTP Richard Hopkins
Creating and running an application.
Leveraging the InCommon Federation to access the NSF TeraGrid Jim Basney Senior Research Scientist National Center for Supercomputing Applications University.
SAN DIEGO SUPERCOMPUTER CENTER Inca Control Infrastructure Shava Smallen Inca Workshop September 4, 2008.
Job Submission with Globus, Condor, and Condor-G Selim Kalayci Florida International University 07/21/2009 Note: Slides are compiled from various TeraGrid.
© Geodise Project, University of Southampton, Geodise Middleware Graeme Pound, Gang Xue & Matthew Fairman Summer 2003.
Introduction to Grid Computing and its components.
AERG 2007Grid Data Management1 Grid Data Management GridFTP Carolina León Carri Ben Clifford (OSG)
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Remote & Collaborative Visualization. TACC Remote Visualization Systems Longhorn – Dell XD Visualization Cluster –256 nodes, each with 48 GB (or 144 GB)
Open Science Grid Build a Grid Session Siddhartha E.S University of Florida.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
GridFTP Guy Warner, NeSC Training Team.
1 GridFTP and SRB Guy Warner Training, Outreach and Education Team, Edinburgh e-Science.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
Advanced Computing Facility Introduction
Creating and running applications on the NGS
TeraGrid Data Transfer
Globus Job Management. Globus Job Management Globus Job Management A: GRAM B: Globus Job Commands C: Laboratory: globusrun.
File Transfer Olivia Irving and Cameron Foss
Part Three: Data Management
Presentation transcript:

Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations

What is the TeraGrid?

TGUP (TeraGrid User Portal)

Accessing TeraGrid User Portal

Portal Overview

Portal: MyTeraGrid

Portal: Resources

Portal: Documentation

Accessing Resources

Web-based SSO via Portal

SSO (Non-Portal) from a TeraGrid Resource

Example - SSH to tg-login.ncsa.teragrid.org or another resource that you have SSH access. - grid-proxy-info - myproxy-logon -l username -grid-proxy-info -gsissh tg-login.purdue.teragrid.org

TeraGrid Resources – Sorted by site – Sorted by machine type – My TeraGrid -> Accounts – Resources s/ s/ – URLs here contain detailed user documentation

Moving data to/from TeraGrid systems Sftp clients from your office to TeraGrid – Command line sftp on Linux – GUI sftp clients GSI-SSHTerm sftp button – Resources -> File Manager [beta] High speed gridftp between TeraGrid systems – globus-url-copy – Uberftp

File Transfers: Small ( <100 MB) Files

Large ( >100 MB) File Transfers: globus-url-copy The globus-url-copy client program is a GridFTP client for transferring files from the command line. Usage: – globus-url-copy where or is of the format: if local file, file: if remote file, gsiftp:// /

Example – Two Party Transfer Logon to NCSA Abe Cluster. Create a large file on NCSA Abe Cluster: – dd bs=100MB count=1 if=/dev/zero of=testfile Copy this file to Purdue Steele Cluster: – globus-url-copy –vb file:///u/ac/username/testfile gsiftp://tg- steele.purdue.teragrid.org:2811/autohome/u108/userna me/

Example – Third Party Transfer Logon to NCSA Abe Cluster Copy the testfile at Purdue Steele Cluster to NCAR Frost Cluster – globus-url-copy -vb gsiftp://tg- steele.purdue.teragrid.org:2811/autohome/u108/username/test file gsiftp://gridftp.frost.ncar.teragrid.org:2811//home/username/ GridFTP server addresses for each site are listed at: – ion.php#deployment ion.php#deployment

Optimized Data Transfer with globus- url-copy Using large TCP windows – globus-url-copy –vb -tcp-bs file:///u/ac/username/testfile gsiftp://tg- steele.purdue.teragrid.org:2811/autohome/u108/username/ Using large memory buffers – globus-url-copy –vb -bs file:///u/ac/username/testfile gsiftp://tg- steele.purdue.teragrid.org:2811/autohome/u108/username/ Using multiple parallel streams – globus-url-copy –vb –p 4 file:///u/ac/username/testfile gsiftp://tg- steele.purdue.teragrid.org:2811/autohome/u108/username/

Large ( >100 MB) File Transfers: UberFTP UberFTP is an interactive GridFTP file transfer client. Opens a session with a remote host, within which files may be transfered and directories and files may be manipulated. Requires GSI authentication. Hands-On: – Login to NCSA Abe – uberftp – open tg-steele.rcac.purdue.edu – parallel 2 – tcpbuf – ls/lls/put/get/…

Imaginations unbound Data movement tips To move a collection of small files, make an archive and move it instead of moving the files individually – tar – zip For high bandwidth links and moderate file or archive size, do not compress, it's usually faster to just move the data [compression is a time waster] For low bandwidth links, compression is usually a time saver – tar z or j options for compression – zip – ssh -C, sftp -C

Permanent Storage at NCSA The larger TeraGrid sites provide persistent high-capacity storage Details vary by site, consult local site documentation for specifics Refer to: – for detailed information about different quotas, policies, and tools (such as SRB, HPSS) at each site.

Managing Your Environment: Softenv

Softenv

Managing Your Environment: Modules * Try at tg-steele.rcac.purdue.edu

Softenv and Modules: Which do I use? Rule of thumb: go with the default on a given machine – When you login for the first time issue ‘softenv’ and ‘module list’ commands – In general, only one should be active by default: go with that one – If you have questions or run into any problems contact

29 Grid Job Management using Globus Common WS interface to schedulers – Unix, Condor, LSF, PBS, SGE, … More generally: interface for process execution management – Lay down execution environment – Stage data – Monitor & manage lifecycle – Kill it, clean up

30 Grid Job Management Goals Provide a service to securely: Create an environment for a job Stage files to/from environment Cause execution of job process(es) – Via various local resource managers Monitor execution Signal important state changes to client Enable client access to output files – Streaming access during execution

31 GRAM GRAM: Globus Resource Allocation and Management GRAM is a Globus Toolkit component – For Grid job management GRAM is a unifying remote interface to Resource Managers – Yet preserves local site security/control Remote credential management File staging via RFT and GridFTP

32 A Simple Example First, login to queenbee.loni-lsu.teragrid.orgqueenbee.loni-lsu.teragrid.org Command example: % globusrun-ws -submit -c /bin/date Submitting job...Done. Job ID: uuid:002a6ab d9-bae6-0002a5ad41e5 Termination time: 01/07/ :55 GMT Current job state: Active Current job state: CleanUp Current job state: Done Destroying job...Done. A successful submission will create a new ManagedJob resource with its own unique EPR for messaging Use – o option to create the EPR file % globusrun-ws -submit –o job.epr -c /bin/date

33 A Simple Example(2) To see the output, use –s (stream) option % globusrun-ws -submit –s -c /bin/date Termination time: 06/14/ :07 GMT Current job state: Active Current job state: CleanUp-Hold Wed Jun 13 14:07:54 EDT 2007 Current job state: CleanUp Current job state: Done Destroying job...Done. Cleaning up any delegated credentials...Done. If you want to send the output to a file, use –so option % globusrun-ws -submit –s –so job.out -c /bin/date … % cat job.out Wed Jun 13 14:07:54 EDT 2007

34 A Simple Example(3) Submitting your job to different schedulers – Fork % globusrun-ws -submit -Ft Fork -s -c /bin/date (Actually, the default is Fork. So, you can skip it in this case.) – SGE % globusrun-ws -submit -Ft PBS-s -c /bin/date Submitting to a remote site % globusrun-ws -submit -F tg- login.frost.ncar.teragrid.org -c /bin/date

35 Batch Job Submissions % globusrun-ws -submit -batch -o job_epr -c /bin/sleep 50 Submitting job...Done. Job ID: uuid:f c5-11d9-97e3-0002a5ad41e5 Termination time: 01/08/ :05 GMT % globusrun-ws -status -j job_epr Current job state: Active % globusrun-ws -status -j job_epr Current job state: Done % globusrun-ws -kill -j job_epr Requesting original job description...Done. Destroying job...Done.

36 Resource Specification Language (RSL) RSL is the language used by the clients to submit a job. All job submission parameters are described in RSL, including the executable file and arguments. You can specify the type and capabilities of resources to execute your job. You can also coordinate Stage-in and Stage-out operations through RSL.

37 Submitting a job through RSL Command: % globusrun-ws -submit -f touch.xml Contents of touch.xml file: /bin/touch touched_it

Security - Basics

How to get Help First, try searching the Knowledge Base or other Documentation If that doesn’t help, submit a ticket – Send an to – Use the TeraGrid User Portal ‘Consulting’ tab Can also call TeraGrid Help Desk 24/7:

Submitting a Ticket

More Info TeraGrid Resource User Guides – p p File Transfers and Data Management on TeraGrid – More Training – nlinetraining nlinetraining