Congreso Cuidad, Spain May 15, 2007 GridWay 1/28, Installation and Basic Configuration GridWay Distributed Systems Architecture Group Universidad Complutense.

Slides:



Advertisements
Similar presentations
Building Portals to access Grid Middleware National Technical University of Athens Konstantinos Dolkas, On behalf of Andreas Menychtas.
Advertisements

TeraGrid Deployment Test of Grid Software JP Navarro TeraGrid Software Integration University of Chicago OGF 21 October 19, 2007.
Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Experiences with GridWay on CRO NGI infrastructure Emir Imamagic, Srce EGEE User.
Distributed Systems Architecture Research Group Universidad Complutense de Madrid EGEE UF4/OGF25 Catania, Italy March 2 nd, 2009 State and Future Plans.
Three types of remote process invocation
A3.1 Assignment 3 Simple Job Submission Using GT 4 GRAM.
Grid Resource Allocation Management (GRAM) GRAM provides the user to access the grid in order to run, terminate and monitor jobs remotely. The job request.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
Setting up of condor scheduler on computing cluster Raman Sehgal NPD-BARC.
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
2 nd GADA Workshop / OTM 2005 Conferences Eduardo Huedo Rubén S. Montero Ignacio M. Llorente Advanced Computing Laboratory Center for.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
Systems Architecture, Fourth Edition1 Internet and Distributed Application Services Chapter 13.
Asynchronous Solution Appendix Eleven. Training Manual Asynchronous Solution August 26, 2005 Inventory # A11-2 Chapter Overview In this chapter,
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
Condor Overview Bill Hoagland. Condor Workload management system for compute-intensive jobs Harnesses collection of dedicated or non-dedicated hardware.
Globus Computing Infrustructure Software Globus Toolkit 11-2.
1 Chapter Overview Introduction to Windows XP Professional Printing Setting Up Network Printers Connecting to Network Printers Configuring Network Printers.
Massive Ray Tracing in Fusion Plasmas on EGEE J.L. Vázquez-Poletti, E. Huedo, R.S. Montero and I.M. Llorente Distributed Systems Architecture Group Universidad.
VMware vCenter Server Module 4.
Sharepoint Portal Server Basics. Introduction Sharepoint server belongs to Microsoft family of servers Integrated suite of server capabilities Hosted.
16.1 © 2004 Pearson Education, Inc. Exam Managing and Maintaining a Microsoft® Windows® Server 2003 Environment Lesson 16: Examining Software Update.
Talend 5.4 Architecture Adam Pemble Talend Professional Services.
Microsoft ® Official Course Module 9 Configuring Applications.
Chapter 5 Roles and features. objectives Performing management tasks using the Server Manager console Understanding the Windows Server 2008 roles Understanding.
Apache Tomcat Web Server SNU OOPSLA Lab. October 2005.

Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Dynamic Firewalls and Service Deployment Models for Grid Environments Gian Luca Volpato, Christian Grimm RRZN – Leibniz Universität Hannover Cracow Grid.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Grid Computing I CONDOR.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
GRAM5 - A sustainable, scalable, reliable GRAM service Stuart Martin - UC/ANL.
3-2.1 Topics Grid Computing Meta-schedulers –Condor-G –Gridway Distributed Resource Management Application (DRMAA) © 2010 B. Wilkinson/Clayton Ferner.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Introduction to the Adapter Server Rob Mace June, 2008.
Grid Workload Management Massimo Sgaravatto INFN Padova.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks S. Natarajan (CSU) C. Martín (UCM) J.L.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, An Overview of the GridWay Metascheduler.
TeraGrid CTSS Plans and Status Dane Skow for Lee Liming and JP Navarro OSG Consortium Meeting 22 August, 2006.
First attempt for validating/testing Testbed 1 Globus and middleware services WP6 Meeting, December 2001 Flavia Donno, Marco Serra for IT and WPs.
The Roadmap to New Releases Derek Wright Computer Sciences Department University of Wisconsin-Madison
1 Grid Portal for VN-Grid Cu Nguyen Phuong Ha. 2 Outline Some words about portals in principle Overview of OGCE GridPortlets.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, Novelties and Features around the GridWay.
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
SAN DIEGO SUPERCOMPUTER CENTER Inca TeraGrid Status Kate Ericson November 2, 2006.
National Center for Supercomputing ApplicationsNational Computational Science Grid Packaging Technology Technical Talk University of Wisconsin Condor/GPT.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Creating and running an application.
Portal Update Plan Ashok Adiga (512)
Linux Operations and Administration
EGI Technical Forum Amsterdam, 16 September 2010 Sylvain Reynaud.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
Microsoft ® Official Course Module 6 Managing Software Distribution and Deployment by Using Packages and Programs.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
Geant4 GRID production Sangwan Kim, Vu Trong Hieu, AD At KISTI.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
Congreso Cuidad, Spain May 15, 2007 GridWay 1/27, Programming with the DRMAA OGF Standard GridWay Distributed Systems Architecture Group Universidad Complutense.
CE design report Luigi Zangrando
Consulting Services JobScheduler Architecture Decision Template
Simulation Production System
Management of Virtual Machines in Grids Infrastructures
Open Source distributed document DB for an enterprise
Consulting Services JobScheduler Architecture Decision Template
The ETICS Build and Test Service
Installation Smooth sailing....
Presentation transcript:

Congreso Cuidad, Spain May 15, 2007 GridWay 1/28, Installation and Basic Configuration GridWay Distributed Systems Architecture Group Universidad Complutense de Madrid

2/28 GridWay DSA Group GridWay DSA Group Contents 1.Philosophy 2.Installation Directories and their Meanings 3.Required Software 4.Platform Notes 5.Configuration 6.Logging 7.Scheduling Policies 8.MAD Configuration

3/28 GridWay DSA Group GridWay DSA Group Philosophy Globus Services GRAM, GridFTP, MDS Local Clusters / Resources Site Users System Administrator GridWay Globus Services GRAM, GridFTP, MDS Resources Other Sites Other Partner Grids User Policies (Urgent jobs, Shares…) Resource Policies (Usage, Fixed Priority…) Accounting User Policies (Urgent jobs, Shares…) Resource Policies (Usage, Fixed Priority…) Accounting Roles

4/28 GridWay DSA Group GridWay DSA Group Philosophy  Standard Deployment  User access the GridWay host  Standard Deployment + gsissh component  User submit, control and monitor jobs from their computers  Standard Deployment + gridgateway component  User access using any GRAM compatible client  Standard Deployment + NFS  User submit, control and monitor jobs from their computers  Standard deployment + portal  User access through a Web Page SSH Alternative Interfaces GSISSH GridGateWay Shared Homes Portal

5/28 GridWay DSA Group GridWay DSA Group Philosophy GridWay is also a tool for interoperability Documents available on how to configure GridWay to interface:  EGEE  TeraGrid  Open Science Grid that can be found in Interoperability achieved through the use of adapters. Interoperability

6/28 GridWay DSA Group GridWay DSA Group Contents 1.Philosophy 2.Installation Directories and their Meanings 3.Required Software 4.Platform Notes 5.Configuration 6.Logging 7.Scheduling Policies 8.MAD Configuration

7/28 GridWay DSA Group GridWay DSA Group Installation Directories and their Meanings $GW_LOCATION/ bin/ doc/ etc/ share/ include/ lib/ libexec/ test/ var/ executables documentation (optional) gwd and default job template license & usage examples (optional) header files compiled libraries wrapper and monitor scripts test suite (optional) log, port and log files

8/28 GridWay DSA Group GridWay DSA Group Installation Directories and their Meanings  Uncompress gw_src.tar.gz  Set $GW_LOCATION  PATH = $PATH:$GW_LOCATION/bin ./configure --prefix=$GW_LOCATION  There are more options -- check them out in the manual  make  make install √ Since June 2007, GridWay is included in Globus Toolkit , and can be installed as part of Globus. Installation Procedure

9/28 GridWay DSA Group GridWay DSA Group 1.Philosophy 2.Installation Directories and their Meanings 3.Required Software 4.Platform Notes 5.Configuration 6.Logging 7.Scheduling Policies 8.MAD Configuration Contents

10/28 GridWay DSA Group GridWay DSA Group Required Software  C Compiler  Tested versions: gcc 3.4.2, 3.4.4, and  Globus C Libraries  globus_gram_client, globus_ftp_client and globus_gass_copy  $GLOBUS_LOCATION must be set  Globus JAVA Development Libraries  J2SE  Versions 1.4.2_10+ (Builds higher than 10) or  GNU Make  Sudo command  Only required for multiple-user mode  Berkeley Database Library –Version (only required to compile the accounting module)‏

11/28 GridWay DSA Group GridWay DSA Group Contents 1.Philosophy 2.Installation Directories and their Meanings 3.Required Software 4.Platform Notes 5.Configuration 6.Logging 7.Scheduling Policies 8.MAD Configuration

12/28 GridWay DSA Group GridWay DSA Group Platform Notes  Fedora Core  FC4 and Sun's Java 1.4.2: Upgrade to Java 1.5+  32 bits JSDK binaries on AMD64 architectures  Debian Testing  No known issues  Mac OS X  No known issues  Tested on Mac OS X 1.4 (Tiger)  Solaris 10  No known issues  Other Linux/UNIX flavours  Should run smoothly  Have you tested GridWay on other flavours? Let us know!

13/28 GridWay DSA Group GridWay DSA Group Contents 1.Philosophy 2.Installation Directories and their Meanings 3.Required Software 4.Platform Notes 5.Configuration 6.Logging 7.Scheduling Policies 8.MAD Configuration

14/28 GridWay DSA Group GridWay DSA Group Configuration  $GW_LOCATION/etc/gwd.conf  Configuration options for GridWay daemon (GWD)‏  $GW_LOCATION/etc/sched.conf  Configuration options for GridWay built-in scheduling policies  $GW_LOCATION/etc/job_template.default  Default values for job templates  $GW_LOCATION/etc/gwrc  Default environment variables for MADs = [value] Overview

15/28 GridWay DSA Group GridWay DSA Group Connection Options  GWD_PORT: TCP/IP port where GWD will listen for client requests.  TCP/IP port being used by GWD can be found at $GW_LOCATION/etc/gwd.port  MAX_NUMBER_OF_CLIENTS: Max number of simultaneous client connections.  NUMBER_OF_JOBS: Max number of jobs handled by GridWay.  NUMBER_OF_ARRAYS: Max number of array-jobs handled by GridWay.  NUMBER_OF_USERS: Max number of different users.  SCHEDULING_INTERVAL: Seconds between two scheduling actions.  DISCOVERY_INTERVAL: Seconds between searches for new hosts on the Grid (Information Manager).  MONITORING_INTERVAL: Seconds between host information updates (Information Manager).  POLL_INTERVAL: Seconds between underlying Grid middleware queries for job state. GWD Configuration Pool Options Intervals Configuration

16/28 GridWay DSA Group GridWay DSA Group Middleware Access Driver (MAD) Options  IM_MAD: Information Manager MADs.  TM_MAD: Information Manager MADs.  EM_MAD: Execution Manager MADs.  MAX_ACTIVE_IM_QUERIES: Max number (soft limit) of active IM queries  Each query spawns one process  DM_SCHED: Scheduling module. GWD Configuration Scheduler Options Configuration

17/28 GridWay DSA Group GridWay DSA Group Contents 1.Installation Directories and their Meanings 2.Required Software 3.Platform Notes 4.Configuration 5.Logging 6.Scheduling Policies 7.MAD Configuration

18/28 GridWay DSA Group GridWay DSA Group Logging gwd.log: System level log –MADs –Jobs (coarse-grain) sched.log: Scheduler log –Fit scheduler policies to your organization needs $JOBID/job.log: Detailed job log information –Details of job resource usage and performance acct: Accounting information –gwacct accesses the databases (needs Berkeley DB Library version ).lock: Prevents from running more than one instance of GWD gwd.port: TCP/IP port listening for client connections globus-gw.log: Used to encapsulate GridWay in a GRAM service $GW_LOCATION/var/

19/28 GridWay DSA Group GridWay DSA Group 1.Philosophy 2.Installation Directories and their Meanings 3.Required Software 4.Platform Notes 5.Configuration 6.Logging 7.Scheduling Policies 8.MAD Configuration Contents

20/28 GridWay DSA Group GridWay DSA Group Assigns fixed priority to each job (00 – 19). User: All jobs of a user are given a fixed priority. Group: All jobs of a user in the specified group are given a fixed priority. User priority prevails over group priority. Users can set priority of their own jobs without exceeding the limit gwsubmit -p Urgent Job Policy Grid administrator can set a fixed priority of 20 to a job. –This job becomes urgent and bypasses all scheduling policies. Jobs Fixed Priority Policy (FP) Scheduling Policies

21/28 GridWay DSA Group GridWay DSA Group Scheduling Policies Allows to establish a dispatching ratio among users of a scheduling domain. Job submission considered, not resource usage. SH_WINDOW_DEPTH: Time intervals considered for evaluation. SH_WINDOW_SIZE: Duration of each interval (day). Allows to prevent low-priority jobs to starve. Jobs with long pending state will be eventually submitted to the Grid. Job submission deadlines may be specified. Job priority will be increased when deadline approaches. DL_HALF: When half of the max priority should be assigned (days) Jobs Fair-Share Policy (SH) Waiting-time Policy (WT) Deadline Policy (DL)

22/28 GridWay DSA Group GridWay DSA Group Scheduling Policies Assigns fixed priority to each resource (01 – 99). Example: Resources of intranet are used more than those from grid. Priority 00 bans a resource. Prioritizes resources suitable for a job from its point of view. Configured via the RANK attribute in job template. Reflects behavior of Grid resources based on job execution statistics. History contribution: During a period of time. Last job contribution: Considering last job on that resource. UG_HISTORY_WINDOW: Days used for statistics in history contribution. UG_HISTORY_RATIO: Weight of history contribution statistics Resources Fixed Resource Priority Policy (FP) Rank Policy (RA) Usage Policy (UG)

23/28 GridWay DSA Group GridWay DSA Group Scheduling Policies Exponential linear back-off strategy in case of resource failure. Resources with persistent failures are discarded (for a given user). FR_MAX_BANNED_TIME: Max time a resource can be banned. FR_BANNED_C: Constant that sets how fast the max banned time is reached. RESCHEDULING POLICIES A better resource is discovered. A job has been waiting in the remote queue system more than a given threshold. Requirements changed by application. Performance degradation is detected. Resources Failure Rate Policy (FR)

24/28 GridWay DSA Group GridWay DSA Group Contents 1.Installation Directories and their Meanings 2.Required Software 3.Platform Notes 4.Configuration 5.Logging 6.Basic Troubleshooting 7.Scheduling Policies 8.MAD Configuration

25/28 GridWay DSA Group GridWay DSA Group MAD Configuration Job execution and management. Provided MADs: –Pre-WS GRAM (>GT 2.4). –WS GRAM (GT 4.0). $GW_LOCATION/var/gwd.conf mad_name: tag for identifying MAD. Useful for logging. path_to_mad: name of MAD executable. Must be placed in $GW_LOCATION/bin/. args: additional arguments to be passed to MAD. rsl|rsl_nsh|rsl2: Language used for describing job requests. –rsl: Pre-WS GRAM. –rsl_nsh: Pre-WS GRAM non-shared home directories (i.e. LCG). –rsl2: WS GRAM. Execution Driver EM_MAD = : : :

26/28 GridWay DSA Group GridWay DSA Group File staging, remote working directory set-up and remote host clean up. Provided MADs: –GridFTP server (>version 1.1.2). –Dummy Transfer driver (clusters without shared home). $GW_LOCATION/var/gwd.conf mad_name: tag for identifying MAD. Useful for logging. path_to_mad: name of MAD executable. Must be placed in $GW_LOCATION/bin/. arg: additional argument to be passed to MAD. File Transfer Driver TM_MAD = : :[args] MAD Configuration

27/28 GridWay DSA Group GridWay DSA Group Host discovery and monitoring. Provided MADs: –Static host information data. –MDS2 with MDS schema (GT 2.4)‏ –MDS2 with GLUE schema (GT 2.4 and LCG middleware)‏ –MDS4 (GT 4.0)‏ $GW_LOCATION/var/gwd.conf mad_name: tag for identifying MAD. Useful for logging. path_to_mad: name of MAD executable. Must be placed in $GW_LOCATION/bin/. args: additional argument to be passed to MAD. nice: integer to be added to RANK (i.e. for prioritizing resources). tm_mad_name: File Transfer driver to be used by managed hosts. em_mad_name: Execution driver to be used by managed hosts. Information Driver IM_MAD = : :[args]:[nice]: : MAD Configuration

28/28 GridWay DSA Group GridWay DSA Group Thank you for your attention! Installation and Basic Configuration