Gilbert Thomas Grid Computing & Sun Grid Engine “Basic Concepts”

Slides:



Advertisements
Similar presentations
CSF4 Meta-Scheduler Tutorial 1st PRAGMA Institute Zhaohui Ding or
Advertisements

© 2007 Open Grid Forum Grids in the IT Data Center OGF 21 - Seattle Nick Werstiuk October 16, 2007.
What is Cloud Computing? Massive computing resources, deployed among virtual datacenters, dynamically allocated to specific users and tasks and accessed.
What is Cloud Computing? Massive computing resources, deployed among virtual datacenters, dynamically allocated to specific users and tasks and accessed.
Operating System.
1 Chapter 11: Data Centre Administration Objectives Data Centre Structure Data Centre Structure Data Centre Administration Data Centre Administration Data.
SLA-Oriented Resource Provisioning for Cloud Computing
Introduction CSCI 444/544 Operating Systems Fall 2008.
Southgreen HPC system Concepts Cluster : compute farm i.e. a collection of compute servers that can be shared and accessed through a single “portal”
Tunis, Tunisia, 28 April 2014 Business Values of Virtualization Mounir Ferjani, Senior Product Manager, Huawei Technologies 2.
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
6/2/20071 Grid Computing Sun Grid Engine (SGE) Manoj Katwal.
Understanding Operating Systems 1 Overview Introduction Operating System Components Machine Hardware Types of Operating Systems Brief History of Operating.
Chapter 8 Operating System Support
High Performance Computing (HPC) at Center for Information Communication and Technology in UTM.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
Parallel Computing The Bad News –Hardware is not getting faster fast enough –Too many architectures –Existing architectures are too specific –Programs.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Configuring Resources for the Grid Jerry Perez.
Operating System. Architecture of Computer System Hardware Operating System (OS) Programming Language (e.g. PASCAL) Application Programs (e.g. WORD, EXCEL)
Operating Systems.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Resource management system for distributed environment B4. Nguyen Tuan Duc.
Sun Grid Engine. Grids Grids are collections of resources made available to customers. Compute grids make cycles available to customers from an access.
11 SYSTEM PERFORMANCE IN WINDOWS XP Chapter 12. Chapter 12: System Performance in Windows XP2 SYSTEM PERFORMANCE IN WINDOWS XP  Optimize Microsoft Windows.
DISTRIBUTED COMPUTING
Grid Computing - AAU 14/ Grid Computing Josva Kleist Danish Center for Grid Computing
©Brooks/Cole, 2003 Chapter 7 Operating Systems. ©Brooks/Cole, 2003 Define the purpose and functions of an operating system. Understand the components.
McGraw-Hill/Irwin © The McGraw-Hill Companies, All Rights Reserved BUSINESS PLUG-IN B17 Organizational Architecture Trends.
◦ What is an Operating System? What is an Operating System? ◦ Operating System Objectives Operating System Objectives ◦ Services Provided by the Operating.
OSes: 1. Intro 1 Operating Systems v Objectives –introduce Operating System (OS) principles Certificate Program in Software Development CSE-TC and CSIM,
Recall: Three I/O Methods Synchronous: Wait for I/O operation to complete. Asynchronous: Post I/O request and switch to other work. DMA (Direct Memory.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Business Plug-In B17 Organizational Architecture Trends.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Batch Scheduling at LeSC with Sun Grid Engine David McBride Systems Programmer London e-Science Centre Department of Computing, Imperial College.
Chapter 7 Operating Systems. Define the purpose and functions of an operating system. Understand the components of an operating system. Understand the.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Copyright ©2003 Digitask Consultants Inc., All rights reserved Cluster Concepts Digitask Seminar November 29, 1999 Digitask Consultants, Inc.
The BioBox Initiative: Bio-ClusterGrid Maddie Wong Technical Marketing Engineer Sun APSTC – Asia Pacific Science & Technology Center.
Chapter 5 McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
1October 9, 2001 Sun in Scientific & Engineering Computing Grid Computing with Sun Wolfgang Gentzsch Director Grid Computing Cracow Grid Workshop, November.
GVis: Grid-enabled Interactive Visualization State Key Laboratory. of CAD&CG Zhejiang University, Hangzhou
Operating System Principles And Multitasking
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Chapter 1 Basic Concepts of Operating Systems Introduction Software A program is a sequence of instructions that enables the computer to carry.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
CS4315A. Berrached:CMS:UHD1 Introduction to Operating Systems Chapter 1.
Tackling I/O Issues 1 David Race 16 March 2010.
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
CLIENT SERVER COMPUTING. We have 2 types of n/w architectures – client server and peer to peer. In P2P, each system has equal capabilities and responsibilities.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
Chapter 16 Client/Server Computing Dave Bremer Otago Polytechnic, N.Z. ©2008, Prentice Hall Operating Systems: Internals and Design Principles, 6/E William.
Grid Computing: An Overview and Tutorial Kenny Daily BIT Presentation 22/09/2016.
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Clouds , Grids and Clusters
Current Generation Hypervisor Type 1 Type 2.
CLUSTER COMPUTING Presented By, Navaneeth.C.Mouly 1AY05IS037
Where are being used the OS?
Grid Computing.
Introduction to Operating System (OS)
University of Technology
Introduction to Cloud Computing
Chapter 1: Introduction
QNX Technology Overview
Sun Grid Engine.
Presentation transcript:

Gilbert Thomas Grid Computing & Sun Grid Engine “Basic Concepts”

Agenda ● Introduction ● Grid Computing ● Sun Grid Engine (SGE)

Problem : Not using Scientists/Engineers efficiently Solution: A Grid makes it easy for the engineers to submit jobs. They run more tests— product design cycle improves. Benefits: Increase productivity which leads to shorter time to market, higher quality and lower costs The Productivity Challenge

Grid Computing A New Computing Utility Model Problem-solving through resource pooling in virtual systems: Virtualization of… Transparent scalability of… Access that is... Resources into a dynamic, single compute resource CPU cycles, storage Dependable, consistent, pervasive, inexpensive

Stages of Sun Grid Computing Cluster Grid Departmental Computing Simplest Grid deployment Maximum utilization of departmental resources Resources allocated based on priorities Campus Grid Enterprise Computing Resources shared within the enterprise Policies ensure computing on demand Gives multiple groups seamless access to enterprise resources Global Grid Internet Computing Resources shared over the Internet Global view of distributed datasets Growth path for enterprise Campus Grids

Grid Computing Model Cluster Grids Usage Simplest Grid deployment Single team: Project Department Single site firewall Benefit Optimal alignment of resources, tasks, and budgets Industry Examples Automotive—More simulations for safer cars Entertainment—Faster image-frame rendering Life Sciences—Pattern matching against huge datasets EDA—Increased design iterations create more powerful devices

Grid Computing Model Campus Grids Usage Multiple teams in organization share one or more Cluster Grids Single site to enterprise-wide Benefit Maximum ROI and utility Industry Examples Manufacturing—Collaborative engineering projects Oil and Gas—Mining-distributed databases Finance—More Monte Carlo simulations for uncovering new business

Grid Computing Model Global Grids N1 Usage Linked Cluster and Campus Grid Models across many organizations Typically used for research Benefit Creates large virtual system Facilitates collaboration between organizations Industry Examples Medicine—Provides expert teams access to medical instruments and distributed computing resources Academia—Facilitates collaboration between geographically dispersed groups Research—Enables compute- intensive projects beyond the firewall

Grid Computing Adoption Trends Campus Grids Multiple teams Single organization Global Grids Multiple teams Multiple organizations Cluster Grids Single team Single organization

Key Software Technologies for the Grid Cluster Grid: Sun Grid Engine Campus Grid: Sun Grid Engine, Enterprise Edition Global Grid: Globus, Avaki = Sun Grid Computing software

How it Works Grid Hardware and Software Components Resource management services above OS layer to integrate systems Hardware/OS systems are unchanged Minimal management software/tool costs Connecting people, departments,organization s, communities

Cluster Grid Solution Sun™ Grid Engine Maximize resources for single projects, teams, departments Prioritize jobs Manage jobs from start to finish Free download for Solaris and Linux Operating Environments

Sun Grid Engine Free Downloads First Year Fast becoming the most-used Distributed Resource Manager (DRM) tool 3016 unique sites 118,000 CPUs worldwide run Sun Grid Engine 1 new CPU every 5 minutes Over 90 countries 60% never used Grid software before 92% rated Sun Grid Engine as Good, Very Good, or Excellent

Existing Problem In Clusters Bottleneck Idle Overloaded

Load Balancing – Ensure no single compute resource is overloaded – SGE automatically finds the resource with the least load for every new job – If no free resource is found, the job is queued till a free resource is available – Implication: Jobs run and finish faster! Solution : Sun Grid Engine

Job types - a mixture of: – Batch – Interactive (qsh, qrsh, qlogin) – Parallel (mpi, pvm...) – Checkpointing – Array Jobs (unlimited size, massive scalability) Dynamically changeable while pending (prior to execution) Job Types

Monitoring ● Qmon ● Mail notification ● Qstat

Qmon: SGE’s GUI

Configuring Queues

Checking Queue Status

Submitting Jobs

Checking Job Status

Qstat Display all info about queues > qstat -f State column: - r= running- s= suspended - q = queued- w= waiting

Qmod Control the status of the queues in your cluster - qmod –dDisable a queue - qmod –eEnable a queue - qmod –sSuspend a queue - qmod –us Resume a suspended queue - qmod –cClear the error states of a queue - qstat –alarm Show the alarm state of a queue

Complexes Set host-specific attributes: - Number of slots - Maximum amount of memory that can be used - Maximum number of diskblocks that can be used - Maximum load for that host Set requestable values to a queue: - Software licences - Available memory - Available disk space - Specific data-sets

Parallel Environments ● Parallel Virtual Machine (PVM) ● Message Parsing Interface (MPI) ● A parallel environment allows execution of shared memory and distributed memory applications.

Parallel Environments Advantages of tight integration with SGE: - Correct accounting - Full job control, i.e.: suspending tasks - Resource limits - Cleaning up/killing all tasks

References Sun Grid Engine Home Sun Grid Engine Open Source Sun Grid Engine Web-Based Training

Gilbert Thomas Associate Engineer Thank you! Thank You For further enquiries,