MiG Projects DM75. Runtime Environment DB Problem: –Many Grid jobs assume that the binaries and libraries are already available –Even if this is the case.

Slides:



Advertisements
Similar presentations
Building Portals to access Grid Middleware National Technical University of Athens Konstantinos Dolkas, On behalf of Andreas Menychtas.
Advertisements

Overview of local security issues in Campus Grid environments Bruce Beckles University of Cambridge Computing Service.
Buffers & Spoolers J L Martin Think about it… All I/O is relatively slow. For most of us, input by typing is painfully slow. From the CPUs point.
Grid Resource Allocation Management (GRAM) GRAM provides the user to access the grid in order to run, terminate and monitor jobs remotely. The job request.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Minimum intrusion GRID. Build one to throw away … So, in a given time frame, plan to achieve something worthwhile in half the time, throw it away, then.
Minimum intrusion GRID. Build one to throw away … So, in a given time frame, plan to achieve something worthwhile in half the time, throw it away, then.
Implementing ISA Server Caching. Caching Overview ISA Server supports caching as a way to improve the speed of retrieving information from the Internet.
Database System Architectures  Client-server Database System  Parallel Database System  Distributed Database System Wei Jiang.
DIRAC API DIRAC Project. Overview  DIRAC API  Why APIs are important?  Why advanced users prefer APIs?  How it is done?  What is local mode what.
Google Distributed System and Hadoop Lakshmi Thyagarajan.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
Enabling Grids for E-sciencE Medical image processing web portal : Requirements analysis. An almost end user point of view … H. Benoit-Cattin,
Operating Systems.  Operating System Support Operating System Support  OS As User/Computer Interface OS As User/Computer Interface  OS As Resource.
A Web 2.0 Portal for Teragrid Fugang Wang Gregor von Laszewski May 2009.
 What is OS? What is OS?  What OS does? What OS does?  Structure of Operating System: Structure of Operating System:  Evolution of OS Evolution of.
Operating Systems.
 Introduction to Operating System Introduction to Operating System  Types Of An Operating System Types Of An Operating System  Single User Single User.
1. Introduction  The JavaScript Grid Portal is trying to find a way to access Grid through Web browser, while using Web 2.0 technologies  The portal.
CSC 322 Operating Systems Concepts Lecture - 4: by Ahmed Mumtaz Mustehsan Special Thanks To: Tanenbaum, Modern Operating Systems 3 e, (c) 2008 Prentice-Hall,
Distributed File Systems
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
March 3rd, 2006 Chen Peng, Lilly System Biology1 Cluster and SGE.
◦ What is an Operating System? What is an Operating System? ◦ Operating System Objectives Operating System Objectives ◦ Services Provided by the Operating.
Rensselaer Polytechnic Institute CSCI-4210 – Operating Systems CSCI-6140 – Computer Operating Systems David Goldschmidt, Ph.D.
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
Virtual Machine Monitors: Technology and Trends Jonathan Kaldor CS614 / F07.
:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: GridKA School 2009 MPI on Grids 1 MPI On Grids September 3 rd, GridKA School 2009.
17-April-2007 High Performance Computing Basics April 17, 2007 Dr. David J. Haglin.
Stuart Wakefield Imperial College London Evolution of BOSS, a tool for job submission and tracking W. Bacchi, G. Codispoti, C. Grandi, INFN Bologna D.
9 February 2000CHEP2000 Paper 3681 CDF Data Handling: Resource Management and Tests E.Buckley-Geer, S.Lammel, F.Ratnikov, T.Watts Hardware and Resources.
Parallel Programming on the SGI Origin2000 With thanks to Igor Zacharov / Benoit Marchand, SGI Taub Computer Center Technion Moshe Goldberg,
CE Operating Systems Lecture 3 Overview of OS functions and structure.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Turning science problems into HTC jobs Wednesday, July 29, 2011 Zach Miller Condor Team University of Wisconsin-Madison.
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
Virtual Memory 1 1.
APST Internals Sathish Vadhiyar. apstd daemon should be started on the local resource Opens a port to listen for apst client requests Runs on the host.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
A PanDA Backend for the Ganga Analysis Interface J. Elmsheuser 1, D. Liko 2, T. Maeno 3, P. Nilsson 4, D.C. Vanderster 5, T. Wenaus 3, R. Walker 1 1: Ludwig-Maximilians-Universität.
UNIX Unit 1- Architecture of Unix - By Pratima.
Introduction to Grid Computing and its components.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
OPTIMIZATION OF DIESEL INJECTION USING GRID COMPUTING Miguel Caballer Universidad Politécnica de Valencia.
Operating Systems.
INFSO-RI Enabling Grids for E-sciencE Policy management and fair share in gLite Andrea Guarise HPDC 2006 Paris June 19th, 2006.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
COMP7500 Advanced Operating Systems I/O-Aware Load Balancing Techniques Dr. Xiao Qin Auburn University
What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently and safely. Provide.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
1.3 Operating system services An operating system provide services to programs and to the users of the program. It provides an environment for the execution.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Computer Architecture Chapter (8): Operating System Support
AS Level Computing 8 CHAPTER: Fundamentals of structured programming The basics of good programming Algorithms System Flow Charts Symbol Conventions Steps.
Jean-Philippe Baud, IT-GD, CERN November 2007
Practical using C++ WMProxy API advanced job submission
Lesson Objectives Aims You should be able to:
Problems with current model
Usecase Dynamo Moderate Requirements:
Introduction to Operating System (OS)
Auburn University COMP7500 Advanced Operating Systems I/O-Aware Load Balancing Techniques (2) Dr. Xiao Qin Auburn University.
Operating Systems.
Initial job submission and monitoring efforts with JClarens
QuaSAQ: Enabling End-to-End QoS for Distributed Multimedia Databases
Chapter 2: Operating-System Structures
Wide Area Workload Management Work Package DATAGRID project
Chapter 2: Operating-System Structures
Virtual Memory 1 1.
Presentation transcript:

MiG Projects DM75

Runtime Environment DB Problem: –Many Grid jobs assume that the binaries and libraries are already available –Even if this is the case – where are those files placed? Solution –Build a database for maintaining runtime environments –Define rules for environment settings –Allow automatic testing for correctness of those settings

Runtime Environment DB Example: –POV-Ray –Define POV_RAY_EXE_PATH /usr/local/bin/pov34/bin –Define POV_RAY_LIB_PATH /usr/local/bin/pov34/include –Test correctness which povray34 = /usr/local/bin/povray34/bin/povray34? find $POV_RAY_LIB_PATH colors.inc = lib/colors.inc

Remote File Access Proxy Problem –Not all systems has HTTPS access from the nodes Solution –Assuming that the nodes and the front-end can communicate - place a proxy-server on the front-end

Remote File Access Proxy Direct Access Proxy Access

Remote File Access Proxy Issues –Security token handling –Performance Potential –Caching –Prefetching without CPU interference

Resource specification detection Problem –A resource is defined by a large set of parameters: Architecture, memory, diskspace,… Access rights, user-id, node-access, queue-system Runtime enviroments Solutions –Have the sysadm add all the information automatically –Run a program that identifies as many components as possible

Resource specification detection Examples –OS = `uname` –if OS==‘Linux’ cat /proc/cpuinfo | grep CPU | awk ‘{print $4}’ tempdrive = `mount | grep /tmp’ if tempdrive = ‘’ tempdrive = ‘/’ space = df $tempdrive gcc_ver = `gcc –v’ if gcc_ver != ‘’ gcc_env = gcc_ver

Monitor We would like to make nice presentations of the state of MiG –# users –#jobs –#Resources ID of resources that are not anonymous –Estimated time to start execution All sorted, filtered and presented as the users requests

Accounting We need to do realiable accounting When a job is submitted to a queue the server must ask a bank to deposit credits corresponding to the maximum use After execution the server must ask to be given the credits corresponding to the resources that were actually used

Accounting Server Bank Job: 10h 1GB mem Reserve (10,1) Run Job Actual use (<=(10,1) Confirm Debit (x,y) Confirm

Accounting Secure Reliable What happens if –The job crashes –The server crashes –The Bank Crashes ?

Grid Units We need to be able to define the performance of a system –Processing speed –IO performance –Networking performance Units: –Generic single CPU Balanced CPU speed and IO –Generic MPP Balanced of all 3 –Individual of the 3

Grid Units The definition of a system should be determined automatically by a program A user should be able to run his applications and get an idea of the Grid units it uses –time a.out Tells us disk need and CPU need Determining network dependency is harder!!!

Applications Dalton Pov-Ray BLAST Others…

Dalton Very important application i chemistry Fairly small input files Fairly small output files Huge runtime Local expertise Very well suited for a Web-portal!!!

Dalton Portal

POV-Ray Popular Simple Can be parallelized using Grid Fairly small input Medium to small output Very well suited for a Web-portal!!!

POV-Ray

BLAST Very important –Right now these guys eat a lot of the time on Horseshoe National expertise Large input files Small output-files Should be scriptable But portals are also interesting

Bio-BLAST E.Coli vs. Human 1GB 512 MB

Shared data-structures for Grid There are many scenarios where Grid jobs could communicate through shared data- structures Examples –Single variables –Bounded buffers –Arrays –Objects All access must be secure!!!

Interfacing with other Grid Implementations It is interresting for MiG to accept other Grids as –Users –Resources Examples: –NorduGrid –gLite –Gridbus –Unicore –OfficeGrid

Supporting more Queuing systems Different resources use different queuing systems Examples –PBS/Torque –LSF –LoadLeveler –OfficeGRID

Programmers API It is interesting for programmers to be able to Grid enable their applications directly –Access Grid files –Submit jobs –Retrieve results For this a library with these features must be designed and implemented

Statistics Just like monitoring it is interresting to obtain statictics on Grid Examples –Usage –#Users –Turn-over-time –Activation time –#Resources –etc…

Many others Including your own proposals!