Grid’5000 Introduction to Grid'5000 1 Grid’5000 A Nation Wide Experimental Grid.

Slides:



Advertisements
Similar presentations
Building a CFD Grid Over ThaiGrid Infrastructure Putchong Uthayopas, Ph.D Department of Computer Engineering, Faculty of Engineering, Kasetsart University,
Advertisements

Challenges for Interactive Grids a point of view from Int.Eu.Grid project Remote Instrumentation Services in Grid Environment RISGE BoF Manchester 8th.
All-in-one graphical tool for grid middleware management Eddy Caron, Abdelkader Amar, Frédéric Desprez, David Loureiro LIP ENS Lyon, INRIA Rhône-Alpes,
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Grid’5000 A Nation Wide Experimental Grid. Grid’5000 Grid raises a lot of research issues: Security, Performance, Fault tolerance, Load Balancing, Fairness,
Towards a Virtual European Supercomputing Infrastructure Vision & issues Sanzio Bassini
High Performance Computing Course Notes Grid Computing.
Grid’5000 GdX Grid'5000 and Grid eXplorer 1 Large Scale Experimental Grids Grid’5000 Grid eXplorer & Franck Cappello INRIA
A Nation Wide Experimental Grid The Grid’5000 project: architecture and objectives Building a nation wide experimental platform for Grid researchers –
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Distributed components
Chapter 1: Introduction
Problem-Solving Environments: The Next Level in Software Integration David W. Walker Cardiff University.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
Grid’5000 Grid' DAS-3 workshop 104/12/06 Grid’5000 * DAS-3 – Grid'5000 workshop December 4th, *5000 CPUs Pierre NEYRON - INRIA.
Architecture overview 6/03/12 F. Desprez - ISC Cloud Context : Development of a toolbox for deploying application services providers with a hierarchical.
Overview of grid / cloud research in France Michel DAYDÉ Scientific Delegate at INS2/CNRS in charge of HPC / Grid / cloud Université de Toulouse - IRIT.
Tunis, Tunisia, June 2012 Cloud Research Activities Pr. Mohamed JEMNI Computing Center Al Khawarizmi (CCK) Research Laboratory LaTICE
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
Eddy Caron Join work with Jonathan Rouzaud-Cornabas, Frédéric Desprez, Rajesh Palanichamy and the DIET Team Ecole Normale Supérieure de Lyon AVALON Research.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 1: Introduction What is an Operating System? Mainframe Systems Desktop Systems.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Distributed Real-Time Systems for the Intelligent Power Grid Prof. Vincenzo Liberatore.
4.x Performance Technology drivers – Exascale systems will consist of complex configurations with a huge number of potentially heterogeneous components.
PrimoGENI Tutorial Miguel Erazo, Neil Goldman, Nathanael Van Vorst, and Jason Liu Florida International University Other project participants: Julio Ibarra.
Optimized Java computing as an application for Desktop Grid Olejnik Richard 1, Bernard Toursel 1, Marek Tudruj 2, Eryk Laskowski 2 1 Université des Sciences.
Abstractions: Programming and deploying apps. on Grids Franck Cappello INRIA* (*this is my own opinion!) CCGRID’08 - Panel.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
A Lightweight Platform for Integration of Resource Limited Devices into Pervasive Grids Stavros Isaiadis and Vladimir Getov University of Westminster
DISTRIBUTED COMPUTING
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
Lecture 3: Sun: 16/4/1435 Distributed Computing Technologies and Middleware Lecturer/ Kawther Abas CS- 492 : Distributed system.
Programming Models & Runtime Systems Breakout Report MICS PI Meeting, June 27, 2002.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
Peer-to-Peer Distributed Shared Memory? Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan/Bretagne France Dagstuhl seminar, October 2003.
1 Introduction to Middleware. 2 Outline What is middleware? Purpose and origin Why use it? What Middleware does? Technical details Middleware services.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal Grid eXplorer (GdX) An Instrument for eXploring the GRID F. Cappello,
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
The Globus Project: A Status Report Ian Foster Carl Kesselman
Programming Parallel and Distributed Systems for Large Scale Numerical Simulation Application Christian Perez INRIA researcher IRISA Rennes, France.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Tools for collaboration How to share your duck tales…
1October 9, 2001 Sun in Scientific & Engineering Computing Grid Computing with Sun Wolfgang Gentzsch Director Grid Computing Cracow Grid Workshop, November.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Emulation in Data Grid eXplorer. Emulation problematic Distributed applicationTarget environment Simulation Emulation App. Model Env. model Formal analysis.
1 BBN Technologies Quality Objects (QuO): Adaptive Management and Control Middleware for End-to-End QoS Craig Rodrigues, Joseph P. Loyall, Richard E. Schantz.
7. Grid Computing Systems and Resource Management
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
1 OASIS Team, INRIA Sophia-Antipolis/I3S CNRS, Univ. Nice Christian Delbé Data Grid Explorer 15/09/03 Large Scale Emulation Mobility in ProActive.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
SYSTEM MODELS FOR ADVANCED COMPUTING Jhashuva. U 1 Asst. Prof CSE
SmartGRID Decentralized, dynamic grid scheduling framework on swarm agent-based intelligence Seminar in HUST, Wuhan, China. Oct. 22, 2008 Ye HUANG, Amos.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Grid Institute Scientific Council, September 10, 2008
Chapter 1 Characterization of Distributed Systems
Clouds , Grids and Clusters
Introduction to Distributed Platforms
Grid Computing.
Abstract Machine Layer Research in VGrADS
University of Technology
(with all project members)
Architectures of distributed systems Fundamental Models
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Grid’5000 Introduction to Grid' Grid’5000 A Nation Wide Experimental Grid

Grid’5000 Introduction to Grid' Grid raises a lot of research issues: Security, Performance, Fault tolerance, Load Balancing, Fairness, Coordination, Message passing, Data storage, Programming, Communication protocols and architecture, Deployment, etc. Theoretical models and simulators cannot capture real life conditions Production platforms have strong difficulties to reproduce experimental conditions How to test and compare? Fault tolerance protocols Security mechanisms Deployment tools etc. Grid: Distributed System Problematic renewal

Grid’5000 Introduction to Grid' log(cost) log(realism) mathsimulation emulation live systems Models: Sys, apps, Platforms, conditions Real systems Real applications Real platforms Real conditions Tools for Distributed System Studies To investigate Distributed System issues, we need: 1) Tools (model, simulators, emulators, experi. Platforms) 2) Strong interaction between these research tools Tools for Large Scale Distributed Systems Real systems Real applications “In-lab” platforms Synthetic conditions Key system mecas. Algo, app. kernels Virtual platforms Synthetic conditions

Grid’5000 Introduction to Grid' log(cost) log(realism) mathsimulation emulation live systems SimGrid MicroGrid Bricks NS, etc. Model Protocol proof Grid eXplorer WANinLab Emulab Grid’5000 TERAGrid PlanetLab Naregi Testbed We need a Grid experimental platform According to the current knowledge: There is no large scale testbed dedicated to Grid experiments  Grid’5000 as a live system  Grid eXplorer as a large scale emulator

Grid’5000 Introduction to Grid' )Remotely controllable Grid nodes installed in geographically distributed laboratories 2)A « Controllable » and « Monitorable » Network between the Grid nodes 3)A middleware infrastructure connecting the nodes (security) 4)A playground to prepare experiments 5)A toolkit to deploy, manage, run experiments and collect results What do we need for Grid experiments ?

Grid’5000 Introduction to Grid' )Building a nation wide experimental platform for Grid researches (like a particle accelerator for the computer scientists) 10/11 geographically distributed sites every site hosts a cluster (from 256 CPUs to 1K CPUs) All sites are connected by RENATER (French Academ. Network) RENATER hosts probes to trace network condition load Design and develop a system/middleware environment for safely test and repeat experiments 2) Use the platform for Grid experiments Address critical issues of Grid system/middleware: Programming, Scalability, Fault Tolerance, Scheduling Address critical issues of Grid Networking High performance transport protocols, Qos Port and test applications Investigate original mechanisms P2P resources discovery, Desktop Grids The Grid’5000 Project

Grid’5000 Introduction to Grid' Lab’s Network LAB/Firewall Router Test Cluster Control Master Site 1 Site 2 Site 3 Users (ssh loggin + password) Firewall/nat Control Slave Test Cluster Front end Control Slave Control site Grid’5000 Big Picture Gateway +VPN (192. For all nodes) One machine Can be seen as a Virtual Grid Gateway

Grid’5000 Introduction to Grid' Thierry Priol (ACI Grid Director) -Brigitte Plateau (President of ACI Grid SC) -Dani Vandrome (Director of Renater) -Frédéric Desprez (Lyon) -Michel Daydé (Toulouse) -Yvon Jégou (Rennes) -Stéphane Lantéri (Sophia) -Raymond Namyst (Bordeaux) -Pascale Primet (Lyon) -Olivier Richard (Grenoble) Steering Committee: (organizer: Franck Cappello, Orsay) Technical Committee: -David Gueldrech (Sophia) -Jean Claude Barbet (Orsay) -Franck Bonnassieux (UREC) -Julien le duc (Grenoble) -Fred Desprez (Lyon) -Yvon Jégou (Rennes) -Olivier Coulaud (Bordeaux) -Frédéric Barbaresco (Toulouse) Forums: Deployment/exploitation: Franck Cappello (AS1, RTP8) Programming models: Raymond Namyst (AS2, RTP8) Grid’5000 Committees

Grid’5000 Introduction to Grid' Grid’5000 Schedule Grid’5000 Hardware Call for proposals Sept03 Selection of 7 sites Nov03 ACI GRID Funding Jan04 Call for Expression Of Interest March04 Vendor selection Jun/July 04 Instal. First tests Spt 04 Final review Oct 04 Fisrt Demo (SC04) Nov 04 Grid’5000 System/middleware Forum Security Prototypes Control Prototypes Grid’5000 Programming Forum Grid’5000 Builder Community Grid’5000 Experiments

Grid’5000 Introduction to Grid' Grid’5000 Funding (ACI + Local District/Prefecture) Grid’ ,6M€ ~0,4€ ~0,35€ ~0,5€ ~0,35€ ~0,3?€ ~0,5€ ~3M€ for hardware only

Grid’5000 Introduction to Grid' Grid’5000 in September’2004 Grid 5000 nodes (soon 4 ) 3

Grid’5000 Introduction to Grid' Summary of Grid5000 XPs Networking –End Host Communication layer –High performance long distance protocols –High Speed Network Emulation –Grid Networking Layer Middleware / OS –Grid’5000 control/access/experiment automation –Scheduling / data distribution in Grid –Fault tolerance in Grid –Resource management –Computational Steering –Grid SSI OS and Grid I/O –Desktop Grid/P2P systems Programming –Component programming for the Grid (Java, Corba) –GRID-RPC –GRID-MPI –Code Coupling Applications –Multi-parametric applications (Climate modeling/Functional Genomic) –Large scale experimentation of distributed applications (Electromagnetism, multi-material fluid mechanics, parallel optimization algorithms, CFD, astrophysics –Medical images, Collaborating tools in virtual 3D environment

Grid’5000 Introduction to Grid' Middleware1(XP)Grid5000 Grid’5000 control - Computing Environment deployment (Ka-tools) - Experiment automation (security and control) - VGrid « mapping a virtual Grid on a real testbed » - Monitoring, benchmarking, performance characterization and analysis Grid Scheduling / data distribution - Scheduling : Data transfers, global communications, work stealing,... - Data re-distribution in Grid - Task distribution and load balancing in heterogeneous Grid - Mixed Parallelism (task and data parallelism) - Mixing data management and task scheduling - Hierarchical and Distributed Scheduling Fault tolerance in Grid - Fault tolerant Grid-RPC (RPC-V) - Hierarchical Fault tolerant MPI (MPICH-V) - Fault tolerant in data-flow approach (Athapascan) Grid SSI OS and Grid I/O - Grid file system (NFSG) - Grid-aware OS (Kerrighed) - Coupling Computational Grid with Reality Center XP: eXPeriments on

Grid’5000 Introduction to Grid' Middleware2(XP)Grid5000 Grid Management -AROMA tool : resources management over a Grid of clusters with different classes of services -Mobile agents for open Grid management -Management of Grids and hosted services (security, QoS, monitoring & control, dynamic configuration, …) -Optimization for wide area distributed query processing -Tools to support the development, administration and usage of heterogeneous resources over the Grid -Virtualization of data storage on Grids -Automatic Deployment of GridRPC middle tier. - Multiclusters and lightweights Grid resource management (OAR/CIGRI) Global Computing/P2P Middleware - Executing Web Services on Desktop Grid Workers (XtremWeb) - Distributing the Coordination in Desktop Grids (XtremWeb) - Harnessing Clusters as parallel Workers - Probabilistic certification in peer-to-peer systems - Large Scale Data Sharing Service based on JXTA (JuxMem) - Management services for textual document in P2P systems

Grid’5000 Introduction to Grid' End Host Communication layer - Communication libraries: Madeleine, MPICH/Madeleine - Intelligent Usage of NICs for local and wide area communications - Direct file access over Myrinet : ORFA/NFS and ORFA/LUSTRE High performance long distance protocols - Alternative Transport for very high speed networks (backpressure) - Differentiated transport with delay control on WAN - Reliable active and non active Multicast - Network Bandwidth optimization in Grid (VTHD++, Paco++). - High performance communication across heterogeneous networks - Fast forwarding and Multiplexing of data on gateway nodes High Speed Network Emulation - Automatic Deployment of emulated high speed domains - Experiment design for grid flow interactions studies Grid Networking Layer - Network Resource and QoS on demand - Grid Overlay and Programmable Routers - Measurement Services for network aware middleware Network(XP)Grid5000

Grid’5000 Introduction to Grid' Component programming on the grid - ProActive : a JAVA library (parallel, distributed, concurrent computing with security and mobility) - Assessment of scalability, deployment, security and fault tolerance issues - Hierarchical components architecture - PadicoTM/Paco++ combining parallel and distributed computing RPC Environment - Large scale experimentation of the DIET platform (Distributed Interactive Engineering Toolbox) - Client/Agent/Server model following the GridRPC standard with distributed scheduling agents MPI Environment - Time sharing Grid resources - Migration over Clusters with heterogeneous high speed networks Code Coupling - Application coupling with Athapascan - Communication / method invocation rescheduling into ORB (HOMA) - Fluid transfer simulation and geological code with PadicoTM/Paco++ Programming(XP)Grid5000

Grid’5000 Introduction to Grid' Applications1(XP)Grid5000 Multi-parametric applications - ACI GRID-TLSE Project : expertise site for sparse linear algebra - Climate modeling and Global Change -DataGène Project : Functional genomic Large scale experimentation of distributed applications –MECAGRID (ACI GRID project, Smash project-team) Massively parallel computations in multi-material fluid mechanics Study of numerical algorithms for heterogeneous computing platforms –Grid computing for medical applications (Epidaure project-team) Interoperable medical image registration grid service –Optimal design of complex systems (Coprin project-team) Evaluation of parallel optimization algorithms based on interval analysis techniques Study of load balancing strategies on heterogeneous resources -Fluid mechanics, molecular dynamics and host-parasite systems in population dynamics, etc. + CFD, astrophysics,… applications + Collaborating tools in virtual 3D environment.

Grid’5000 Introduction to Grid' Applications2(XP)Grid5000 Steering –JECS: a JAVA Environment for Computational Steering Distributed computing and interactive visualization of 3D numerical simulations (Caiman and Oasis project-teams) Collaborative environment Computational Electromagnetism application (JEM3D) –Steering of numerical simulations (ACI GRID-EPSN Project) Parallel on-line visualization / monitoring Data Redistribution Computational Steering by direct image manipulation