Eddy Caron Join work with Jonathan Rouzaud-Cornabas, Frédéric Desprez, Rajesh Palanichamy and the DIET Team Ecole Normale Supérieure de Lyon AVALON Research.

Slides:



Advertisements
Similar presentations
Building a CFD Grid Over ThaiGrid Infrastructure Putchong Uthayopas, Ph.D Department of Computer Engineering, Faculty of Engineering, Kasetsart University,
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
All-in-one graphical tool for grid middleware management Eddy Caron, Abdelkader Amar, Frédéric Desprez, David Loureiro LIP ENS Lyon, INRIA Rhône-Alpes,
Workflow management within DIET Raphaël Bolze LIP ENS Lyon, CNRS INRIA Rhône-Alpes, GRAAL project
DIET Overview and some recent work A middleware for the large scale deployment of applications over the Grid Frédéric Desprez LIP ENS Lyon / INRIA GRAAL.
High Performance Computing Course Notes Grid Computing.
GridRPC Sources / Credits: IRISA/IFSIC IRISA/INRIA Thierry Priol et. al papers.
A Nation Wide Experimental Grid The Grid’5000 project: architecture and objectives Building a nation wide experimental platform for Grid researchers –
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Workload Management Massimo Sgaravatto INFN Padova.
Grid Computing Net 535.
Architecture overview 6/03/12 F. Desprez - ISC Cloud Context : Development of a toolbox for deploying application services providers with a hierarchical.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Cracow Grid Workshop’10 Kraków, October 11-13,
1 1 Hybrid Cloud Solutions (Private with Public Burst) Accelerate and Orchestrate Enterprise Applications.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
MobSched: An Optimizable Scheduler for Mobile Cloud Computing S. SindiaS. GaoB. Black A.LimV. D. AgrawalP. Agrawal Auburn University, Auburn, AL 45 th.
KARMA with ProActive Parallel Suite 12/01/2009 Air France, Sophia Antipolis Solutions and Services for Accelerating your Applications.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
All-in-one graphical tool for the management of DIET a GridRPC middleware Eddy Caron, Frédéric Desprez, David Loureiro, Benjamin Depardon, Aurélien Cedeyn.
Abstractions: Programming and deploying apps. on Grids Franck Cappello INRIA* (*this is my own opinion!) CCGRID’08 - Panel.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
A Distributed Computing System Based on BOINC September - CHEP 2004 Pedro Andrade António Amorim Jaime Villate.
DISTRIBUTED COMPUTING
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Jean-Sébastien Gay LIP ENS Lyon, Université Claude Bernard Lyon 1 INRIA Rhône-Alpes GRAAL Research Team Join work with DIET TEAM D istributed I nteractive.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Peer-to-Peer Distributed Shared Memory? Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan/Bretagne France Dagstuhl seminar, October 2003.
Javascript Cog Kit By Zhenhua Guo. Grid Applications Currently, most grid related applications are written as separate software. –server side: Globus,
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
Master Worker Paradigm Support in Software Component Models Hinde Bouziane, Christian Pérez PARIS Research Team INRIA/IRISA Rennes ANR CIGC LEGO (ANR-05-CICG-11)
1 Andreea Chis under the guidance of Frédéric Desprez and Eddy Caron Scheduling for a Climate Forecast Application ANR-05-CIGC-11.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
The Grid computing Presented by:- Mohamad Shalaby.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
ServiceSs, a new programming model for the Cloud Daniele Lezzi, Rosa M. Badia, Jorge Ejarque, Raul Sirvent, Enric Tejedor Grid Computing and Clusters Group.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GVis: Grid-enabled Interactive Visualization State Key Laboratory. of CAD&CG Zhejiang University, Hangzhou
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Nguyen Tuan Anh. VN-Grid: Goals  Grid middleware (focus of this presentation)  Tuan Anh  Grid applications  Hoai.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
WP1 : Applications Océan / atmosphère Cosmologie Site d'expertise algèbre linéaire creuse TLSE.
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
1 VLDB - Data Management in Grids B. Del-Fabbro, D. Laiymani, J.M. Nicod and L. Philippe Laboratoire d’Informatique de l’Université de Franche-Comté Séoul,
Tier3 monitoring. Initial issues. Danila Oleynik. Artem Petrosyan. JINR.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Pour Michel Hello, Tu peux trouver dans ce ppt 3 parties, je te laisse te servir. - L’outil réalisé par GRAAL et pour la communauté de Grid’5000: GRUDU.
EGI Technical Forum Amsterdam, 16 September 2010 Sylvain Reynaud.
+ Support multiple virtual environment for Grid computing Dr. Lizhe Wang.
Distributed Geospatial Information Processing (DGIP) Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
- DAG Scheduling with Reliability - - GridSolve - - Fault Tolerance In Open MPI - Asim YarKhan, Zhiao Shi, Jack Dongarra VGrADS Workshop April 2007.
XtreemOS IP project is funded by the European Commission under contract IST-FP Scientific coordinator Christine Morin, INRIA Presented by Ana.
- Eddy Caron.
Workload Management Workpackage
GWE Core Grid Wizard Enterprise (
Globus —— Toolkits for Grid Computing
The Globus Toolkit™: Information Services
Wide Area Workload Management Work Package DATAGRID project
Presentation transcript:

Eddy Caron Join work with Jonathan Rouzaud-Cornabas, Frédéric Desprez, Rajesh Palanichamy and the DIET Team Ecole Normale Supérieure de Lyon AVALON Research Team SysFera

DIET’s Goals Our goals To develop a toolbox for the deployment of environments using the Application Service Provider (ASP) paradigm with different applications Use as much as possible public domain and standard software To obtain a high performance and scalable environment Implement and validate our more theoretical results Scheduling for heterogeneous platforms, data (re)distribution and replication, performance evaluation, algorithmic for heterogeneous and distributed platforms, … Based on CORBA and our own software developments FAST for performance evaluation, LogService for monitoring, VizDIET for the visualization, GoDIET for the deployment Several applications in different fields (simulation, bioinformatics, …) Release 2.8 available on the web since november ACI Grid ASP, RNTL GASP, ANR LEGO CIGC-05-11, ANR Gwendia, Celtic-plus Project SEED4C 6/03/12DIET a Cloud Middleware for Seed4C

RPC and Grid-Computing: Grid-RPC One simple idea – Implementing the RPC programming model over the grid – Using resources accessible through the network – Mixed parallelism model (data-parallel model at the server level and task parallelism between the servers) Features needed – Load-balancing (resource localization and performance evaluation, scheduling), – IDL, – Data and replica management, – Security, – Fault-tolerance, – Interoperability with other systems, – … Design of a standard interface – within the OGF (Grid-RPC and SAGA WG) – Existing implementations: NetSolve/GridSolve, Ninf, DIET, OmniRPC 6/03/12DIET a Cloud Middleware for Seed4C

RPC and Grid Computing: Grid-RPC AGENT(s) S1S2 S3 S4 A, B, C Answer (C) S2 ! Request Op(C, A, B) Client 6/03/12DIET a Cloud Middleware for Seed4C

Client and server interface Client side So easy … Multi-interface (C, C++, Fortran, Java, Python, Scilab, Web, etc.) Grid-RPC compliant Server side Install and submit new server to agent (LA) Problem and parameter description Client IDL transfer from server Dynamic services new service new version security update outdated service Etc. 6/03/12DIET a Cloud Middleware for Seed4C

Architecture overview 6/03/12DIET a Cloud Middleware for Seed4C

Workflow Management Workflow representation Direct Acyclic Graph (DAG) Each vertex is a task Each directed edge represents communication between tasks Goals Build and execute workflows Use different heuristics to solve scheduling problems Extensibility to address multi-workflows submission and large grid platform Manage heterogeneity and variability of environment ANR Gwendia Language definition (MOTEUR & MADAG) Comparison on Grid’5000 vs EGI Idle timeData transfertExecution time EGI (Glite)32.857s s s Grid’5000 (DIET) 0.214s s s 6/03/12DIET a Cloud Middleware for Seed4C

DIET Scheduling: Plug-in Schedulers SeD level Performance estimation function Estimation Metric Vector - dynamic collection of performance estimation values Performance measures available through DIET FAST-NWS performance metrics Time elapsed since the last execution CoRI (Collector of Resource Information) Developer defined values Aggregation Methods Defining mechanism to sort SeD responses: associated with the service and defined at SeD level Tunable comparison/aggregation routines for scheduling Priority Scheduler Performs pairwise server estimation comparisons returning a sorted list of server responses; Can minimize or maximize based on SeD estimations and taking into consideration the order in which the request for those performance estimations was specified at SeD level. 6/03/12DIET a Cloud Middleware for Seed4C

Data Management DAGDA Joining task scheduling and data management Standardized through GridRPC OGF WG. Data Arrangement for Grid and Distributed Applications Explicit data replication: Using the API. Implicit data replication. Data replacement algorithm: LRU, LFU AND FIFO Transfer optimization by selecting the more convenient source. Storage resources usage management. Data status backup/restoration. 6/03/12DIET a Cloud Middleware for Seed4C Join work with Gaël LeMahec (UPJV/MIS)

Parallel & sequential jobs transparent for the user system dependent submission SeD Batch Many batch systems Batch schedulers behaviour Internal scheduling process Monitoring & Performance prediction Simulation (Simbatch) Parallel and batch submissions SeD OAROGE LSF PBSLoadleveler SeD Batch SeD // NFS 6/03/12DIET a Cloud Middleware for Seed4C SLURM

DIET Cloud: Prototype Inside the Cloud DIET platform is virtualized inside the cloud. (as Xen image for example) Very flexible and scalable as DIET nodes can be launched Scheduling is more complex DIET as a Cloud manager Eucalyptus interface Eucalyptus is treated as a new Batch System Provide a new implementation for the BatchSystem abstract class 6/03/12DIET a Cloud Middleware for Seed4C

DIET Cloud: New Architecture 6/03/12DIET a Cloud Middleware for Seed4C

Building a nation wide experimental platform for Grid & P2P researches (like a particle accelerator for the computer scientists) 9 geographically distributed sites hosting clusters with 256 CPUs to 1K CPUs) All sites are connected by RENATER (French Res. and Edu. Net.) Design and develop a system/middleware environment for safely test and repeat experiments Use the platform for Grid experiments in real life conditions 4 main features: A high security for Grid’5000 and the Internet, despite the deep reconfiguration feature Single sign-on High-performance LRMS: OAR A user toolkit to reconfigure the nodes and monitor experiment: Kadeploy Grid’5000 Grid’5000 DIET deployment over a maximum of processors 1 MA, 8 LA, 540 SeDs 1120 clients on 140 machines DGEMM requests (2000x2000 matrices) Simple round-robin scheduling 6/03/12DIET a Cloud Middleware for Seed4C

Conclusions Grid-RPC Interesting approach for several applications Simple, flexible, and efficient Many interesting research issues (scheduling, data management, resource discovery and reservation, deployment, fault-tolerance, …) DIET Scalable, open-source, and multi-application platform Concentration on several issues like resource discovery, scheduling (distributed scheduling and plugin schedulers), deployment (GoDIET), performance evaluation (CoRI), monitoring (LogService and VizDIET), data management and replication (DAGDA), Cloud support Large scale validation on the Grid’5000 platform A middleware designed and tunable for different applications 6/03/12DIET a Cloud Middleware for Seed4C

Results A complete Middleware for heterogeneous infrastructure DIET is light to use Dedicated to many applications Designed for Grid and Cloud Efficient even in comparison to commercial tools DIET is high tunability middleware Used in production The DIET Team SysFera Compagny (16 persons today) 6/03/12DIET a Cloud Middleware for Seed4C