Www.eu-eela.eu E-science grid facility for Europe and Latin America OurGrid and the co-existence with gLite Alexandre Duarte Universidade Federal de Campina.

Slides:



Advertisements
Similar presentations
Distributed Processing, Client/Server and Clusters
Advertisements

1 Bogotá, EELA-2 1 st Conference, On the Co-existence of Service and Opportunistic Grids Francisco Brasileiro Universidade Federal.
MyGrid: A User-Centric Approach for Grid Computing Walfredo Cirne Universidade Federal da Paraíba.
The OurGrid Project Walfredo Cirne Universidade Federal de Campina Grande.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
E-science grid facility for Europe and Latin America JRA1 Status Report Development of Services for Applications and Infrastructure Francisco.
Chapter 1: Introduction
Technical Architectures
Running Thor over MyGrid Walfredo Cirne Universidade Federal de Campina Grande.
Labs of The World, Unite!!! Walfredo Cirne Universidade Federal de Campina Grande.
1/16/2008CSCI 315 Operating Systems Design1 Introduction Notice: The slides for this lecture have been largely based on those accompanying the textbook.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
© 2001 by Prentice Hall8-1 Local Area Networks, 3rd Edition David A. Stamper Part 3: Software Chapter 8 Client/Server Architecture.
SOFTWARE.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial Session 6: Introduction to grids and cloud computing International Symposium on Grid.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
E-science grid facility for Europe and Latin America Bridging OurGrid-based and gLite-based Grid Infrastructures Abmar de Barros, Adabriand.
DISTRIBUTED COMPUTING
Grid Initiatives for e-Science virtual communities in Europe and Latin America DIRAC TEAM CPPM – CNRS DIRAC Grid Middleware.
E-science grid facility for Europe and Latin America OurGrid E2GRIS1 Rafael Silva Universidade Federal de Campina.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Miguel Branco CERN/University of Southampton Enabling provenance on large-scale e-Science applications.
Unit – I CLIENT / SERVER ARCHITECTURE. Unit Structure  Evolution of Client/Server Architecture  Client/Server Model  Characteristics of Client/Server.
E-science grid facility for Europe and Latin America Developing e-Infrastructure services for e-Science applications: the EELA-2 experience.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Loosely Coupled Parallelism: Clusters. Context We have studied older archictures for loosely coupled parallelism, such as mesh’s, hypercubes etc, which.
E-science grid facility for Europe and Latin America JRA1 – Activity Report and Plans Francisco Brasileiro Universidade Federal de Campina.
OurGrid: A Simple Solution for Running Bag-of-Tasks Applications on Grids Marcelo Meira, Walfredo Cirne (marcelo, Universidade.
Alexandre Duarte Gustavo Wagner Francisco Brasileiro Walfredo Cirne Multi-Environment Software Testing on the Grid Universidade Federal de Campina Grande.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
1 Catania, 4 th EEGE User Forum/OGF 25, OurGrid integration with gLite based grids in EELA-2 Francisco Brasileiro Universidade.
E-science grid facility for Europe and Latin America E2GRIS1 Gustavo Miranda Teixeira Ricardo Silva Campos Laboratório de Fisiologia Computacional.
1 Bogotá, EELA-2 1 st Conference, The OurGrid Approach for Opportunistic Grid Computing Francisco Brasileiro Universidade Federal.
E-science grid facility for Europe and Latin America Bridging the High Performance Computing Gap with OurGrid Francisco Brasileiro Universidade.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
BalticGrid-II Project BalticGrid-II Kick-off Meeting, , Vilnius1 Joint Research Activity Enhanced Application Services on Sustainable e-Infrastructure.
BalticGrid-II Project The Second BalticGrid-II All-Hands Meeting, Riga, May, Joint Research Activity Enhanced Application Services on Sustainable.
E-infrastructure shared between Europe and Latin America Interoperability between EELA and OurGrid Alexandre Duarte CERN and UFCG 1 st.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
E-science grid facility for Europe and Latin America JRA1 – Annual Activity Report Francisco Brasileiro Universidade Federal de Campina.
Uppsala, April 12-16th 2010EGEE 5th User Forum1 A Business-Driven Cloudburst Scheduler for Bag-of-Task Applications Francisco Brasileiro, Ricardo Araújo,
Application Software System Software.
Distributed Data for Science Workflows Data Architecture Progress Report December 2008.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Data Communications and Networks Chapter 9 – Distributed Systems ICT-BVF8.1- Data Communications and Network Trainer: Dr. Abbes Sebihi.
E-infrastructure shared between Europe and Latin America Interoperability between EELA and OurGrid Alexandre Duarte CERN IT-GD EELA Project.
DIRAC Project A.Tsaregorodtsev (CPPM) on behalf of the LHCb DIRAC team A Community Grid Solution The DIRAC (Distributed Infrastructure with Remote Agent.
E-science grid facility for Europe and Latin America JRA1 role and its interaction with SA1 and NA3 Francisco Brasileiro Universidade Federal.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
E-science grid facility for Europe and Latin America The GILDA t-Infrastructure Riccardo Bruno INFN, Sez. CT Joint EELA-2/EGEE-III Tutorial.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
8 th International Desktop Grid Federation Workshop, Hannover, Germany, August 17 th, 2011 DEGISCO Desktop Grids for International Scientific Collaboration.
Deploying Research in the Real World: The OurGrid Experience
Grid Computing.
GSAF Grid Storage Access Framework
GSAF Grid Storage Access Framework
Module 01 ETICS Overview ETICS Online Tutorials
Subject Name: Operating System Concepts Subject Number:
ShareGrid: architettura e middleware
Database System Architectures
Presentation transcript:

E-science grid facility for Europe and Latin America OurGrid and the co-existence with gLite Alexandre Duarte Universidade Federal de Campina Grande (Brazil) Joint EELA-2/EGEE-III Tutorial for Trainers 30/06 to 04/07/2008 Part of these slides were created by Francisco Brasileiro (UFCG-Brazil) and Diego Scardaci (INFN-Italy)

Agenda Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/ Introduction The OurGrid Approach –Architecture –Scheduling –Avoiding Free-Riders –Security Concerns –Application Models OurGrid and gLite Co-Existence Conclusions

Introduction Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/2008 3

EELA-2 Objectives Build a powerful, functional and well supported Grid Facility Address a large community of users Assert the financial & management schemes to operate and support the e-Infrastructure on the long range Anticipate the handover of the e-Infrastructure operation and support Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/2008 4

Powerful Grid Facility Dream –An active community of potential grid users –High skilled support team to deploy and manage resource centres –A lot of resource centres with large amounts of computational resource to put in the grid Reality: –An active community of potential grid users –Lack of skilled personnel –A few resource centres with a good amounts of computational resources –A lot of resource centres with small amounts of computational resources Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/2008 5

Making the Dream a Reality User Community –Continue with the good work –Keep finding more interesting applications to support Skilled personnel –That’s why we (you) are here (there) –Training training and training Computational Resources –Buy more computers? $$$ –Buy clusters? $$$$$ –Buy Supercomputers?$$$$$$$$$$$$ –Share idle resources?FREE  Example: UFCG with ±3000 PCs * 16 idle hours / day ≈ 2000 idle PCs!!! Totally free! 6 Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/2008

Sharing Idle Resources Voluntary Computing (eg. –Organisations donate their resources to A given project –Donators normally do not use the available resources for their own purposes –Entrance barrier is high, because one must  invest a good deal of effort in “advertising”  have a very high visibility project  be in a prestigious institution –May be useful when the organisation has access to a large number of desktops Peer2Peer Grid (eg. OurGrid) –Peers donate their resources to other Peers in the grid –Donators normally use the available resources for their own purposes –Entrance barrier is low, just deploy a new Peer in the grid Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/2008 7

An Important Note These are not competing technologies! Each one is more appropriate to a particular subset of the users’ base Each one has its virtues and drawbacks It is very likely that they will be able not only to co- exist, but also to interoperate Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/2008 8

The OurGrid Approach Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/2008 9

The OurGrid Approach Labs can freely join the system without any human intervention –No need for negotiation; no paperwork Clear incentive to join the system –One can’t be worse off by joining the system –Noticeable increased response time –Free-riding resistant Basic dependability properties –Configurable level of security –Resilience to faults –Scalability Easy to install, configure, manage and program –No need for specialized support team Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

But there is no free lunch To simplify the problem, OurGrid is focused on Bag-of- Tasks (BoT) applications –No need for communication among tasks  Facilitates scheduling and security enforcement –Simple fail-over/retry mechanisms to tolerate faults –No need for QoS guarantees –Script-based programming is natural  Facilitates use Fortunately, many important applications are BoT! –Data mining, Massive search, Bio computing, Parameter sweep, Monte Carlo simulations, Fractal calculations, Image processing and many others Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

OurGrid Architecture Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

Finding Resources OurGrid GIS (NodeWiz) allows the execution of rich queries that encompass not only multiple attributes, but also range operators A scheduler might want to locate suitable resources –OS=linux && RAM ≥ 1G && clock > 4GHz && load < 0.5 A user may want to locate a dataset that contains particular data itens –rain_fall && -37º52’ < long < -37º46’ && 144º54’ < lat < 145º03’ && date ≥ 01/01/2007 … Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

Scheduling with no Information Grid scheduling typically depends on information about the grid (eg. machine speed and load) and the application (eg. task size) However, getting accurate information about all applications and resources is hard in a large scale peer-to-peer grid Can we efficiently schedule tasks without requiring access to information? This would make the system much easier to deploy and simpler to use Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

Workqueue with Replication Tasks are sent to idle processors When there are no more tasks, running tasks are replicated on idle processors The first replica to finish is the official execution Other replicas are cancelled Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

Preventing Free-riders It is important to encourage collaboration –In file-sharing, most users free-ride OurGrid uses a reciprocation-based incentive mechanism –Tit-for-tat The Network of Favors –All peers maintain a local balance for all known peers –Peers with greater balances have priority when there is contention for local resources –Under contention, the more one donates, the more one gets back –No additional infrastructure is needed Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

OurGrid Security How to protect resources from applications? –Leverages on the fact that BoT applications only communicate to receive input and return the output –Input/output is done by the OurGrid Worker Manager that runs within a Java virtual machine –The remote task runs inside a virtual machine, with no network access, and disk access only to a designated partition  Other configurations are possible –A new virtual machine is instantiated before a new task is run How to protect applications from resources? –Increased script language to accommodate an optional check phase  Application may introduce task-dependent water marks Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

Application Models Script-based –Stage-in/out files. Embedded –Direct access to MyGrid’s API Portal-based –Web interface to Mygrid’s API Framework-based –MyGrid inside of frameworks Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

OurGrid-enabling an Application Write a script using a very simple language –Simple abstractions  File transfer (put, store, get)  Hide heterogeneity ($PLAYPEN, $STORAGE) –Define constraints (job requirements and grid machine attributes) Write a program that embeds the business logic and may make use of more complex features available through a Java API Deploy a Portal that embeds the application Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

An Example: Factoring a Number job: label: my_factorial_useless_example requirements: (OS=linux && RAM ≥ 1G && clock > 4GHz && load < 0.5) task: init: store factoring $PLAYPEN remote: factoring output-$JOB-$TASK final: get $PLAYPEN/output-$JOB-$TASK results task: init: store factoring $PLAYPEN remote: factoring output-$JOB-$TASK final: get $PLAYPEN/output-$JOB-$TASK results task: init: store factoring $PLAYPEN remote: factoring output-$JOB-$TASK final: get $PLAYPEN/output-$JOB-$TASK results … Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

Some Applications Script-based –Risk assessment for agriculture loans (EMBRAPA) –Our own research on computer science  Simulations API-based –SmartPumping (PETROBRAS)  Parallel execution of genetic algorithms for optimizing oil pipeline operation –EPANET-Grid (R&D project)  Grid-enabled version of the EPANET system for simulation of water supply systems –GridVida (R&D Project)  Image processing to support diagnosis by identifying similar cases in the archival database Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

Some Applications Portal-based : SegHidro (R&D project) –Several uses related to management of water resources in a Brazilian semi-arid area –Academic and industrial users –Allow the configuration of different workflows of simulation models and the execution of them in ensembles –Sharing of computing resources, data and complementary expertise Framework-based: GridUnit (R&D project) –An extension of JUnit –Features  Transparent and Automatic Distribution  Test Case Contamination Avoidance  Environmental coverage  Graphical user interface Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

OURGRID AND GLITE CO- EXISTENCE OurGrid and gLite Co-Existence Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

EELA-2 Joint Research Activity Help in fostering the sustainability of the e-Infrastructure –Making the e-Infrastructure more interesting and wide spread by increasing its reach and its usability Promote a continued and increased interaction between research groups in Europe and Latin-America Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

Increase the Reach by Allowing the scavenging of idle resources –Create the necessary mechanisms to allow resource centres that run the OurGrid middleware to co-exist with resource centres running gLite within the EELA platform –Provide some level of interoperation between these different kinds of resource centres and their associated applications Allowing the execution of the grid middleware on top of Microsoft Windows platforms –Port the gLite middleware to the Windows platform –Leveraging on the multi-platform characteristics of OurGrid Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

Increase the Usability by Developing new application-oriented grid services –Easy the creation of digital archives and data grid frameworks –Secure storage to solve the insider abuse problem –Support for cooperative workflows –Other selected services required by NA3 applications Leveraging on the grid services provided by the OurGrid middleware to execute bag-of-tasks jobs Facilitating the management of resource centres Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/ The first step is to allow EELA-2 OurGrid Resource Centres to be created –Provide support for the use of the gLite PKI by OurGrid resource centres OurGrid –gLite Co-Existence

OurGrid –gLite Co-Existence Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/ The second step is to allow idle resources in an EELA-2 gLite resource centre to be exposed as OurGrid resources

Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/ The final step is to allow resources of an OurGrid resource centre to be exposed as gLite resources –This will be achieved in two sub-steps  Firstly, allow clusters to be exposed as a single resource in an OurGrid resource centre

Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/ The final step is to allow resources of an OurGrid resource centre to be exposed as gLite resources –This will be achieved in two sub-steps  Firstly, allow clusters to be exposed as a single resource in an OurGrid resource centre  Secondly, make these resources available at the gLite grid

JRA1 Milestones MJRA1.1: set/2008 –PKI-enabled OurGrid middleware MJRA1.2: jan/2009 –Prototypes of the proposed services MJRA1.3: jan/2009 –Common information system between gLite and OurGrid up & running MJRA1.4: jul/2009 –Prototype of the gateway to transfer jobs from gLite to OurGrid and vice-versa MJRA1.5: jul/2009 –Stable version of the proposed services MJRA1.6: jan/2010 –Stable version of the gateway to transfer jobs from gLite to OurGrid and vice-versa Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/

References About OurGrid –Download the middleware and documentation from –Read “Labs of the world, unite!!! W. Cirne, F. Brasileiro, N. Andrade, L. Costa, A. Andrade, R. Novaes, M. Mowbray. Journal of Grid Computing 4 (3) (2006) ” for more details. About JRA1 –Requests/Comments/Suggestions/Criticisms  Send to either Francisco Brasileiro or Diego Scardaci –Contact the developers at –Download new software distributions from Catania, Joint EELA-2/EGEE-III Tutorial for Trainers, 30/06 to 04/07/