Introduction to Grids and Grid applications Peter Kacsuk and Gergely Sipos MTA SZTAKI www.lpds.sztaki.hu.

Slides:



Advertisements
Similar presentations
SZTAKI Desktop Grid P. Kacsuk MTA SZTAKI
Advertisements

P. Kacsuk, G. Sipos, A. Toth, Z. Farkas, G. Kecskemeti and G. Hermann P. Kacsuk, G. Sipos, A. Toth, Z. Farkas, G. Kecskemeti and G. Hermann MTA SZTAKI.
C3.ca in Atlantic Canada Virendra Bhavsar Director, Advanced Computational Research Laboratory (ACRL) Faculty of Computer Science University of New Brunswick.
What is Grid Computing? Cevat Şener Dept. of Computer Engineering, METU.
High Performance Computing Course Notes Grid Computing.
P-GRADE and WS-PGRADE portals supporting desktop grids and clouds Peter Kacsuk MTA SZTAKI
Introduction to Grids and Grid applications Gergely Sipos MTA SZTAKI
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
WS-PGRADE: Supporting parameter sweep applications in workflows Péter Kacsuk, Krisztián Karóczkai, Gábor Hermann, Gergely Sipos, and József Kovács MTA.
Hungarian GRID Projects and Cluster Grid Initiative P. Kacsuk MTA SZTAKI
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 15 th April 2009 Visit of Spanish Royal Academy.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial Session 6: Introduction to grids and cloud computing International Symposium on Grid.
Chemistry GRID and its application for air pollution forecast Computer and Automation Research Institute of the Hungarian Academy of Sciences (MTA SZTAKI)
Hungarian Supercomputing GRID
SZTAKI Desktop Grid – a Hierarchical Desktop Grid System P. Kacsuk, A. Marosi, J. Kovacs, Z. Balaton, G. Gombas, G. Vida, A. Kornafeld MTA SZTAKI
Open Science Grid For CI-Days Internet2: Fall Member Meeting, 2007 John McGee – OSG Engagement Manager Renaissance Computing Institute.
National Center for Supercomputing Applications GLORIAD Science Applications Astronomy – Virtual Observatories Global Climate Change Richard M. Crutcher.
INFSO-RI SZTAKI’s Exploitation plan AHM meeting Budapest, 23 June 2009 Peter Kacsuk, Robert Lovas MTA SZTAKI.
1 portal.p-grade.hu Further information on P-GRADE Gergely Sipos MTA SZTAKI Hungarian Academy of Sciences.
SAN DIEGO SUPERCOMPUTER CENTER NUCRI Advisory Board Meeting November 9, 2006 Science Gateways on the TeraGrid Nancy Wilkins-Diehr TeraGrid Area Director.
Open Science Grid For CI-Days Elizabeth City State University Jan-2008 John McGee – OSG Engagement Manager Manager, Cyberinfrastructure.
A short introduction to the Worldwide LHC Computing Grid Maarten Litmaath (CERN)
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Grid projects in Europe Giuseppe Andronico.
From P-GRADE to SCI-BUS Peter Kacsuk, Zoltan Farkas and Miklos Kozlovszky MTA SZTAKI - Computer and Automation Research Institute of the Hungarian Academy.
1 IDGF International Desktop Grid Federation How can you benefit from joining IDGF? Hannover, Peter Kacsuk, MTA SZTAKI, EDGI.
10/24/2015OSG at CANS1 Open Science Grid Ruth Pordes Fermilab
WS-PGRADE portal and its usage in the CancerGrid project M. Kozlovszky, P. Kacsuk Computer and Automation Research Institute of the Hungarian Academy of.
National Computational Science National Center for Supercomputing Applications National Computational Science NCSA Terascale Clusters Dan Reed Director,
Condor: High-throughput Computing From Clusters to Grid Computing P. Kacsuk – M. Livny MTA SYTAKI – Univ. of Wisconsin-Madison
1 Advanced features of the P-GRADE portal Peter Kacsuk, Gergely Sipos Peter Kacsuk, Gergely Sipos MTA.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE – paving the way for a sustainable infrastructure.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Ian Bird LHC Computing Grid Project Leader LHC Grid Fest 3 rd October 2008 A worldwide collaboration.
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
Introduction to Grids and Grid applications Gergely Sipos MTA SZTAKI
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
Les Les Robertson LCG Project Leader High Energy Physics using a worldwide computing grid Torino December 2005.
ESFRI & e-Infrastructure Collaborations, EGEE’09 Krzysztof Wrona September 21 st, 2009 European XFEL.
August 3, March, The AC3 GRID An investment in the future of Atlantic Canadian R&D Infrastructure Dr. Virendra C. Bhavsar UNB, Fredericton.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Services for advanced workflow programming.
1 IDGF International Desktop Grid Federation How can you benefit from joining IDGF? Lyon, Peter Kacsuk, MTA SZTAKI, EDGI is.
7. Grid Computing Systems and Resource Management
| nectar.org.au NECTAR TRAINING Module 2 Virtual Laboratories and eResearch Tools.
11 Introduction to EDGI Peter Kacsuk, MTA SZTAKI Start date: Duration: 27 months EDGI.
INFSO-RI Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
1 Porting applications to the NGS, using the P-GRADE portal and GEMLCA Peter Kacsuk MTA SZTAKI Hungarian Academy of Sciences Centre for.
The EDGeS project receives Community research funding 1 The EDGeS project: Enabling Desktop Grids for e-Science P. Kacsuk MTA SZTAKI.
The Worldwide LHC Computing Grid Frédéric Hemmer IT Department Head Visit of INTEL ISEF CERN Special Award Winners 2012 Thursday, 21 st June 2012.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
27/10/2006, Catania EGEE Industry Day Filling the gaps: Advanced Grid portal and desktop grid technologies for industry Robert Lovas MTA SZTAKI Laboratory.
Northwest Indiana Computational Grid Preston Smith Rosen Center for Advanced Computing Purdue University - West Lafayette West Lafayette Calumet.
1 Open Science Grid: Project Statement & Vision Transform compute and data intensive science through a cross- domain self-managed national distributed.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
INFSO-RI Enabling Grids for E-sciencE EGEE general project update Fotis Karayannis EGEE South East Europe Project Management Board.
EGI-Engage EGI Webinar - Introduction - Gergely Sipos EGI.eu / MTA SZTAKI 6/26/
DutchGrid KNMI KUN Delft Leiden VU ASTRON WCW Utrecht Telin Amsterdam Many organizations in the Netherlands are very active in Grid usage and development,
Introduction to Grid and Grid applications Peter Kacsuk MTA SZTAKI
Grids and SMEs: Experience and Perspectives Emanouil Atanassov, Todor Gurov, and Aneta Karaivanova Institute for Parallel Processing, Bulgarian Academy.
INTRODUCTION TO HIGH PERFORMANCE COMPUTING AND TERMINOLOGY.
Clouds , Grids and Clusters
Volunteer Computing for Science Gateways
The LHC Computing Grid Visit of Mtro. Enrique Agüera Ibañez
Long-term Grid Sustainability
Introduction to Grid computing and the EGEE project
Grid Computing.
Recap: introduction to e-science
Grid Computing Colton Lewis.
EGI Webinar - Introduction -
Presentation transcript:

Introduction to Grids and Grid applications Peter Kacsuk and Gergely Sipos MTA SZTAKI

What is Grid? ● They are heterogeneous in every aspect Grid ● A Grid is a collection of computers, storages, special devices, services that can dynamically join and leave the Grid Internet ● They are geographically distributed and connected by a wide-area network ● They can be accessed on- demand by a set of users

Why use a Grid? A user has a complex problem that requires many services/resources in order to reduce computation time access large databases access special equipments collaborate with other users Internet

Typical Grid application areas High-performance computing (HPC) to achieve higher performance than individual supercomputers/clusters can provide Reguirement: parallel computing High-throughput computing (HTC) To exploit the spare cycles of various computers connected by wide area networks Collaborative work Several users can jointly and remotely solve complex problems

Two players of the Grid Resource donors = D Resource users = U Relationship between the two characterizes the Grid: if U ~ D => generic Grid model if U >> D=> utility Grid model if U desktop Grid model

Generic Grid modell Internet Donating free resources Requiring resources Inst1 Inst2 Inst4 Inst3

Characteristics of the generic Grid model A volunteer Grid: Anybody can donate resources Heterogeneous resources, that dynamically join and leave Anybody (belonging to the donating institutes) can use the donated resources for solving her/his own applications Symmetric relationship between donors and users: U ~ D Examples: GT-2 grids 1 st version of UK NGS Problems: Installing and maintaining client and server grid software are too complicated Volunteer Grids are not robust and reliable

Desktop Grid model Internet Dynamic resource donation Work package distribution Company/ univ. server Donor: Company/ univ. or private PC Donor: Company/ Univ. or private PC Donor: Company/ univ. or private PC Application

Desktop Grid model – Master/slave parallelism Internet Master Workunit-1 Workunit-2 Workunit-3 Workunit-N DG Server

Characteristics of the desktop Grid model A volunteer Grid: Anybody can donate resources Heterogeneous resources, that dynamically join and leave One or a small number of projects can use the resources Asymmetric relationship between donors and users: U << D Advantage: Donating a PC is extremely easy Setting up and maintaining a DG server is much easier than installing the server sw of utility grids

Types of Desktop Grids Global Desktop Grid Aim is to collect resources for grand-challenge scientific problems Example: BOINC SZTAKI Desktop Grid (SZDG) Local Desktop Grid Aim is to enable the quick and easy creation of grid for any community (company, univ. city, etc.) to solve their own applications Example: Local SZDG

SETI: a global desktop grid ● ● 3.8M users in 226 countries ● 1200 CPU years/day ● 38 TF sustained (Japanese Earth Simulator is 32 TF sustained) ● Highly heterogeneous: >77 different processor types

SZTAKI Desktop Grid global version

TOP 500 entry performance:1645 GFlops URLs: and

SZTAKI Desktop Grid local version Main objective: Enable the creation of local DG for any community Demonstrate how to create such a system Building production Grids requires huge effort and represents a privilege for those organizations where high Grid expertise is available Using the local SZDG package Any organization can build a local DG in a day with minimal effort and with minimal cost (a strong PC is enough as a server machine) The applications of the local community will be executed by the spare PC cycles of the local community There is no limitation for the applied PCs, all the PCs of the organization can be exploited (heterogeneous Grid) You can download the local SZDG package from:

DSP application on a local SZDG in the Univ. of Westminster Digital Signal Processing Appl.: Designing optimal periodic nonuniform sampling sequences Currently more than 100 PCs connected from Westminster and planned to extend over 1000 PCs DSP sizeProductionSZDG ~35min~1h 44min ~7h 23min ~141h~46h 46min The speedup ~5h 4min Sequential ~3h 33min ~41h 53min ~724h

Usage of local SZDG in industry AMRI Hungary Ltd. Drug discovery application Creating enterprise Grid for prediction of ADME/Tox parameters Millions of molecules to test according to potential drug criteria New FP6 EU Grid project: CancerGrid Hungarian Telecom Creating enterprise Grid for supporting large data mining applications where single computer performance is not enough OMSZ (Hungarian Meteorology Service) Creating enterprise Grid for climate modeling

Utility Grid model Internet Donating free resources static 7/24 mode Dynamic resource requirements Inst1 User 1 Inst2 User N Donor and user

Characteristics of the utility Grid model Semi-volunteer Grids: Donors must be “professional” resource providers who provide production service (7/24 mode) Typically homogeneous resources Anybody can use the donated resources for solving her/his own applications Asymmetric relationship between donors and users: U >> D Examples: EGEE -> SEE-Grid, BalticGrid, etc. UK NGS current version, NorduGrid OSG, TeraGrid

The largest production Grid: EGEE Scale > 180 sites in 39 countries ~ CPUs > 5 PB storage > concurrent jobs per day > 60 Virtual Organisations Country participating in EGEE

Dynamic Grid ~ 33 sites, ~1400 CPUS Production Grid Real users, real applications It is in 24/7 operation, unattended by administrators for most of the time NorduGrid

TeraGrid NCSA: Compute IntensiveSDSC: Data IntensivePSC: Compute Intensive IA64 Pwr4 EV68 IA32 EV7 IA64 Sun 10 TF IA large memory nodes 230 TB Disk Storage 3 PB Tape Storage GPFS and data mining 4 TF IA-64 DB2, Oracle Servers 500 TB Disk Storage 6 PB Tape Storage 1.1 TF Power4 6 TF EV68 71 TB Storage 0.3 TF EV7 shared-memory 150 TB Storage Server 1.25 TF IA Viz nodes 20 TB Storage 0.4 TF IA-64 IA32 Datawulf 80 TB Storage Extensible Backplane Network LA Hub Chicago Hub IA32 Storage Server Disk Storage Cluster Shared Memory Visualization Cluster LEGEND 30 Gb/s IA64 30 Gb/s Sun ANL: VisualizationCaltech: Data collection analysis 40 Gb/s Backplane Router PSC integrated Q3 03

Exploiting parallelism ● Single parallel application ● Single-site parallel execution ● Multi-site parallel execution ● Workflow branch parallelism ● Sequential components ● Parallel components ● Two-level single-site parallelism ● Two-level multi-site parallelism ● Parameter sweep (study) applications: ● The same application is executed with many (1000s) different parameter sets ● The application itself can be ● Sequential ● Single parallel ● workflow

How to use a Grid for single-site parallelism? Internet

How to use a Grid for multi-site parallelism? Internet

How to use a Grid for two level single-site parallelism? Internet

How to use a Grid for two level multi-site parallelism? Internet

Master/slave parallelism and parametric studies in utility Grids Internet Master Workunit-1 Workunit-2 Workunit-3 Workunit-N

How to use a Grid for HPC parameter study? Internet

Typical Grid Applications ● Computation intensive ● Interactive simulation (climate modeling) ● Very large-scale simulation and analysis (galaxy formation, gravity waves, battlefield simulation) ● Engineering (parameter studies, linked component models) ● Data intensive ● Experimental data analysis (high-energy physics) ● Image and sensor analysis (astronomy, climate study, ecology) ● Distributed collaboration ● Online instrumentation (microscopes, x-ray devices, etc.) ● Remote visualization (climate studies, biology) ● Engineering (large-scale structural testing, chemical engineering) ● In all cases, the problems were big enough that they required people in several organization to collaborate and share computing resources, data, instruments.

EGEE Applications ● >20 applications from 7 domains ● High Energy Physics ● Biomedicine ● Earth Sciences ● Computational Chemistry ● Astronomy ● Geo-Physics ● Financial Simulation ● Further applications in evaluation Applications now moving from testing to routine and daily usage

An Example Problem tackled by EGEE ● The Large Hadron Collider (LHC) located at CERN, Geneva Switzerland ● Scheduled to go into production in 2007 ● Will generate 10 Petabytes (10 7 Gigabytes) of information per year ● This information must be processed and stored somewhere ● It is beyond the scope of a single institution to manage this problem -> VO is needed

R R R R R R R R R R Virtual Organizations Distributed resources and people Linked by networks, crossing admin domains Sharing resources, common goals Dynamic VO-B VO-A R R R R

Local EGEE related activities Portugal and Spain are part of South West EGEE Federation (SWE) grid.ifca.unican.es/egee-sa1-swe Also involved in “E-infrastructure shared between Europe and Latin America” project (EELA)

OGSA/OGSI Super- computing Network Computing Cluster computing High-throughput computing High-performance computing Web Services CondorGlobus Client/server Progress in Grid Systems OGSA/WSRF Grid Systems 2nd Gen. 3rd Gen. 1st Gen.

Other EU Grid projects Training and Education: ICEAGE International Collaboration to Extend and Advance Grid Education

Structure of the current course ● Day 1 ● Introduction to grid technologies ● Detailed study of GT4 technology and usage ● Day 2 ● Application development on Grids ● Portal technology ● Day 3 ● GT4 Grid installation ● Portal installation