Enrico Fattibene INFN-CNAF

Slides:



Advertisements
Similar presentations
The LHC experiments AuthZ Interoperation requirements GGF16, Athens 16 February 2006 David Kelsey CCLRC/RAL, UK
Advertisements

EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
INFSO-RI Enabling Grids for E-sciencE FloodGrid application Ladislav Hluchy, Viet D. Tran Institute of Informatics, SAS Slovakia.
08/11/908 WP2 e-NMR Grid deployment and operations Technical Review in Brussels, 8 th of December 2008 Marco Verlato.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial Session 6: Introduction to grids and cloud computing International Symposium on Grid.
Computing for ILC experiment Computing Research Center, KEK Hiroyuki Matsunaga.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Configuring and Maintaining EGEE Production.
HPDC 2007 / Grid Infrastructure Monitoring System Based on Nagios Grid Infrastructure Monitoring System Based on Nagios E. Imamagic, D. Dobrenic SRCE HPDC.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
L ABORATÓRIO DE INSTRUMENTAÇÃO EM FÍSICA EXPERIMENTAL DE PARTÍCULAS Enabling Grids for E-sciencE Grid Computing: Running your Jobs around the World.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Grid Middleware Tutorial / Grid Technologies IntroSlide 1 /14 Grid Technologies Intro Ivan Degtyarenko ivan.degtyarenko dog csc dot fi CSC – The Finnish.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES GGUS Overview ROC_LA CERN
INFSO-RI Enabling Grids for E-sciencE Introduction to Grid Computing, EGEE and Bulgarian Grid Initiatives - Plovdiv,
Les Les Robertson LCG Project Leader High Energy Physics using a worldwide computing grid Torino December 2005.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
HLRmon accounting portal DGAS (Distributed Grid Accounting System) sensors collect accounting information at site level. Site data are sent to site or.
Recent improvements in HLRmon, an accounting portal suitable for national Grids Enrico Fattibene (speaker), Andrea Cristofori, Luciano Gaido, Paolo Veronesi.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
HLRmon accounting portal The accounting layout A. Cristofori 1, E. Fattibene 1, L. Gaido 2, P. Veronesi 1 INFN-CNAF Bologna (Italy) 1, INFN-Torino Torino.
DataTAG is a project funded by the European Union International School on Grid Computing, 23 Jul 2003 – n o 1 GridICE The eyes of the grid PART I. Introduction.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
DataTAG is a project funded by the European Union CERN, 8 May 2003 – n o 1 / 10 Grid Monitoring A conceptual introduction to GridICE Sergio Andreozzi
II EGEE conference Den Haag November, ROC-CIC status in Italy
HLRmon Enrico Fattibene INFN-CNAF 1EGI-TF Lyon, France19-23 September 2011.
Using HLRmon for advanced visualization of resource usage Enrico Fattibene INFN - CNAF ISCG 2010 – Taipei March 11 th, 2010.
EMI INFSO-RI Servizi Grid per il calcolo e l'accesso ai dati Workshop DUCK – Bologna, Francesco Giacomini INFN-CNAF.
Enabling Grids for E-sciencE INFN Workshop – May 7-11 Rimini 1 Grid Accounting Status at INFN Riccardo Brunetti INFN-TORINO.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
DGAS Accounting – toward national grid infrastructures HPDC workshop on Monitoring, Logging and Accounting, (MLA) in production Grids 10/06/2009, Munich.
Grid Colombia Workshop with OSG Week 2 Startup Rob Gardner University of Chicago October 26, 2009.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
Claudio Grandi INFN Bologna Workshop congiunto CCR e INFNGrid 13 maggio 2009 Le strategie per l’analisi nell’esperimento CMS Claudio Grandi (INFN Bologna)
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
Accessing the VI-SEEM infrastructure
Workload Management Workpackage
Grid Computing: Running your Jobs around the World
Job monitoring and accounting data visualization
Regional Operations Centres Core infrastructure Centres
The EDG Testbed Deployment Details
Il sistema di supporto di INFNGRID e GGUS
OGF PGI – EDGI Security Use Case and Requirements
StoRM: a SRM solution for disk based storage systems
Ian Bird GDB Meeting CERN 9 September 2003
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Brief overview on GridICE and Ticketing System
Introduction to gLite GRID Enviroment
Monitoring: problems, solutions, experiences
EGEE VO Management.
How to enable computing
Grid2Win: Porting of gLite middleware to Windows XP platform
Sergio Fantinel, INFN LNL/PD
Introduction to Grid Technology
GridICE monitoring for the EGEE infrastructure
A Messaging Infrastructure for WLCG
a VO-oriented perspective
Leigh Grundhoefer Indiana University
LHC Data Analysis using a worldwide computing grid
DGAS Today and tomorrow
HLRmon accounting portal
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

Enrico Fattibene INFN-CNAF Grid introduction Enrico Fattibene INFN-CNAF 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Outline The scientific “demand” What is a Grid? Primary Grid components European Grid Infrastructure (EGI) Italian Grid Infrastructure (IGI) IGI Grid management and support 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster eScience Science is becoming increasingly digital, needs to deal with increasing amounts of data and computational needs Simulations get ever more detailed Nanotechnology – design of new materials from the molecular scale Modelling and predicting complex systems (weather forecasting, river floods, earthquake) Decoding the human genome Experimental Science uses ever more sophisticated sensors to make precise measurements Need high statistics Huge amounts of data Serves user communities around the world Science is getting more digital world-wide – LHC as example 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster 3

Calcolo Parallelo su Grid e CSN4cluster LHC at CERN CMS LHCb ATLAS ALICE 40,000,000 collisions/sec in each of the four detectors 100,000 of today’s fastest processors 15 PetaBytes of new data each year 150 times the total content of the Web each year 1 Petabyte (1PB) = 1000TB = 10 times the text content of the World Wide Web** ** Urs Hölzle, VP Operations at Google 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster What is a Grid? Computational Grid  is a collection of distributed, possibly heterogeneous resources which can be used as an ensemble to execute large-scale applications The three fondamental properties of Grid computing: coordinating resources that are not subject to centralized control using standard, open, general-purpose protocols and interfaces delivering nontrivial qualities of service Large-scale coordinated management of resources belonging to different administrative domains (multi-domain vs single domain) Standard, open, multi-purpose protocols and interfaces that provide a range of services (standard vs proprietary) Delivery of complex Quality of Service (QoS): Grid computing allows its constituent resources to be used in a coodinated fashion to deliver various types of QoS, such as resposed time, throughput, avaiability, reliability, security, etc. 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Primary components The primary components of a production Grid are: Computing resources Storage resources Access points to the grid Core services Other elements are as much fundamental for the working, managing and monitoring of the Grid: Monitoring tools Accounting tools Management and control infrastructure 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Computing resources Provide the possibility to execute a computation But also: Get the status of the computation Cancel the computation Computing resources are typically provided by possibly large farms of computers - Worker Nodes (WNs) Usually managed by a batch system (e.g. LSF, PBS, Condor) The corresponding Grid abstraction is called a Computing Element (CE) 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Storage resources Provide the possibility to manage the storage of data Data are typically in the form of files Create, read, write, delete files/directories Storage may be provided using different technologies DPM, Castor, dCache, StoRM for management GridFTP for transfer rfio, gsidcap, posix,... for access The corresponding Grid abstraction is called Storage Element (SE) 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Authentication A Grid may count hundreds of CEs and SEs. Do I need an account on each of them? No A Grid identity is managed with an X.509 certificate, which represents that user's credentials /C=IT/O=INFN/OU=Personal Certificate/L=CNAF/CN=Enrico Fattibene A Grid identity is transparently mapped to a local identity/account, provided the authorization is granted 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Authorization Grid users belongs to an experiment and, within that experiment, to different groups On the Grid an experiment is a Virtual Organization (VO) VO, groups and roles can be associated to an identity by a VO Membership Service (VOMS) VO, groups and roles are included in the user's credentials and used, for example, in the local mapping 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Information System How do I know which resources are available? How do I know which ones I can use? Services publish their existence, characteristics and status in the Information Service The information is published according to an agreed-upon schema, called the GLUE schema The most common implementation is based on LDAP and is called BDII (Berkley Database Information Index) 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Job Management The Workload Management System (WMS) is responsible for the distribution and management of tasks across Grid resources, in particular Computing Elements, in such a way that applications are conveniently, efficiently and effectively executed Complemented by the Logging&Bookkeeping (LB) Service Keep track of a number of events generated by different components involved in job management Provide the status of a job 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Monitoring Observing the composition, state and features of available resources Analyzing their behavior and performance Detecting and prevent fault situations 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Accounting How many resources have I used? How many resources have a certain VO used? An accounting system provides support to give precise answers to such questions Collect information at resource level Propagate the info at higher-levels, where it can be aggregated according to different views 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Grid schema Information System Data Catalogs User Support Core Services VO Management Job Broker (WMS) File Transfer Service CE SE Site A Site B CE SE Site C CE SE 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Grid advantages: site Make better use of existing resource Monitoring tools Accounting tools Support for site managers Installation, upgrading, problems, ticketing system Coordination of security aspects 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Grid advantages: user Can solve larger, more complex problems in a shorter time Easier to collaborate with other organizations Support for users Application porting 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

European e-Infrastructures European Data Grid (EDG) Middlewere developing and testbed deployment Enabling Grid for E-sciencE (EGEE) I-II-III From the prototype to the production infrastructure European Grid Infrastructure (EGI) Towards a sustainable Grid infrastructure Key role of the National Grid Initiatives (NGIs) Based on the gLite/UMD (Unified Middleware Distribution) middleware release http://www.egi.eu/ 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster EGI in numbers Logical CPUs (cores) 248,424 EGI 337,608 All Storage resources 106.7 PB disk 112.8 PB tape Resource Centres 329 EGI 346 All 35% of logical cores provided by the 9 largest Resource Centres Countries 50 EGI 57 All 38 National Grid Infrastructures (NGIs) providing resources 1 European International Research Organisations (EIRO) providing resources (CERN) 19 countries in 4 non-European Operations Centres 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster IGI The Italian Grid Infrastructure (IGI) is part of EGI together with many European National Grid Initiatives (NGIs) It’s one of the widest NGIs http://www.italiangrid.org/ 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster IGI in numbers Logical CPUs (cores) 26,087 Storage resources 24.5 PB disk 5 PB tape Resource Centres 58 Partners 19 Institutes/Universities Users 1100 Job per year 30 millions 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster IGI Grid management IGI Grid management is performed by the Operation Center. The main activities are: Production of the Infngrid middleware release (customization of the gLite/UMD release) and test Deployment of the release to the sites, support to local administrators and sites certification Periodical check of the resources and services status Support at an Italian level Support at an European level Introduction of new Italian sites in the infrastructure Introduction of new regional VOs 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster IGI support About 10 supporters perform a checking activity composed of 1 shift per week, with 2 person per shift: provides a first support to sites and users (1st line supporters team) Specialists of Grid services (2nd line supporters) take place in case of more complex problems The main activities is: Checking of the Grid status and problem warning, tailing them until their solution if possible Checking of the ticket still opened and pressing the expert or the site-managers for answering and solving them In case of problems with IGI infrastructure: Register and submit tickets through https://ticketing.cnaf.infn.it CMT (Central Management Team) is the generic department Evidenziare il cambiamento nel sistema dei turni 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Monitoring tools Nagios Simplifies Grid resources operations Visualization & management interface on Grid resources status Provides site admin-centric monitoring Issues notifications as soon as problem appears GStat Queries the Information System every 5 minutes The sites and nodes checked are those registered in the GOC DB The inconsistency of the information published and the eventual missing of a service that a site should publish are reported as an error http://gstat.egi.eu 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

HLRmon accounting portal Open section with general accounting data The personal certificate installed in the browser is required Data aggregated per: Grid site VO, groups and roles CA, RA Job type (Grid or local) https://dgas.cnaf.infn.it/hlrmon/report/charts.php Restricted section providing per-user information visible only by registered and authorized users https://dgas.cnaf.infn.it/hlrmon/report/ranking.php 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster

Calcolo Parallelo su Grid e CSN4cluster Thank you Questions ? 26 Settembre 2011 Calcolo Parallelo su Grid e CSN4cluster