20-May-2003HEPiX Amsterdam EDG Fabric Management on Solaris G. Cancio Melia, L. Cons, Ph. Defert, I. Reguero, J. Pelegrin, P. Poznanski, C. Ungil Presented.

Slides:



Advertisements
Similar presentations
30-31 Jan 2003J G Jensen, RAL/WP5 Storage Elephant Grid Access to Mass Storage.
Advertisements

The Quantum Chromodynamics Grid James Perry, Andrew Jackson, Matthew Egbert, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
CERN – BT – 01/07/ Cern Fabric Management -Hardware and State Bill Tomlin GridPP 7 th Collaboration Meeting June/July 2003.
Andrew McNab - Manchester HEP - 24 May 2001 WorkGroup H: Software Support Both middleware and application support Installation tools and expertise Communication.
ELFms status and deployment, 25/5/2004 ELFms, status, deployment Germán Cancio for CERN IT/FIO HEPiX spring 2004 Edinburgh 25/5/2004.
DataGrid is a project funded by the European Union CHEP 2003 – March 2003 – Towards automation of computing fabrics... – n° 1 Towards automation.
German Cancio – WP4 developments Partner Logo WP4-install plans WP6 meeting, Paris project conference
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
ASIS et le projet EU DataGrid (EDG) Germán Cancio IT/FIO.
Current Status of Fabric Management at CERN, 26/7/2004 Current Status of Fabric Management at CERN CHEP 2004 Interlaken, 27/9/2004 CERN IT/FIO: G. Cancio,
OpenCCM: The Open CORBA Components Platform OSMOSE WP3 CCM Meeting 14th November 2003, LIFL, Lille, France Philippe Merle Jacquard Project.
Automating Linux Installations at CERN G. Cancio, L. Cons, P. Defert, M. Olive, I. Reguero, C. Rossi IT/PDP, CERN presented by G. Cancio.
Understanding and Managing WebSphere V5
WP4-install task report WP4 workshop Barcelona project conference 5/03 German Cancio.
EGEE is a project funded by the European Union under contract IST Quattor Installation of Grid Software C. Loomis (LAL-Orsay) GDB (CERN) Sept.
Networked Application Architecture Design. Application Building Blocks Application Software Data Infrastructure Software Local Area Network Server Desktop.
DataGrid is a project funded by the European Commission under contract IST IT Post-C5, Managing Computer Centre machines with Quattor.
EDG LCFGng: concepts Fabric Management Tutorial - n° 2 LCFG (Local ConFiGuration system)  LCFG is originally developed by the.
1 Linux in the Computer Center at CERN Zeuthen Thorsten Kleinwort CERN-IT.
October, Scientific Linux INFN/Trieste B.Gobbo – Compass R.Gomezel - T.Macorini - L.Strizzolo INFN - Trieste.
Olof Bärring – WP4 summary- 6/3/ n° 1 Partner Logo WP4 report Status, issues and plans
quattor NCM components introduction tutorial German Cancio CERN IT/FIO.
EDG WP4: installation task LSCCW/HEPiX hands-on, NIKHEF 5/03 German Cancio CERN IT/FIO
19-May-2003 Solaris service: Status and plans at CERN Ignacio Reguero IT / Product Support / Unix Infrastructure Presented by Manuel Guijarro.
Partner Logo DataGRID WP4 - Fabric Management Status HEPiX 2002, Catania / IT, , Jan Iven Role and.
Olof Bärring – WP4 summary- 4/9/ n° 1 Partner Logo WP4 report Plans for testbed 2
1 The new Fabric Management Tools in Production at CERN Thorsten Kleinwort for CERN IT/FIO HEPiX Autumn 2003 Triumf Vancouver Monday, October 20, 2003.
05/29/2002Flavia Donno, INFN-Pisa1 Packaging and distribution issues Flavia Donno, INFN-Pisa EDG/WP8 EDT/WP4 joint meeting, 29 May 2002.
German Cancio – WP4 developments Partner Logo System Management: Node Configuration & Software Package Management
Large Farm 'Real Life Problems' and their Solutions Thorsten Kleinwort CERN IT/FIO HEPiX II/2004 BNL.
Fabric Infrastructure LCG Review November 18 th 2003 CERN.ch.
Deployment work at CERN: installation and configuration tasks WP4 workshop Barcelona project conference 5/03 German Cancio CERN IT/FIO.
G. Cancio, L. Cons, Ph. Defert - n°1 October 2002 Software Packages Management System for the EU DataGrid G. Cancio Melia, L. Cons, Ph. Defert. CERN/IT.
Maite Barroso – WP4 Barcelona – 13/05/ n° 1 -WP4 Barcelona- Closure Maite Barroso 13/05/2003
Installing, running, and maintaining large Linux Clusters at CERN Thorsten Kleinwort CERN-IT/FIO CHEP
May http://cern.ch/hep-proj-grid-fabric1 EU DataGrid WP4 Large-Scale Cluster Computing Workshop FNAL, May Olof Bärring, CERN.
SPMA & SWRep: Basic exercises HEPiX hands-on, NIKHEF 5/03 German Cancio
Software Management with Quattor German Cancio CERN/IT.
Olof Bärring – WP4 summary- 4/9/ n° 1 Partner Logo WP4 report Plans for testbed 2 [Including slides prepared by Lex Holt.]
Managing the CERN LHC Tier0/Tier1 centre Status and Plans March 27 th 2003 CERN.ch.
Cluster Configuration Update Including LSF Status Thorsten Kleinwort for CERN IT/PDP-IS HEPiX I/2001 LAL Orsay Tuesday, December 08, 2015.
EU 2nd Year Review – Feb – WP4 demo – n° 1 WP4 demonstration Fabric Monitoring and Fault Tolerance Sylvain Chapeland Lord Hess.
C. Aiftimiei, E. Ferro / January LCFGng server installation Cristina Aiftimiei, Enrico Ferro INFN-LNL.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Tools and techniques for managing virtual machine images Andreas.
Olof Bärring – EDG WP4 status&plans- 22/10/ n° 1 Partner Logo EDG WP4 (fabric mgmt): status&plans Large Cluster.
Fabric Management with ELFms BARC-CERN collaboration meeting B.A.R.C. Mumbai 28/10/05 Presented by G. Cancio – CERN/IT.
German Cancio – WP4 developments Partner Logo WP4-install progress CERN, 19/6/2002 for WP4-install.
Maite Barroso - 10/05/01 - n° 1 WP4 PM9 Deliverable Presentation: Interim Installation System Configuration Management Prototype
ASIS + RPM: ASISwsmp German Cancio, Lionel Cons, Philippe Defert, Andras Nagy CERN/IT Presented by Alan Lovell.
David Foster LCG Project 12-March-02 Fabric Automation The Challenge of LHC Scale Fabrics LHC Computing Grid Workshop David Foster 12 th March 2002.
The EDG Testbed The European DataGrid Project Team
15-Feb-02Steve Traylen, RAL WP6 Test Bed Report1 RAL/UK WP6 Test Bed Report Steve Traylen, WP6 PPGRID/RAL, UK
Linux Configuration using April 12 th 2010 L. Brarda / CERN (some slides & pictures taken from the Quattor website) ‏
Automated management…, 26/7/2004 Automated management of large fabrics with ELFms Germán Cancio for CERN IT/FIO LCG-Asia Workshop Taipei, 26/7/2004
Quattor tutorial Introduction German Cancio, Rafael Garcia, Cal Loomis.
Partner Logo Olof Bärring, WP4 workshop 10/12/ n° 1 (My) Vision of where we are going WP4 workshop, 10/12/2002 Olof Bärring.
Fabric Management: Progress and Plans PEB Tim Smith IT/FIO.
Managing Large Linux Farms at CERN OpenLab: Fabric Management Workshop Tim Smith CERN/IT.
Quattor installation and use feedback from CNAF/T1 LCG Operation Workshop 25 may 2005 Andrea Chierici – INFN CNAF
Quattor: An administration toolkit for optimizing resources Marco Emilio Poleggi - CERN/INFN-CNAF German Cancio - CERN
AII v2 Ronald Starink Luis Fernando Muñoz Mejías
Monitoring and Fault Tolerance
Status of Fabric Management at CERN
Germán Cancio CERN IT/FIO LCG workshop, 24/3/04
Grid related projects CERN openlab LCG EDG F.Fluckiger
WP4-install status update
Status and plans of central CERN Linux facilities
German Cancio CERN IT .quattro architecture German Cancio CERN IT.
Towards automation of computing fabrics using tools from the fabric management workpackage of the EU DataGrid project Maite Barroso Lopez (WP4)
Presentation transcript:

20-May-2003HEPiX Amsterdam EDG Fabric Management on Solaris G. Cancio Melia, L. Cons, Ph. Defert, I. Reguero, J. Pelegrin, P. Poznanski, C. Ungil Presented by M. Guijarro. CERN/IT

20-May-2003HEPiX Amsterdam Fabric management on Solaris. EDG: the European DataGRID project Fabric Management (WP4) Global functioning of installation and configuration Configuration Installation Node Configuration Management Solaris port Status and plans

20-May-2003HEPiX Amsterdam EDG DataGrid is a project funded by European Union. The objective is to build the next generation computing infrastructure providing intensive computation and analysis of shared large-scale databases, from hundreds of TeraBytes to PetaBytes, across widely distributed scientific communities.

20-May-2003HEPiX Amsterdam EDG Divided in Work Packages: –Middleware: WP1: Work Scheduling WP2: Data Management WP3: Monitoring services WP4: Fabric Management WP5: Storage Management WP6: Integration Testbed & Support WP7: Network – Applications: WP8: Particle Physics WP9: Earth Observation WP10: Biology

20-May-2003HEPiX Amsterdam Fabric Management (WP4) Divided in Tasks: – Installation – Configuration – Monitoring – Fault Tolerance – Resource Management – Gridification – Integration } Today's subject

20-May-2003HEPiX Amsterdam WP4: Global idea CCM SPMA NCM Components CdispdNCM Registration Notification SPMA SPMA.cfg CDB nfs http ftp Mgmt API ACL’s Client Nodes SWRep Servers cache Packages (rpm, pkg) packages (RPM, PKG) PXE DHCP Mgmt API ACL’s Installation server DHCP handling KS/JS PXE handling KS/JS generator Node Install CCM Node (re)install? EDG group slide

20-May-2003HEPiX Amsterdam WP4: Configuration and Installation Objective: To develop system management tools for enabling the deployment of very large computing fabrics […] with reduced sysadmin and operation costs. Installation task: solutions for –automated from scratch node installation –node configuration/reconfiguration –software storage, distribution and installation Configuration task: solutions for –storing, maintaining and retrieving configuration information.

20-May-2003HEPiX Amsterdam WP4: Configuration Central Configuration Database (CDB): Common store for configuration information –…including what software packages to deploy from which repository on which nodes Configuration information can be arranged in templates: Possible to create template combinations/hierarchies to match service structures Each template can be maintained (using a GUI) by a different person Configuration information is validated and kept under version control using transactions LXBATCH lxbatch444 Linux Base packages CC packages EDG/LCG m/ware lxbatch445 lxbatch446

20-May-2003HEPiX Amsterdam WP4: Installation The Software Package Management and Distribution subsystem is responsible for managing and storing software packages, and the distribution and installation of these packages on client nodes. SWrep (Software Repository): –Software modules are bundled into packages using a given packaging format, like RPM for most Linux distributions, or PKG for Sun/Solaris. – The packages themselves are stored on a managed software repository. –This repository is accessible via protocols like HTTP, FTP, or a shared file system. The Node Configuration Management provides a framework for adapting the actual configuration of a node to its desired configuration, as it is described in the node’s profile inside the CDB. –This target information is made available to the node via a configuration component running on each node. –Node components are notified by a daemon which is polling the CDB.

20-May-2003HEPiX Amsterdam WP4: NCM ● NCM – Node Configuration Management ● Client software running on the node which takes care of “implementing” what is in the configuration ● Configurations are centrally stored, managed and accessed (CDB), using XML profiles (per node) ● “Components” (like SUE features) are responsible for updating local config files, and notifying services if needed

20-May-2003HEPiX Amsterdam WP4: Global idea CCM SPMA NCM Components CdispdNCM Registration Notification SPMA SPMA.cfg CDB nfs http ftp Mgmt API ACL’s Client Nodes SWRep Servers cache Packages (rpm, pkg) packages (RPM, PKG) PXE DHCP Mgmt API ACL’s Installation server DHCP handling KS/JS PXE handling KS/JS generator Node Install CCM Node (re)install? EDG group slide

20-May-2003HEPiX Amsterdam Solaris: CDB Configuration Database –CDB stores the hardware and software configuration in a configuration server –PAN is used to compile HLD to LLD –Clients (NCM,AII) access the CDB using the Node View Access API (Configuration Cache Manager) Solaris –Global Schema has to be adapted to Solaris –PAN, CDB and CCM already ported

20-May-2003HEPiX Amsterdam Solaris – AII Automated Installation Infrastructure – Installs machines acording to the configuration in the CDB – 3 modules: DHCP NBP (Network Bootstrap Program) OS Installer Solaris – Loader is PXElinux in Linux -> OpenBoot in Solaris – Anaconda/Kickstart in Linux -> Jumpstart in Solaris

20-May-2003HEPiX Amsterdam Solaris: SPMA SPMA –Reads the list of installed packages (OS) –Get the list of packages to be installed (CDB) –Computes the differences –Determines the list of operations to do –Calls the package installer/de-installer Solaris port –Reads pkg data base (-> rpm) –Pkgt installs/de-installs (-> rpmt) –ASIS apps packages with pkg (-> rpm)

20-May-2003HEPiX Amsterdam Solaris: Implementing SPMA spma-target.cf Example: /afs/.cern.ch/asis/PKGS/sun4x_58 ASIS-ASIS-applog sun4x_58... spma-managed-packages Example: - ASIS-ASIS-applog sun4x_58... differencesList of operations check_conflicts store_actions arrange_actions execute_actions pkgt SPMA is an application relying on a set of libraries, all system independent, except the Packager class (virtual) which is inherited by the platform dependent class (SysVPkgr for Solaris) PKG Transactions is a tool to install, upgrade and remove Solaris packages in one transaction.

20-May-2003HEPiX Amsterdam Solaris: NCM Node Configuration Manager –Updates the configuration of the machine when the configuration in CDB changes –Provides the framework, components are needed for the different local services Solaris –Many linux components reusable (if designed with portability in mind) –Some specific components needed (30% of the current SUE features are Solaris-specific)

20-May-2003HEPiX Amsterdam Solaris: WP4 Status Installation –ASIS applications have been packaged with pkg –SPMA and pkgt work but are still in test –NCM in detailed design phase (collaboration with EDG, more focused on Linux) – AII in design phase Configuration –Pan, database and cache manager ported and included in CVS –Global schema worked on (design validated)

20-May-2003HEPiX Amsterdam Solaris: WP4 Future 2003/end Q2: LCG-1 with WP4/Linux installation and configuration (already ~ 100 nodes) 2003/end Q3: SPMA and pkgt will be used for Solaris 9 certification at CERN 2003/end Q3: SPMA and pkgt deployed in the Computer Centre 2003/end Q3: CDB on Solaris. 2003/end Q4: GUI for a CDB editor

20-May-2003HEPiX Amsterdam Conclusion Specific Resource: –J. Pelegrin (CERN - SPMA, pkgt, ASISdist) –C. Ungil (SUN fellow – CDB, AII, NCM) –S. Lopienski (CERN – CDB editor) Proof of concept of WP4 Proof of portability (RedHat 386, Solaris but also could be ia64)