Presentation is loading. Please wait.

Presentation is loading. Please wait.

EDG WP4: installation task LSCCW/HEPiX hands-on, NIKHEF 5/03 German Cancio CERN IT/FIO

Similar presentations


Presentation on theme: "EDG WP4: installation task LSCCW/HEPiX hands-on, NIKHEF 5/03 German Cancio CERN IT/FIO"— Presentation transcript:

1 EDG WP4: installation task LSCCW/HEPiX hands-on, NIKHEF 5/03 German Cancio CERN IT/FIO http://cern.ch/wp4-install

2 HEPiX hands-on / Installation Task / German Cancio CERN - n° 2 Agenda Part 1: u General architectural overview u Components description and current status Part 2: u Exercises on software distribution Part 3: u Discussion: differences to other solutions (if time permits)

3 HEPiX hands-on / Installation Task / German Cancio CERN - n° 3 Disclaimer u This is not a repetition of the WP4 LCFGng tutorial given last year at CERN. I will describe the proposed replacement for LCFG, developed by EDG WP4-install. u This is a work in progress. Most of the subsystems presented here are currently under design/development, although some are already been deployed at CERN. u There are less practical exercises than theory slides ;-( u Your feedback is a most welcome source for improvements!

4 HEPiX hands-on / Installation Task / German Cancio CERN - n° 4 EDG WP4: reminder u WP4 is the ‘fabric management’ work package of the EU DataGrid project. u Objective: n To develop system management tools for enabling the deployment of very large computing fabrics […] with reduced sysadmin and operation costs. u Installation task: solutions for n automated from scratch node installation n node configuration/reconfiguration n software storage, distribution and installation u Configuration task: solutions for n storing, maintaining and retrieving configuration information.

5 HEPiX hands-on / Installation Task / German Cancio CERN - n° 5 WP4-install architecture Subsystems: u Base Installation: n AII (Automated Installation Infrastructure) u Node Configuration: n NCM (Node Configuration Manager) u Software Distribution: n Software Repository (SWRep) n Software Package Management Agent (SPMA)

6 HEPiX hands-on / Installation Task / German Cancio CERN - n° 6 WP4-install arch CCM SPMA NCM Components Cdispd NCM Registration Notification SPMA SPMA.cfg CDB nfs http ftp Mgmt API ACL’s Client Nodes SWRep Servers cache Packages (rpm, pkg) packages (RPM, PKG) PXE DHCP Mgmt API ACL’s Installation server DHCP handling KS/JS PXE handling KS/JS generator Node Install CCM Node (re)install? Automated Installation Infrastructure DHCP and Kickstart (or JumpStart) are re- generated according to CDB contents PXE can be set to reboot or reinstall by operator Software Repository Packages (in RPM or PKG format) can be uploaded into multiple Software Repositories Client access is using HTTP, NFS/AFS or FTP Management access subject to authentication/authorization Node Configuration Manager (NCM) Configuration Management on the node is done by NCM Components Each component is responsible for configuring a service (network, NFS, sendmail, PBS) Components are notified by the Cdispd whenever there was a change in their configuration Software Package Mgmt Agent (SPMA) SPMA manages the installed packages Runs on Linux (RPM) or Solaris (PKG) SPMA configuration done via an NCM component Can use a local cache for pre-fetching packages (simultaneous upgrades of large farms)

7 HEPiX hands-on / Installation Task / German Cancio CERN - n° 7 Base installation (AII)

8 HEPiX hands-on / Installation Task / German Cancio CERN - n° 8 AII (Automated Installation Infrastructure) u Subsystem to automate the node base installation via the network u Layer on top of existing technologies (base system installer, DHCP, PXE) u Modules: u AII-dhcp: n manage DHCP server for network installation information u AII-nbp (network bootstrap program): n manages the PXE configuration for each node (boot from HD/ start the installation via network) u AII-osinstall: n Manage OS configuration files required by the OS installation procedure (KickStart, JumpStart) u More details in AII design document: http://edms.cern.ch/document/374559 http://edms.cern.ch/document/374559

9 HEPiX hands-on / Installation Task / German Cancio CERN - n° 9 AII: current status u Architectural design finished u Detailed Design, implementation progressing u first alpha version expected mid July

10 HEPiX hands-on / Installation Task / German Cancio CERN - n° 10 Node Configuration (NCM)

11 HEPiX hands-on / Installation Task / German Cancio CERN - n° 11 Node Configuration Management (NCM) u Client software running on the node which takes care of “implementing” what is in the configuration profile u Modules: n “Components” n Invocation and notification framework n Component support libraries

12 HEPiX hands-on / Installation Task / German Cancio CERN - n° 12 NCM: Components u “Components” (like SUE “features” or LCFG ‘objects’) are responsible for updating local config files, and notifying services if needed u Components register their interest in configuration entries or subtrees, and get invoked in case of changes u Components do only configure the system Usually, this implies regenerating and/or updating local config files (eg. /etc/sshd_config ) u Use standard system facilities (SysV scripts) for managing services n Components can notify services using SysV scripts when their configuration changes. u Possible to define configuration dependencies between components n Eg. configure network before sendmail

13 HEPiX hands-on / Installation Task / German Cancio CERN - n° 13 Component example sub Configure { my ($self) = @_; # access configuration information my $config=NVA::Config->new(); my $arch=$config->getValue('/system/architecture’); # NVA API $self->Fail (“not supported") unless ($arch eq ‘i386’); # (re)generate and/or update local config file(s) open (myconfig,’/etc/myconfig’); … # notify affected (SysV) services if required if ($changed) { system(‘/sbin/service myservice reload’); … }

14 HEPiX hands-on / Installation Task / German Cancio CERN - n° 14 NCM (contd.)  cdispd (Configuration Dispatch Daemon) n Monitors the config profile, and invokes components via the ncd if there were changes  ncd (Node Configuration Deployer): n framework and front-end for executing components (via cron, cdispd, or manually) n Dependency ordering of components u Component support libraries: n For recurring system mgmt tasks (interfaces to system services, sysinfo), log handling, etc u More details in NCM design document http://edms.cern.ch/document/372643 http://edms.cern.ch/document/372643

15 HEPiX hands-on / Installation Task / German Cancio CERN - n° 15 NCM architecture (from design doc.)

16 HEPiX hands-on / Installation Task / German Cancio CERN - n° 16 NCM: Status u Architectural design finished u Detailed (class) design progressing u First version expected mid July u Porting/coding of base configuration components completed mid September n more than 60 components to be ported for having a complete EDG solution (configuring all EDG middleware services)! u Pilot deployment on CERN central interactive/batch facilities expected at the end of the year

17 HEPiX hands-on / Installation Task / German Cancio CERN - n° 17 Software Distribution (SWRep and SPMA)

18 HEPiX hands-on / Installation Task / German Cancio CERN - n° 18 SPM (Software Package Mgmt) (I) SWRep (Software Repository): u Client-server toolsuite for the management of software packages u Universal repository: n Extendable to multiple platforms and package formats (RHLinux/RPM, Solaris/PKG,… others like Debian dpkg) n Multiple package versions/releases u Management (“product maintainers”) interface: n ACL based mechanism to grant/deny modification rights (packages associated to “areas”) n Current implementation using SSH u Client access: via standard protocols n HTTP (scalability), but also AFS/NFS, FTP u Replication: using standard tools (eg. rsync) n Availability, load balancing

19 HEPiX hands-on / Installation Task / German Cancio CERN - n° 19 SPM (Software Package Mgmt) (II) Software Package Management Agent (SPMA): u Runs on every target node u Multiple repositories can be accessed (eg. division/experiment specific) u Plug-in framework allows for portability n System packager specific transactional interface (RPMT, PKGT) u Can manage either all or a subset of packages on the nodes n Useful for add-on installations, and also for desktops n Configurable policies (partial or full control, mandatory and unwanted packages, conflict resolution…) u Addresses scalability n Packages can be stored ahead in a local cache, avoiding peak loads on software repository servers (simultaneous upgrades of large farms) n HTTP protocol allows to use web proxy hierarchies

20 HEPiX hands-on / Installation Task / German Cancio CERN - n° 20 SPM (Software Package Mgmt) (III) u SPMA functionality: 1.Compares the packages currently installed on the local node with the packages listed in the configuration 2.Computes the necessary install/deinstall/upgrade operations 3.Invokes the packager (rpmt/pkgt) with the right operation transaction set u The SPM is driven via a local configuration file n For batch/servers: A NCM component generates/maintains this cf file out of CDB information n For desktops: Possible to write a GUI for locally editing the cf file

21 HEPiX hands-on / Installation Task / German Cancio CERN - n° 21 Software Package Manager (SPM) RPMT u RPMT (RPM transactions) is a small tool on top of the RPM libraries, which allows for multiple simultaneous package operations resolving dependencies (unlike RPM) n Example: ‘upgrade X, deinstall Y, downgrade Z, install T’ and verify/resolve appropriate dependencies u Does use basic RPM library calls, no added intelligence u Ports available for RPM 3 and 4.0.X u Will try to feedback to rpm user community after porting to RPM 4.2 u CERN IT/PS working on equivalent Solaris port (PKGT)

22 HEPiX hands-on / Installation Task / German Cancio CERN - n° 22 SWRep/SPMA architecture Packages Mgmt API Repository A packages Mgmt API CDB config Client nodes NCM/ GUI SPMA.cfg SPMA (RPM, PKG) GUI CLI cache Repository B inventory http afs nfs ftp (HTTP Proxy) rpmt

23 HEPiX hands-on / Installation Task / German Cancio CERN - n° 23 SPMA & SWRep: current status u First production version available u Being deployed in the CERN Computer Centre (next slide) u Enhanced functionality (package cache management) for mid- October u Solaris port progressing (cf. M. Guijarro’s talk)

24 HEPiX hands-on / Installation Task / German Cancio CERN - n° 24 SPMA/SWRep deployment @ CERN CC u Started phasing out legacy SW distribution systems (including ASIS) on the central batch/interactive servers (LXPLUS&LXBATCH) n Using HTTP as package access protocol (scalability) n > 400 nodes currently running it in production n Deployment page: http://cern.ch/wp4-install/CERN/deploy http://cern.ch/wp4-install/CERN/deploy u Server clustering solution n For CDB (XML profiles) and SWRep (RPM’s over HTTP) n Replication done with rsync n Load balancing done with simple DNS round-robin n Currently, 3 servers in production (800 MHz, 500MB RAM, FastEthernet) giving ~ 3*12Mbyte throughput n Future: may include usage of hierarchical web proxys (eg. using squid)

25 HEPiX hands-on / Installation Task / German Cancio CERN - n° 25 Questions / comments ?


Download ppt "EDG WP4: installation task LSCCW/HEPiX hands-on, NIKHEF 5/03 German Cancio CERN IT/FIO"

Similar presentations


Ads by Google