CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFMs to Agile Infrastructure CERN, IT Department.

Slides:



Advertisements
Similar presentations
MIS 2000 Class 20 System Development Process Updated 2014.
Advertisements

Agile Testing with Testing Anywhere The road to automation need not be long.
VENDORS, CONSULTANTS AND USERS
Testing as a Service with HammerCloud Ramón Medrano Llamas CERN, IT-SDC
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
CERN IT Department CH-1211 Genève 23 Switzerland t Integrating Lemon Monitoring and Alarming System with the new CERN Agile Infrastructure.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 17 Slide 1 Extreme Programming.
CONTINUOUS INTEGRATION, DELIVERY & DEPLOYMENT ONE CLICK DELIVERY.
Continuous Delivery Ajey Gore Head of Technology ThoughtWorks India.
Software to Data model Lenos Vacanas, Stelios Sotiriadis, Euripides Petrakis Technical University of Crete (TUC), Greece Workshop.
Term 2, 2011 Week 3. CONTENTS The physical design of a network Network diagrams People who develop and support networks Developing a network Supporting.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
Status of WLCG Tier-0 Maite Barroso, CERN-IT With input from T0 service managers Grid Deployment Board 9 April Apr-2014 Maite Barroso Lopez (at)
AI project components: Facter and Hiera
Operating Systems & Infrastructure Services CERN IT Department CH-1211 Geneva 23 Switzerland t OIS OIS Feedback on Module Responsibilities.
InWEnt | Qualified to shape the future1 Internet based Human Resource Development Management Platform Human Resource Development Programme in Natural Disaster.
Extreme/Agile Programming Prabhaker Mateti. ACK These slides are collected from many authors along with a few of mine. Many thanks to all these authors.
CERN IT Department CH-1211 Genève 23 Switzerland t ITIL at CERN Tony Cass HEPiX LBL, 29 th October 2009.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
CERN IT Department CH-1211 Genève 23 Switzerland t Windows Desktop Applications Life-cycle Management Sebastien Dellabella, Rafal Otto Internet.
CERN IT Department CH-1211 Genève 23 Switzerland t Experiences running a production Puppet Ben Jones HEPiX Bologna Spring.
Configuration Management Evolution at CERN Gavin
Chapter 14 Part II: Architectural Adaptation BY: AARON MCKAY.
CERN IT Department CH-1211 Geneva 23 Switzerland t Daniel Gomez Ruben Gaspar Ignacio Coterillo * Dawid Wojcik *CERN/CSIC funded by Spanish.
Continuous Integration and Code Review: how IT can help Alex Lossent – IT/PES – Version Control Systems 29-Sep st Forum1.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES AI’s user access, OpenStack security groups and firewall.
VENDORS, CONSULTANTS AND USERS. WHY CAN’T COMPANIES DEVELOP THEIR OWN ERP PACKAGES? To develop an ERP package is a complex & time consuming activity which.
CERN IT Department CH-1211 Genève 23 Switzerland t The Agile Infrastructure Project Part 1: Configuration Management Tim Bell Gavin McCance.
MIS 7003 MBA Core Course in MIS Professor Akhilesh Bajaj The University of Tulsa Introduction to S/W Engineering © All slides in this presentation Akhilesh.
What Is DevOps? DevOps is "a portmanteau of 'development' and 'operations'" and is "a software development method that stresses communications, collaboration,
CERN IT Department CH-1211 Geneva 23 Switzerland t CF Computing Facilities Agile Infrastructure Monitoring CERN IT/CF.
CERN IT Department CH-1211 Genève 23 Switzerland PES 1 Ermis service for DNS Load Balancer configuration HEPiX Fall 2014 Aris Angelogiannopoulos,
CERN IT Department CH-1211 Genève 23 Switzerland t IT Configuration Activities Gavin McCance Online Cross-experiment Meeting, 14 June 2012.
1 CERN IT Department CH-1211 Genève 23 Switzerland t Puppet in the CERN CC Tomas Karasek Steve Traylen Oct
Stairway to the cloud or can we take the highway? Taivo Liik.
| nectar.org.au NECTAR TRAINING Module 4 From PC To Cloud or HPC.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Agile Infrastructure Monitoring HEPiX Spring th April.
CERN IT Department CH-1211 Geneva 23 Switzerland t A proposal for improving Job Reliability Monitoring GDB 2 nd April 2008.
Chapter 10 Information Systems Development. Learning Objectives Upon successful completion of this chapter, you will be able to: Explain the overall process.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Hammercloud and Nagios Dan Van Der Ster Nicolò Magini.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Development Workflow of the Configuration Management.
An Agile Requirements Approach 1. Step 1: Get Organized  Meet with your team and agree on the basic software processes you will employ.  Decide how.
Agenda: Overview of Agile testing Difference between Agile and traditional Methodology Agile Development Methodologies Extreme Programming Test Driven.
Operating Systems & Information Services CERN IT Department CH-1211 Geneva 23 Switzerland t OIS Drupal at CERN Juraj Sucik Jarosław Polok.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Alarming with GNI VOC WG meeting 12 th September.
CERN - IT Department CH-1211 Genève 23 Switzerland t Operating systems and Information Services OIS Proposed Drupal Service Definition IT-OIS.
1 Update at RAL and in the Quattor community Ian Collier - RAL Tier1 HEPiX FAll 2010, Cornell.
Cruise Training Introduction of Continuous Integration.
CERN IT Department CH-1211 Genève 23 Switzerland t SL(C) 5 Migration at CERN CHEP 2009, Prague Ulrich SCHWICKERATH Ricardo SILVA CERN, IT-FIO-FS.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN Agile Infrastructure Monitoring Pedro Andrade CERN – IT/GT HEPiX Spring 2012.
CERN AI Config Management 16/07/15 AI for INFN visit2 Overview for INFN visit.
©Ian Sommerville 2007COTS-based System Engineering Slide 1 COTS-based System Engineering.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Agile Infrastructure Project Overview : Status and.
Configuration Services at CERN HEPiX fall Ben Jones, HEPiX Fall 2014.
Founded by Big Five Consulting ex-employees Oracle Gold Partner Focus on PeopleSoft 15 years of PeopleSoft experience Worked in both technical and functional.
Software Development. The Software Life Cycle Encompasses all activities from initial analysis until obsolescence Analysis of problem or request Analysis.
CERN IT Department CH-1211 Genève 23 Switzerland t Load testing & benchmarks on Oracle RAC Romain Basset – IT PSS DP.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES GIT Service in the Agile Infrastructure Project Vítor.
READ ME FIRST Use this template to create your Partner datasheet for Azure Stack Foundation. The intent is that this document can be saved to PDF and provided.
System Conversion.
Status and Plans for InCA
Software Development Life Cycle
Graduation Project Kick-off presentation - SET
X in [Integration, Delivery, Deployment]
JOINED AT THE HIP: DEVSECOPS AND CLOUD-BASED ASSETS
Extreme Programming.
The StarlingX Story Learn, Try, Get Involved!
OU BATTLECARD: Oracle Systems Learning Subscription
Presentation transcript:

CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFMs to Agile Infrastructure CERN, IT Department HEPiX Fall 2013 Workshop

CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFMs to Agile Infrastructure - 2 Agenda Configuration Management at CERN –ELFms –Agile Infrastructure (AI) Project Migration from ELFms to AI Migration activities –meetings –training –documentation AI Challenges Conclusions

CERN IT Department CH-1211 Genève 23 Switzerland t Configuration Management Service at CERN We aim to provide the tools, process and the infrastructure for the service managers and system administrators to manage the configuration of any machine hosted in the CERN’s Computer Centre Migration from ELFMs to Agile Infrastructure - 3

CERN IT Department CH-1211 Genève 23 Switzerland t ELFms Extremely Large Fabric management system (ELFms) –includes a set of tools Quattor for configuration management Lemon for performance and exception monitoring LEAF Hardware Management System and State Management System Migration from ELFMs to Agile Infrastructure - 4

CERN IT Department CH-1211 Genève 23 Switzerland t ELFms: Quattor Quattor current situation: –configuration management software –developed in-house –first release 2003 –successfully managing around 8000 servers –we have more that 260 persons managing configurations Including system administrators from CERN IT and experiments, computer centre operators and technicians Why changing?? –not scalable to the number of machines expected in the future Geneva Computer Centre and Wigner Computer Centre –the costs to develop and maintain our own configuration solution are not reducing –hire professionals trained in a tool developed in-house is not easy –small community –support problems Migration from ELFMs to Agile Infrastructure - 5

CERN IT Department CH-1211 Genève 23 Switzerland t Agile Infrastructure Project Agile Infrastructure (AI) Configuration Management System –Puppet for the configuration management Cloud infrastructure –new VM hosting service – cloud approach OpenStack as cloud software for virtual machines Monitoring –new alarm system (lemon/GNI) Migration from ELFMs to Agile Infrastructure - 6

CERN IT Department CH-1211 Genève 23 Switzerland t AI: Configuration Management System based on free and open source tools –Puppet and its suite of tools, Foreman, Git, etc. scalable to the increasing number of computer nodes –aim ≈ 300,000 virtual machines compatible with the cloud approach processes and workflows completely rethought Migration from ELFMs to Agile Infrastructure - 7

CERN IT Department CH-1211 Genève 23 Switzerland t AI: Configuration Management Benefits of the AI approach: –Puppet has a very active community: often problems are fixed and missing features implemented before you event report them –community support is working well until now –documentation: the Puppet documentation is very good and is continuously being improved –fast deployment tools already available and “ready to use” –easier to hire people who have skills in a widely- used tools than our internal tools Migration from ELFMs to Agile Infrastructure - 8

CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFms to AI What is the best way to migrate to the new system? –it was decided to redo everything –no direct conversion from Quattor managed machines to Puppet managed machines… in other words: no direct conversion of the Quattor configuration files –Why? easier and faster to redo everything than try to convert Quattor configurations Migration from ELFMs to Agile Infrastructure - 9

CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFms to AI ELFms and AI for the moment they live in parallel It is expected that over time less and less servers will be configured/managed with ELFms/Quattor –we aim to do most of the transition to AI/Puppet during the next two years Migration: It was decided to involve system administrators in early stages of the project –Why? collect their feedback promote collaboration between the system administrators and the AI architects help us to adapt AI to the system administrators requirements Migration from ELFMs to Agile Infrastructure - 10

CERN IT Department CH-1211 Genève 23 Switzerland t Migration activities (meetings) What have we been doing to introduce the new configuration management system to the system administrators? –one year after the starting of the project a series of regular meetings started to happen with system administrators and service managers introduce the new Configuration Management System provide information about project and future plans since the first meeting the system administrators have received early access to the Configuration Management System –deliberated decision –it was important for us their “user experience” with our system Migration from ELFMs to Agile Infrastructure - 11

CERN IT Department CH-1211 Genève 23 Switzerland t Migration activities (meetings - results) Example how the collaboration have been working… three months after the start of the regular meetings it was introduced the first development process of the Puppet modules –puppet modules are reusable set of files and Puppet manifests to configure a piece of software (e.g. apache, mysql, etc.) –puppet modules can be shared by several services it is necessary to coordinate their changes we don’t want people to step on each other Migration from ELFMs to Agile Infrastructure - 12

CERN IT Department CH-1211 Genève 23 Switzerland t Migration activities (meetings - results) Results of meetings: feedback on the process –we had some shortcomings with the first iteration issues in pushing changes into production at different speeds the test process was not easy the ownership and responsibility of the changes weren’t clear etc. –it was necessary to change the process several improvements were made… Migration from ELFMs to Agile Infrastructure - 13

CERN IT Department CH-1211 Genève 23 Switzerland t Migration activities (meetings - results) Nine months after the introduction of the first version development process… a new version was introduced… –the new version mitigates several problems encountered on the first version –feedback was very positive –still on-going discussion a positive collaboration between the AI team and the system administrators improved the development process of the Puppet modules Migration from ELFMs to Agile Infrastructure - 14

CERN IT Department CH-1211 Genève 23 Switzerland t Migration activities (training) Eight months after the start of regular meetings we wanted to increase the number of participants… A new strategy was necessary… –during the last summer five training sessions were organised to introduced the new system to the system administrators and service managers hands-on approach –helped to get even more system administrators involved in the on-going project –after one morning: the users were able to create the first Puppet managed virtual machine once again important feedback was collected during these training sessions Migration from ELFMs to Agile Infrastructure - 15

CERN IT Department CH-1211 Genève 23 Switzerland t AI Documentation Keep the documentation of our processes up- to-date has been a great challenge –the evolution of the project and constant changes make very difficult to have a up-to-date documentation even during the training sessions the things were changing –information spread across several places –several tries to mitigate the problem forum (later replaced by standard support lines) the regular meetings with system administrators mitigates the problem but doesn’t solve it –direct and fast communication of the changes –presentation slides become obsolete with the evolution of the project Migration from ELFMs to Agile Infrastructure - 16

CERN IT Department CH-1211 Genève 23 Switzerland t AI Challenges Documentation –keep up the constantly changing environment –define and maintain the documentation of all the processes Certain things had to be re-done differently –trial and error approach –time and effort consuming Maturity of Puppet and the other tools –e.g. it was necessary to rollback the migration from Puppet 2.7 to 3.0 until a bug fix was provided Puppet not completely adapted to an heterogeneous population of system administrators working on very different services –secrets isolation –code from independent services running on the same infrastructure Integration of the various tools we selected proved difficult sometimes Migration from ELFMs to Agile Infrastructure - 17

CERN IT Department CH-1211 Genève 23 Switzerland t Conclusions Approach taken was well appreciated –different from the past… –release early, release often feedback loop between developers end users/testers has been working quite well –the new developments are totally focused on the users needs –the feedback has been very positive The use of the off-the-shelf open source tools has some benefits –documentation you even have books on it –large and active community is supporting Puppet Migration from ELFMs to Agile Infrastructure - 18

CERN IT Department CH-1211 Genève 23 Switzerland t Conclusions The benefits have been great: for the first time during the development of a project in the IT department system administrators are participate closely in the development of a new project –The development process is entirely open… Puppet and friends model is moving forward –4000 hosts in production –150 persons managing configurations Migration from ELFMs to Agile Infrastructure - 19

CERN IT Department CH-1211 Genève 23 Switzerland t Thanks!! Questions?