NorthGrid status Alessandra Forti Gridpp13 Durham, 4 July 2005.

Slides:



Advertisements
Similar presentations
Northgrid Status Alessandra Forti Gridpp20 Dublin 12 March 2008.
Advertisements

NorthGrid status Alessandra Forti Gridpp15 RAL, 11 th January 2006.
Northgrid Status Alessandra Forti Gridpp22 UCL 2 April 2009.
Northgrid Status Alessandra Forti Gridpp24 RHUL 15 April 2010.
NorthGrid status Alessandra Forti Gridpp12 Brunel, 1 February 2005.
Report of Liverpool HEP Computing during 2007 Executive Summary. Substantial and significant improvements in the local computing facilities during the.
Southgrid Status Pete Gronbech: 27th June 2006 GridPP 16 QMUL.
SM3121 Software Technology Mark Green School of Creative Media.
London Tier 2 Status Report GridPP 13, Durham, 4 th July 2005 Owen Maroney, David Colling.
RSS. W HAT IS IT AND WHY IS IT USED ? B Y WHOM ? RSS stands for: Rich Site Summary or Really Simple Syndication It’s a technology that allows users to.
© 2009 GroundWork Open Source, Inc. PROPRIETARY INFORMATION: Information contained herein is not for use or disclosure outside of GroundWork Open Source,
Andrew McNab - Manchester HEP - 22 April 2002 UK Rollout and Support Plan Aim of this talk is to the answer question “As a site admin, what are the steps.
Summary of issues and questions raised. FTS workshop for experiment integrators Summary of use  Generally positive response on current state!  Now the.
1 Deployment of an LCG Infrastructure in Australia How-To Setup the LCG Grid Middleware – A beginner's perspective Marco La Rosa
London Tier 2 Status Report GridPP 12, Brunel, 1 st February 2005 Owen Maroney.
Southgrid Status Report Pete Gronbech: February 2005 GridPP 12 - Brunel.
26/4/2001VMware - HEPix - LAL 2001 Windows/Linux Coexistence : VMware Approach HEPix – LAL Apr Michel Jouvin
ScotGrid: a Prototype Tier-2 Centre – Steve Thorn, Edinburgh University SCOTGRID: A PROTOTYPE TIER-2 CENTRE Steve Thorn Authors: A. Earl, P. Clark, S.
Quarterly report SouthernTier-2 Quarter P.D. Gronbech.
27/04/05Sabah Salih Particle Physics Group The School of Physics and Astronomy The University of Manchester
Southgrid Technical Meeting Pete Gronbech: 16 th March 2006 Birmingham.
Robert Fourer, Jun Ma, Kipp Martin Copyright 2006 An Enterprise Computational System Built on the Optimization Services (OS) Framework and Standards Jun.
Issues in Milan Two main problems (details in the next slides): – Site excluded from analysis due to corrupted installation of some releases (mainly )
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
BINP/GCF Status Report BINP LCG Site Registration Oct 2009
Northgrid Alessandra Forti M. Doidge, S. Jones, A. McNab, E. Korolkova Gridpp26 Brighton 30 April 2011.
CERN Manual Installation of a UI – Oxford July - 1 LCG2 Administrator’s Course Oxford University, 19 th – 21 st July Developed.
12th November 2003LHCb Software Week1 UK Computing Glenn Patrick Rutherford Appleton Laboratory.
Deployment Issues David Kelsey GridPP13, Durham 5 Jul 2005
Manchester HEP Desktop/ Laptop 30 Desktop running RH Laptop Windows XP & RH OS X Home server AFS using openafs 3 DB servers Kerberos 4 we will move.
UMD TIER-3 EXPERIENCES Malina Kirn October 23, 2008 UMD T3 experiences 1.
London Tier 2 Status Report GridPP 11, Liverpool, 15 September 2004 Ben Waugh on behalf of Owen Maroney.
Southgrid Technical Meeting Pete Gronbech: May 2005 Birmingham.
INFSO-RI Enabling Grids for E-sciencE Enabling Grids for E-sciencE Pre-GDB Storage Classes summary of discussions Flavia Donno Pre-GDB.
Grid Security Vulnerability Group Linda Cornwall, GDB, CERN 7 th September 2005
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
ACHIEVEMENTS Spring 2013 Employee Development Mark Zocher.
HEP Computing Status Sheffield University Matt Robinson Paul Hodgson Andrew Beresford.
8 th CIC on Duty meeting Krakow /2006 Enabling Grids for E-sciencE Feedback from SEE first COD shift Emanoil Atanassov Todor Gurov.
2-Sep-02Steve Traylen, RAL WP6 Test Bed Report1 RAL and UK WP6 Test Bed Report Steve Traylen, WP6
Rutherford Appleton Lab, UK VOBox Considerations from GridPP. GridPP DTeam Meeting. Wed Sep 13 th 2005.
| nectar.org.au NECTAR TRAINING Module 5 The Research Cloud Lifecycle.
BNL Service Challenge 3 Status Report Xin Zhao, Zhenping Liu, Wensheng Deng, Razvan Popescu, Dantong Yu and Bruce Gibbard USATLAS Computing Facility Brookhaven.
Documentation (& User Support) Issues Stephen Burke RAL DB, Imperial, 12 th July 2007.
PERFORMANCE AND ANALYSIS WORKFLOW ISSUES US ATLAS Distributed Facility Workshop November 2012, Santa Cruz.
Cyber Safety Mohammad Abbas Alamdar Teacher of ICT STS Ajman – Boys School.
RAL PPD Tier 2 (and stuff) Site Report Rob Harper HEP SysMan 30 th June
1Maria Dimou- cern-it-gd LCG November 2007 GDB October 2007 VOM(R)S Workshop report Grid Deployment Board.
BaBar Cluster Had been unstable mainly because of failing disks Very few (
15-Feb-02Steve Traylen, RAL WP6 Test Bed Report1 RAL/UK WP6 Test Bed Report Steve Traylen, WP6 PPGRID/RAL, UK
1 Update at RAL and in the Quattor community Ian Collier - RAL Tier1 HEPiX FAll 2010, Cornell.
What you need to know.  Each TDI vessel is equipped with satellite communications that supplies a LOW BANDWIDTH internet connection. Even though the.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Mario Reale – GARR NetJobs: Network Monitoring Using Grid Jobs.
WLCG Service Report Jean-Philippe Baud ~~~ WLCG Management Board, 24 th August
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operations: Evolution of the Role of.
Status of gLite-3.0 deployment and uptake Ian Bird CERN IT LCG-LHCC Referees Meeting 29 th January 2007.
J Jensen/J Gordon RAL Storage Storage at RAL Service Challenge Meeting 27 Jan 2005.
VOMS chapter 1&1/2 Alessandra Forti Sergey Dolgodobrov HEP Sysman meeting 5 December 2005.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
EGEE is a project funded by the European Union under contract IST Issues from current Experience SA1 Feedback to JRA1 A. Pacheco PIC Barcelona.
10/18/01Linux Reconstruction Farms at Fermilab 1 Steven C. Timm--Fermilab.
II EGEE conference Den Haag November, ROC-CIC status in Italy
SemiCorp Inc. Presented by Danu Hunskunatai GGU ID #
DB Questions and Answers open session (comments during session) WLCG Collaboration Workshop, CERN Geneva, 24 of April 2008.
IPEmotion License Management PM (V1.2).
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
2007/05/22 Integration of virtualization software Pierre Girard ATLAS 3T1 Meeting
London Tier-2 Quarter Owen Maroney
The Beijing Tier 2: status and plans
The Troubleshooting theory
Presentation transcript:

NorthGrid status Alessandra Forti Gridpp13 Durham, 4 July 2005

4 July 2005Alessandra Forti GridPP13 Durham Outline Sites Resource Summary Posts Sites by Site situation –Lancaster, Liverpool, Manchester, Sheffield Installation and configuration dcache Communication Security Conclusions

4 July 2005Alessandra Forti GridPP13 Durham Sites Resources Summary SitekSI2kTBNetworkStatus Lancaster MB ukLight online Liverpool001 GBoffline Manchester GBonline Sheffield24021GBonline

4 July 2005Alessandra Forti GridPP13 Durham Sites Resources Summary SiteOSLCGSRMVOs LancasterSL32_4_0Yes (dcache) 5 LiverpoolSL32_4_0No3 ManchesterSL32_4_0Yes (dcache) 7 SheffieldSL32_4_0No6

4 July 2005Alessandra Forti GridPP13 Durham Posts 4.5 FTE have been filled –Lancaster: Matt Doidge –Liverpool: Pawel Trepka –Manchester: Marc Kelly, Colin Morey –Sheffield: Andrew Beresford (0.5 FTE) Posts are working for each university and there is no common effort but they meet at the monthly technical meeting and report on sites situation and exchange information there.

4 July 2005Alessandra Forti GridPP13 Durham Lancaster New Farm has been installed in May –Few problems with RGMA and publishing the latter mostly due to a change in the name of the CE SC3 participation –Connected to ukLight and to the production network In the process of subnetting and dual homing –Installing two dcache storage elements To avoid overlapping between production network and ukLight traffic –Installing other required software like FTS, LFC

4 July 2005Alessandra Forti GridPP13 Durham Liverpool Still offline The new post hopefully will solve the manpower problem Last 10 days a peak of effort to install LCG –Didn’t quite make it for today but hopefully will continue also after GridPP Perhaps next quarter Liverpool will be online?

4 July 2005Alessandra Forti GridPP13 Durham Manchester At the moment –Still online with the old cluster: 40kSI –UI software being installed on department desktops –Dcache installed on 2 WNs The order for the new cluster has been placed –It is due to arrive at the beginning of august –Electricity bills sorted sharing part of the cpus with engineers (not discussed yet how) Main effort dedicated to prepare the structure –Setting up servers, networking, monitoring, security - Surveying assembly of the nodes, installation and testing –Establishing cooperative relations with MCC –Working on dcache configuration

4 July 2005Alessandra Forti GridPP13 Durham Sheffield Not much to say about Sheffield –It is the best site in NorthGrid Always on time with updates Cluster always full of jobs Totalised ~15300 kSI hours ~5 times hours than the second site. Really active Atlas user asking a lot of user questions –The only note I can make is that they tend not to answer s. They fix the problem and don’t close the ticket!

4 July 2005Alessandra Forti GridPP13 Durham Installation and configuration Installation easiness greatly improved. ☺ YAIM installs and configures a standard site very easily ☺ YAIM can be easily debugged and fixed when things go wrong ☺ YAIM can be extended for non standard sites ☺ Can be plugged in a kickstart ×YAIM doesn’t configure a site to be secure Security recipes should go in the installation notes or on a security WEB site not in the scripts. ☺ In general has made sys admins life much easier!

4 July 2005Alessandra Forti GridPP13 Durham Dcache Lancaster and Manchester have installed dcache –Different configurations: Lancaster has dedicated data servers Manchester is using (2) WNs disk space –YAIM merely installs the components and starts the services It doesn’t configure the dcache nor the Info System (waiting for new LCG release to see improvements –Dcache configuration documentation is lacking –Examples of hardware configuration and requirements are lacking –SRM available/space per VO cannot be easily calculated Matter of configuration again? Installation experience at different sites has been very different.

4 July 2005Alessandra Forti GridPP13 Durham Communication with the external world Users and experiments should open tickets not write directly to sys admin. –Users don’t have to know site contact addresses –Miscommunication is avoided –Tickets send automatic reminders – An escalation procedure is followed if the problem is not solved –Tickets are traceable Sys admin should write to the ROC about rogue users –The sys admin doesn’t have to know about user personal –Miscommunication is avoided –The user might not be a user but a hacker better have someone to investigate it

4 July 2005Alessandra Forti GridPP13 Durham Security Has it been said often enough? –Very simple things to check and block at OS configuration level that can improve security Portmap Ssh Inetd Cron acls Root password Switching off unused services (port table out of date and confusing) Monitoring ports for services Monitoring network traffic whe n something weird is noticed

4 July 2005Alessandra Forti GridPP13 Durham Conclusions NorthGrid is slowly coming together –Posts are in place –Expected equipment has been setup or is on its way –Technical information is exchanged during the monthly technical meetings It could be more often but one step at the time –Networking is being followed up in two sites –SRM/dcache has been installed at two sites –One of the sites has a very good running record