Deployment of IPv6-only CPU on WLCG - update from the HEPiX IPv6 WG

Slides:



Advertisements
Similar presentations
IPv6 testing plans 25 Jan Short term – next 6 weeks Add sites to testbed – Glasgow (DPM storage end point) – Fix DESY – Others? Is GridFTP mesh.
Advertisements

News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) HEPiX, Oxford 24 Mar 2015.
HEPiX IPv6 Working Group David Kelsey (STFC-RAL, UK) 4 May 2011 HEPiX, GSI, Darmstadt david.kelsey at stfc.ac.uk.
HEPiX IPv6 Working Group David Kelsey (STFC-RAL) 1 July 2011 UK HEP Sysman meeting.
Network and Transfer WG Metrics Area Meeting Shawn McKee, Marian Babik Network and Transfer Metrics Kick-off Meeting 26 h November 2014.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) WLCG GDB, CERN 8 July 2015.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) GridPP35, Liverpool 11 Sep 2015.
The production deployment of IPv6 on WLCG David Kelsey (STFC-RAL) CHEP2015, OIST, Okinawa 16 Apr 2015.
The HEPiX IPv6 Working Group David Kelsey (STFC-RAL) HEPiX, Ann Arbor MI 30 Oct 2013.
The HEPiX IPv6 Working Group David Kelsey EGI TF, Prague 18 Sep 2012.
Status Report of WLCG Tier-1 candidate for KISTI-GSDC Sang-Un Ahn, for the GSDC Tier-1 Team GSDC Tier-1 Team 12 th CERN-Korea.
MW Readiness Verification Status Andrea Manzi IT/SDC 21/01/ /01/15 2.
The HEPiX IPv6 Working Group David Kelsey WLCG GDB, CERN 14 Nov 2012.
Network and Transfer Metrics WG Meeting Shawn McKee, Marian Babik perfSONAR Operations Sub-group 22 nd October 2014.
Status Report of WLCG Tier-1 candidate for KISTI-GSDC Sang-Un Ahn, for the GSDC Tier-1 Team GSDC Tier-1 Team ATHIC2012, Busan,
HEPiX IPv6 Working Group David Kelsey GDB, CERN 11 Jan 2012.
The HEPiX IPv6 working group David Kelsey (STFC-RAL) HEPiX meeting, Bologna 17 Apr 2013.
Report from GSSD Storage Workshop Flavia Donno CERN WLCG GDB 4 July 2007.
WLCG Operations Coordination Andrea Sciabà IT/SDC 10 th July 2013.
WLCG and IPv6 David Kelsey (STFC-RAL) LHCOPN/LHCONE, Rome 28 Apr 2014.
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
The HEPiX IPv6 Working Group David Kelsey HEPiX, Prague 26 April 2012.
WLCG: Are we ready for IPv6? David Kelsey (STFC-RAL) ISGC 2014, Taipei 26 Mar 2014.
HEPiX IPv6 Working Group David Kelsey david DOT kelsey AT stfc DOT ac DOT uk (STFC-RAL) HEPiX, Vancouver 26 Oct 2011.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) HEPIX, BNL 13 Oct 2015.
The HEPiX IPv6 Working Group David Kelsey (STFC-RAL) EGI OMB 19 Dec 2013.
WLCG Operations Coordination report Maria Dimou Andrea Sciabà IT/SDC On behalf of the WLCG Operations Coordination team GDB 12 th November 2014.
Dissemination and User Feedback Castor deployment team Castor Readiness Review – June 2006.
UK Status and Plans Catalin Condurache – STFC RAL ALICE Tier-1/Tier-2 Workshop University of Torino, February 2015.
HEPiX IPv6 Working Group David Kelsey (STFC-RAL) GridPP33 Ambleside 22 Aug 2014.
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
RAL Site Report HEP SYSMAN June 2016 – RAL Gareth Smith, STFC-RAL With thanks to Martin Bly, STFC-RAL.
HEPiX spring 2013 report HEPiX Spring 2013 CNAF Bologna / Italy Helge Meinhard, CERN-IT Contributions by Arne Wiebalck / CERN-IT Grid Deployment Board.
PerfSONAR operations meeting 3 rd October Agenda Propose changes to the current operations of perfSONAR Discuss current and future deployment model.
Dynamic Extension of the INFN Tier-1 on external resources
WLCG IPv6 deployment strategy
Jeremy Coles, STFC-RAL GDB April 4, 2007
LHC[OPN/ONE]  IPv6  status
LHCOPN/LHCONE status report pre-GDB on Networking CERN, Switzerland 10th January 2017
Ian Bird WLCG Workshop San Francisco, 8th October 2016
LCG Service Challenge: Planning and Milestones
Report from WLCG Workshop 2017: WLCG Network Requirements GDB - CERN 12th of July 2017
gLite->EMI2/UMD2 transition
HEPiX Spring 2014 Annecy-le Vieux May Martin Bly, STFC-RAL
Andrea Chierici On behalf of INFN-T1 staff
Plans to support IPv6-only CPU on WLCG
Service Challenge 3 CERN
WLCG Operations Coordination
How to enable computing
Update on Plan for KISTI-GSDC
Taming the protocol zoo
Support for IPv6-only CPU – an update from the HEPiX IPv6 WG
Deployment of IPv6-only CPU on WLCG – an update from the HEPiX IPv6 WG
Olof Bärring LCG-LHCC Review, 22nd September 2008
Update from the HEPiX IPv6 WG
Deployment of IPv6-only CPU on WLCG – an update from the HEPiX IPv6 WG
Venue and Participants
Summary from last MB “The MB agreed that a detailed deployment plan and a realistic time scale are required for deploying glexec with setuid mode at WLCG.
IPv6 deployment at CERN - status update -
Grid status ALICE Offline week March 30, Maarten Litmaath CERN-IT v1.1
Alerting/Notifications (MadAlert)
WLCG and support for IPv6-only CPU
WLCG Collaboration Workshop;
HEPiX IPv6 Working Group F2F Meeting
ETHZ, Zürich September 1st , 2016
New Types of Accounting Beyond CPU
CHIPP - CSCS F2F meeting CSCS, Lugano January 25th , 2018.
IPv6 update Duncan Rand Imperial College London
The LHCb Computing Data Challenge DC06
Presentation transcript:

Deployment of IPv6-only CPU on WLCG - update from the HEPiX IPv6 WG David Kelsey (STFC) on behalf of the HEPiX IPv6 WG HEPiX meeting 17 Oct 2017, KEK, Japan

Many thanks to my Colleagues Active in HEPiX IPv6 Working Group 2017 M Babik (CERN), M Bly (RAL), J Chudoba (Prague), C Condurache (RAL), A Dewhurst (RAL/ATLAS), D van Dok (Nikhef), T Finnern (DESY), T Froy (QMUL), C Grigoras (CERN/ALICE), K Hafeez (RAL), B Hoeft (KIT), D P Kelsey (RAL), F Lopez Munoz (PIC), E Martelli (CERN), R Nandakumar (RAL/LHCb), K Ohrenberg (DESY), F Prelz (INFN), D Rand (Imperial), A Sciaba (CERN/CMS), U Tigerstedt (CSC) & D Traynor (QMUL) many more in the past apologies to any I have missed And many more former colleagues too 17/10/2017 HEPiX IPv6 WG

Outline History WLCG – support for IPv6-only CPU – timeline Update from the Tier-0/1 sites Update from Tier-2 sites & LHC experiments CERN Tier-0 Storage(EOS) IPv6-only CPU - testing Transition monitoring Current issues Summary 17/10/2017 HEPiX IPv6 WG

Preparatory work during 2011-2016 17/10/2017 HEPiX IPv6 WG

HEPiX IPv6 Working Group Started in April 2011 Phase 1 – full analysis of work to be done Applications, system and network tools, security etc Create and operate a distributed test-bed No interference with WLCG production data analysis! Propose timetable and plan for transition 17/10/2017 HEPiX IPv6 WG

2012 CERN announces predicted shortage of routable IPv4 addresses explosion of virtualisation Active HEPiX IPv6 test-bed with ~ 12 sites engagement of 4 LHC experiments Testing regular GridFTP IPv6 data transfers across the testbed Testing dual-stack services (production) at Imperial College London Tier2 Concluded not able to support IPv6-only clients before 2014 17/10/2017 HEPiX IPv6 WG

At CHEP2013 conference > 2 PB data transferred over IPv6 in last 6 months Success rate > 87% Very High! GridFTP IPv6 data transfer mesh 17/10/2017 HEPiX IPv6 WG

2013 - Data Management Testing the important data transfer protocols, technology and data storage/file systems For IPv6-readiness GridFTP, DPM, dCache, xRootD, OpenAFS, FTS, CASTOR Found many problems needing work Worked closely with developer communities Concluded IPv6-only will be much later than 2014! 17/10/2017 HEPiX IPv6 WG

2015 At CHEP conference in April 2015 75% of Tier-1 sites are IPv6-ready but only 20% of Tier2 10% of sites now reporting lack of IPv4 addresses Most important IPv6-only use case Sites, Clouds providing CPU (virtual machines) Opportunistic resources may be IPv6-only Need dual-stack federated storage services And dual-stack central WLCG and Experiment services 17/10/2017 HEPiX IPv6 WG

The IPv6 transition 2016-2020 17/10/2017 HEPiX IPv6 WG

2016 Growing need for support of IPv6-only WN Continue to push for deployment of production dual-stack data services LHCOPN (Tier0-Tier1 private network) IPv6 peering everywhere perfSONAR – end to end network monitoring – dual-stack Move central services and central monitoring to IPv6 For CHEP2016 – October, San Francisco guidance on IPv6 security for WLCG sites Deployment timetable approved by WLCG Management Board From April 2017 17/10/2017 HEPiX IPv6 WG

WLCG deployment plan: timeline - approved WLCG MB Sep 2016 By April 1st 2017 Sites can provide IPv6-only CPUs if necessary Tier-1’s must provide dual-stack storage access with sufficient performance and reliability At least in a testbed setup Stratum-1 service at CERN must be dual-stack A dedicated ETF infrastructure to test IPv6 services must be available ATLAS and CMS must deploy all services interacting with WNs in dual-stack All the above, without disrupting normal WLCG operations By April 1st 2018 Tier-1’s must provide dual-stack storage access in production with increased performance and reliability Tier-1’s must upgrade their Stratum-1 and FTS to dual-stack The official ETF infrastructure must be migrated to dual-stack GOCDB, OIM, GGUS, BDII should be dual-stack By end of Run2 A large number of sites will have migrated their storage to IPv6 The recommendation to keep IPv4 as a backup will be dropped 17/10/2017 HEPiX IPv6 WG

Tier 0/1 status 17/10/2017 HEPiX IPv6 WG

Tier 1 status Part of table http://hepix-ipv6.web.cern.ch/sites-connectivity 17/10/2017 HEPiX IPv6 WG

Tier 1 (cont’d) All Tier 1 now have IPv6 peering with LHCOPN Except KR-KISTI-GSDC Should connect soon Dual-stack Tier1 storage slowly being deployed By 31st July 2017 - 11 Tier1’s claim some storage Seems on-track for April 2018 But we should continue to monitor 17/10/2017 HEPiX IPv6 WG

Tier 2 status & Experiments 17/10/2017 HEPiX IPv6 WG

ALICE Monitoring all sites – IPv6 readiness http://alimonitor.cern.ch/ipv6/ All SEs and CEs Site by site 71 SEs in 54 sites 9 have IPv6 DNS AAAA (6 sites) 8 are reachable over IPv6 Concern – not changing 17/10/2017 HEPiX IPv6 WG

ALICE (part of list) 17/10/2017 HEPiX IPv6 WG

CMS A storage test for xRootD is being prepared Adding a DNS IPv6 test to production ETF instance Can be done from an IPv4-only system! A storage test for xRootD is being prepared To automatically test all SEs (from IPv6 ETF/SAM3) Other experiments can request same ETF DNS test CMS also tracking all storage@sites with their IPv6 readiness 17 Sites “Tested” (11 OK, 4 problems, 2 not connected) 26 Sites “Not ready yet” 7 “Unknown status” Also updating the old WLCG survey https://www.gridpp.ac.uk/wiki/2014_IPv6_WLCG_Site_Survey 17/10/2017 HEPiX IPv6 WG

LHCb Agreed that they will monitor in same way as ALICE (using same table and columns) Analysis after the September 2017 meeting 21 SEs (6 IPv6 capable) 163 CEs at 76 Sites (15 IPv6 at 6 sites) Here just standard WLCG CE exclude VAC, Vcycle, DIRAC (none is IPv6 capable) 17/10/2017 HEPiX IPv6 WG

ATLAS Machines that need to be IPv6 (to allow jobs to run on IPv6-only WN) done for some time Very few operational problems A problem at one site with WebDav deletes IPv6 firewall problem - now fixed Starting to move the Frontier service to dual stack Plans to make more services dual stack but still to happen 17/10/2017 HEPiX IPv6 WG

CERN Storage EOS 17/10/2017 HEPiX IPv6 WG

17/10/2017 HEPiX IPv6 WG

Sep 2017 17/10/2017 HEPiX IPv6 WG

IPv6-only CPU testing By several members of the WG, including Tier 1: PIC Tier 2: Brunel, QMUL Test jobs run by ATLAS, CMS, LHCb at various times In general works, but still some issues to solve QMUL has been running some IPv6-only nodes behind NAT64 Good way of listing which services are contacted by a particular job 17/10/2017 HEPiX IPv6 WG

Transition monitoring 17/10/2017 HEPiX IPv6 WG

FTS transfers (During 6 hrs) total https://monit.cern.ch/ 17/10/2017 HEPiX IPv6 WG

FTS transfers (during 6 Hrs) IPv6 Filter: ipv6 = true IPv6: ~3% “successes”; ~5% “throughput”; failure spike is IPv4! 17/10/2017 HEPiX IPv6 WG

PerfSONAR Dual-stack mesh IPv6 Bandwidth test http://psmad.grid.iu.edu/maddash-webui/index.cgi?dashboard=Dual-Stack%20Mesh%20Config 17/10/2017 HEPiX IPv6 WG

ETF IPv6 ETF IPv6 instance provides dual-stack testing support for SAM Works for all experiments – they can now request their own IPv6 instance Using experiment production topologies it parses a list of CEs/SEs from the experiments feeds https://etf-ipv6.cern.ch/etf/check_mk/index.py 17/10/2017 HEPiX IPv6 WG

Current IPv6 issues IPv6 – ongoing intermittent problems between SARA and Imperial London – LHCONE link? Vendor? Now solved! GGUS #129946 X.509 CA CRLs (being chased with IGTF) 37 IPv6, 51 IPv4-only CERN Agile Infrastructure plan to turn on IPv6 on VMs by default delayed (until Jan 2018) because of a router bug Docker containers and IPv6-only support? (issue 25407) In general works, but some bridging problems (CERN EOS) IPv6-only WN tests at PIC HTcondor instability 17/10/2017 HEPiX IPv6 WG

HEPiX IPv6 WG meetings Meetings held monthly (and 3 F2F per year) Last F2F at CERN 11/12 Sep 2017 Next F2F at CERN 11/12 Jan 2018 Participation of all LHC experiments, Tier-0/1 sites and some Tier-2 sites Participation from more sites warmly welcome Write to ipv6@hepix.org to join Discuss technical issues, progress reports Best way to get involved and contribute 17/10/2017 HEPiX IPv6 WG

Summary Dual-stack storage at Tier-1 is coming slowly Some issues still to be solved for IPv6-only WN ~10% of services dual-stack; ~3 to 5% of FTS data throughput is IPv6 A number of Tier 2s run dual-stack storage But still a small minority WLCG Tier 2s please start planning for IPv6 now Automatic endpoint transition monitoring being worked on How best to track/urge/encourage/support the Tier 2’s? Documentation & training Responsible for deployment: WLCG Operations and its IPv6 Task Force Tracking/monitoring best done Experiment by Experiment given different timescales & requirements 17/10/2017 HEPiX IPv6 WG

Links HEPiX IPv6 web Working group meetings http://hepix-ipv6.web.cern.ch Working group meetings http://indico.cern.ch/categoryDisplay.py?categId=3538 WLCG Operations IPv6 Task Force http://hepix-ipv6.web.cern.ch/content/wlcg-ipv6-task-force-0 17/10/2017 HEPiX IPv6 WG

Questions? 17/10/2017 HEPiX IPv6 WG