Presentation is loading. Please wait.

Presentation is loading. Please wait.

RAL Tier1: 2001 to 2011 James Thorne GridPP 19 30 th August 2007.

Similar presentations


Presentation on theme: "RAL Tier1: 2001 to 2011 James Thorne GridPP 19 30 th August 2007."— Presentation transcript:

1 RAL Tier1: 2001 to 2011 James Thorne GridPP 19 30 th August 2007

2 30/08/2007 j.i.thorne@scitech.ac.uk 2001 to 2007 Sorry GridPP, Im afraid I cant do that!

3 30/08/2007 j.i.thorne@scitech.ac.uk Result of GridPP3 for Tier1 Good result: –Effort increases from 16.5 to 20.4 FTE –£6.8M hardware budget (cf £2.3M in GridPP2) Extra fault management/hardware staff as size of farm increases A good result but team remains thinly stretched; hardware is just sufficient to meet experiments requirements.

4 30/08/2007 j.i.thorne@scitech.ac.uk Planned Tier1 Storage Capacity (TiB)

5 30/08/2007 j.i.thorne@scitech.ac.uk Planned Tier1 CPU Capacity (KSI2K)

6 30/08/2007 j.i.thorne@scitech.ac.uk Estimated Rack Count

7 30/08/2007 j.i.thorne@scitech.ac.uk Estimated number of Disk Servers

8 30/08/2007 j.i.thorne@scitech.ac.uk Estimated number of Spinning Drives

9 30/08/2007 j.i.thorne@scitech.ac.uk Approximate H.W Value Allocated to Experiments in 2008

10 30/08/2007 j.i.thorne@scitech.ac.uk Hardware CPU Disk Tape Further procurements in FY08, FY09 and FY10

11 30/08/2007 j.i.thorne@scitech.ac.uk New Machine Room Order placed and contractor has started work 800m 2 can accommodate 300 racks + 5 robots 2.3MW Power/Cooling capacity (some UPS) Office accommodation for all E-Science staff Scheduled to be available for September 2008

12 30/08/2007 j.i.thorne@scitech.ac.uk Staffing Lex Holt left Tier1 James Adams is moving from hardware support to Fabric Team system admin Plan to recruit: –Replacement hardware repair position –Two experiment support posts; one ATLAS, one CMS. –Raja Nandakumar as honorary team member from LHCb –Will also shortly commences GridPP3 recruitments

13 30/08/2007 j.i.thorne@scitech.ac.uk CASTOR Operational issues mentioned at GridPP 18 were tip of iceberg and CASTOR 2.1.2 service was found to be inoperable. Massive amount of re-engineering carried out since March with much effort from CASTOR team. –Huge progress –Areas of concern We are optimistic that CASTOR will be a success

14 30/08/2007 j.i.thorne@scitech.ac.uk SL4 20% of batch farm now running SL4 Negotiating with LHC experiments to agree the move of their capacity from SL3 to SL4. Once LHC migration is completed, remaining capacity will follow within a few weeks. Depends on the experiments, but should expect termination of SL3 service in September

15 30/08/2007 j.i.thorne@scitech.ac.uk Reliability March: invested a lot of effort without much gain Continue to prioritise reliability and making progress Recently exceeded target, now must maintain Start Sysadmin On Duty in September Start on call later this year

16 30/08/2007 j.i.thorne@scitech.ac.uk RAL-LCG2 Availability/Reliability

17 30/08/2007 j.i.thorne@scitech.ac.uk CPU Efficiencies CPU efficiency much improved August fall still being investigated March minimum when CASTOR was broken

18 30/08/2007 j.i.thorne@scitech.ac.uk CPU Efficiencies

19 30/08/2007 j.i.thorne@scitech.ac.uk Termination of GridPP use of ADS Service GridPP funding and use of old legacy Atlas Datastore service scheduled to end at end of March 2008. RAL will continue to operate ADS service and experiments are free to purchase capacity directly from ADS Team.

20 30/08/2007 j.i.thorne@scitech.ac.uk dCache Closure dCache still supported and working We will give 6 months notice before terminating dCache service No notice of termination yet Aiming to end service by end of GRIDPP2 (March 2008). Also cannot terminate ADS service until dCache ceases.

21 30/08/2007 j.i.thorne@scitech.ac.uk Grid Only Move to Grid only access postponed until December 2007 No new local accounts In January 2008: –Batch job submission through RB/CE only (no qsub, some exceptions) –No local login to UIs (some exceptions) –AFS Service will end

22 30/08/2007 j.i.thorne@scitech.ac.uk Conclusions Positioning ourselves for LHC production. A lot of good progress with CASTOR and expect to meet the needs of the ATLAS M4 run and CMSs CSA07. Reliability has finally improved.


Download ppt "RAL Tier1: 2001 to 2011 James Thorne GridPP 19 30 th August 2007."

Similar presentations


Ads by Google