Presentation is loading. Please wait.

Presentation is loading. Please wait.

KIT – Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschaft Steinbuch Centre for Computing www.kit.edu.

Similar presentations


Presentation on theme: "KIT – Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschaft Steinbuch Centre for Computing www.kit.edu."— Presentation transcript:

1 KIT – Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschaft Steinbuch Centre for Computing www.kit.edu Site Report GridKa@SCC Dimitri Nilsen, Andreas Heiss, Manfred Alef, Andreas Petzold

2 2 Merger of the former Reserch Centre Karlsruhe and University of Karlsruhe since Oct. 1, 2009 9000 employees 22500 students GridKa@SCC

3 3 New Building 2012

4 4 One of 11 WLCG Tier1 centres, Share of the total WLCG Tier1 capacity ~ 14 %. Supported non-LHC experiments: - Resources for Compass, Babar, CDF, D0 until end of data analysis. - Grid test environment for Belle-II - Significant resources for Auger

5 5 Experiments Atlas, Alice, LHCB, CMS Auger, Compas, Babar, CDF, D0, bell D-Grid VOs Belle-II 333 Admin Server File Server Databases Grid and non-Grid Servers Rack Manager virtual Machines(VMware) GridKa Today

6 6 130kHS06 (128777) ~11600 pysical Cores ~19000 hyperthreaded cores 14450 job slots (at least 2 GB RAM pro job) Milestone 2012 New nodes have been installed (54000 HS06) 2x Intel Xeon E5-2665 (2.4 GHz 8-core Sandy Bridge) 48 GB RAM (3 GB per core) 4 nodes per 2 U (Intel S2600JF) Replace all old nodes with Intel Xeon E5345 (See separate talk by Manfred Alef for benchmark results.) Compute Resources

7 7 Storage Resources dCachexrootdNFS Server651712 Filesystems22911731 application server 21+12 GridFTP2 DisksCapacity SATA 750GB47303547,5 SATA 1TB3000 SATA 2TB573011460 SATA 3TB5801740 SAS 300GB12036 SAS 300GB600180 SAS 450GB870391,5 SSDs 200GB204 2035920,359PB

8 8 CPU Usage 2011 Farm usage: 95% 21.275.300 jobs (58290 jobs per day) Total Wall-Time: 103.512.000 hours CPU-Time: 71.742.000 hours CPU/Wall-Time: 69,3%

9 9 gLite/EMI Local services and gateways CREAM-CE, SiteBDII, VO-Boxes Central services TopBDII, WMS/LB, LFC, FTS Globus & Unicore Access for D-Grid VOs Central Services for NGI-DE Regional nagios monitoring Middleware Deployment

10 10 Middleware CF Puppet /files Puppet Dashboard Dynamic/Global Configuration automatix Static Configuration SVN Static Configuration SVN Puppet /etc External resources any Monitoring Unicore GT5 GT4 gLite Middleware Stack resources GridKA WNs GGUS reports

11 11 Service monitoring with Nagios/Icinga Internal ticketing system: Redmine

12 12 Two groups (storage, grid services) of 8-10 people each, weekly rotating – In addition: KIT wide network and infrastructure on-call service Alarms are sent to mobile phones by Nagions/Icinga – GGUS alarm tickets via Nagios Person on-call may or may not be an expert for the affected system (no experts on duty!). – will try 'standard recipes' to fix the problem – if necessary, will try to reach an expert 24/7 on-call service Approx. 85% of problems could be solved w/o calling (additional) experts – Documentation is improved continuously – People gain experience On-call services does not guarantee that problems can be fixed within few hours.

13 13 GridKa on-call service

14 14 National Grid Initiative NGI-DE Under the roof of EGI (European Grid Infrastructure) Within the Gauß Alliance of German academic computing centers As coordinator of DGI-2 (D-Grid Integration Project) As coordinator of bwGRiD (state Grid) Responsible for international tasks in EGI Central helpdesk for EGI, WLCG, EMI Coordination of development of operational tools Grid Infrastructures at KIT

15 15 Web portal and “single point of contact” for Grid users Central support platform for WLCG, EGI, EMI Ticket synchronization with more than 20 regional helpdesk systems in Europe Developed at and operated by SCC since 2003 Alignment according to ITIL services for Grid users ~80,000 Grid problems solved by 1,500 grid experts worldwide Basic element of the Grid Operations and Support Center (GOSC) Support for users and providers of the German e-infrastructure (NGI-DE) German operations uplink to the European e-infrastructure (EGI) Operation of core services for NGI-DE and EGI Global Grid User Support (GGUS)


Download ppt "KIT – Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschaft Steinbuch Centre for Computing www.kit.edu."

Similar presentations


Ads by Google