CEPC software & computing study group report

Slides:



Advertisements
Similar presentations
DATA PRESERVATION IN ALICE FEDERICO CARMINATI. MOTIVATION ALICE is a 150 M CHF investment by a large scientific community The ALICE data is unique and.
Advertisements

Large scale data flow in local and GRID environment V.Kolosov, I.Korolko, S.Makarychev ITEP Moscow.
Ian Bird WLCG Workshop Okinawa, 12 th April 2015.
GridPP Steve Lloyd, Chair of the GridPP Collaboration Board.
Ian Fisk and Maria Girone Improvements in the CMS Computing System from Run2 CHEP 2015 Ian Fisk and Maria Girone For CMS Collaboration.
Computing for ILC experiment Computing Research Center, KEK Hiroyuki Matsunaga.
YAN, Tian On behalf of distributed computing group Institute of High Energy Physics (IHEP), CAS, China CHEP-2015, Apr th, OIST, Okinawa.
Take on messages from Lecture 1 LHC Computing has been well sized to handle the production and analysis needs of LHC (very high data rates and throughputs)
BESIII Production with Distributed Computing Xiaomei Zhang, Tian Yan, Xianghu Zhao Institute of High Energy Physics, Chinese Academy of Sciences, Beijing.
Distributed Computing for CEPC YAN Tian On Behalf of Distributed Computing Group, CC, IHEP for 4 th CEPC Collaboration Meeting, Sep , 2014 Draft.
Next Generation Operating Systems Zeljko Susnjar, Cisco CTG June 2015.
VICOMTECH VISIT AT CERN CERN 2013, October 3 rd & 4 th O.COUET CERN/PH/SFT DATA VISUALIZATION IN HIGH ENERGY PHYSICS THE ROOT SYSTEM.
Claudio Grandi INFN Bologna CMS Computing Model Evolution Claudio Grandi INFN Bologna On behalf of the CMS Collaboration.
Slide David Britton, University of Glasgow IET, Oct 09 1 Prof. David Britton GridPP Project leader University of Glasgow UK-T0 Meeting 21 st Oct 2015 GridPP.
ATLAS Distributed Computing perspectives for Run-2 Simone Campana CERN-IT/SDC on behalf of ADC.
Ian Bird WLCG Networking workshop CERN, 10 th February February 2014
Ian Bird Overview Board; CERN, 8 th March 2013 March 6, 2013
Meeting with University of Malta| CERN, May 18, 2015 | Predrag Buncic ALICE Computing in Run 2+ P. Buncic 1.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
Alessandro De Salvo CCR Workshop, ATLAS Computing Alessandro De Salvo CCR Workshop,
The Helmholtz Association Project „Large Scale Data Management and Analysis“ (LSDMA) Kilian Schwarz, GSI; Christopher Jung, KIT.
Scientific Computing at Fermilab Lothar Bauerdick, Deputy Head Scientific Computing Division 1 of 7 10k slot tape robots.
Computing infrastructures for the LHC: current status and challenges of the High Luminosity LHC future Worldwide LHC Computing Grid (WLCG): Distributed.
HEPiX IPv6 Working Group David Kelsey (STFC-RAL) GridPP33 Ambleside 22 Aug 2014.
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number Federated Cloud Update.
LHCb Computing 2015 Q3 Report Stefan Roiser LHCC Referees Meeting 1 December 2015.
LHC collisions rate: Hz New PHYSICS rate: Hz Event selection: 1 in 10,000,000,000,000 Signal/Noise: Raw Data volumes produced.
Scientific Data Processing Portal and Heterogeneous Computing Resources at NRC “Kurchatov Institute” V. Aulov, D. Drizhuk, A. Klimentov, R. Mashinistov,
Ian Bird, CERN WLCG Project Leader Amsterdam, 24 th January 2012.
ATLAS – statements of interest (1) A degree of hierarchy between the different computing facilities, with distinct roles at each level –Event filter Online.
Canadian Bioinformatics Workshops
Bob Jones EGEE Technical Director
Review of the WLCG experiments compute plans
Status of WLCG FCPPL project
(Prague, March 2009) Andrey Y Shevel
The advances in IHEP Cloud facility
Distributed Computing in IHEP
Ian Bird WLCG Workshop San Francisco, 8th October 2016
AWS Integration in Distributed Computing
Elastic Computing Resource Management Based on HTCondor
Report from WLCG Workshop 2017: WLCG Network Requirements GDB - CERN 12th of July 2017
Computing models, facilities, distributed computing
Overview of the Belle II computing
SuperB and its computing requirements
Yaodong CHENG Computing Center, IHEP, CAS 2016 Fall HEPiX Workshop
Scientific Computing Strategy (for HEP)
LHC experiments Requirements and Concepts ALICE
FUTURE ICT CHALLENGES IN SCIENTIFIC COMPUTING
for the Offline and Computing groups
WLCG: TDR for HL-LHC Ian Bird LHCC Referees’ meting CERN, 9th May 2017.
Dagmar Adamova, NPI AS CR Prague/Rez
Dagmar Adamova (NPI AS CR Prague/Rez) and Maarten Litmaath (CERN)
Thoughts on Computing Upgrade Activities
Computing at CEPC Xiaomei Zhang Xianghu Zhao
Vision for CERN IT Department
CernVM Status Report Predrag Buncic (CERN/PH-SFT).
WLCG Collaboration Workshop;
Cloud Computing R&D Proposal
The latest developments in preparations of the LHC community for the computing challenges of the High Luminosity LHC Dagmar Adamova (NPI AS CR Prague/Rez)
New strategies of the LHC experiments to meet
Input on Sustainability
Xiaomei Zhang On behalf of CEPC software & computing group Nov 6, 2017
EGI Webinar - Introduction -
Simulation and Physics
Defining the Grid Fabrizio Gagliardi EMEA Director Technical Computing
Status and prospects of computing at IHEP
Computing at the HL-LHC
Backfilling the Grid with Containerized BOINC in the ATLAS computing
Exploring Multi-Core on
Workflow and HPC erhtjhtyhy Doug Benjamin Argonne National Lab.
Presentation transcript:

CEPC software & computing study group report Xiaomei Zhang On behalf of CEPC software & computing study group

Content Study group and organization Goal and Plan Status and progress Demand analysis Roadmap study, design considerations and Challenges Strengthen International cooperation

Study group and organization People joined Current CEPC software group IHEP computing center Shandong University Sun Yat-sen University JINR BESIII offline software group Have first meeting at the end of 2015 Plan to have a monthly meeting Welcome more people to join us!

Study goal and topics Goal Main topics involved Discuss, investigate and confirm roadmap of CEPC software and computing Advanced computing and software technology study and practice Main topics involved Software framework Event data model Software technology Parallel computing Status and future trends of GEANT, ROOT and DD4HEP Future computing technology Databases Deep demand analysis of CEPC software and computing Simulation, reconstruction, analysis software ……

Status and progress CEPC software CEPC computing resources Software framework adopted from ILC Simulation and other modules are working well on it Advantages Quite satisfied with functions for current developments Disadvantages Few communities use it Worry about future supports CEPC computing resources No direct funding supports Resources mainly shared with other experiments in the way of Local farm Distributed computing

Demand analysis R&D phase : data volume evaluation 1PB storage and 2000 CPU cores to support full simulation of 6 months each year Experiment phase : data volume evaluation Higgs工厂(0~10年) 原始产生300TB/年,10年积累大约3PB 假设取1*10^6事例,1IP,1MB/event BESIII数据量(每年)*3 Z工厂(10~11年) 原始产生至少100PB/年 假设取10^11事例,1IP,0.5MB/event BESIII数据量(每年)*1000 Other experiments BELLEII is going to have 100PB raw data/year (2017年) LHC is going to have ~500PB raw data/year (Run3, 2020年) Need more details with software design

Software framework consideration Use an existing one Develop from beginning vs. Consideration of the choice for CEPC Enough services and functionalities Easy to use Future supports Almost all widely used frameworks can satisfy our requirements Several potential candidates are investigated and compared

Framework candidates investigation Marlin: currently used by CEPC(with uncertain official support) Gaudi: very popular for collider physics experiments, most familiar to us, very comprehensive but a bit heavy ROOT: very flexible and powerful, but need more manpower for some service functionalities development ART: optimized for high intensity physics experiments and a little complex SNiper: lightweight and optimized for non-collider experiments Marlin Gaudi ROOT ART SNiPER User Interface XML Python, TXT Root script FHiCL Python Adoption ILC Atlas, BES3, DYB Phenix, Alice Mu2e, NOvA, LArSoft, LBNF JUNO, LHAASO

Computing considerations It is still far to confirm the computing technology now used for 30 years more But we believe the technology is evolving step by step Now the main computing task is to study and follow the latest computing technology to prepare for the future, including Cloud computing Distributed computing Multi-cores computing High performance computing Unified distributed data management and access “Smart” network, high bandwidth future network …….

Distributed computing(1) Organize heterogeneous and distributed worldwide computing resources for scientific computing purposes Distributed computing origins from grid computing In HEP, distributed computing is becoming main way of resource organization, taking place of one single data center The success of WLCG contributes a lot

Distributed computing(2) Current evolution of distributed computing Tiered computing model becomes more flexible To optimize resource usage and data access Various heterogeneous resources (opportunistic resources) are integrated into distributed computing framework Not only Grid, but also Cluster, Cloud, Volunteer computing…… opportunistic resources are significantly additional to the pledged one 50% Atlas production use opportunistic resources

Distributed computing(3) Distributed data management becomes more flexible and automatic Global data federation based on xrootd makes data accessible anywhere Automatic global data placement tools makes use of data storage more reasonable and efficiency

Distributed computing(4) Data volume of future CEPC is comparable to LHC Distributed computing is one of the ways we must think about The prototype of CEPC distributed computing has already set up It is working for current CEPC R&D phrase It is small scale, but it is a start for us to look into distributed computing technology

Virtualization and Cloud computing(1) It is a trend to use cloud for large scale resource management CERN computing center has already migrated into “virtualization” era 125000 cores have been managed by VMs in 2015, in 2016 will extend to 190000 and more Many other US and Europe communities use cloud Cloud federation used to integrate distributed cloud has been a hot study topic now Could take place of grid in future? Commercial cloud received more and more concerns from HEP It is estimated that the HEP data processing is only 1% of commercial cloud problems Using commercial cloud to meet the requirement peak has become popular

Virtualization and Cloud computing(2) IHEP computing center has already started cloud study and set up production system Cloud activities involved: Distributed computing integrate collaboration cloud and commercial cloud OpenStack based Cloud platform Virtual farms to flexibly use cloud resources

Challenges Software and computing is evolving very rapidly Lack of man power Lack of funding

Strengthen international cooperation Establish relationships with international organizations HEP software foundation DIRAC consortium Take part in conferences CHEP Hepix Cooperate with international HEP experiments LHC, BELLEII Involved in the development of advanced technology