Physics data management tools: computational evolutions and benchmarks Mincheol Han 1, Chan-Hyeung Kim 1, Lorenzo Moneta 2, Maria Grazia Pia 3, Hee Seo.

Slides:



Advertisements
Similar presentations
Maria Grazia Pia, INFN Genova and CERN1 An OO model for intra-nuclear transport Maria Grazia Pia L. Bellagamba, A. Brunengo, E. Di Salvo for the Geant4.
Advertisements

Maria Grazia Pia, INFN Genova Test & Analysis Project Maria Grazia Pia, INFN Genova on behalf of the T&A team
Maria Grazia Pia, INFN Genova Conceptual challenges and computational progress in X-ray simulation Maria Grazia Pia INFN Genova, Italy Maria Grazia Pia.
Maria Grazia Pia, INFN Genova New techniques in Monte Carlo simulation: experience with a prototype of generic programming application to Geant4 physics.
Precision validation of Geant4 electromagnetic physics Katsuya Amako, Susanna Guatelli, Vladimir Ivanchenko, Michel Maire, Barbara Mascialino, Koichi Murakami,
SLA-Oriented Resource Provisioning for Cloud Computing
Maria Grazia Pia, INFN Genova Atomic Relaxation Models A. Mantero, B. Mascialino, Maria Grazia Pia INFN Genova, Italy P. Nieminen ESA/ESTEC
Hee Seo, Chan-Hyeung Kim, Lorenzo Moneta, Maria Grazia Pia Hanyang Univ. (Korea), INFN Genova (Italy), CERN (Switzerland) 18 October 2010 Design, development.
Maria Grazia Pia, INFN Genova 1 Part V The lesson learned Summary and conclusions.
Low Energy Electromagnetic Physics
Geant4-Genova Group Validation of Susanna Guatelli, Alfonso Mantero, Barbara Mascialino, Maria Grazia Pia, Valentina Zampichelli INFN Genova, Italy IEEE.
Max-Planck-Institut für extraterrestrische Physik and Halbleiterlabor, Germany Space Sciences Lab., UC Berkeley, USA CNES, Toulouse, France INFN Genova.
20 February 2002Geant4 Users' Workhsop, SLAC1 Low-Energy Electromagnetic Processes in P. Nieminen (ESA-ESTEC)
1 Improving Hash Join Performance through Prefetching _________________________________________________By SHIMIN CHEN Intel Research Pittsburgh ANASTASSIA.
1 An Empirical Study on Large-Scale Content-Based Image Retrieval Group Meeting Presented by Wyman
J. Tinslay 1, B. Faddegon 2, J. Perl 1 and M. Asai 1 (1) Stanford Linear Accelerator Center, Menlo Park, CA, (2) UC San Francisco, San Francisco, CA Verification.
Maria Grazia Pia, INFN Genova CERN, 26 July 2004 Background of the Project.
1 M.G. Pia et al. The application of GEANT4 simulation code for brachytherapy treatment Maria Grazia Pia INFN Genova, Italy and CERN/IT
Maria Grazia Pia, INFN Genova Low Energy Electromagnetic Physics Maria Grazia Pia INFN Genova
Maria Grazia Pia, INFN Genova – CHEP 2001 ow Energy Electromagnetic Physics ow Energy Electromagnetic Physics S. Chauvie,G. Depaola, F. Longo, V. Ivanchenko,
Copyright Arshi Khan1 System Programming Instructor Arshi Khan.
Maria Grazia Pia Systematic validation of Geant4 electromagnetic and hadronic models against proton data Systematic validation of Geant4 electromagnetic.
Maria Grazia Pia, INFN Genova and CERN1 Geant4 Hadron Kinetic Model for intra-nuclear transport Maria Grazia Pia CERN/IT and INFN, Sezione di Genova L.Bellagamba.
Maria Grazia Pia, INFN Genova CHEP May 2012 New York City, NY, USA Maria Grazia Pia M. Batič, M. Begalli, M. Han, S. Hauf, G. Hoff, C. H. Kim,
Geant4 simulation of the attenuation properties of plastic shield for  - radionuclides employed in internal radiotherapy Domenico Lizio 1, Ernesto Amato.
University of Maryland Compiler-Assisted Binary Parsing Tugrul Ince PD Week – 27 March 2012.
Sergey Ananko Saint-Petersburg State University Department of Physics
Alfonso Mantero, INFN Genova Models for the Simulation of X-Ray Fluorescence and PIXE A. Mantero, S. Saliceti, B. Mascialino, Maria Grazia Pia INFN Genova,
Summary of Work Zhang Qiwei INFN - CIAE. Validation of Geant4 EM physics for gamma rays against the SANDIA, EPDL97 and NIST databases.
Evaluating Impact of Storage on Smartphone Energy Efficiency David T. Nguyen.
Yongzhi Wang, Jinpeng Wei VIAF: Verification-based Integrity Assurance Framework for MapReduce.
Maria Grazia Pia, INFN Genova Low Energy Electromagnetic Physics Maria Grazia Pia INFN Genova on behalf of the Low Energy Electromagnetic.
IEEE Nuclear Science Symposium and Medical Imaging Conference Short Course The Geant4 Simulation Toolkit Sunanda Banerjee (Saha Inst. Nucl. Phys., Kolkata,
OOAD… LowE Electrons From HEP computing to medical research and vice versa Bidirectional From HEP computing to medical research and vice versa Bidirectional.
Maria Grazia Pia, INFN Genova Test & Analysis Project aka “statistical testing” Maria Grazia Pia, INFN Genova on behalf of the T&A team
Performance Analysis Of Generics In Scientific Computing Laurentiu Dragan Stephen M. Watt Ontario Research Centre for Computer Algebra University of Western.
IEEE NSS October – 2 November 2013 Seoul, Korea T. Basaglia 1, M. Batic 2, M. C. Han 3, G. Hoff 4, C. H. Kim 3, H. S. Kim 3, M. G. Pia 5, P. Saracco.
Maria Grazia Pia, INFN Genova New Physics Data Libraries for Monte Carlo Transport Maria Grazia Pia 1, Lina Quintieri 2, Mauro Augelli 3, Steffen Hauf.
Susanna Guatelli & Barbara Mascialino G.A.P. Cirrone (INFN LNS), G. Cuttone (INFN LNS), S. Donadio (INFN,Genova), S. Guatelli (INFN Genova), M. Maire (LAPP),
Detector Simulation on Modern Processors Vectorization of Physics Models Philippe Canal, Soon Yung Jun (FNAL) John Apostolakis, Mihaly Novak, Sandro Wenzel.
Quest for omega mesons by their radiative decay mode in √s=200 GeV A+A collisions at RHIC-PHENIX ~Why is it “Quest”?~ Simulation Study Real Data Analysis.
ENDF/B-VI Coupled Photon-Electron Data for Use in Radiation Shielding Applications by Dermott E. Cullen Lawrence Livermore National Laboratory & Robert.
Maria Grazia Pia, INFN Genova 1 New models for PIXE simulation with Geant4 CHEP 2009 Prague, March 2009 Maria Grazia Pia INFN Genova G. Weidenspointner,
Validation of inner shell ionization cross sections for electron transport Sung Hun, Kim Nuclear Engineering, Hanyang University, Seoul, Republic of Korea.
IEEE Nuclear Science Symposium and Medical Imaging Conference Short Course The Geant4 Simulation Toolkit Sunanda Banerjee (Saha Inst. Nucl. Phys., Kolkata,
An Efficient Linear Time Triple Patterning Solver Haitong Tian Hongbo Zhang Zigang Xiao Martin D.F. Wong ASP-DAC’15.
Precision Validation of Geant4 Electromagnetic Physics Geant4 DNA Project Meeting 26 July 2004, CERN Michela.
4th Workshop on Geant4 Bio-medical Developments and Geant4 Physics Validation Riccardo Capra 1 Physics processes Software process and OOAD.
Maria Grazia Pia, INFN Genova Statistics Toolkit Project Maria Grazia Pia, INFN Genova AIDA Workshop.
Physics Data Libraries: Content and Algorithms for Improved Monte Carlo Simulation Physics data libraries play an important role in Monte Carlo simulation:
Upgrade of G4Penelope models Luciano Pandola INFN – LNGS for the Geant4 EM Working Groups 15 th Geant4 Workshop, ESTEC, October 4 th -8 th, 2010.
NANO5 – Geant4 related R&D for new particle transport methods M. Augelli, M. Begalli, T. Evans, E. Gargioni, B. Grosswendt, S. Hauf, C. H. Kim, M. Kuster,
Susanna Guatelli Geant4 in a Distributed Computing Environment S. Guatelli 1, P. Mendez Lorenzo 2, J. Moscicki 2, M.G. Pia 1 1. INFN Genova, Italy, 2.
Validation of Geant4 EM physics for gamma rays against the SANDIA, EPDL97 and NIST databases Zhang Qiwei INFN-LNS/CIAE 14th Geant4 Users and Collaboration.
Maria Grazia Pia, INFN Genova and CERN1 Geant4 highlights of relevance for medical physics applications Maria Grazia Pia INFN Genova and CERN.
Nawanol Theera-Ampornpunt, Seong Gon Kim, Asish Ghoshal, Saurabh Bagchi, Ananth Grama, and Somali Chaterji Fast Training on Large Genomics Data using Distributed.
Maria Grazia Pia, INFN Genova - G4 WG Coord. Meeting, 13/11/2001 ow Energy Electromagnetic Physics ow Energy Electromagnetic Physics New physics features.
POLITECNICO DI MILANO A SystemC-based methodology for the simulation of dynamically reconfigurable embedded systems Dynamic Reconfigurability in Embedded.
June 12, 2016CITALA'121 Cloud Computing Technology For Large Scale and Efficient Arabic Handwriting Recognition System HAMDI Hassen, KHEMAKHEM Maher
Introduction to Performance Tuning Chia-heng Tu PAS Lab Summer Workshop 2009 June 30,
Models for the Simulation of X-Ray Fluorescence and PIXE
Genomic Data Clustering on FPGAs for Compression
Data libraries as a collaborative tool across Monte Carlo codes
Simulation in a Distributed Computing Environment
Low-Energy Electromagnetic Processes in
Chapter 12 Pipelining and RISC
Precision validation of Geant4 electromagnetic physics
The Geant4 Hadrontherapy Advanced Example
OPERATING SYSTEMS MEMORY MANAGEMENT BY DR.V.R.ELANGOVAN.
Parallel Exact Stochastic Simulation in Biochemical Systems
Presentation transcript:

Physics data management tools: computational evolutions and benchmarks Mincheol Han 1, Chan-Hyeung Kim 1, Lorenzo Moneta 2, Maria Grazia Pia 3, Hee Seo 1 1 Hanyang University, Korea – 2 CERN, Switzerland – 3 INFN Sezione Di Genova, Italy SNA + MC 2010 Joint International Conference on Supercomputing in Nuclear Applications + Monte Carlo 2010

Physics data libraries Data libraries Collection of experimental or theoretical tabulations of physics quantities e.g. cross sections, form factors, nuclear and atomic parameters etc. Distributed by data centres: RSICC (ORNL), NEA, NIST… Essential ingredient of Monte Carlo simulation Use established data Speed up simulation w.r.t. using analytical formulae Common background for different Monte Carlo systems ENDF/B, ENSDF, JENDL, CENDL, BROND, EEDL, EPDL, EADL…

Dealing with physics data Data management Load (and store) data Retrieve data Use data: directly, by interpolation Loading Usually in the simulation initialization phase Loading on demand Retrieving In the course of the simulation (usually at each step) Can be source of significant overload

Original design in Geant4 Composite Pattern Handle different data collections transparently  Data for materials  Data for atoms  Data for shells Strategy Pattern Handle interchangeable interpolation algorithms transparently electromagnetic data (Livermore library)

Can we improve it? Geant4 physics on a diet Leaner software design Improve computational performance Enhance clarity and transparency Facilitate testing Ease maintenance physics models CHEP 2009 R&D: physics models M.G. Pia et al., Design and performance evaluations of generic programming techniques in a R&D prototype of Geant4 physics physics data Monte Carlo + CHEP 2010 R&D: physics data Prototype to evaluate candidate solutions quantitatively This talk: selection of preliminary results Final and complete results will be documented in a dedicated publication

Test set-up Test case: Livermore library data EEDL (Evaluate Electron Data Library): ionisation, Bremsstrahlung EPDL97 (Evaluated Photon Data Library): Compton and Rayleigh scattering, photoelectric effect, pair and triplet production EADL (Evaluated Atomic Data Library): atomic parameters Computing environment Geant4 9.4-beta + G4EMLOW 6.13 Intel® Core™ Duo CPU E8500 with 3.16 GHz processor, 4 GB RAM, Linux SLC5, gcc compiler Intel® CPU U4100 with 1.30GHz processor, 2 GB RAM, MS Windows XP SP3, MSVC++9 C++ compiler (with SP1) Load Load test loading data for a number of elements between 1 and 100 each experiment repeated 100 times, the whole series repeated 10 times Retrieve Retrieve test finding the data associated with a randomly chosen atomic number finding procedure repeated 10 6 times, whole experiment repeated 10 times

Data structure Improve the physical design of the data library itself Large tabulations split into individual files (one per element) time (ms) to load data vs. number of elements present in the experimental set-up Excitation data original data split data

Data structure Large physics tabulations require large memory allocation for storing the data, time to load them into memory and to search trough them Are all the data really necessary? Reduce the amount of data A B C A B C Suppress B if ● can be interpolated with the same accuracy based on A and C time (ms) to load data vs. number of elements present in the experimental set-up Compton scattering functions Number of data for each element reduced original original data reduced data

Use forthcoming C++ features Current implementation uses STL map for most data, STL vector for a few data types Evaluated unordered_map (AKA hash map) Included in C++0x TR1 gcc 4.3.x in MSVC Pair production cross sections time (ms) to load data vs. number of elements present in the experimental set-up STL map unordered_map

Caching pre-calculated data Recent modification in Geant4 low energy electromagnetic package: cache pre-calculated log 10 data Credit to current Geant4 low energy electromagnetic group Not to be credited to the authors of this talk The authors of this talk Quantified the time for loading/retrieving Quantified the memory consumption to store additional (cached) data Reviewed the modified software design and implementation: flaws ~10% time saving w.r.t. on-the-fly log 10 calculation loading retrieving original modified original modified time (ms) to load and retrieve data vs. number of elements present in the experimental set-up

Generic programming techniques polymorphic behavior of data sets and interpolation algorithms is not necessary at runtime through dynamic binding OOAD iteration Preliminary design Templates eliminate the overhead due to the virtual table associated with inheritance Contribution to to improving execution speed

Effect of prototype design: loading Rayleigh scattering form factors Bremsstrahlung cross sections The extent of the improvement depends on the characteristics of the data original prototype original prototype time (ms) to load data vs. number of elements present in the experimental set-up Original design: STL vectors, load all elements

Effect of prototype design: retrieving Pair production cross sections Bremsstrahlung spectrum data time (ms) to retrieve data vs. number of elements present in the experimental set-up Original design Prototype design Prototype design + unordered_map

Use vectors! Some data sets in the original design do not require the use of STL map Can be efficiently managed by using STL vectors Not worthwhile to move them to unordered_map Rayleigh scattering form factors time (ms) to retrieve data vs. number of elements present in the experimental set-up Original design Prototype design (map) Prototype design (unordered_map)

Conclusions Prototype R&D on Geant4 physics data management Investigated  Data structure  Software design  Use of C++0x TR1 features Results  Leaner software  Improved performance RD Geant4 R&D phase Cutting edge technology Rigorous software development process Geant4 would profit from reenacting a R&D phase to exploit new technology with the same spirit of scientific openness and rigorousness as RD44 Geant4 would profit from reenacting a R&D phase to exploit new technology with the same spirit of scientific openness and rigorousness as RD44 Same conclusions at CHEP 2009 regarding physics modeling Acknowledgment Thanks to CERN Directorate for support