P. Saracco, M.G. Pia, INFN Genova An exact framework for Uncertainty Quantification in Monte Carlo simulation CHEP 2013 Amsterdam, 14-18 October 2013 Paolo.

Slides:



Advertisements
Similar presentations
Design of Experiments Lecture I
Advertisements

Uncertainty Quantification & the PSUADE Software
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Propagation of Error Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred.
Approaches to Data Acquisition The LCA depends upon data acquisition Qualitative vs. Quantitative –While some quantitative analysis is appropriate, inappropriate.
Understanding the Accuracy of Assembly Variation Analysis Methods ADCATS 2000 Robert Cvetko June 2000.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Discrete-Event Simulation: A First Course Steve Park and Larry Leemis College of William and Mary.
Steps of a sound simulation study
Simulation Modeling and Analysis Session 12 Comparing Alternative System Designs.
Simulation.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Evaluating Hypotheses
1 Validation and Verification of Simulation Models.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Lecture 10 Comparison and Evaluation of Alternative System Designs.
1 Stratified sampling This method involves reducing variance by forcing more order onto the random number stream used as input As the simplest example,
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Maria Grazia Pia Systematic validation of Geant4 electromagnetic and hadronic models against proton data Systematic validation of Geant4 electromagnetic.
Statistical Analysis of Systematic Errors and Small Signals Reinhard Schwienhorst University of Minnesota 10/26/99.
Analysis of Simulation Results Andy Wang CIS Computer Systems Performance Analysis.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
LINEAR REGRESSION Introduction Section 0 Lecture 1 Slide 1 Lecture 5 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall.
Atmospheric Neutrino Oscillations in Soudan 2
Development of An ERROR ESTIMATE P M V Subbarao Professor Mechanical Engineering Department A Tolerance to Error Generates New Information….
1 Validation & Verification Chapter VALIDATION & VERIFICATION Very Difficult Very Important Conceptually distinct, but performed simultaneously.
 1  Outline  stages and topics in simulation  generation of random variates.
Modeling and simulation of systems Simulation optimization and example of its usage in flexible production system control.
Chapter 4 – Modeling Basic Operations and Inputs  Structural modeling: what we’ve done so far ◦ Logical aspects – entities, resources, paths, etc. 
Random Sampling, Point Estimation and Maximum Likelihood.
PARAMETRIC STATISTICAL INFERENCE
Stochastic Linear Programming by Series of Monte-Carlo Estimators Leonidas SAKALAUSKAS Institute of Mathematics&Informatics Vilnius, Lithuania
IEEE Nuclear Science Symposium and Medical Imaging Conference Short Course The Geant4 Simulation Toolkit Sunanda Banerjee (Saha Inst. Nucl. Phys., Kolkata,
Geo597 Geostatistics Ch9 Random Function Models.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
1 SMU EMIS 7364 NTU TO-570-N Inferences About Process Quality Updated: 2/3/04 Statistical Quality Control Dr. Jerrell T. Stracener, SAE Fellow.
Monte Carlo Methods Versatile methods for analyzing the behavior of some activity, plan or process that involves uncertainty.
Lab 3b: Distribution of the mean
Delivering Integrated, Sustainable, Water Resources Solutions Monte Carlo Simulation Robert C. Patev North Atlantic Division – Regional Technical Specialist.
1 Everyday Statistics in Monte Carlo Shielding Calculations  One Key Statistics: ERROR, and why it can’t tell the whole story  Biased Sampling vs. Random.
IEEE Nuclear Science Symposium and Medical Imaging Conference Short Course The Geant4 Simulation Toolkit Sunanda Banerjee (Saha Inst. Nucl. Phys., Kolkata,
MODES-650 Advanced System Simulation Presented by Olgun Karademirci VERIFICATION AND VALIDATION OF SIMULATION MODELS.
Example: Bioassay experiment Problem statement –Observations: At each level of dose, 5 animals are tested, and number of death are observed.
Lesson 4: Computer method overview
Precision analysis of Geant4 condensed transport effects on energy deposition in detectors M. Batič 1,2, G. Hoff 1,3, M. G. Pia 1 1 INFN Sezione di Genova,
1 1 Slide Simulation Professor Ahmadi. 2 2 Slide Simulation Chapter Outline n Computer Simulation n Simulation Modeling n Random Variables and Pseudo-Random.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
- 1 - Computer model under uncertainty In previous lecture on accuracy assessment –We considered mostly deterministic models. – We did not distinguish.
Statistics Presentation Ch En 475 Unit Operations.
Regression Analysis1. 2 INTRODUCTION TO EMPIRICAL MODELS LEAST SQUARES ESTIMATION OF THE PARAMETERS PROPERTIES OF THE LEAST SQUARES ESTIMATORS AND ESTIMATION.
Introduction to Sampling Methods Qi Zhao Oct.27,2004.
G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1 Statistical Data Analysis: Lecture 5 1Probability, Bayes’ theorem 2Random variables and.
1 Giuseppe G. Daquino 26 th January 2005 SoFTware Development for Experiments Group Physics Department, CERN Background radiation studies using Geant4.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
Building Valid, Credible & Appropriately Detailed Simulation Models
CORRELATION-REGULATION ANALYSIS Томский политехнический университет.
MONTE CARLO TRANSPORT SIMULATION Panda Computing Week 2012, Torino.
Introduction to emulators Tony O’Hagan University of Sheffield.
A Short Course on Geant4 Simulation Toolkit Introduction
Uncertainty quantification in generic Monte Carlo Simulation: a mathematical framework How to do it? Abstract: Uncertainty Quantification (UQ) is the capability.
Chapter 7. Classification and Prediction
Prepared by Lloyd R. Jaisingh
Ch3: Model Building through Regression
A Brachytherapy Treatment Planning Software Based on Monte Carlo Simulations and Artificial Neural Network Algorithm Amir Moghadam.
Unfolding Problem: A Machine Learning Approach
Filtering and State Estimation: Basic Concepts
A Short Course on Geant4 Simulation Toolkit Introduction
Lecture # 2 MATHEMATICAL STATISTICS
Lesson 4: Application to transport distributions
Dr. Arslan Ornek MATHEMATICAL MODELS
Presentation transcript:

P. Saracco, M.G. Pia, INFN Genova An exact framework for Uncertainty Quantification in Monte Carlo simulation CHEP 2013 Amsterdam, October 2013 Paolo Saracco, Maria Grazia Pia INFN Genova, Italy

P. Saracco, M.G. Pia, INFN Genova Monte Carlo in HEP MC engine Event generator (Pythia, Herwig…) Particle transport (Geant4, MCNP, MARS…) input observable primary particles, deposited energy… 2 cross sections, branching ratios, physics models, physics parameters.. How much can we trust the observables produced by MC?

P. Saracco, M.G. Pia, INFN Genova Measure and compare 3 Courtesy CERN CDS, CERN-EX Test beam If the accuracy of observable A is assessed by comparison with experiment, what about observable B? And what about the simulation of detector concepts, which do not exist yet? And observables which cannot be measured in practice? Measured and simulated observable

P. Saracco, M.G. Pia, INFN Genova Uncertainties 4 Input Parameter uncertainties observable with uncertainties Monte Carlo method Statistical uncertainty Uncertainties deriving from input uncertainties Monte Carlo algorithm simulation model Uncertainty quantification is the ground for predictive Monte Carlo simulation Beware: input uncertainties can be hidden in models and algorithms in the code Validation of MC modeling ingredients

P. Saracco, M.G. Pia, INFN Genova Parameter uncertainties 5 They are the uncertainties of the “ingredients” of the simulation engine (particle transport system, event generator) Can they be disentangled from statistical uncertainties associated with the Monte Carlo method? Can we estimate their effect on the observables produced by the simulation? Sensitivity analysis Exact calculation

P. Saracco, M.G. Pia, INFN Genova Sensitivity analysis 6 Computational cost Mathematical methods Software toolkits (DAKOTA, PSUADE..) To reduce computational costs, one can reduce the sensitivity analysis to the search for the most probable output  this means giving up a full statistical characterization of the output

P. Saracco, M.G. Pia, INFN Genova Disentangling uncertainties I 7 Algorithmic (statistical) and parameter uncertainties How do statistical and parameter uncertainties mix in a simulation environment? EXAMPLE: a (very) simplified “transport code”, a random path generator ruled by two constant parameters describing the relative probability of absorption (  A ) and scattering processes (  S ), sampling an observable – track-length in this case. (by the way, this simulates the propagation of neutral particles in an uniform medium with constant scattering and absorption cross-sections and isotropic scattering) If  S is affected by some uncertainty, say  S, min <  S <  S, max we run many simulations varying its value with some known probability (for instance flat)

P. Saracco, M.G. Pia, INFN Genova Disentangling uncertainties 8 Algorithmic (statistical) and parameter uncertainties simulations (a)10 5 events (b) events (c)10 6 events (d)10 8 events As statistical errors decrease, the distribution of the observable is dominated by parameter uncertainties only Results for a track length observable scored in a volume near the source

P. Saracco, M.G. Pia, INFN Genova Uncertainty propagation 9

P. Saracco, M.G. Pia, INFN Genova 10 Uncertainty propagation: verification Example: evaluate the area of the circle with some flat uncertainty on the measure of its radius f(R)=  R-R min )  (R max -R)/(R max -R min ) A(R) =  R 2 Metropolis simulation, 50K samples Metropolis simulation, 1M samples analytical result the exact solution is known

P. Saracco, M.G. Pia, INFN Genova The task of UQ 11 The feasibility of UQ requires: 1)To know input uncertainties and their probability distributions Validation of MC modeling ingredients needed 1)To be able to solve explicitly for G(x): An exact mathematical context needed 2)To use MC simulations to determine parameters in G(x): (in the previous example, to find A max/min from simulation and to determine the proper behavior A -1/2 ) Possible with few simulations with predetermined accuracy 2) is independent from the features of the specific problem and can be solved within the scope of wide assumptions this is an exact mathematical frame for UQ

P. Saracco, M.G. Pia, INFN Genova UQ for a 1 parameter problem We must determine the relationship x 0  between the input parameter  and the (exact) result for the required physical observable x 0. This can be done with few (21 in the case) MC simulations at fixed values of  S within its range of variability. _________ An example of this procedure in the previous simple transport problem for observables at different distances from the source. Note the linear relationship x 0 (  )

P. Saracco, M.G. Pia, INFN Genova Many parameters problem 13 In the generic case we have many input parameter unknowns: We make two “reasonable” assumptions: -the  k are independent - is linear (if necessary subdivide the domain of variability of the unknowns in such a way to fulfill the condition) Under these hypothesis the evaluation of G(x) reduces to a well known problem in probability theory: the determination of the weighted sum of a certain number of independent stochastic variables. Unfortunately even this problem is not soluble in general Can we solve it?

P. Saracco, M.G. Pia, INFN Genova Some remarks 14 Under these assumptions with  k 2 the variances of the individual input unknowns. For M input unknowns we need a priori M+1 simulations to determine the values, a task that can be pursued reasonably if the number of input unknowns is not so large. So detailed physical knowledge of the problem at hand is required to select a proper set of physical parameters on which is meaningful to attempt a full Uncertainty Quantification. We then emphasize that a full UQ is PROBLEM SPECIFIC

P. Saracco, M.G. Pia, INFN Genova 15 BUT we are not sure that  x 2 is a proper measure of the output uncertainty, since we do not know the exact form of G(x). In some useful selected cases the form of G(x) is known: -all the input unknowns are normally distributed: in such case G(x) is normal with the quoted variance -all the input unknown are uniformly distributed: in such case a generalization of the Irwine-Hall distribution holds -all the input unknowns have  -stable distributions with the same  value: in such case G(x) is again a stable distribution with the same  value (e.g. the Lorentz distribution) We recently proved that a general form exists for the weighted sum of generic polynomial distributions over different intervals: this result can be used in principle to find an approximate form of G(x) with arbitrary predetermined accuracy in the general case.

P. Saracco, M.G. Pia, INFN Genova Current scope of applicability 16 Single parameter uncertainty (see [1]): - complete analysis of uncertainty propagation available - simulation is used solely to determine the values of the parameters defining the output probability density function -confidence intervals for the output are known with a statistical error that can be predetermined Many parameter uncertainty (see also [2]): -a complete UQ is possible only for independent input uncertainties. -calculability issues may exist, in practice, if the number of parameters considered is high and/or if linearity of x 0 (  k ) is questionable -confidence interval for the output are affected by the statistical errors in the determination of the required parameters AND by errors in the polynomial approximations required -in principle a predefined accuracy can be obtained -calculation issues must be studied [1] - P. Saracco, M. Batic, G. Hoff, M.G. Pia – “Theoretical ground for the propagation ofuncertainties in Monte Carlo particle transport”, submitted to IEEE Trans. Nucl. Phys., [2] – P. Saracco, M.G. Pia – “Uncertainty Quantification and the problem of determining the distribution of the sum of N independent stochastic variables: an exact solution for arbitrary polynomial distributions on different intervals”, submitted to Journ. Math. Phys., 2013.

P. Saracco, M.G. Pia, INFN Genova Validation of physics ingredients The validation of the physics “ingredients” of Monte Carlo codes is a (complex, slow) still ongoing process ‒ at least regarding Monte Carlo particle transport Quantitative estimates of input uncertainties (cross sections, angular distributions, BR etc.) are necessary for uncertainty propagation ‒ A qualitative plot is not enough… Epistemic uncertainties are often embedded in the code, without being documented ‒ See M. G. Pia, M. Begalli, A. Lechner, L. Quintieri, P. Saracco, Physics-related epistemic uncertainties of proton depth dose simulation, IEEE Trans. Nucl. Sci., vol. 57, no. 5, pp ,

P. Saracco, M.G. Pia, INFN Genova Conclusion and outlook A novel conceptual approach A mathematical framework Calculation methods for single and many parameter uncertainties A novel conceptual approach A mathematical framework Calculation methods for single and many parameter uncertainties 18 Collaboration with HEP experiments is welcome! to determine the intrinsic uncertainty of the results of Monte Carlo simulation (beyond statistical uncertainty) We have established  Verification in a realistic experimental scenarios  Application software system Outlook These developments are applicable to Monte Carlo simulation in general Particle transport Event generators In parallel, we pursue extensive Geant4 physics validation :

P. Saracco, M.G. Pia, INFN Genova 19 Collezione di immagini utili

P. Saracco, M.G. Pia, INFN Genova Don’t forget the chef 20 With the same ingredients we obtain very different tastes of the soup allowing the chef to modify the recipe In a simulation context the recipe is chosen by MC user, when he defines the experimental configuration: geometry, composition, external conditions, … For the purpose of UQ different experimental configurations – if not VERY similar – must be analyzed as different problems The “simulation engine” is the CODE together with the CHOSEN EXPERIMENTAL CONFIGURATION