G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1 Statistical Data Analysis: Lecture 5 1Probability, Bayes’ theorem 2Random variables and.

Slides:



Advertisements
Similar presentations
Computing and Statistical Data Analysis / Stat 4
Advertisements

Generating Random Numbers
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #21.
Sampling Distributions (§ )
Random number generation Algorithms and Transforms to Univariate Distributions.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
G. Cowan Lectures on Statistical Data Analysis Lecture 2 page 1 Statistical Data Analysis: Lecture 2 1Probability, Bayes’ theorem 2Random variables and.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Statistics.
Random Number Generation
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
K. Desch – Statistical methods of data analysis SS10
G. Cowan 2011 CERN Summer Student Lectures on Statistics / Lecture 41 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 14 page 1 Statistical Data Analysis: Lecture 14 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 13 page 1 Statistical Data Analysis: Lecture 13 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Introduction to Statistics − Day 2
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination using shapes ATLAS Statistics Meeting CERN, 19 December, 2007 Glen Cowan.
Lecture II-2: Probability Review
ETM 607 – Random Number and Random Variates
G. Cowan Lectures on Statistical Data Analysis Lecture 7 page 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem 2Random variables and.
880.P20 Winter 2006 Richard Kass 1 Confidence Intervals and Upper Limits Confidence intervals (CI) are related to confidence limits (CL). To calculate.
G. Cowan Lectures on Statistical Data Analysis Lecture 1 page 1 Lectures on Statistical Data Analysis YETI IPPP Durham Young Experimentalists and.
Uniovi1 Some distributions Distribution/pdfExample use in HEP BinomialBranching ratio MultinomialHistogram with fixed N PoissonNumber of events found UniformMonte.
Modeling and Simulation CS 313
G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,
G. Cowan Lectures on Statistical Data Analysis Lecture 3 page 1 Lecture 3 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
Irakli Chakaberia Final Examination April 28, 2014.
1 Statistical Distribution Fitting Dr. Jason Merrick.

Module 1: Statistical Issues in Micro simulation Paul Sousa.
Basic Concepts in Number Theory Background for Random Number Generation 1.For any pair of integers n and m, m  0, there exists a unique pair of integers.
Random Numbers and Simulation  Generating truly random numbers is not possible Programs have been developed to generate pseudo-random numbers Programs.
Experimental Method and Data Process: “Monte Carlo Method” Presentation # 1 Nafisa Tasneem CHEP,KNU
G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 1: Introduction 清华大学高能物理研究中心 2010 年 4 月 12—16 日 Glen Cowan.
G. Cowan Computing and Statistical Data Analysis / Stat 2 1 Computing and Statistical Data Analysis Stat 2: Catalogue of pdfs London Postgraduate Lectures.
1 Introduction to Statistical Methods for High Energy Physics Glen Cowan 2006 CERN Summer Student Lectures CERN Summer Student Lectures on Statistics Glen.
G. Cowan Lectures on Statistical Data Analysis Lecture 1 page 1 Lectures on Statistical Data Analysis London Postgraduate Lectures on Particle Physics;
Monte Carlo Methods So far we have discussed Monte Carlo methods based on a uniform distribution of random numbers on the interval [0,1] p(x) = 1 0  x.
Practical Statistics for Particle Physicists Lecture 3 Harrison B. Prosper Florida State University European School of High-Energy Physics Anjou, France.
1 A Bayesian statistical method for particle identification in shower counters IX International Workshop on Advanced Computing and Analysis Techniques.
G. Cowan Lectures on Statistical Data Analysis Lecture 1 page 1 Lectures on Statistical Data Analysis RWTH Aachen Graduiertenkolleg February, 2007.
Computer simulation Sep. 9, QUIZ 2 Determine whether the following experiments have discrete or continuous out comes A fair die is tossed and the.
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Statistical Data Analysis: Lecture 4 1Probability, Bayes’ theorem 2Random variables and.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
R.Kass/F02 P416 Lecture 1 1 Lecture 1 Probability and Statistics Introduction: l The understanding of many physical phenomena depend on statistical and.
MAT 4830 Mathematical Modeling 04 Monte Carlo Integrations
1 Introduction to Statistics − Day 2 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.
Gil McVean, Department of Statistics Thursday February 12 th 2009 Monte Carlo simulation.
G. Cowan Lectures on Statistical Data Analysis Lecture 6 page 1 Statistical Data Analysis: Lecture 6 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Aachen 2014 / Statistics for Particle Physics, Lecture 11 Statistical Methods for Particle Physics Lecture 1: probability, random variables, MC.
0 Simulation Modeling and Analysis: Input Analysis 7 Random Numbers Ref: Law & Kelton, Chapter 7.
MONTE CARLO METHOD DISCRETE SIMULATION RANDOM NUMBER GENERATION Chapter 3 : Random Number Generation.
Introduction to Statistics − Day 2
Ex1: Event Generation (Binomial Distribution)
Lectures on Statistical Data Analysis
Computing and Statistical Data Analysis / Stat 8
Computing and Statistical Data Analysis Stat 3: The Monte Carlo Method
Lecture 3 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Statistics for Particle Physics Lecture 1: Fundamentals
Sampling Distributions (§ )
Presentation transcript:

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1 Statistical Data Analysis: Lecture 5 1Probability, Bayes’ theorem 2Random variables and probability densities 3Expectation values, error propagation 4Catalogue of pdfs 5The Monte Carlo method 6Statistical tests: general concepts 7Test statistics, multivariate methods 8Goodness-of-fit tests 9Parameter estimation, maximum likelihood 10More maximum likelihood 11Method of least squares 12Interval estimation, setting limits 13Nuisance parameters, systematic uncertainties 14Examples of Bayesian approach

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 2 What it is: a numerical technique for calculating probabilities and related quantities using sequences of random numbers. The usual steps: (1) Generate sequence r 1, r 2,..., r m uniform in [0, 1]. (2) Use this to produce another sequence x 1, x 2,..., x n distributed according to some pdf f (x) in which we’re interested (x can be a vector). (3) Use the x values to estimate some property of f (x), e.g., fraction of x values with a < x < b gives → MC calculation = integration (at least formally) MC generated values = ‘simulated data’ → use for testing statistical procedures The Monte Carlo method

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 3 Random number generators Goal: generate uniformly distributed values in [0, 1]. Toss coin for e.g. 32 bit number... (too tiring). → ‘random number generator’ = computer algorithm to generate r 1, r 2,..., r n. Example: multiplicative linear congruential generator (MLCG) n i+1 = (a n i ) mod m, where n i = integer a = multiplier m = modulus n 0 = seed (initial value) N.B. mod = modulus (remainder), e.g. 27 mod 5 = 2. This rule produces a sequence of numbers n 0, n 1,...

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 4 Random number generators (2) The sequence is (unfortunately) periodic! Example (see Brandt Ch 4): a = 3, m = 7, n 0 = 1 ← sequence repeats Choose a, m to obtain long period (maximum = m  1); m usually close to the largest integer that can represented in the computer. Only use a subset of a single period of the sequence.

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 5 Random number generators (3) are in [0, 1] but are they ‘random’? Choose a, m so that the r i pass various tests of randomness: uniform distribution in [0, 1], all values independent (no correlations between pairs), e.g. L’Ecuyer, Commun. ACM 31 (1988) 742 suggests a = m = Far better generators available, e.g. TRandom3, based on Mersenne twister algorithm, period =  1 (a “Mersenne prime”). See F. James, Comp. Phys. Comm. 60 (1990) 111; Brandt Ch. 4

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 6 The transformation method Given r 1, r 2,..., r n uniform in [0, 1], find x 1, x 2,..., x n that follow f (x) by finding a suitable transformation x (r). Require: i.e. That is, setand solve for x (r).

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 7 Example of the transformation method Exponential pdf: Set and solve for x (r). → works too.)

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 8 The acceptance-rejection method Enclose the pdf in a box: (1) Generate a random number x, uniform in [x min, x max ], i.e. r 1 is uniform in [0,1]. (2) Generate a 2nd independent random number u uniformly distributed between 0 and f max, i.e. (3) If u < f (x), then accept x. If not, reject x and repeat.

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 9 Example with acceptance-rejection method If dot below curve, use x value in histogram.

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 10 Improving efficiency of the acceptance-rejection method The fraction of accepted points is equal to the fraction of the box’s area under the curve. For very peaked distributions, this may be very low and thus the algorithm may be slow. Improve by enclosing the pdf f(x) in a curve C h(x) that conforms to f(x) more closely, where h(x) is a pdf from which we can generate random values and C is a constant. Generate points uniformly over C h(x). If point is below f(x), accept x.

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 11 Monte Carlo event generators Simple example: e  e  →     Generate cos  and  : Less simple: ‘event generators’ for a variety of reactions: e + e - →    , hadrons,... pp → hadrons, D-Y, SUSY,... e.g. PYTHIA, HERWIG, ISAJET... Output = ‘events’, i.e., for each event we get a list of generated particles and their momentum vectors, types, etc.

12 A simulated event PYTHIA Monte Carlo pp → gluino-gluino

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 13 Monte Carlo detector simulation Takes as input the particle list and momenta from generator. Simulates detector response: multiple Coulomb scattering (generate scattering angle), particle decays (generate lifetime), ionization energy loss (generate  ), electromagnetic, hadronic showers, production of signals, electronics response,... Output = simulated raw data → input to reconstruction software: track finding, fitting, etc. Predict what you should see at ‘detector level’ given a certain hypothesis for ‘generator level’. Compare with the real data. Estimate ‘efficiencies’ = #events found / # events generated. Programming package: GEANT

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 14 Wrapping up lecture 5 We’ve now seen the Monte Carlo method: calculations based on sequences of random numbers, used to simulate particle collisions, detector response. So far, we’ve mainly been talking about probability. But suppose now we are faced with experimental data. We want to infer something about the (probabilistic) processes that produced the data. This is statistics, the main subject of the following lectures.

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 15 Extra slides

G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 16 “True” random numbers 1955 the RAND Corporation published a book of random numbers generated with an “electronic roulette wheel”, based on random frequency electronic pulses. You can download all 1,000,000 of them (and buy the book) from