Practical Statistics for Particle Physicists Lecture 3 Harrison B. Prosper Florida State University European School of High-Energy Physics Anjou, France.

Slides:



Advertisements
Similar presentations
J. Daunizeau Institute of Empirical Research in Economics, Zurich, Switzerland Brain and Spine Institute, Paris, France Bayesian inference.
Advertisements

Week 11 Review: Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution.
CHAPTER 8 More About Estimation. 8.1 Bayesian Estimation In this chapter we introduce the concepts related to estimation and begin this by considering.
Bayesian inference “Very much lies in the posterior distribution” Bayesian definition of sufficiency: A statistic T (x 1, …, x n ) is sufficient for 
Practical Statistics for LHC Physicists Bayesian Inference Harrison B. Prosper Florida State University CERN Academic Training Lectures 9 April, 2015.
Psychology 290 Special Topics Study Course: Advanced Meta-analysis April 7, 2014.
Sampling Distributions (§ )
Bayesian inference Gil McVean, Department of Statistics Monday 17 th November 2008.
An Introduction to Bayesian Inference Michael Betancourt April 8,
G. Cowan Statistics for HEP / NIKHEF, December 2011 / Lecture 3 1 Statistical Methods for Particle Physics Lecture 3: Limits for Poisson mean: Bayesian.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
G. Cowan Discovery and limits / DESY, 4-7 October 2011 / Lecture 3 1 Statistical Methods for Discovery and Limits Lecture 3: Limits for Poisson mean: Bayesian.
Statistical Analysis of Systematic Errors and Small Signals Reinhard Schwienhorst University of Minnesota 10/26/99.
Review of normal distribution. Exercise Solution.
Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)
CDF Joint Physics Group June 27, 2003 Rick FieldPage 1 PYTHIA Tune A versus Run 2 Data  Compare PYTHIA Tune A with Run 2 data on the “underlying event”.
1 Probability and Statistics  What is probability?  What is statistics?
Bayesian Inference, Basics Professor Wei Zhu 1. Bayes Theorem Bayesian statistics named after Thomas Bayes ( ) -- an English statistician, philosopher.
Harrison B. Prosper Workshop on Top Physics, Grenoble Bayesian Statistics in Analysis Harrison B. Prosper Florida State University Workshop on Top Physics:
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
G. Cowan Lectures on Statistical Data Analysis Lecture 1 page 1 Lectures on Statistical Data Analysis London Postgraduate Lectures on Particle Physics;
Maximum Likelihood - "Frequentist" inference x 1,x 2,....,x n ~ iid N( ,  2 ) Joint pdf for the whole random sample Maximum likelihood estimates.
Bayesian vs. frequentist inference frequentist: 1) Deductive hypothesis testing of Popper--ruling out alternative explanations Falsification: can prove.
Statistics In HEP Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.
Frequentistic approaches in physics: Fisher, Neyman-Pearson and beyond Alessandro Palma Dottorato in Fisica XXII ciclo Corso di Probabilità e Incertezza.
1 TTU report “Dijet Resonances” Kazim Gumus and Nural Akchurin Texas Tech University Selda Esen and Robert M. Harris Fermilab Jan. 26, 2006.
Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #25.
BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University CERN Summer Students July 2014.
Bayes Theorem. Prior Probabilities On way to party, you ask “Has Karl already had too many beers?” Your prior probabilities are 20% yes, 80% no.
Experience from Searches at the Tevatron Harrison B. Prosper Florida State University 18 January, 2011 PHYSTAT 2011 CERN.
INTRODUCTION TO CLINICAL RESEARCH Introduction to Statistical Inference Karen Bandeen-Roche, Ph.D. July 12, 2010.
Practical Statistics for Particle Physicists Lecture 2 Harrison B. Prosper Florida State University European School of High-Energy Physics Parádfürdő,
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #24.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
Study of pair-produced doubly charged Higgs bosons with a four muon final state at the CMS detector (CMS NOTE 2006/081, Authors : T.Rommerskirchen and.
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
G. Cowan, RHUL Physics Statistics for early physics page 1 Statistics jump-start for early physics ATLAS Statistics Forum EVO/Phone, 4 May, 2010 Glen Cowan.
1 27 March 2000CL Workshop, Fermilab, Harrison B. Prosper The Reverend Bayes and Solar Neutrinos Harrison B. Prosper Florida State University 27 March,
- 1 - Outline Introduction to the Bayesian theory –Bayesian Probability –Bayes’ Rule –Bayesian Inference –Historical Note Coin trials example Bayes rule.
Don LincolnExperimental QCD and W/Z+Jet Results 1 Recent Dijet Measurements at DØ Don Lincoln Fermi National Accelerator Laboratory for the DØ Collaboration.
G. Cowan Lectures on Statistical Data Analysis Lecture 6 page 1 Statistical Data Analysis: Lecture 6 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1 Statistical Data Analysis: Lecture 5 1Probability, Bayes’ theorem 2Random variables and.
In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.
1 CMS Sensitivity to Quark Contact Interactions with Dijets Selda Esen (Brown) Robert M. Harris (Fermilab) DPF Meeting Nov 1, 2006.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Systematics in Hfitter. Reminder: profiling nuisance parameters Likelihood ratio is the most powerful discriminant between 2 hypotheses What if the hypotheses.
Bayesian Brain Probabilistic Approaches to Neural Coding 1.1 A Probability Primer Bayesian Brain Probabilistic Approaches to Neural Coding 1.1 A Probability.
From Small-N to Large Harrison B. Prosper SCMA IV, June Bayesian Methods in Particle Physics: From Small-N to Large Harrison B. Prosper Florida State.
Outline Historical note about Bayes’ rule Bayesian updating for probability density functions –Salary offer estimate Coin trials example Reading material:
Investigation on CDF Top Physics Group Ye Li Graduate Student UW - Madison.
Practical Statistics for LHC Physicists Frequentist Inference Harrison B. Prosper Florida State University CERN Academic Training Lectures 8 April, 2015.
Practical Statistics for Particle Physicists Lecture 3 Harrison B. Prosper Florida State University European School of High-Energy Physics Parádfürdő,
Confidence Intervals Lecture 2 First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University.
Status of the Higgs to tau tau
Lecture 1.31 Criteria for optimal reception of radio signals.
The expected confident intervals for triple gauge coupling parameter
BAYES and FREQUENTISM: The Return of an Old Controversy
Proposals for near-future BG determinations from control regions
Confidence Intervals and Limits
Bayesian Inference, Basics
Bayesian inference J. Daunizeau
CS639: Data Management for Data Science
Computing and Statistical Data Analysis / Stat 10
Presentation transcript:

Practical Statistics for Particle Physicists Lecture 3 Harrison B. Prosper Florida State University European School of High-Energy Physics Anjou, France 6 – 19 June, ESHEP2012 Practical Statistics Harrison B. Prosper

Outline  Lecture 1  Descriptive Statistics  Probability  Likelihood  The Frequentist Approach – 1  Lecture 2  The Frequentist Approach – 2  The Bayesian Approach  Lecture 3 – Analysis Example 2

Practicum Toy data and code at topdiscovery.tar contactinteractions.tar just download and unpack 3

Recap: The Bayesian Approach – 1 Definition: A method is Bayesian if 1.it is based on the subjective interpretation of probability and 2.it uses Bayes’ theorem for all inferences. Dobserved data θparameter of interest ωnuisance parameters πprior density ESHEP2012 Practical Statistics Harrison B. Prosper4

Recap: The Bayesian Approach – 2 Nuisance parameters are removed by marginalization: in contrast to profiling, which can be thought of as marginalization with a δ-function prior ESHEP2012 Practical Statistics Harrison B. Prosper5

Recap: The Bayesian Approach – 3 Bayes theorem can be used to compute the probability of a model Dobserved data θ M parameters of model M Mmodel ωnuisance parameters πprior density ESHEP2012 Practical Statistics Harrison B. Prosper6

Bayesian Methods: Theory & Practice. Harrison B. Prosper7 Recap: The Bayesian Approach – 5 1.Factorize the priors:  (  , ω, M) =  (θ M, ω|M)  (M) 2.Then, for each model, M, compute the function 3.Then, compute the probability of each model, M

8 Recap: The Bayesian Approach – 6 In order to compute p(M|D), however, two things are needed: 1.Proper priors over the parameter spaces 2.The priors  (M). Usually, we compute the Bayes factor: which is the ratio in the first bracket, B 10.

An Analysis Example Search for Contact Interactions

In our current theories, all interactions proceed via the exchange of particles: But,… Contact Interactions – 1

Contact Interactions – 2 … when the experimentally available energies are << the mass of the exchanged particles, the interactions can be approximated as contact interactions (CI), for example:

Contact Interactions – 3 Consider the model* L ~ λ Ψγ μ Ψ Ψγ μ Ψ. with λ = ξ / Λ 2, where ξ can be positive or negative. At leading order, the possible reactions are: The amplitude is linear in λ a = a SM + λ a CI Eichten, Hinchliffe, Lane, Quigg, Rev. Mod. Phys. 56, 579 (1984)

Contact Interactions – 4 Contact interactions calculated at leading order with Pythia 6. QCD calculated at next-to-leading order with FastNLO. Expect cross section in each bin to be of the form

Bayesian Analysis

Simulated Data – 1 Data M = 25 bins (362 ≤ pT ≤ 2000) D = 575,999 to 0 (large dynamic range!) Parameters λ =parameter of interest nuisance parameters c = QCD cross section per p T bin b, a = signal parameters for destructive interference, b < 0 ESHEP2012 Practical Statistics Harrison B. Prosper15

Simulated Data – 2 Choose b < 0 Assume integrated luminosity of 5 fb α = 5000 pb -1

Analysis – 1 Step 1. Assume the following probability model for the observations where ESHEP2012 Practical Statistics Harrison B. Prosper17

Analysis Issues 1.Sensitive to jet energy scale (JES) 2.Sensitive to the parton distribution functions (PDF) 3.Large dynamic range causes the limits on Λ to be very sensitive to the value of α. For example, changing α from 5000 to 5030 decreases the limit by 25%! Solution: 1., 2. Integrate likelihood over JES and PDF parameters 3. Integrate likelihood over the scale factor α ESHEP2012 Practical Statistics Harrison B. Prosper18

Analysis – 2 Step 2: We can re-write as where ESHEP2012 Practical Statistics Harrison B. Prosper19 Exercise 11: Show this

Analysis – 3 We now eliminate α by integrating with respect to α. But to do so, we need a prior density for α. In the absence of reliable information about this parameter, we shall use which is an example of a reference prior. ESHEP2012 Practical Statistics Harrison B. Prosper20

Analysis – 4 Step 3: The integration with respect to α yields Step 4: Randomly sample from: 1.the jet energy scale, 2.jet energy resolution, 3.the PDF parameter sets, 4.the factorization an renormalization scales 5.and any other nuisance parameters of the problem This generates an ensemble of points {ω i } ESHEP2012 Practical Statistics Harrison B. Prosper21

Analysis – 5 Step 5: We approximate the posterior density using where, again, we use a reference prior for π (λ|ω). It turns out that this prior can be calculated exactly. ESHEP2012 Practical Statistics Harrison B. Prosper22

Analysis – 6 Step 6: Finally, we can compute a 95% Bayesian interval by solving for λ UP, from which we compute Λ = 1/√λ UP. For the simulated data (and ignoring Step 4., i.e., systematic uncertainties), we obtain Λ > % CL ESHEP2012 Practical Statistics Harrison B. Prosper23 Exercise 12: Write a program to implement this analysis

24 Summary Probability Two main interpretations: 1.Degree of belief 2.Relative frequency Likelihood Function Main ingredient in any non-trivial statistical analysis. Frequentist Principle Construct statements such that a given (minimum) fraction of them will be true over a given ensemble of statements.

Summary Frequentist Approach 1.Use likelihood function only 2.Eliminate nuisance parameters by profiling 3.Fisher: Reject null if p-value is judged to be too small 4.Neyman: Decide on a fixed threshold for rejection and reject null if threshold has been breached, but only if the probability of the alternative is high enough Bayesian Approach 1)Model all uncertainty using probabilities and use Bayes’ theorem to make inferences. 2)Eliminate nuisance parameters through marginalization. ESHEP2012 Practical Statistics Harrison B. Prosper25