BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University CERN Summer Students July 2014.

Slides:



Advertisements
Similar presentations
Bayes rule, priors and maximum a posteriori
Advertisements

DO’S AND DONT’S WITH LIKELIHOODS Louis Lyons Oxford (CDF)
1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University LBL, January 2008.
Week 11 Review: Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution.
1 Statistical Problems in Particle Physics Louis Lyons Oxford IPAM, November 2004.
BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University CERN Latin American School March 2015.
Psychology 290 Special Topics Study Course: Advanced Meta-analysis April 7, 2014.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #21.
1 LIMITS Why limits? Methods for upper limits Desirable properties Dealing with systematics Feldman-Cousins Recommendations.
BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University Heidelberg April 2013.
Bayesian inference Gil McVean, Department of Statistics Monday 17 th November 2008.
1 BAYES versus FREQUENTISM The Return of an Old Controversy The ideologies, with examples Upper limits Systematics Louis Lyons, Oxford University and CERN.
Statistics for Particle Physics: Intervals Roger Barlow Karlsruhe: 12 October 2009.
Slide 1 Statistics for HEP Roger Barlow Manchester University Lecture 1: Probability.
Probability theory Much inspired by the presentation of Kren and Samuelsson.
1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University CERN, October 2006 Lecture 4.
A/Prof Geraint Lewis A/Prof Peter Tuthill
1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University SLAC, January 2007.
Credibility of confidence intervals Dean Karlen / Carleton University Advanced Statistical Techniques in Particle Physics Durham, March 2002.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
I The meaning of chance Axiomatization. E Plurbus Unum.
1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Imperial College and Oxford University Vienna May 2011.
Lecture 9: p-value functions and intro to Bayesian thinking Matthew Fox Advanced Epidemiology.
Statistical Analysis of Systematic Errors and Small Signals Reinhard Schwienhorst University of Minnesota 10/26/99.
A quick intro to Bayesian thinking 104 Frequentist Approach 10/14 Probability of 1 head next: = X Probability of 2 heads next: = 0.51.
1 Probability and Statistics  What is probability?  What is statistics?
Statistical Decision Theory
Statistics for Data Miners: Part I (continued) S.T. Balke.
Dr. Gary Blau, Sean HanMonday, Aug 13, 2007 Statistical Design of Experiments SECTION I Probability Theory Review.
Theory of Probability Statistics for Business and Economics.
G. Cowan Lectures on Statistical Data Analysis Lecture 1 page 1 Lectures on Statistical Data Analysis London Postgraduate Lectures on Particle Physics;
Statistical Data Analysis and Simulation Jorge Andre Swieca School Campos do Jordão, January,2003 João R. T. de Mello Neto.
Statistics Lectures: Questions for discussion Louis Lyons Imperial College and Oxford CERN Summer Students July
Practical Statistics for Particle Physicists Lecture 3 Harrison B. Prosper Florida State University European School of High-Energy Physics Anjou, France.
Likelihood function and Bayes Theorem In simplest case P(B|A) = P(A|B) P(B)/P(A) and we consider the likelihood function in which we view the conditional.
Bayesian vs. frequentist inference frequentist: 1) Deductive hypothesis testing of Popper--ruling out alternative explanations Falsification: can prove.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Statistics In HEP Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.
Frequentistic approaches in physics: Fisher, Neyman-Pearson and beyond Alessandro Palma Dottorato in Fisica XXII ciclo Corso di Probabilità e Incertezza.
Making sense of randomness
Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University.
1 Practical Statistics for Physicists CERN Latin American School March 2015 Louis Lyons Imperial College and Oxford CMS expt at LHC
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
Conditional Probability Mass Function. Introduction P[A|B] is the probability of an event A, giving that we know that some other event B has occurred.
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
1 27 March 2000CL Workshop, Fermilab, Harrison B. Prosper The Reverend Bayes and Solar Neutrinos Harrison B. Prosper Florida State University 27 March,
Statistical NLP: Lecture 4 Mathematical Foundations I: Probability Theory (Ch2)
In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
1 Practical Statistics for Physicists CERN Summer Students July 2014 Louis Lyons Imperial College and Oxford CMS expt at LHC
1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Imperial College and Oxford University Amsterdam Oct 2009.
Bayesian Estimation and Confidence Intervals Lecture XXII.
BAYES and FREQUENTISM: The Return of an Old Controversy
Bayesian Estimation and Confidence Intervals
What is Probability? Bayes and Frequentism
Statistical Issues for Dark Matter Searches
BAYES and FREQUENTISM: The Return of an Old Controversy
Confidence Intervals and Limits
Bayes for Beginners Stephanie Azzopardi & Hrvoje Stojic
Lecture 4 1 Probability (90 min.)
Statistical NLP: Lecture 4
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Wellcome Trust Centre for Neuroimaging
Bayes for Beginners Luca Chech and Jolanda Malamud
CS 594: Empirical Methods in HCC Introduction to Bayesian Analysis
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Statistics for HEP Roger Barlow Manchester University
Mathematical Foundations of BME Reza Shadmehr
Presentation transcript:

BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University CERN Summer Students July 2014

2

3 It is possible to spend a lifetime analysing data without realising that there are two very different fundamental approaches to statistics: Bayesianism and Frequentism.

4 How can textbooks not even mention Bayes / Frequentism? For simplest case with no constraint onthen at some probability, for both Bayes and Frequentist (but different interpretations) See Bob Cousins “Why isn’t every physicist a Bayesian?” Amer Jrnl Phys 63(1995)398

5 We need to make a statement about Parameters, Given Data The basic difference between the two: Bayesian : Probability (parameter, given data) (an anathema to a Frequentist!) Frequentist : Probability (data, given parameter) (a likelihood function)

6 PROBABILITY MATHEMATICAL Formal Based on Axioms FREQUENTIST Ratio of frequencies as n  infinity Repeated “identical” trials Not applicable to single event or physical constant BAYESIAN Degree of belief Can be applied to single event or physical constant (even though these have unique truth) Varies from person to person *** Quantified by “fair bet”

7 Bayesian versus Classical Bayesian P(A and B) = P(A;B) x P(B) = P(B;A) x P(A) e.g. A = event contains t quark B = event contains W boson or A = I am in CERN B = I am giving a lecture P(A;B) = P(B;A) x P(A) /P(B) Completely uncontroversial, provided….

8 Bayesian PosteriorLikelihoodPrior Problems: 1) p(param) Has particular value For Bayesian, “Degree of my belief” 2) Prior What functional form? Maybe OK if previous measurement More difficult to parametrise ignorance More troubles in many dimensions Bayes’ Theorem p(param | data) α p(data | param) * p(param)

9 Data overshadows prior Mass of Z boson (from LEP)

10 Prior Even more important for UPPER LIMITS L

11 Prior = zero in unphysical region Posterior for m 2 υe = L x Prior Mass-squared of neutrino

Bayesian posterior  intervals 12 Upper limit Lower limit Central interval Shortest

13 Example: Is coin fair ? Toss coin: 5 consecutive tails What is P(unbiased; data) ? i.e. p = ½ Depends on Prior(p) If village priest: prior ~ δ (p = 1/2 ) If stranger in pub: prior ~ 1 for 0 < p <1 (also needs cost function)

14 P (Data;Theory) P (Theory;Data)

15 P (Data;Theory) P (Theory;Data) Theory = male or female Data = pregnant or not pregnant P (pregnant ; female) ~ 3%

16 P (Data;Theory) P (Theory;Data) Theory = male or female Data = pregnant or not pregnant P (pregnant ; female) ~ 3% but P (female ; pregnant) >>>3%

17 P (Data;Theory) P (Theory;Data) HIGGS SEARCH at CERN Is data consistent with Standard Model? or with Standard Model + Higgs? End of Sept 2000: Data not very consistent with S.M. Prob (Data ; S.M.) < 1% valid frequentist statement Turned by the press into: Prob (S.M. ; Data) 99% i.e. “It is almost certain that the Higgs has been seen”

18 Classical Approach Neyman “confidence interval” avoids pdf for Uses only P( x; ) Confidence interval : P( contains ) = True for any Varying intervals from ensemble of experiments fixed Gives range of for which observed value was “likely” ( ) Contrast Bayes : Degree of belief = is in

19 μ≥0 No prior for μ Classical (Neyman) Confidence Intervals Uses only P(data|theory) Example: Param = Temp at centre of Sun Data = est. flux of solar neutrinos Theoretical Parameter µ Data x

20 μ≥0 No prior for μ Classical (Neyman) Confidence Intervals Uses only P(data|theory) Example: Param = Temp at centre of Sun Data = est. flux of solar neutrinos Theoretical Parameter µ Data x Data x µ range <1.5 Empty 1.5 – 2.2 Upper limit >2.2 2-sided

21 at 90% confidence and known, but random unknown, but fixed Probability statement about and Frequentist Bayesian and known, and fixed unknown, and random Probability/credible statement about

Basis of method Bayes Theorem  Posterior probability distribution Uses pdf for data, for fixed parameters Meaning of probability Degree of beliefFrequentist definition Prob of parameters? YesAnathema Needs prior?YesNo Choice of interval? YesYes (except F+C) Data considered Only data you have….+ other possible data Likelihood principle? YesNo 22 Bayesian versus Frequentism Bayesian Frequentist

Bayesian versus Frequentism Ensemble of experiment NoYes (but often not explicit) Final statement Posterior probability distribution Parameter values  Data is likely Unphysical/ empty ranges Excluded by priorCan occur SystematicsIntegrate over priorExtend dimensionality of frequentist construction CoverageUnimportantBuilt-in Decision making Yes (uses cost function)Not useful 23 Bayesian Frequentist

24 Bayesianism versus Frequentism “Bayesians address the question everyone is interested in, by using assumptions no-one believes” “Frequentists use impeccable logic to deal with an issue of no interest to anyone”

25 Recommended to use both Frequentist and Bayesian approaches If agree, that’s good If disagree, see whether it is just because of different approaches Approach used at LHC

26 Tomorrow: χ 2 and Goodness of Fit THE paradox L ikelihoods for parameter determination