Statistical Issues for Dark Matter Searches

Slides:

Advertisements

Similar presentations

DO’S AND DONT’S WITH LIKELIHOODS Louis Lyons Oxford (CDF)

Advertisements

1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University LBL, January 2008.

Using the Profile Likelihood in Searches for New Physics / PHYSTAT 2011 G. Cowan 1 Using the Profile Likelihood in Searches for New Physics arXiv:

27 th March CERN Higgs searches: CL s W. J. Murray RAL.

BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University CERN Latin American School March 2015.

1 LIMITS Why limits? Methods for upper limits Desirable properties Dealing with systematics Feldman-Cousins Recommendations.

BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University Heidelberg April 2013.

1 BAYES versus FREQUENTISM The Return of an Old Controversy The ideologies, with examples Upper limits Systematics Louis Lyons, Oxford University and CERN.

Statistics In HEP 2 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.

Statistics for Particle Physics: Intervals Roger Barlow Karlsruhe: 12 October 2009.

Slide 1 Statistics for HEP Roger Barlow Manchester University Lecture 1: Probability.

1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University CERN, October 2006 Lecture 4.

1 Practical Statistics for Physicists Dresden March 2010 Louis Lyons Imperial College and Oxford CDF experiment at FNAL CMS expt at LHC

1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University SLAC, January 2007.

1 Nuisance parameters and systematic uncertainties Glen Cowan Royal Holloway, University of London IoP Half.

Credibility of confidence intervals Dean Karlen / Carleton University Advanced Statistical Techniques in Particle Physics Durham, March 2002.

G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.

1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Imperial College and Oxford University Vienna May 2011.

G. Cowan 2011 CERN Summer Student Lectures on Statistics / Lecture 41 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability.

G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.

G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.

1 Practical Statistics for Physicists Stockholm Lectures Sept 2008 Louis Lyons Imperial College and Oxford CDF experiment at FNAL CMS expt at LHC

Statistical Analysis of Systematic Errors and Small Signals Reinhard Schwienhorst University of Minnesota 10/26/99.

Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)

880.P20 Winter 2006 Richard Kass 1 Confidence Intervals and Upper Limits Confidence intervals (CI) are related to confidence limits (CL). To calculate.

1 Probability and Statistics  What is probability?  What is statistics?

G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,

1 Is there evidence for a peak in this data?. 2 “Observation of an Exotic S=+1 Baryon in Exclusive Photoproduction from the Deuteron” S. Stepanyan et.

G. Cowan Lectures on Statistical Data Analysis Lecture 1 page 1 Lectures on Statistical Data Analysis London Postgraduate Lectures on Particle Physics;

Practical Statistics for Particle Physicists Lecture 3 Harrison B. Prosper Florida State University European School of High-Energy Physics Anjou, France.

Likelihood function and Bayes Theorem In simplest case P(B|A) = P(A|B) P(B)/P(A) and we consider the likelihood function in which we view the conditional.

Statistics In HEP Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.

1 Is there evidence for a peak in this data?. 2 “Observation of an Exotic S=+1 Baryon in Exclusive Photoproduction from the Deuteron” S. Stepanyan et.

Statistical Methods for Data Analysis Introduction to the course Luca Lista INFN Napoli.

1 Is there evidence for a peak in this data?. 2 “Observation of an Exotic S=+1 Baryon in Exclusive Photoproduction from the Deuteron” S. Stepanyan et.

G. Cowan RHUL Physics page 1 Status of search procedures for ATLAS ATLAS-CMS Joint Statistics Meeting CERN, 15 October, 2009 Glen Cowan Physics Department.

BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University CERN Summer Students July 2014.

G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.

1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.

1 Is there evidence for a peak in this data?. 2 “Observation of an Exotic S=+1 Baryon in Exclusive Photoproduction from the Deuteron” S. Stepanyan et.

In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.

G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.

G. Cowan RHUL Physics Statistical Issues for Higgs Search page 1 Statistical Issues for Higgs Search ATLAS Statistics Forum CERN, 16 April, 2007 Glen Cowan.

1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Imperial College and Oxford University Amsterdam Oct 2009.

1 Is there evidence for a peak in this data?. 2 “Observation of an Exotic S=+1 Baryon in Exclusive Photoproduction from the Deuteron” S. Stepanyan et.

Statistical Issues in Searches for New Physics

Discussion on significance

BAYES and FREQUENTISM: The Return of an Old Controversy

Statistical Significance & Its Systematic Uncertainties

Statistical Methods used for Higgs Boson Searches

Is there evidence for a peak in this data?

Is there evidence for a peak in this data?

What is Probability? Bayes and Frequentism

χ2 and Goodness of Fit & Likelihood for Parameters

BAYES and FREQUENTISM: The Return of an Old Controversy

Confidence Intervals and Limits

Statistics for the LHC Lecture 3: Setting limits

Lecture 4 1 Probability (90 min.)

Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.

Statistical Methods for the LHC

TAE 2018 Benasque, Spain 3-15 Sept 2018 Glen Cowan Physics Department

Statistical Tests and Limits Lecture 2: Limits

TAE 2017 / Statistics Lecture 2

Statistical Methods for HEP Lecture 3: Discovery and Limits

Introduction to Statistics − Day 4

Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.

Statistics for HEP Roger Barlow Manchester University

Computing and Statistical Data Analysis / Stat 10

Presentation transcript:

Statistical Issues for Dark Matter Searches Louis Lyons Imperial College DMUK, Edinburgh July 2017

Theme: Using data to make judgements about H1 (Bgd+ DM) versus H0 (just Bgd) Why Statistics? Experiments are expensive and time-consuming so Worth investing effort in statistical analysis  better information from data Possible Topics: Blind Analysis Why 5σ for discovery? Significance P(A|B) ≠ P(B|A) Meaning of p-values Wilks’ Theorem LEE = Look Elsewhere Effect Background Systematics Coverage p0 v p1 plots Upper Limits (N.B. Several of these topics have no unique solutions from Statisticians) Conclusions

Statistical Procedures Parameter Determination / Upper Limits e.g. MHiggs = 80±2 Flux of WIMPs < ? in given mass range Goodness of Fit Is data consistent with ‘No WIMPs’ ? Hypothesis Testing Which theory fits data better? e.g. D.M. or no D.M. = Discovery or Exclusion (or cannot decide) Decision Theory What expt should I do next? Involves cost functions

Data Counting expt = 1 bin 2) On-off problem = 2 bins Nobs counts, with estimated bgd b (±σb) 2) On-off problem = 2 bins N counts in signal region, M counts in bgd 3) Distribution F(x) = many bins Fit with B(x) + μS(x)

BAYES and FREQUENTISM: The Return of an Old Controversy

WHAT IS PROBABILITY? MATHEMATICAL Formal FREQUENTIST Based on Axioms FREQUENTIST Ratio of frequencies as n infinity Repeated “identical” trials Not applicable to single event or physical constant BAYESIAN Degree of belief Can be applied to single event or physical constant (even though these have unique truth) Varies from person to person *** Quantified by “fair bet” LEGAL PROBABILITY

Bayesian p(param | data) α p(data | param) * p(param) Bayes’ Theorem p(param | data) α p(data | param) * p(param) posterior likelihood prior Problems: p(param) Has particular true value “Degree of belief” Prior What functional form? Uninformative prior: flat? In which variable? e.g. m, m2, ln m,….? “Priors may be OK for parametrising prior knowledge, but not really for prior ignorance”

Even more important for UPPER LIMITS Prior Even more important for UPPER LIMITS

Classical (Neyman) Confidence Intervals Uses only P(data|theory) Theoretical Parameter µ Specific Example μ = Temp at centre of Sun x = Measured solar v flux Observation x  μ≥0 No prior for μ

Methods for Upper Limits p-values Likelihoods 2 (Neyman or Pearson versions) Bayesian methods Sensitive to priors Neyman construction for Upper Limits Feldman-Cousins CLs = p1/(1-p0)

CLs = p1/(1-p0) (a) (b) H0 H1 t tobs t tobs p1 p0 (c) Conservative frequentist H0 H1 t tobs tobs t p1 p0 (c) With 2 hypotheses, each with own pdf, p-values are defined as tail areas, pointing in towards each other H0 H1 tobs t

90% Classical 2-sided interval for Gaussian σ = 1 μ ≥ 0 e.g. m2(νe) Xobs = 3 Two-sided range for μ Xobs = 1 Upper limit for μ =2.6 Xobs =-2 No region for μ

Be very explicit about what your procedure is 90% Classical Upper Limit for Gaussian σ = 1 μ ≥ 0 e.g. m2(νe) Conclusion: Be very explicit about what your procedure is Xobs = 1 Upper limit = 2.3

Ilya Narsky, FNAL CLW 2000 (No systematics) Upper Limit is very sensitive to method when n < b

Including systematics Bayes: Uses priors to model uncertainties Cousins-Highland: Bayesian systematics for frequentist ULs Profile Likelihood: Lprofile(φ) = L(φ,vbest(μ)) Dauncey, Kenzie, Wardle & Davies (IC, CMS): “Handling uncertainties in background shapes: the discrete profiling method”, JINST 10 (2015) no.04, 04015 Has been used in CMS analysis of Hγγ

Sensitivity Expected Upper Limit Expected = Median, Mean, Asimov (Can also give 68% and/or 90% bands) Useful for: Planning stage of experiment Optimise search procedure See if observed limit is plausible Compare different experiments

at 90% confidence Frequentist Bayesian and known, but random unknown, but fixed Probability statement about and Frequentist and known, and fixed unknown, and random Probability/credible statement about Bayesian

Why 5σ for Discovery? Statisticians ridicule our belief in extreme tails (esp. for systematics) Our reasons: 1) Past history (Many 3σ and 4σ effects have gone away) 2) LEE = Look Elsewhere Effect 3) Worries about underestimated systematics 4) Subconscious Bayes calculation p(H1|x) = p(x|H1) * π(H1) p(H0|x) p(x|H0) π(H0) Posterior Likelihood Priors prob ratio “Extraordinary claims require extraordinary evidence” N.B. Points 2), 3) and 4) are experiment-dependent Alternative suggestion: L.L. “Discovering the significance of 5” http://arxiv.org/abs/1310.1284

How many ’s for discovery? SEARCH SURPRISE IMPACT LEE SYSTEMATICS No. σ Higgs search Medium Very high M 5 Single top No Low 3 SUSY Yes Very large 7 Bs oscillations Medium/Low Δm 4 Neutrino osc High sin22ϑ, Δm2 Bs μ μ Low/Medium Pentaquark High/V. high M, decay mode (g-2)μ anom H spin ≠ 0 4th gen q, l, ν M, mode 6 Dark energy Strength Grav Waves Enormous 8 Suggestions to provoke discussion, rather than `delivered on Mt. Sinai’ Bob Cousins: “2 independent expts each with 3.5σ better than one expt with 5σ”

Resources Books by Particle Physicists PHYSTAT meetings Barlow, Benkhe, Cowan, James, Lista, Lyons, Roe,….. PDG: Sections on Probability, Statistics and Monte Carlo simulation. PHYSTAT meetings CERN and FNAL 2000 for U.L. PhyStat-nu, Japan and FNAL 2016 PhyStat-DM in 2018? Statistics Committees Collider expts: BaBar, CDF, ATLAS, CMS Maybe for neutrino expts Perhaps for DM RooStats e.g. Lyons + Moneta at CERN (2016) and at IPMU (2017) “Too easy to use”

Conclusions Do your homework: Try to achieve consensus Good luck. Before re-inventing the wheel, try to see if Statisticians have already found a solution to your statistics analysis problem. Don’t use your own square wheel if a circular one already exists. Try to achieve consensus Good luck. Move from U.L.  Discovery and Measurements