Sensitivity of searches for new signals and its optimization

Slides:



Advertisements
Similar presentations
27 th March CERN Higgs searches: CL s W. J. Murray RAL.
Advertisements

2 – In previous chapters: – We could design an optimal classifier if we knew the prior probabilities P(wi) and the class- conditional probabilities P(x|wi)
1 LIMITS Why limits? Methods for upper limits Desirable properties Dealing with systematics Feldman-Cousins Recommendations.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 18, Slide 1 Chapter 18 Confidence Intervals for Proportions.
Visual Recognition Tutorial
Chapter 19: Confidence Intervals for Proportions
Statistics In HEP 2 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.
Parameter Estimation: Maximum Likelihood Estimation Chapter 3 (Duda et al.) – Sections CS479/679 Pattern Recognition Dr. George Bebis.
8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 Frequent problem: Decision making based on statistical information.
PHYSTAT 05P. Catastini Bias-Free Estimation in Multicomponent Maximum Likelihood Fits with Component-Dependent Templates Pierluigi Catastini I.N.F.N. -
Basics of Statistical Estimation. Learning Probabilities: Classical Approach Simplest case: Flipping a thumbtack tails heads True probability  is unknown.
End of Chapter 8 Neil Weisenfeld March 28, 2005.
Maximum Likelihood (ML), Expectation Maximization (EM)
G. Cowan 2011 CERN Summer Student Lectures on Statistics / Lecture 41 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability.
Examining the balance between optimising an analysis for best limit setting and best discovery potential Gary C. Hill, Jessica Hodges, Brennan Hughey,
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
7-2 Estimating a Population Proportion
Inferences About Process Quality
Statistical Analysis of Systematic Errors and Small Signals Reinhard Schwienhorst University of Minnesota 10/26/99.
Statistical inference: confidence intervals and hypothesis testing.
880.P20 Winter 2006 Richard Kass 1 Confidence Intervals and Upper Limits Confidence intervals (CI) are related to confidence limits (CL). To calculate.
Statistical Decision Theory
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,
1 Lesson 3: Choosing from distributions Theory: LLN and Central Limit Theorem Theory: LLN and Central Limit Theorem Choosing from distributions Choosing.
+ Warm-Up4/8/13. + Warm-Up Solutions + Quiz You have 15 minutes to finish your quiz. When you finish, turn it in, pick up a guided notes sheet, and wait.
Practical Statistics for Particle Physicists Lecture 3 Harrison B. Prosper Florida State University European School of High-Energy Physics Anjou, France.
Statistics In HEP Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.2 Estimating a Population Proportion.
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
A statistical test for point source searches - Aart Heijboer - AWG - Cern june 2002 A statistical test for point source searches Aart Heijboer contents:
Probability and Measure September 2, Nonparametric Bayesian Fundamental Problem: Estimating Distribution from a collection of Data E. ( X a distribution-valued.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
"Classical" Inference. Two simple inference scenarios Question 1: Are we in world A or world B?
G. Cowan, RHUL Physics Discussion on significance page 1 Discussion on significance ATLAS Statistics Forum CERN/Phone, 2 December, 2009 Glen Cowan Physics.
Experience from Searches at the Tevatron Harrison B. Prosper Florida State University 18 January, 2011 PHYSTAT 2011 CERN.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #24.
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
A significance test or hypothesis test is a procedure for comparing our data with a hypothesis whose truth we want to assess. The hypothesis is usually.
Machine Learning 5. Parametric Methods.
ES 07 These slides can be found at optimized for Windows)
Javad Azimi, Ali Jalali, Xiaoli Fern Oregon State University University of Texas at Austin In NIPS 2011, Workshop in Bayesian optimization, experimental.
G. Cowan, RHUL Physics Statistics for early physics page 1 Statistics jump-start for early physics ATLAS Statistics Forum EVO/Phone, 4 May, 2010 Glen Cowan.
Copyright© 1998, Triola, Elementary Statistics by Addison Wesley Longman 1 Testing a Claim about a Mean: Large Samples Section 7-3 M A R I O F. T R I O.
In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
Statistics 19 Confidence Intervals for Proportions.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
G. Cowan SLAC Statistics Meeting / 4-6 June 2012 / Two Developments 1 Two developments in discovery tests: use of weighted Monte Carlo events and an improved.
Chapter 10 Confidence Intervals for Proportions © 2010 Pearson Education 1.
Giovanni Punzi SNS & INFN-Pisa
Discussion on significance
Confidence Intervals for Proportions
Proposals for near-future BG determinations from control regions
Confidence Intervals and Limits
Confidence Intervals for Proportions
Lecture 4 1 Probability (90 min.)
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Confidence Intervals for Proportions
Introduction to Statistics − Day 4
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Lecture Slides Elementary Statistics Twelfth Edition
Section 11.1: Significance Tests: Basics
Confidence Intervals for Proportions
Presentation transcript:

Sensitivity of searches for new signals and its optimization Giovanni Punzi SNS & INFN - Pisa

Giovanni Punzi - PHYSTAT2003 Outline of this talk Unavoidable preamble: what is a ‘search’ Issues with the definition of “sensitivity” A better definition of sensitivity Application to counting experiments Optimizing sensitivity Real-life applications Giovanni Punzi - PHYSTAT2003 9/8/2003

Giovanni Punzi - PHYSTAT2003 What is a `search’ ? Search = Test of Hypotesis + Limits I have an observable x I have a default theory H0 (“no signal”) I have a set of alternatives Hm,depending on a parameter m (e.g. a mass, or x-section of a new process) I will choose a significance level  and define an appropriate critical region I don’t care how exactly you do it. If x in critical region, I have a discovery. I ignore what you do after (parameter estimation?) A useful classical concept: test power: 1-(m) (m) = probability of missing the discovery when Hm If I didn’t discover anything, I will set limits on m H0 Hm p(x) x critical accept.   1 -  m Giovanni Punzi - PHYSTAT2003 9/8/2003

Giovanni Punzi - PHYSTAT2003 Plan B: Setting limits I consider Neyman’s Confidence Limits, at a (previously) chosen CL Hm m2 m1 x Can use a variety of methods, in general no connection with the Test Making the two agree may be difficult (limits may exclude H0 while no discovery was claimed...). A difficult issue, but outside the scope of this talk I will make only a minimal assumption (wait for more) Giovanni Punzi - PHYSTAT2003 9/8/2003

How sensitive is a search experiment? Usually have many possibilities regarding the sample to use for the measurement (cuts, observables…) - how do you make the optimal choice ? Apart from optimization, you sometimes need to be able to quote a figure Several approaches, based on two alternative views: Assume no signal, and ask how tight your limits will be. Suppose a signal exists, and look at some measure of “significance” S/sqrt(B) , S/sqrt(S+B) , S/(sqrt(B)+sqrt(B+S))… Large power Giovanni Punzi - PHYSTAT2003 9/8/2003

Troubles with sensitivity definitions No unified view - lead to quote two separate figures (can adoption of a particular viewpoint produce a bias?) Limit optimization hard to define except for 1-D & 1-sided Use mean or median limit Problems with significances: They are “expected”, not actual, significances S/sqrt(B) diverges for small B (B=0.0001 with S=0.1 is called good) The others are not obviously what you need. Also, they depend on the cross section of the signal being sought (may be unknown) In principle, what you want is maximum power. But: In general you can’t maximize simultaneously 1-(m) for every m You don’t really care about turning 90 into 100, or 0.01 into 0.1 Giovanni Punzi - PHYSTAT2003 9/8/2003

A different definition of sensitivity of a search Sensitivity not given by a number, but a REGION in parameter space Define sensitivity region of the experiment the set of m values such that: 1 - (m) > CL The following two statements hold simultaneously: If the true value of m is anywhere inside the sensitivity region then you have a probability > CL of claiming discovery (@signif.  ) If you don’t get a discovery result then you will be able* to exclude (at least) the sensitivity region @CL C.L. (N.B. holds independently of the true value of m !) an exclusion region sensitivity region m2 m1 *green covers blue (1- CL > ) Yields tighter limits for free Giovanni Punzi - PHYSTAT2003 9/8/2003

Advantages of the proposed definition of sensitivity Makes your experiment predictable Good for planning: if one (or a set of experiments) cover the whole parameter space of a theory, you know you have it covered. “Unified” view of sensitivity Good candidate to optimization, whether or not you expect a signal: in both cases you want the sensitivity region to be as large as possible No dependence on metric or priors (purely frequentist) Works in any number of dimensions Independent of choice of a limit-setting algorithm Giovanni Punzi - PHYSTAT2003 9/8/2003

Application to Poisson (“counting experiment”) Sensitivity region takes the simple form S(m) > Smin(B) Smin B (0.95,0.95) (“3”,0.95) (“5”,0.90) (a, CL) Giovanni Punzi - PHYSTAT2003 9/8/2003

Use in optimization - compare to “significance” 1/Smin S depends on cuts t: S(m,t) = (t)*L*(m) > Smin => (m) > Smin/((t)*L) => maximize (t)/Smin Very useful feature: independent on cross section expected for signal (as S/√B, but it does not diverge ) 1/√B 1/√(S+B) B Giovanni Punzi - PHYSTAT2003 9/8/2003

Giovanni Punzi - PHYSTAT2003 Approximate formulas (a,b) = # of sigmas for (,) Tail-improved Gaussian approx. Simplest: approximate b~a Giovanni Punzi - PHYSTAT2003 9/8/2003

A real-life example Initial search for charmless B decays@CDF S/(a/2+√B) Optimal formula eliminates fake solution with tight cuts Signal found (summer 2002). Giovanni Punzi - PHYSTAT2003 9/8/2003

Another real-life example Search for the rare decay D0->µ+µ- @CDF Expect B=1.8 No signal found, improve on current best limit (hep-ex/0308059, recently submitted to PRD) Giovanni Punzi - PHYSTAT2003 9/8/2003

Giovanni Punzi - PHYSTAT2003 Conclusions S/(a/2 + √B) a pretty good idea Giovanni Punzi - PHYSTAT2003 9/8/2003