Program for evaluation of the significance, confidence intervals and limits by direct probabilities calculations S.Bityukov (IHEP,Protvino), S.Erofeeva(MSA.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

The Maximum Likelihood Method
Estimation of Means and Proportions
CHAPTER 2 Building Empirical Model. Basic Statistical Concepts Consider this situation: The tension bond strength of portland cement mortar is an important.
Using the Profile Likelihood in Searches for New Physics / PHYSTAT 2011 G. Cowan 1 Using the Profile Likelihood in Searches for New Physics arXiv:
Bayesian inference “Very much lies in the posterior distribution” Bayesian definition of sufficiency: A statistic T (x 1, …, x n ) is sufficient for 
The Bayesian Effects in measurement of Asymmetry of Poisson Traffic Flows S.Bityukov (IHEP,Protvino),N.Krasnikov (INR RAS,Moscow), A.Kuznetsov(NEU,Boston),V.Smirnova.
N.D.GagunashviliUniversity of Akureyri, Iceland Pearson´s χ 2 Test Modifications for Comparison of Unweighted and Weighted Histograms and Two Weighted.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Evaluating Hypotheses
G. Cowan RHUL Physics Comment on use of LR for limits page 1 Comment on definition of likelihood ratio for limits ATLAS Statistics Forum CERN, 2 September,
G. Cowan 2011 CERN Summer Student Lectures on Statistics / Lecture 41 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability.
EEM332 Design of Experiments En. Mohd Nazri Mahmud
Inference about a Mean Part II
Experimental Evaluation
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Lecture II-2: Probability Review
TOPLHCWG. Introduction The ATLAS+CMS combination of single-top production cross-section measurements in the t channel was performed using the BLUE (Best.
The Neymann-Pearson Lemma Suppose that the data x 1, …, x n has joint density function f(x 1, …, x n ;  ) where  is either  1 or  2. Let g(x 1, …,
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)
Chapter 6 The Normal Probability Distribution
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Distinguishability of Hypotheses S.Bityukov (IHEP,Protvino; INR RAS, Moscow) N.Krasnikov (INR RAS, Moscow ) December 1, 2003 ACAT’2003 KEK, Japan S.Bityukov.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,
Estimation Bias, Standard Error and Sampling Distribution Estimation Bias, Standard Error and Sampling Distribution Topic 9.
Slide Slide 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim about a Proportion 8-4 Testing a Claim About.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
 Candidate events are selected by reconstructing a D, called a tag, in several hadronic modes  Then we reconstruct the semileptonic decay in the system.
Two approaches to Combining Significance S.Bityukov, N.Krasnikov, A.Nikitenko November 4, 2008 ACAT ’ 2008 Erice, Sicily, ItalyS.Bityukov.
1 SMU EMIS 7364 NTU TO-570-N Inferences About Process Quality Updated: 2/3/04 Statistical Quality Control Dr. Jerrell T. Stracener, SAE Fellow.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
26134 Business Statistics Tutorial 12: REVISION THRESHOLD CONCEPT 5 (TH5): Theoretical foundation of statistical inference:
G. Cowan, RHUL Physics Discussion on significance page 1 Discussion on significance ATLAS Statistics Forum CERN/Phone, 2 December, 2009 Glen Cowan Physics.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
© Copyright McGraw-Hill 2004
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
ES 07 These slides can be found at optimized for Windows)
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
G. Cowan Computing and Statistical Data Analysis / Stat 9 1 Computing and Statistical Data Analysis Stat 9: Parameter Estimation, Limits London Postgraduate.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Statistical Significance Hypothesis Testing.
1 Section 8.2 Basics of Hypothesis Testing Objective For a population parameter (p, µ, σ) we wish to test whether a predicted value is close to the actual.
In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
R. Kass/W03 P416 Lecture 5 l Suppose we are trying to measure the true value of some quantity (x T ). u We make repeated measurements of this quantity.
Chapter 6 Sampling and Sampling Distributions
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 Multivariate Histogram Comparison S.I. BITYUKOV, V.V. SMIRNOVA (IHEP, Protvino), N.V. KRASNIKOV (INR RAS, Moscow) IML LHC Machine Learning WG Meeting.
Discussion on significance
Virtual University of Pakistan
arXiv:physics/ v3 [physics.data-an]
LECTURE 33: STATISTICAL SIGNIFICANCE AND CONFIDENCE (CONT.)
The Maximum Likelihood Method
Lecture 4 1 Probability (90 min.)
Chapter 9 Hypothesis Testing.
Unfolding Problem: A Machine Learning Approach
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Warmup To check the accuracy of a scale, a weight is weighed repeatedly. The scale readings are normally distributed with a standard deviation of
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Confidence Intervals.
Introduction to Statistics − Day 4
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Unfolding with system identification
Modeling with the normal distribution
Presentation transcript:

Program for evaluation of the significance, confidence intervals and limits by direct probabilities calculations S.Bityukov (IHEP,Protvino), S.Erofeeva(MSA IECS,Moscow), N.Krasnikov(INR RAS, Moscow), A.Nikitenko(IC, London) September, 2005 PhyStat 2005 Oxford, UK S.Bityukov

During planning or processing of experiment we often consider a statistical hypothesis H 0 : new physics is present in Nature against hypothesis H 1 : new physics is absent in Nature. The value of uncertainty in our conclusion is defined by the probabilities  = P(reject H 0 | H 0 is true) - Type I error and b = P(accept H 0 | H 0 is false) - Type II error There are many definitions of significance as a measure of excess of signal events above background. Many approaches exist also to methods of construction of intervals and limits: confidence, tolerant, fiducial and so on. During one of the CMS meetings Gunter Quast formulated the problem of practicians “the only remaining problem: make a choice … chosen method should be “as simple as possible, but not wrong!” Introduction Introduction

September, 2005 PhyStat 2005 Oxford, UK S.Bityukov 1.Gaussian limit gives the wrong answer for low value of β (tail of Poisson distribution is heavier than tail of Gaussian) 2. The statistics like S L (a likelihood-ratio-based test statistic) have poor statistical properties as estimator of significance (S L = √2(ln L1-ln L2)= √2(ln Q), where Q is the ratio of binned/unbinned likelihood fits for hypotheses H 0 and H 1 H 0 : signal present and H 1 : no signal present) The simplest significance is the significance S_cP described at the next slide. The S_cP is quite natural significance, which allows to take into account any uncertainties by direct calculation of probabilities. Motivation of the work (significance) Motivation of the work (significance)

Definition of the significance S_cP Definition of the significance S_cP September, 2005 PhyStat 2005 Oxford, UK S.Bityukov S_cP - probability from Poisson distribution with mean μ_b to observe equal or greater than Nobs events, converted to equivalent number of sigmas of a Gaussian distribution (see report (page 8) by G. Quast in CMS Physics analysis days, May 9-12, 2005, CERN also, see I.Narsky, NIM A450(2000)444). The presented program ScP allows to calculate this significance with taking into account experimental systematics with statistical properties (Gaussian approximation) and theoretical systematics without any statistical properties. Also, the program calculates (if option is on) the combining significance of several channels. As is assumed all channels are independent.

September, 2005 PhyStat 2005 Oxford, UK S.Bityukov Conception Conception Conception. The probability of making a Type II error (β) in hypotheses test about presence of signal in experiment (H0) is used for determination number of sigmas (of background distribution) between expected background and observed number of events Nobs (formula 8 in CMS CR 2002/05 ).CMS CR 2002/05 This probability is used for determination of signal significance, i.e. the significance S_cP will be found under resolving of equations, where It can be used in combining of results. Let us consider two possible approaches.

Combining of observed results: Combining of observed results: Approach 1 September, 2005 PhyStat 2005 Oxford, UK S.Bityukov Approach 1. Suppose that observed value is greater than expected background Let β_1 be Type II error for channel 1 (event A = background  Nobs_1 with P(A)= β_1) and β_2 be Type II error for channel 2 (event B = background  Nobs_2 with P(B)= β_2). Because event A is independent from event B then probability of simultaneous appearance of A and B equals β_12 = P(AB) = P(A)*P(B) = β_1* β_2. After determination of β_12 one can calculate the S_cP.

Combining of expected signals & backgrounds: Combining of expected signals & backgrounds: Approach 2 September, 2005 PhyStat 2005 Oxford, UK S.Bityukov Approach 2. Suppose that Nobs is expected sum of expected signal (μ_s) and expected background (μ_b), i.e. Nobs = μ_s+μ_b (the case of planned experiment). Then the sums of expected numbers of signal (μ_s_i ) and background (μ_b_i ) events in each channel are used as summary μ_s and μ_b for calculation of combined significance. Note that we take into account in this case as fluctuation of expected background and fluctuation of expected signal. After determination of corresponding β one can calculate the S_cP.

Uncertainties Uncertainties September, 2005 PhyStat 2005 Oxford, UK S.Bityukov The program takes into account two types of uncertainties: a)experimental systematics with statistical properties (we assume that this systematics has Gaussian distribution with known variance σ_b**2 in according with formula μ_b = expected background + N(0,σ _b)). Appr.2: In the case of the combining of channels the summary variance σ_b**2 is the sum of partial variance σ_b_i**2. b) theoretical systematics (δ_b) without any statistical properties (we assume: the worst case takes place when the background is maximal, i.e. μ_b*(1+ δ_b), but we take the signal plus the background as Nobs; more information can be found in S.Bityukov, N.Krasnikov, CMS CR 2002/05 orS.Bityukov, N.Krasnikov, CMS CR 2002/05 S.Bityukov, N.Krasnikov, Mod.Phys.Lett.A 13 (1998)3235) Appr.2: The combining δ_b is the sum of partial δ_b_i.

Main input and output parameters Main input and output parameters September, 2005 PhyStat 2005 Oxford, UK S.Bityukov Main input parameters: 1. expected background – μ_b 2. signal = observed value (Nobs) - expected background (μ_b) – μ_s 3. experimental uncertainty (r.m.s.) of background with statistical properties – σ_b 4. systematics of theoretical origin in background – δ_b Output parameters: 1. significance S_cP, calculated by formula - dsgnf 2. significance S_cP_MC, calculated by Monte Carlo - dsgnfm

Auxiliary input parameters Auxiliary input parameters September, 2005 PhyStat 2005 Oxford, UK S.Bityukov 1. switch for choosing of type calculations - iflag iflag = 1 calculations by formula (quick calculations) iflag = 2 Monte Carlo calculations iflag = 12 calculations by formula and by Monte Carlo 2. number of channels for calculations - nchan (from 1 up to 10) 3. number of channels for combined S_cP - ncombi (from 1 up to nchan) 4. parameter for Monte Carlo calculation - over over - parameter for Monte Carlo calculations. It is a number of Monte Carlo trials which will give value of number events over or equal Nobs. This parameter (and internal value dbeta) determines the number of trials for given μ_s, μ_b, σ_b and δ_b in routine SCPMC.

The structure of program The structure of program September, 2005 PhyStat 2005 Oxford, UK S.Bityukov Language: Fortran 77 iflag Three different types 1. SCPFOR - calculations by formula of calculations: 2. SCPMC - Monte Carlo calculations 12. SCPFOR + SCPMC Main program processes the user requirements (defined in operators DATA) and calls routines SCPFOR and/or SCPMC.

Problem and approximation Problem and approximation September, 2005 PhyStat 2005 Oxford, UK S.Bityukov The problem which takes place during calculations is the restricted range of applicability of standard procedure DGAUSN in CERNLIB. For values of S_cP>6.2-7 the procedure gives non correct result. In this case we use as a good approximation the significance (MPL A13 (1998)3235) S_c12 = 2 (  (μ_s+μ_b) -  μ_b). The account of the uncertainties is very simple: theoretical systematics (δ_b) S_c12t = 2 (  (μ_s+μ_b) -  (μ_b(1+δ_b)). experimental systematics ( σ_b**2 )  μ_b S_c12e = 2 (  (μ_s+μ_b) -  μ_b)  (μ_b+σ_b**2)

Simplest example of program S_cP output Simplest example of program S_cP output September, 2005 PhyStat 2005 Oxford, UK S.Bityukov Example of G.Quast: bkg=2 sig=5.4. Here S 1 =3.8 S 12 =2.6 S L =2.7 Significance S_cP and/or S_cP_MC: NN of channels = 1, Combining channels from 1 up to 1 calculation type = 12 types: (1) S_cP by formula and (2) S_cP_MC by Monte Carlo σ_b-experimental uncertainty, i.e. μ_b = background + N(0, σ_b) δ_b - systematics of theoretical origin without statistical properties #ch backgr. signal σ_b δ_b S_cP S_cP_MC S_c E

Program output with combining of channels Program output with combining of channels September, 2005 PhyStat 2005 Oxford, UK S.Bityukov NN of channels = 2, Combining channels from 1 up to 2 #ch backgr. signal σ_b δ_b S_cP S_cP_MC S_c E COMBINING of OBSERVED RESULTS Combined channels(1-2) without errors Combined channels(1-2) with stat. errors Combined channels(1-2) both types of err COMBINING for EXPECTED SIGNAL and BACKGROUND Sum

Range of applicability of the program ScP Range of applicability of the program ScP September, 2005 PhyStat 2005 Oxford, UK S.Bityukov NN of channels = 3, Combining channels from 1 up to 1 calculation type = 12 #ch backgr. signal σ_b δ_b S_cP S_cP_MC S_c

Motivation of the work (confidence intervals) Motivation of the work (confidence intervals) September, 2005 PhyStat 2005 Oxford, UK S.Bityukov Suppose f(n ; m ) describes the Poisson distribution of probabilities and g( m ;n) is the density of Gamma-distribution G 1,1+n then (Eq.1) where and n is the observed number of casual events appearing in Poisson flow for certain period of time. This identity shows that in our case the distribution of the probability of a true value of Poisson distribution parameter (the confidence density) for observed value n is the Gamma- distribution with mode n and mean value n+1, i.e. observed value n corresponds to the most probable value of parameter. The Poisson and Gamma distributions are statistically dual distributions. As shown, we for these distributions can reconstruct only single confidence density.

Program Limsb Program Limsb September, 2005 PhyStat 2005 Oxford, UK S.Bityukov The unique of confidence density allows to construct the confidence intervals by simplest (and correct) way: we reconstruct for observed value n the correspondent confidence density and by direct calculations of probabilities determine the confidence intervals and/or confidence limits. Now the program Limsb constructs the central confidence interval and the confidence interval of minimal length for observed value n. Input: values EPS, CL and array DLAMB. The testing set of observed values is given in data array DLAMB. The value EPS determines the precision of calculations. The value CL determines the confidence level of intervals.

Simplest example of program Limsb output September, 2005 PhyStat 2005 Oxford, UK S.Bityukov Confidence limits: eps, CL = E central and shortes confidence intervals NN ev left bound right bound left tail upper prob. lenght.10000E-01 Central Minimal E E Central Minimal E E Central Minimal Central Minimal Central Minimal

Conclusion Conclusion September, 2005 PhyStat 2005 Oxford, UK S.Bityukov Programs ScP and Limsb can be found in Web page We are ready to include in program Limsb the calculation of the confidence intervals of Poisson distribution parameter for signal events in presence of background (formula O.Helene, which appears in our approach by natural way, see hep-ex/ ). We are grateful to Vladimir Gavrilov, Vassili Katchanov, and Albert De Roeck for the interest and support of this work. We would like to thank Bob Cousins, Vladimir Obraztsov and Claudia Wulz for discussions and useful comments.