Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University.

Slides:



Advertisements
Similar presentations
Practical Statistics for LHC Physicists Bayesian Inference Harrison B. Prosper Florida State University CERN Academic Training Lectures 9 April, 2015.
Advertisements

Chapter 10: Estimating with Confidence
Statistics review of basic probability and statistics.
1 LIMITS Why limits? Methods for upper limits Desirable properties Dealing with systematics Feldman-Cousins Recommendations.
Sampling Distributions (§ )
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
Credibility of confidence intervals Dean Karlen / Carleton University Advanced Statistical Techniques in Particle Physics Durham, March 2002.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
Chapter 7 Estimation: Single Population
The Sampling Distribution of the Sample Mean AGAIN – with a new angle.
Chapter 10: Estimating with Confidence
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Statistical Analysis of Systematic Errors and Small Signals Reinhard Schwienhorst University of Minnesota 10/26/99.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
The Standardized Normal Distribution Z is N( 0, 1 2 ) The standardized normal X is N( ,  2 ) 1.For comparison of several different normal distributions.
CHAPTER 8 Estimating with Confidence
1 Probability and Statistics  What is probability?  What is statistics?
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Statistics for Data Miners: Part I (continued) S.T. Balke.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
LECTURER PROF.Dr. DEMIR BAYKA AUTOMOTIVE ENGINEERING LABORATORY I.
Practical Statistics for Particle Physicists Lecture 3 Harrison B. Prosper Florida State University European School of High-Energy Physics Anjou, France.
Statistics In HEP Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.
Introduction  Populations are described by their probability distributions and parameters. For quantitative populations, the location and shape are described.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Section 10.1 Confidence Intervals
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
LECTURE 25 THURSDAY, 19 NOVEMBER STA291 Fall
+ DO NOW. + Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population.
Chapter 8: Confidence Intervals based on a Single Sample
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
11 Confidence Intervals – Introduction A point estimate provides no information about the precision and reliability of estimation. For example, the sample.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University CERN Summer Students July 2014.
Experience from Searches at the Tevatron Harrison B. Prosper Florida State University 18 January, 2011 PHYSTAT 2011 CERN.
Practical Statistics for Particle Physicists Lecture 2 Harrison B. Prosper Florida State University European School of High-Energy Physics Parádfürdő,
Confidence Intervals – Introduction
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #24.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
G. Cowan Computing and Statistical Data Analysis / Stat 9 1 Computing and Statistical Data Analysis Stat 9: Parameter Estimation, Limits London Postgraduate.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Estimation by Intervals Confidence Interval. Suppose we wanted to estimate the proportion of blue candies in a VERY large bowl. We could take a sample.
In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
LECTURE 26 TUESDAY, 24 NOVEMBER STA291 Fall
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Ivo van Vulpen Why does RooFit put asymmetric errors on data points ? 10 slides on a ‘too easy’ topic that I hope confuse, but do not irritate you
Practical Statistics for LHC Physicists Frequentist Inference Harrison B. Prosper Florida State University CERN Academic Training Lectures 8 April, 2015.
Confidence Intervals Lecture 2 First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University.
ESTIMATION.
Confidence Interval Estimation
BAYES and FREQUENTISM: The Return of an Old Controversy
Confidence Intervals and Limits
Lecture 4 1 Probability (90 min.)
Chapter 8: Estimating with Confidence
Confidence Interval Estimation
Counting Statistics and Error Prediction
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Chapter 8: Estimating with Confidence
Sampling Distributions (§ )
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Statistics for HEP Roger Barlow Manchester University
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Presentation transcript:

Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper2 Outline Lecture 1 Lecture 1 –Introduction –Confidence Intervals - Frequency Interpretation –Poisson Example –Summary Lecture 2 Lecture 2 –Deductive and Inductive Reasoning –Confidence Intervals - Bayesian Interpretation –Poisson Example –Summary

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper3 Introduction We physicists often talk about calculating “errors”, but what we really mean, of course, is We physicists often talk about calculating “errors”, but what we really mean, of course, is our –quantifying our uncertainty A measurement is not uncertain, but it has an error ε about which we are uncertain! A measurement is not uncertain, but it has an error ε about which we are uncertain!

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper4 Introduction - i One way to quantify uncertainty is the standard deviation or, even better, the root mean square deviation of the distribution of measurements. One way to quantify uncertainty is the standard deviation or, even better, the root mean square deviation of the distribution of measurements. In 1937 Jerzy Neyman invented another measure of uncertainty called a confidence interval. In 1937 Jerzy Neyman invented another measure of uncertainty called a confidence interval.

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper5 Introduction - ii Consider the following questions Consider the following questions –What is the mass of the top quark? –What is the mass of the tau neutrino? –What is the mass of the Higgs boson? Here are possible answers Here are possible answers –m t = ± 5.1 GeV –m ν < 18.2 MeV –m H > GeV

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper6 Introduction – iii These answers are unsatisfactory These answers are unsatisfactory confidence –because they do not specify how much confidence we should place in them. Here are better answers Here are better answers –m t = ± 5.1 GeV, with CL = –m ν < 18.2 MeV,with CL = –m H > GeV, with CL = Confidence Level –m H > GeV, with CL = CL = Confidence Level

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper7 Introduction - iv Note that the statements Note that the statements –m t = ± 5.1 GeV, CL = –m ν < 18.2 MeV,CL = –m H > GeV, CL = are just an asymmetric way of writing –m t lies in[169.2, 179.4]CL =0.683 –m t lies in [169.2, 179.4] GeV, CL = –m ν lies in[0, 18.2]CL =0.950 –m ν lies in [0, 18.2] MeV, CL = –m H lies in[114.3, ∞)CL =0.950 –m H lies in [114.3, ∞) GeV, CL = 0.950

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper8 Introduction - v The goal of these lectures is to explain the precise meaning of statements of the form The goal of these lectures is to explain the precise meaning of statements of the form θLUβ θ lies in [L, U], with CL = β L = lower limitL = lower limit U = upper limitU = upper limit For example For example m t m t lies in [169.2, 179.4] GeV, with CL = 0.683

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper9 What is a Confidence Level? A confidence level is a probability that quantifies in some way the reliability of a given statement A confidence level is a probability that quantifies in some way the reliability of a given statement But, what exactly is probability? But, what exactly is probability? –Bayesiandegree of beliefplausibility of –Bayesian: The degree of belief in, or plausibility of, a statement (Bayes, Laplace, Jeffreys, Jaynes) –Frequentistrelative frequency –Frequentist: The relative frequency with which something happens (Boole, Venn, Fisher, Neyman)

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper10 Probability: An Example Consider the statement Consider the statement –S –S = “It will rain in Morelia on Monday” And the probability assignment And the probability assignment S –Pr{S} = 0.01 Bayesian interpretation Bayesian interpretation plausibility ofS –The plausibility of the statement S is 0.01 Frequentist interpretation Frequentist interpretation relative frequency –The relative frequency with which it rains on Mondays in Morelia is 0.01

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper11 Confidence Level: Interpretation Since probability can be interpreted in (at least) two different ways, the interpretation of statements such as Since probability can be interpreted in (at least) two different ways, the interpretation of statements such as –m t –m t lies in [169.2, 179.4] GeV, with CL = depends on which interpretation of probability is being used. A great deal of confusion arises in our field because of our tendency to forget this fact A great deal of confusion arises in our field because of our tendency to forget this fact

Confidence Intervals Frequency Interpretation

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper13 Confidence Intervals The basic idea The basic idea set of ensembles θ –Imagine a set of ensembles of experiments, each member of which is associated with a fixed value of the parameter to be measured θ (for example, the top quark mass). E [l(E), u(E)] θ –Each experiment E, within an ensemble, yields an interval [l(E), u(E)], which either contains or does not contain θ.

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper14 Coverage Probability θ coverage probability –For a given ensemble, the fraction of experiments with intervals containing the θ value associated with that ensemble is called the coverage probability of the ensemble. –In general, the coverage probability will vary from one ensemble to another.

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper15 Example E1E1E1E1 E3E3 E5E5E5E5 E4E4 E2E2 E1E1E1E1 E3E3 E5E5E5E5 E4E4E4E4 E2E2E2E2 E1E1E1E1 E3E3 E5E5 E4E4E4E4 E2E2E2E2 Pr = 0.4 Ensemble with θ = θ 1 with Pr = 0.4 Pr = 0.8 Ensemble with θ = θ 2 with Pr = 0.8 Pr = 0.6 Ensemble with θ = θ 3 with Pr = 0.6

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper16 θ [l(E), u(E)] θ –If our experiment is selected at random from the ensemble to which it belongs (presumably the one associated with the true value of θ) then the probability that its interval [l(E), u(E)] contains θ is equal to the coverage probability of that ensemble. –The crucial point is this β confidence level –The crucial point is this: We try to construct the set of ensembles so that the coverage probability over the set is never less than some pre-specified value β, called the confidence level. Confidence Level – Frequency Interpretation

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper17 Confidence Level - ii Points to Note Points to Note confidence level set of ensembles –In the frequency interpretation, the confidence level is a property of the set of ensembles; In fact, it is the minimum coverage probability over the set. –Consequently, if the set of ensembles is unspecified or unknown the confidence level is undefined.

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper18 Any Any set of intervals β with a minimum coverage probability equal to β confidence intervals is a set of confidence intervals at 100 β % confidence level confidence level (CL). (Neyman,1937) Confidence intervals are defined not by how they are constructed, but by their frequency properties. Confidence Intervals – Formal Definition EExperiment l(E)Lower limit u(E)Upper limit

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper19 Confidence Intervals: An Example Experiment: Experiment: θ –To measure the mean rate θ of UHECRs above eV per unit solid angle. Assume the probability of N events to be a given by a Poisson distribution Assume the probability of N events to be a given by a Poisson distribution

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper20 Confidence Intervals – Example - ii Goal: Compute a set of intervals Goal: Compute a set of intervals for N = 0, 1, 2, … with CL = for a set of ensembles, each member of which is characterized by a different mean event count θ.

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper21 Why 68.3%? It is just a useful convention! It is just a useful convention! [x- σ, x+σ] 0.683x σ –It comes from the fact that for a Gaussian distribution the confidence intervals given by [x- σ, x+σ] are associated with a set of ensembles whose confidence level is (x = measurement, σ = std. dev.) The main reason for this convention is the Central Limit Theorem The main reason for this convention is the Central Limit Theorem – Most sensible distributions become more and more Gaussian as the data increase.

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper22 Count Parameter space Confidence Interval – General Algorithm )(Nl  For each value θ find an interval in N with probability content of at least β

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper23 Count Parameter space Confidence Interval – General Algorithm Probability to be here is ≥ β Probability to be here is ≥ β )(Nl  θ For each value θ find an interval in N with probability content ≥ β

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper24 Coverage Probability for θ = 10

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper25 Example: Interval in N for θ = 10

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper26 Confidence Intervals – Specific Algorithms Neyman Neyman –Region: fixed probabilities on either side Feldman – Cousins Feldman – Cousins P(n| θ)/ P(n|n) –Region: containing largest likelihood ratios P(n| θ)/ P(n|n) Mode – Centered Mode – Centered P(n| θ) –Region: containing largest probabilities P(n| θ)

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper27 Define Left cumulative distribution function Right cumulative distribution function Valid for both continuous and discrete distributions. Neyman Construction

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper28 Solve where Neyman Construction - ii Remember: Left is UP and Right is LOW!

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper29 Choose and solve for the interval Central Confidence Intervals

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper30 Central Confidence Intervals - ii Poisson Distribution

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper31 Comparison of Confidence Intervals θ Central Feldman-Cousins Mode-Centered N±√N

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper32 Comparison of Confidence Interval Widths Central Feldman-Cousins Mode-Centered N±√N

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper33 Comparison of Coverage Probabilities θ Central Feldman-Cousins Mode-Centered N±√N

November 21, 2002First ICFA Instrumentation School/Workshop Harrison B. Prosper34 Summary The interpretation of confidence intervals and confidence levels depends on which interpretation of probability one is using The interpretation of confidence intervals and confidence levels depends on which interpretation of probability one is using The coverage probability of an ensemble of experiments is the fraction of experiments that produce intervals containing the value of the parameter associated with that ensemble The coverage probability of an ensemble of experiments is the fraction of experiments that produce intervals containing the value of the parameter associated with that ensemble The confidence level is the minimum coverage probability over a set of ensembles. The confidence level is the minimum coverage probability over a set of ensembles. The confidence level is undefined if the set of ensembles is unspecified or unknown The confidence level is undefined if the set of ensembles is unspecified or unknown