Statistics In HEP Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Lecture XXIII.  In general there are two kinds of hypotheses: one concerns the form of the probability distribution (i.e. is the random variable normally.
Probability and Statistics Basic concepts II (from a physicist point of view) Benoit CLEMENT – Université J. Fourier / LPSC
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #22.
Chap 9: Testing Hypotheses & Assessing Goodness of Fit Section 9.1: INTRODUCTION In section 8.2, we fitted a Poisson dist’n to counts. This chapter will.
Bayesian inference Gil McVean, Department of Statistics Monday 17 th November 2008.
Statistics In HEP 2 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.
8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 Frequent problem: Decision making based on statistical information.
Hypothesis testing Some general concepts: Null hypothesisH 0 A statement we “wish” to refute Alternative hypotesisH 1 The whole or part of the complement.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan 2011 CERN Summer Student Lectures on Statistics / Lecture 41 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
The Neymann-Pearson Lemma Suppose that the data x 1, …, x n has joint density function f(x 1, …, x n ;  ) where  is either  1 or  2. Let g(x 1, …,
Statistical Analysis of Systematic Errors and Small Signals Reinhard Schwienhorst University of Minnesota 10/26/99.
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Overview of Statistical Hypothesis Testing: The z-Test
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Overview Definition Hypothesis
Hypothesis Testing – Introduction
Confidence Intervals and Hypothesis Testing - II
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
Presented by Mohammad Adil Khan
Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 8 Introduction to Hypothesis Testing
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Hypothesis Testing (Statistical Significance). Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Statistical Decision Theory
Statistical Inference Decision Making (Hypothesis Testing) Decision Making (Hypothesis Testing) A formal method for decision making in the presence of.
G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
IE241: Introduction to Hypothesis Testing. We said before that estimation of parameters was one of the two major areas of statistics. Now let’s turn to.
Practical Statistics for Particle Physicists Lecture 3 Harrison B. Prosper Florida State University European School of High-Energy Physics Anjou, France.
Bayesian vs. frequentist inference frequentist: 1) Deductive hypothesis testing of Popper--ruling out alternative explanations Falsification: can prove.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Correlation Assume you have two measurements, x and y, on a set of objects, and would like to know if x and y are related. If they are directly related,
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
"Classical" Inference. Two simple inference scenarios Question 1: Are we in world A or world B?
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
© Copyright McGraw-Hill 2004
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #24.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
G. Cowan Computing and Statistical Data Analysis / Stat 9 1 Computing and Statistical Data Analysis Stat 9: Parameter Estimation, Limits London Postgraduate.
One Sample Inf-1 In statistical testing, we use deductive reasoning to specify what should happen if the conjecture or null hypothesis is true. A study.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
G. Cowan Lectures on Statistical Data Analysis Lecture 6 page 1 Statistical Data Analysis: Lecture 6 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan RHUL Physics Statistical Issues for Higgs Search page 1 Statistical Issues for Higgs Search ATLAS Statistics Forum CERN, 16 April, 2007 Glen Cowan.
Helge VossAdvanced Scientific Computing Workshop ETH Multivariate Methods of data analysis Helge Voss Advanced Scientific Computing Workshop ETH.
Lecture 1.31 Criteria for optimal reception of radio signals.
Hypothesis Testing – Introduction
Testing Hypotheses I Lesson 9.
Presentation transcript:

Statistics In HEP Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements

Outline  What is Probability : frequentist / Bayesian review PDFs and some of their properties  Hypothesis testing test statistic power and size of a test error types Neyman-Pearson  What is the best test statistic concept of confidence level/p-value  Maximum Likelihood fit  strict frequentist Neyman – confidence intervals what “bothers” people with them  Feldmans/Cousins confidence belts/intervals  Yes, but what about systematic uncertainties? Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 2

Interpreting your measurement 3 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP  What do we REALLY mean by:  m W = / ;  M Higgs < 114.4GeV/c  these things are results of:  involved measurements  many “assumptions”  correct statistical interpretation:  most ‘honest’ presentation of the result  unless: provide all details/assumptions that went into obtaining the results  needed to correctly “combine” with others (unless we do a fully combined analysis)

What is Probability Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 4 A U niverse A B A∩B Venn-Diagram B A U niverse Venn-Diagram

What is Probability Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 5 1)a measure of how likely it is that some event will occur; a number expressing the ratio of favorable – to – all cases 2)the quality of being probable; a probable event or the most probable event (WordNet® Princeton)  Frequentist probability  Axioms of probability:  pure “set-theory”

Frequentist vs. Bayesian 6 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP Bayes’ Theorem  This follows simply from the “conditional probabilities”:

Frequentist vs. Bayesian Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 7  Some people like to spend hours talking about this… Bayes’ Theorem B.t.w.: Nobody doubts Bayes’ Theorem: discussion starts ONLY if it is used to turn frequentist statements: into Bayesian probability statements:

Frequentist vs. Bayesian 8 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP  Certainly: both have their “right-to-exist”  Some “probably” reasonable and interesting questions cannot even be ASKED in a frequentist framework :  “How much do I trust the simulation”  “How likely is it that it is raining tomorrow?”  “How likely is it that the LHC/Tevatron will also be operated next year?”  after all.. the “Bayesian” answer sounds much more like what you really want to know: i.e. “How likely is the “parameter value” to be correct/true ?”  BUT:  NO Bayesian interpretation w/o “prior probability” of the parameter  where do we get that from?  all the actual measurement can provide is “frequentist”!

Probability Distribution/Density of a Random Variable Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 9 random variable x : characteristic quantity of point in sample space discrete variables continuous variables normalisation (It has to be ‘somewhere’)

Cumulative Distribution Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 10 Cumulative PDF distribution: PDF (probability density function) we will come back to this..

Functions of Random Variables Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 11 Glen Cowan: Statistical data analysis

Conditioning and Marginalisation Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 12 ↔ consider some variable in the joint PDF(x,y) as constant (given): Glen Cowan: Statistical data analysis  marginalisation: If you are not interested in the dependence on “x”  project onto “y” (integrate “x out”)  conditional probability:

Hypothesis Testing Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 13

Why talking about “NULL Hypothesis” 14 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP  Statistical tests are most often formulated using a  “null”-hypothesis and its  “alternative”-hypotheisis  Why?  it is much easier to “exclude” something rather than to prove that something is true.  excluding: I need only ONE detail that clearly contradicts  assume you search for the “unknown” new physics. “null”-hypothesis :Standard Model (background) only “alternative”: everything else

Hypothesis Testing Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 15 y  n “feature space” y ( B )  0, y ( S )  1 > cut: signal = cut: decision boundary < cut: background y(x): Example: event classification Signal(H 1 ) or Background(H 0 )  choose cut value: i.e. a region where you “reject” the null- (background-) hypothesis (“size” of the region based on signal purity or efficiency needs)  You are bound to making the wrong decision, too…

Hypothesis Testing Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 16 Trying to select signal events: (i.e. try to disprove the null-hypothesis stating it were “only” a background event) Type-2 error: (false negative)  reject as background although it is signal Signal Back- ground Signal Type-2 error Back- ground Type-1 error accept as: truly is: Type-1 error: (false positive)  accept as signal although it is background

Hypothesis Testing Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 17 Try to exclude the null- hypothesis (as being unlikely to be at the basis of the observation): Type-2 error: (false negative) fail to reject the null-hypothesis/accept null hypothesis although it is false  reject alternative hypothesis although it would have been the correct/true one H1H1 H0H0 H1H1 Type-2 error H0H0 Type-1 error accept as: truly is: Type-1 error: (false positive) reject the null-hypothesis although it would have been the correct one  accept alternative hypothesis although it is false Significance α: Type-1 error rate: Size β: Type-2 error rate: Power: 1  β “C”: “critical” region: if data fall in there  REJECT the null-hypothesis should be small

Type 1 versus Type 2 Errors Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 18 y(x) y ( B )  0, y ( S )  1 Signal(H 1 ) /Background(H 0 ) discrimination:  backgr.  signal “limit” in ROC curve given by likelihood ratio which one of those two is the better?? y’(x) y’’(x) Type-1 error small Type-2 error large Type-1 error large Type-2 error small Signal(H 1 ) /Background(H 0 ) :  Type 1 error: reject H 0 although true  background contamination  Significance α: background sel. efficiency 1  : background rejection  Type 2 error: accept H 0 although false  loss of efficiency  Power: 1  β signal selection efficiency

Neyman Pearson Lemma 19 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP .  “limit” in ROC curve given by likelihood ratio Type-1 error small Type-2 error large Type-1 error large Type-2 error small Neyman-Peason: The Likelihood ratio used as “selection criterium” y(x) gives for each selection efficiency the best possible background rejection. i.e. it maximises the area under the “Receiver Operation Characteristics” (ROC) curve

Neyman Pearson Lemma 20 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP .  “limit” in ROC curve given by likelihood ratio Type-1 error small Type-2 error large Type-1 error large Type-2 error small

Neyman Pearson Lemma 21 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP Note: it is probably better to plot 1/α rather than 1- α as: α  α/2 means “HALF the background”

Neyman Pearson Lemma 22 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP Kyle Cranmer

Neyman Pearson Lemma 23 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP assume we want to modify/find another “critical” region with same size (α) i.e. same probability under H 0 Kyle Cranmer

Neyman Pearson Lemma 24 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP … as size (α) is fixed Kyle Cranmer

Neyman Pearson Lemma 25 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP outside “critical region” given by LL-ratio Kyle Cranmer

Neyman Pearson Lemma 26 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP outside “critical region” given by LL-ratio inside “critical region” given by LL-ratio Kyle Cranmer

Neyman Pearson Lemma 27 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP Kyle Cranmer

Neyman Pearson Lemma 28 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP The NEW “acceptance” region has less power! (i.e. probability under H 1 ) q.e.d Kyle Cranmer “critical” region (reject H 0 ) “acceptance” region (accept H 0 (reject H 1 )

Neyman Pearson Lemma Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 29

P-Value 30 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP translate to the common “sigma”  how many standard deviations “Z” for same p-value on one sided Gaussian  5σ = p-value of 2.87∙10 -7 Note:  p-value is property of the actual measurement  p-value is NOT a measure of how probably the hypothesis is

Distribution of P-Values Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP

Summary Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 32