Decision making as a model 2. Statistics and decision making.

Slides:



Advertisements
Similar presentations
Lecture XXIII.  In general there are two kinds of hypotheses: one concerns the form of the probability distribution (i.e. is the random variable normally.
Advertisements

CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Is it statistically significant?
Decision making as a model 3. Heavy stuff: derivation of two important theorems.
AP Statistics – Chapter 9 Test Review
Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Appendix 3 Probability and Statistics.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
Empirical Analysis Doing and interpreting empirical work.
Chapter Seventeen HYPOTHESIS TESTING
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
PSY 307 – Statistics for the Behavioral Sciences
Today Concepts underlying inferential statistics
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Descriptive Statistics
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
Overview of Statistical Hypothesis Testing: The z-Test
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Overview Definition Hypothesis
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Chapter 8 Hypothesis Testing (假设检验)
Fundamentals of Hypothesis Testing: One-Sample Tests
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 4 Introduction to Hypothesis Testing Introduction to Hypothesis Testing.
Chapter 8 Introduction to Hypothesis Testing
Hypothesis testing Chapter 9. Introduction to Statistical Tests.
STA Statistical Inference
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
1 Lecture 19: Hypothesis Tests Devore, Ch Topics I.Statistical Hypotheses (pl!) –Null and Alternative Hypotheses –Testing statistics and rejection.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. CHAPTER 12 Significance Tests About Hypotheses TESTING HYPOTHESES ABOUT PROPORTIONS.
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
IE241: Introduction to Hypothesis Testing. We said before that estimation of parameters was one of the two major areas of statistics. Now let’s turn to.
Errors & Power. 2 Results of Significant Test 1. P-value < alpha 2. P-value > alpha Reject H o & conclude H a in context Fail to reject H o & cannot conclude.
Statistics In HEP Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Unit 8 Section 8-1 & : Steps in Hypothesis Testing- Traditional Method  Hypothesis Testing – a decision making process for evaluating a claim.
Correlation Assume you have two measurements, x and y, on a set of objects, and would like to know if x and y are related. If they are directly related,
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Rejecting Chance – Testing Hypotheses in Research Thought Questions 1. Want to test a claim about the proportion of a population who have a certain trait.
"Classical" Inference. Two simple inference scenarios Question 1: Are we in world A or world B?
9.3/9.4 Hypothesis tests concerning a population mean when  is known- Goals Be able to state the test statistic. Be able to define, interpret and calculate.
© Copyright McGraw-Hill 2004
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Chapter 13 Understanding research results: statistical inference.
SIGNAL DETECTION THEORY  A situation is described in terms of two states of the world: a signal is present ("Signal") a signal is absent ("Noise")  You.
Example The strength of concrete depends, to some extent on the method used for drying it. Two different drying methods were tested independently on specimens.
HYPOTHESIS TESTING E. Çiğdem Kaspar, Ph.D, Assist. Prof. Yeditepe University, Faculty of Medicine Biostatistics.
Hypothesis Testing.  Hypothesis is a claim or statement about a property of a population.  Hypothesis Testing is to test the claim or statement  Example.
Lecture 1.31 Criteria for optimal reception of radio signals.
Logic of Hypothesis Testing
FINAL EXAMINATION STUDY MATERIAL III
Dr.MUSTAQUE AHMED MBBS,MD(COMMUNITY MEDICINE), FELLOWSHIP IN HIV/AIDS
Hypothesis Testing: Hypotheses
CONCEPTS OF HYPOTHESIS TESTING
Statistical Tests - Power
P-value Approach for Test Conclusion
Chapter 9: Hypothesis Tests Based on a Single Sample
Introduction to Inductive Statistics
Testing Hypotheses I Lesson 9.
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Decision making as a model 2. Statistics and decision making

Bayesian statistics: p(H|D)  p(H) ∙p(D|H) If H refers to possible values of θ: pdf(θ|D)  pdf(θ) ∙L (θ|D)NB: L: Likelihood function! From about 1925 Bayesian approach in inductive statistics was marginalised (now a come back)

In “classial ” statistics frequentist interpretation of probability is preferred Hypotheses are TRUE or FALSE (we don’t know which for certain - not a matter of probability) and are accepted of rejected based on data D and likelihood p(D|H) e.g. test of significance

Statistic S compute pdf(S|H 0 ) (for sample of n) If p is small, reject H 0, you could accept some alternative SxSx p Fisher Null hypothesis about some population parameter do experiment (  S x, p) probability density

Neyman & Pearson pdf(S|H 0 )pdf(S|H 1 ) do experiment, compute S x and choose between H 0 and H 1 Statistic S Specify H 0,H 1 and their pdf’s. Decide on a criterion based on β p(type II error) and α p(type I error)

Neyman & Pearson more suitable for decision making than for science! For completeness: Likelihood approach without priors: Fisher, Royall p(H|D)  p(H)∙p(D|H) Irrespective of p(H): how strong is D’s support for H ? Example: model selection: AkaikeAIC = -log(L) + k BIC = -log(L) + k log(n)/2

Military technology (WW2): Signal Detection Theory Application of Application of Neyman-Pearson to processing sonar or radar signals on noisy background

Hypothesis 0: there is no signal, only noise Hypothesis 1: there is a signal and noise NB.1 On the basis of some “evidence” I have to act, although I do not know which H is true! NB.2This is typically a “classic” approach, but at the end Bayes will creep in by the back door!

“Evidence”, e.g.…..???? 1.Effect (= a value of “Evidence”) of signal is variable (according to a probability distribution). 2. Effect of Noise is also variable. Probability density Problem: is this “Evidence” (= a point on x-axis) the effect of a signal (+ noise) or of noise only? fundamental assumptions of signal detection theory

3. If signal is weak, distributions overlap and errors are unavoidable, whichever criterion is adopted “No” “Yes”

Signal (+noise) (only) noise miss hit correct rejection false alarm Terminology:

The stronger the signal (or the better the detector) … the further the distributions lie apart

“No” “Yes” Given some sensitivity (= a distribution for noise and one for signal) several response criteria can be adopted Dependent on van personal preference or “pay off” in this situation: -How bad is a miss, how important is a hit? -How bad is a false alarm, how important is a correct rejection? -Hoe often do signals occur? (think of Bayes!)

Two types of applications: 1.Normative: distributions are known, try to find optimal criterion (for optimal behavior) -Is that a hostile plane? -Does this mammogram indicate a malignancy? -Is there a weapon in this suitcase? -Can we admit this student to this school? -What is the best cut-off score for this test?

Two types of application: 2.Descriptive: Behavior is known, try to reconstruct distributions and criterion as a rational model How good is this person in detecting a v among u’s? Is this person inclined to say “yes” in a recognition test? How well judges or juries are able to distinguish between the guilty and the innocent? Do judges and lay juries differ in their bias for convicting or acquitting? How good is this test?.

Hit rate = Proportion hits (of signal trials) False Alarm Rate = Proportion false alarms (of noise trials) “No” “Yes” An experiment with noise (blank) and and signal (target) trials: A strict (“high”) criterion results in few hits and few false alarms

false alarms hits “No” “Yes” A lax “low” criterion results in more hits and more false alarms -given the same sensitivity

connects points in a Hit/FA- plot, resulting from adopting several criteria given the same sensitivity (= same distributions) ROC-curve characterises detector sensitivity (or signal strength) independent of criterion important: sensitivity and criterion theoretically independent The ROC-(response operating characteristic) curve

ROC-curve Receiver Operating Characteristic Relative Operating Characteristic Isosensitivity Curve false alarms hits Same sensitivity (for this signal), several criteria

Greater sensitivity: ROC-curve further from diagonal false alarms hits (Perfection would be: all hits and no false alarms)

Suggests two types of measure for sensitivity (independent of criterion:) 2.Area under ROC-Curve: A 1.distance between signal and noise distributions (e.g. d ' )

No distinction between signal and noise: A =.50 (ROC-curve reflects only bias for saying “yes” or “no”)

Perfect distinction between signal and noise: A  1.

Types of measures for criterion: 2. Likelihood ratio p(x c |S)/p(x c |N) = h/f (e.g. β) h f 1. Position on op x-axis (e.g. c) 3. Position in ROC-plot (left down. vs right up) 4. Slope of tangent on ROC c

Signal Detection Theory is applied in many contexts! Breast cancer?

PSA-indices for screening prostate cancer FA rate Hit rate

Psychodiagnosis: 1.How good is this test distinguishing relevant categories? 2.What is good cut-off score (at which score should I hire the candidate/admit the student / send the cliënt to a psychiatrist or an asylum? Control group patients Test score

Comer & Kendall 2005: Children’s Depression Inventory detects depression in a sample of anxious and anxious + depressive children Several cut-off scores

What are the costs missing a weapon/explosive at an airport? What are the costs a false alarm? What are the costs of screening (apparatus, personnel, delay)?