Sequential Hypothesis Testing under Stochastic Deadlines Peter Frazier, Angela Yu Princeton University TexPoint fonts used in EMF. Read the TexPoint manual.

Slides:

Advertisements

Similar presentations

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.

Advertisements

Lecture XXIII.  In general there are two kinds of hypotheses: one concerns the form of the probability distribution (i.e. is the random variable normally.

LECTURE 11: BAYESIAN PARAMETER ESTIMATION

Designing a behavioral experiment

Sampling Distributions (§ )

Quasi-Continuous Decision States in the Leaky Competing Accumulator Model Jay McClelland Stanford University With Joel Lachter, Greg Corrado, and Jim Johnston.

What is Statistical Modeling

Visual Recognition Tutorial

An Experimental Paradigm for Developing Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan March, 2004.

An Experimental Paradigm for Developing Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan UNC: November, 2003.

Statistical Decision Theory, Bayes Classifier

Evaluating Hypotheses

Presenting: Assaf Tzabari

An Experimental Paradigm for Developing Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan February, 2004.

4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.

Inference about a Mean Part II

Inferences About Process Quality

From T. McMillen & P. Holmes, J. Math. Psych. 50: 30-57, MURI Center for Human and Robot Decision Dynamics, Sept 13, Phil Holmes, Jonathan.

Introduction to Regression Analysis, Chapter 13,

Theory of Decision Time Dynamics, with Applications to Memory.

Statistical Hypothesis Testing. Suppose you have a random variable X ( number of vehicle accidents in a year, stock market returns, time between el nino.

Hypothesis Testing – Introduction

Chapter 8 Introduction to Hypothesis Testing

The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.

Statistical Decision Theory

Chapter 8 Introduction to Hypothesis Testing

Theory of Probability Statistics for Business and Economics.

Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.

ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.

1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.

Bayesian Classification. Bayesian Classification: Why? A statistical classifier: performs probabilistic prediction, i.e., predicts class membership probabilities.

Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,

Confidence intervals and hypothesis testing Petter Mostad

Decision Making Theories in Neuroscience Alexander Vostroknutov October 2008.

D ECIDING WHEN TO CUT YOUR LOSSES Matt Cieslak, Tobias Kluth, Maren Stiels & Daniel Wood.

Statistical Inference An introduction. Big picture Use a random sample to learn something about a larger population.

Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.

Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.

Chapter 10: Introduction to Statistical Inference.

Ch15: Decision Theory & Bayesian Inference 15.1: INTRO: We are back to some theoretical statistics: 1.Decision Theory –Make decisions in the presence of.

The Computing Brain: Focus on Decision-Making

Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.

© Copyright McGraw-Hill 2004

Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,

Machine Learning 5. Parametric Methods.

Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.

G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.

Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.

Statistical NLP: Lecture 4 Mathematical Foundations I: Probability Theory (Ch2)

Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.

Application of Dynamic Programming to Optimal Learning Problems Peter Frazier Warren Powell Savas Dayanik Department of Operations Research and Financial.

Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”

Sequential Off-line Learning with Knowledge Gradients Peter Frazier Warren Powell Savas Dayanik Department of Operations Research and Financial Engineering.

Markov-Chain-Monte-Carlo (MCMC) & The Metropolis-Hastings Algorithm P548: Intro Bayesian Stats with Psych Applications Instructor: John Miyamoto 01/19/2016:

Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,

Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.

Psychology and Neurobiology of Decision-Making under Uncertainty Angela Yu March 11, 2010.

Outline Historical note about Bayes’ rule Bayesian updating for probability density functions –Salary offer estimate Coin trials example Reading material:

Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.

Optimal Decision-Making in Humans & Animals Angela Yu March 05, 2009.

Math 6330: Statistical Consulting Class 11

Sample Mean Distributions

Hypothesis Testing – Introduction

CONCEPTS OF HYPOTHESIS TESTING

Discrete Event Simulation - 4

Dynamical Models of Decision Making Optimality, human performance, and principles of neural information processing Jay McClelland Department of Psychology.

Statistical NLP: Lecture 4

Sampling Distributions (§ )

CS639: Data Management for Data Science

Applied Statistics and Probability for Engineers

Presentation transcript:

Sequential Hypothesis Testing under Stochastic Deadlines Peter Frazier, Angela Yu Princeton University TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAAAAAA

Sequential Hypothesis Testing

under Stochastic Deadlines

Peter Frazier & Angela Yu Princeton University

We consider the sequential hypothesis testing problem and generalize the sequential probability ratio test (SPRT) to the case with stochastic deadlines. This causes reaction times for correct responses to be faster than for errors, as seen in behavioral studies. Summary

Both decreasing the deadline’s mean and increasing its variance causes more response urgency. Results extend to the general case with convex continuation cost.

1. Sequential Probability Ratio Test

Sequential Hypothesis Testing ABABAB wait At each time, the subject decides whether to act (A or B), or collect more information. This requires balancing speed vs. accuracy.

We observe a sequence of i.i.d. samples x 1,x 2,... from some density. The underlying density is unknown, but is known to equal either f 0 or f 1. We begin with a prior belief about whether f 0 or f 1 is the true density, which we update through time based on the samples. We want to maximize accuracy

Let  be the index of the true distribution. Let p 0 be the initial belief, P{  =1}. Let p t := P{  =1 | x 1,...,x t }. Let c be a cost paid per-sample. Let d be a cost paid to violate the deadline (used later) Let  be time-index of the last sample collected. Let  be the guessed hypothesis.

Posterior probabilities may be calculated via Bayes Rule: Time (t) Probability (p t )

Probability of Error Time Delay Penalty The objective function is: where we require that the decisions  and  are “non-anticipative”, that is, whether  <= t is entirely determined by the samples x 1,...,x t, and  is entirely determined by the samples x 1,...,x . Objective Function

Time (t) Probability (p t ) A B  Optimal Policy (SPRT) Wald & Wolfowitz (1948) showed that the optimal policy is to stop as soon as p exits an interval [A,B], and to choose the hypothesis that appears more likely at this time. This policy is called the Sequential Probability Ratio Test or SPRT.

2. Models for Behavior

A classic sequential hypothesis testing task is detecting coherent motion in random dots. One hypothesis is that monkeys and people behave optimally and according to the SPRT.

(Roitman & Shadlen, 2002) Broadly speaking, the model based on the classic SPRT fits experimental behavior well. Accuracy vs. CoherenceReaction Time vs. Coherence There is one caveat, however…

Accuracy Mean RT RT Distributions (Data from Roitman & Shadlen, 2002; analysis from Ditterich, 2007) SPRT fails to predict the difference in response time distributions between correct and error responses. Correct responses are more rapid in experiments. SPRT predicts they should be identically distributed.

3. Generalizing to Stochastic Deadlines

(Data from Roitman & Shadlen, 2002) (Analysis from Ditterich, 2006) Monkeys occasionally abort trials without responding, but it is always better to guess than to abort under the assumed objective function. To explain the discrepancy, we hypothesize a limit on the length of time that monkeys can fixate the target.

Error Penalty Deadline Penalty Hypothesizing a decision deadline D leads to a new objective function: Time Penalty We will assume that D has a non-decreasing failure rate, i.e. P{D=t+1 | D>t} is non-decreasing in t. This assumption is met by deterministic, normal, gamma, and exponential deadlines, and others. Objective Function

The resulting optimal policy is to stop as soon as p t exits a region that narrows with time. Probability (p t ) Generalized SPRT Classic SPRT Deadline Time (t) Optimal Policy

Reaction Time Frequency of Occurrence Correct Responses Error Responses Under this policy, correct responses are generally faster than error responses. Response Times

Influence of the Parameters Deadline Uncertainty Deadline Mean Time Penalty Deadline Penalty Plots of the continuation region C t (blue), and the probability of a correct response P{  =  |  =t} (red). D was gamma distributed, and the default settings were c=.001, d=2, mean(D)=40, std(D)=1. In each plot we varied one while keeping the others fixed.

Theorem : The continuation region at time t for the optimal policy, C t, is either empty or a closed interval, and it shrinks with time (C t+1 µ C t ). Proposition : If P{D<1} = 1 then there exists a T < 1 such that C T = ;. That is, the optimal reaction time is bounded above by T.

Proof Sketch Lemma 1: The continuation cost of the optimal policy, Q(t,p), is concave as a function of p. Lemmas 2 and 3: Wasting a time period incurs an opportunity cost in addition to its immediate cost c. Lemma 4: If we are certain which hypothesis is correct (p=0 or p=1), then the optimal policy is to stop as soon as possible. Its value is: Define Q(t,p t ) to be the conditional loss given p t of continuing once from time t and then behaving optimally.

C t+1 Proof Sketch CtCt p0 1 Expected Loss Q(t+1,p)-c Q(t,p) min(p,1-p)

References 1.Anderson, T W (1960). Ann. Math. Statist. 31: Bogacz, R et al. (2006). Pyschol. Rev. 113: Ditterich, J (2006). Neural Netw. 19(8): Luce, R D (1986). Response Times: Their Role in Inferring Elementary Mental Org. Oxford Univ. Press. 5.Mozer et al (2004). Proc. Twenty Sixth Annual Conference of the Cognitive Science Society Poor, H V (1994). An Introduction to Signal Detection and Estimation. Springer-Verlag. 7.Ratcliff, R & Rouder, J N (1998). Psychol. Sci. 9: Roitman J D, & Shadlen M N (2002). J. Neurosci. 22: Siegmund, D (1985). Sequential Analysis. Springer. 10.Wald, A & Wolfowitz, J (1948). Ann. Math. Statisti. 19: