1 Bayesianism Without Priors, Acts Without Consequences Robert Nau Fuqua School of Business Duke University ISIPTA 2005.

Slides:

Advertisements

Similar presentations

Reasons for (prior) belief in Bayesian epistemology

Advertisements

Economics of Information (ECON3016)

Introduction to Portfolio Selection and Capital Market Theory: Static Analysis

Utility theory U: O-> R (utility maps from outcomes to a real number) represents preferences over outcomes ~ means indifference We need a way to talk about.

6.896: Topics in Algorithmic Game Theory Lecture 20 Yang Cai.

(More on) characterizing dominant-strategy implementation in quasilinear environments (see, e.g., Nisan’s review: Chapter 9 of Algorithmic Game Theory.

CPS Bayesian games and their use in auctions Vincent Conitzer

BASICS OF GAME THEORY. Recap Decision Theory vs. Game Theory Rationality Completeness Transitivity What’s in a game? Players Actions Outcomes Preferences.

Consumption Basics Microeconomia III (Lecture 5) Tratto da Cowell F. (2004), Principles of Microeoconomics.

Overview... Consumption: Basics The setting

Week 11 Review: Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution.

Lecture XXIII.  In general there are two kinds of hypotheses: one concerns the form of the probability distribution (i.e. is the random variable normally.

Negotiation A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor SIUC.

Imperfect commitment Santiago Truffa. Agenda 1.“Authority and communication in Organizations” Wouter Dessein, RES “Contracting for information.

Games What is ‘Game Theory’? There are several tools and techniques used by applied modelers to generate testable hypotheses Modeling techniques widely.

Rational Expectations and the Aggregation of Diverse Information in Laboratory Security Markets Charles R. Plott, Shyam Sunder.

Rational choice: An introduction Political game theory reading group 3/ Carl Henrik Knutsen.

EC941 - Game Theory Prof. Francesco Squintani Lecture 8 1.

Utility Axioms Axiom: something obvious, cannot be proven Utility axioms (rules for clear thinking)

P.V. VISWANATH FOR A FIRST COURSE IN INVESTMENTS.

Judgment and Decision Making in Information Systems Utility Functions, Utility Elicitation, and Risk Attitudes Yuval Shahar, M.D., Ph.D.

Lecture 4 on Individual Optimization Risk Aversion

Rational Learning Leads to Nash Equilibrium Ehud Kalai and Ehud Lehrer Econometrica, Vol. 61 No. 5 (Sep 1993), Presented by Vincent Mak

The Identification Problem Question: Given a set of observed outcomes and covariates, can you estimate a unique set of parameters ? Answer: In general,

Notes – Theory of Choice

Job Market Signaling (Spence model)

Incomplete Contracts Renegotiation, Communications and Theory December 10, 2007.

EC941 - Game Theory Francesco Squintani Lecture 3 1.

Lecture 3: Arrow-Debreu Economy

De Finetti’s ultimate failure Krzysztof Burdzy University of Washington.

1 Intermediate Microeconomics Preferences. 2 Consumer Behavior Budget Set organizes information about possible choices available to a given consumer.

COMP14112: Artificial Intelligence Fundamentals L ecture 3 - Foundations of Probabilistic Reasoning Lecturer: Xiao-Jun Zeng

Risk Neutral Equilibria of Noncooperative Games Robert Nau Duke University April 12, 2013.

Chapter Four: Decision Making Under Uncertainty (Econ 512): Economics of Financial Markets Dr. Reyadh Faras.

The Development of Decision Analysis Jason R. W. Merrick Based on Smith and von Winterfeldt (2004). Decision Analysis in Management Science. Management.

Decision Analysis (cont)

STOCHASTIC DOMINANCE APPROACH TO PORTFOLIO OPTIMIZATION Nesrin Alptekin Anadolu University, TURKEY.

© 2009 Institute of Information Management National Chiao Tung University Lecture Note II-3 Static Games of Incomplete Information Static Bayesian Game.

Rational choice 23/ th reading group seminar on qualitative methods.

ECO290E: Game Theory Lecture 12 Static Games of Incomplete Information.

Chapter 3 Discrete Time and State Models. Discount Functions.

1 The Shape of Incomplete Preferences Robert Nau Duke University Presented at ISIPTA ‘03.

On the smooth ambiguity model and Machina’s reflection example Robert Nau Fuqua School of Business Duke University.

ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.

Making Simple Decisions

Introduction to Matching Theory E. Maskin Jerusalem Summer School in Economic Theory June 2014.

Axioms Let W be statements known to be true in a domain An axiom is a rule presumed to be true An axiomatic set is a collection of axioms Given an axiomatic.

Paul Milgrom and Nancy Stokey Journal of Economic Thoery,1982.

Ellsberg’s paradoxes: Problems for rank- dependent utility explanations Cherng-Horng Lan & Nigel Harvey Department of Psychology University College London.

Scoring Rules, Generalized Entropy, and Utility Maximization Robert Nau Fuqua School of Business Duke University (with Victor Jose and Robert Winkler)

Lecture 3 on Individual Optimization Uncertainty Up until now we have been treating bidders as expected wealth maximizers, and in that way treating their.

Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.

Risk Management with Coherent Measures of Risk IPAM Conference on Financial Mathematics: Risk Management, Modeling and Numerical Methods January 2001.

Decision theory under uncertainty

Axiomatic Theory of Probabilistic Decision Making under Risk Pavlo R. Blavatskyy University of Zurich April 21st, 2007.

6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 22.

1 Optimizing Decisions over the Long-term in the Presence of Uncertain Response Edward Kambour.

Scoring Rules, Generalized Entropy, and Utility Maximization Victor Jose, Robert Nau, & Robert Winkler Fuqua School of Business Duke University Presentation.

EC941 - Game Theory Prof. Francesco Squintani Lecture 6 1.

5.1.Static Games of Incomplete Information

Scoring Rules, Generalized Entropy and Utility Maximization Victor Richmond R. Jose Robert F. Nau Robert L. Winkler The Fuqua School of Business Duke University.

Statistical NLP: Lecture 4 Mathematical Foundations I: Probability Theory (Ch2)

Decision Theory Lecture 4. Decision Theory – the foundation of modern economics Individual decision making – under Certainty Choice functions Revelead.

1 Decision Analysis Without Consequences, Bayesianism Without Probabilities Robert Nau Fuqua School of Business Duke University Presentation for RUD 2005.

Security Markets V Miloslav S Vošvrda Theory of Capital Markets.

CHAPTER 1 FOUNDATIONS OF FINANCE I: EXPECTED UTILITY THEORY

Information Design: A unified Perspective Prior information

Strategic Information Transmission

Risk Chapter 11.

Statistical NLP: Lecture 4

Presentation transcript:

1 Bayesianism Without Priors, Acts Without Consequences Robert Nau Fuqua School of Business Duke University ISIPTA 2005

2 Possible sources of imprecision in subjective (epistemic) probabilities 1.Preferences are incomplete or only partially revealed, so beliefs are represented by convex sets of probabilities. 2.Preferences are complete but violate the independence axiom, so beliefs are represented by non-additive or second-order or otherwise generalized probabilities. 3.Preferences are complete and satisfy independence, but beliefs are entangled with state-dependent utilities, so they are represented by risk neutral probabilities (marginal betting rates for money).

3 Does source #3 matter? Yes! Inseparability of probabilities and utilities is potentially very problematic for game theory (common knowledge of utilities? common prior probabilities??) Also problematic for economic models which try to separate the effects of “information” and “tastes” Also problematic for characterizations of aversion to risk and/or uncertainty that refer to “true” expected values or “constant” acts as benchmarks And it’s also problematic for the measurement of previsions!

4 How are previsions measured? De Finetti’s method: the lower prevision of x is the quantity of money you are willing to pay to receive $x. –But what if utility for money is nonlinear?? Walley/De Cooman’s method: the lower prevision of x is the quantity of utility you are willing pay to receive x utiles. –But how are “utiles” measured, and what if utility is state-dependent? The “utile” problem cannot be finessed away, because...

5 How are utilities measured? The authority for a subjective scale of utility comes from Savage (1954) and Anscombe-Aumann (1963). Their models assume the existence of a set of primitive consequences (“states of the person”). Acts are arbitrary, often-counterfactual, mappings of states to consequences (or objective lotteries thereon). The utility of act x is defined as the number  such that x is indifferent to a lottery that yields the best and worst consequences with objective probabilities  and 1 . This definition depends on the conventional assumption that consequences have state-independent utility, but...

6 State-independence of utility is inherently untestable It is unreasonable to expect that, in all situations, it will be possible to define “consequences” or “prizes” so that their utilities will be a priori state-independent—particularly in situations that people care about. The usual axioms of state-independence (Savage’s P3 or Anscombe-Aumann’s monotonicity) anyway do not rule out the possibility of state-dependent utility scale factors. Aumann’s 1971 letter to Savage: what if the event in question is the survival of your beloved spouse or the debunking of the theory on which your life’s work is based? (or the performance of your retirement portfolio...)

7 This paper: Axiomatizes a simple additive representation of preferences without explicit consequences The only primitives are discrete “moves” of two players— the DM and nature—and monetary gambles Implicitly every combination of a move of the DM, a move of nature, and a monetary payoff, is treated as a distinct consequence Doesn’t yield “true” probabilities—but doesn’t need to! Provides a sufficient foundation for Bayesian methods of decision analysis and game theory based on no-arbitrage and risk neutral probabilities, which are marginal previsions (local betting rates measured in terms of money, not utiles)

8 Modeling framework Let A = {a 1, …, a I } denote a finite set of strategies (alternative courses of action) Let S = {s 1, …, s J } denote a finite set of states. Non-empty subsets of S are events. Let w = {w 1, …, w J }  W   J denote an observable allocation of money over states (i.e., a side-gamble) An act is a strategy-allocation pair (a, w) An outcome is a triple (a, s, w) This is essentially the Arrow-Debreu state-preference framework augmented by a finite set of strategies

9 Preference axioms Axiom 1: (Ordering)  is a weak order, and for every strategy the restricted preference relation over allocations is a continuous weak order. Axiom 2: (Strict monotonicity) w* j > w j  (a, w* j w  j )  (a, w) for all a, j, w, w* j. (More money preferred to less, and no null states)

10 Axiom 3: (Strategic generalized triple cancellation) For every pair of strategies a and b, allocations w, x, y, z, state j, and constants w j *, x j *: If (a, w)  (b, x) and (a, w* j w  j )  (b, x* j x -j ) and (a, w j y  j )  (b, x j,z  j ) then (a, w* j y  j )  (b, x* j z  j ). In words: if changing w j to w j * is no worse than changing x j to x j * when the background is a comparison of (a, w) vs. (b, x), then changing w j to w j * cannot be strictly worse than changing x j to x j * when different allocations y and z are received in the other states. Axiom 4: (Overlap among strategies) There exist allocations {w (a), a  A } in the interior of W such that (a, w (a) ) ~ (b, w (b) ) for all strategies a and b.

11 Main result: general additive model Theorem 1: Axioms 1-4 hold iff  is represented by a cardinal utility function U having the strategy-dependent additive-across-states form: U(a, w) =, in which {v as } are strictly increasing, continuous evaluation functions for money that are unique up to joint transformations of the form  v as +  s +  as where  >0 and, for every a,. Note: preferences among strategies without any side- gambles are represented by {  s v as (0)}.

12 Special cases of the general additive model 1.SEU with quasi-linear utility v as (w s ) = p s (u as +w s ) 2.SEU with state-dependent linear utility v as (w s ) = p s (u as +  s w s ) 3.SEU with state-independent utility and prior stakes v as (w s ) = p s u(w s +W s ) 4.General state-dependent SEU v as (w s ) = p s u as (w s )

13 Which SEU preference parameters are theoretically observable ? The DM’s “true” subjective probabilities for all events are observable only in special case 1 (quasi-linear utility). True probabilities are distorted by state-dependence and/or effects of prior stakes in cases 2 and 3. True probabilities are irrelevant in special case 4 (hopelessly entangled with utilities). But… so what ! True probabilities are not needed for analysis of decisions (or games).

14 What’s really observable via finite measurements? The decision maker’s preference order has an additive representation in terms of evaluation functions {v as }, which are infinite-dimensional vectors. What sort of finite-dimensional measurements of preferences can be performed to enable decision analysis & game-theoretic analysis? How can we tell if the decision maker is acting “rationally” based (only) on this information?

15 Analysis of a finite decision problem Henceforth, assume that only a finite number M of concrete acts are available, accompanied by “small” side-gambles. W.l.o.g. let each concrete act be considered as a unique strategy with zero allocation, e.g., “strategy m” means act (a m, 0) for m = 1, …, M An outcome of the decision problem is a pair (m, s). Let v ms (w) henceforth denote the evaluation function for money in outcome (m, s). How can we predict the decision maker’s choice of m via credible elicitation of information about her “beliefs” and “values” encoded in {v ms (w)} ?

16 De Finetti’s excellent idea Let beliefs (subjective probabilities) be revealed through offers to accept small gambles. Let rationality be defined as the avoidance of arbitrage (“no Dutch books”). Under-appreciated virtue: this approach not only defines beliefs and rationality, it renders them common knowledge in a practical sense. It can also be extended to the revelation of values (albeit with imperfect separation of beliefs/values). It also provides a natural bridge to game theory and asset pricing theory

17 Measurable parameters of belief: risk neutral probabilities The decision maker’s risk neutral (betting) probabilities given strategy m are the normalized derivatives of the evaluation functions {v ms }, evaluated at w=0: Note that they are, in general, act-dependent Conditional on the “event” that strategy m is chosen, for whatever reason, z is a utility non-decreasing “belief gamble” iff 0  z ·  m  E  [z]

18 Common knowledge of beliefs (CKB) Definition: vectors {z k } are acceptable gambles if the DM is willing to let an opponent choose small non-negative multipliers {  k } and receive a total payoff of  k  k z k from the opponent Axiom CKB: for each m, the vectors z = ±(1 s   ms ) are acceptable given strategy m, where 1 s is the indicator for state s and  ms is the risk neutral probability of state s given given strategy m. I.e., the DM is willing to buy/sell Arrow-Debreu securities on state s at price  ms in the event she chooses strategy m.

19 In what sense do risk neutral probabilities {  m } represent “beliefs”? They coincide with hypothetical “true” probabilities (only) in the special case of quasi-linear utility In all other cases, they are strictly more useful (e.g., they suffice to determine marginal prices and risk premia) They are updated via Bayes’ rule on receipt of new experimental information, exactly like “true” probabilities Individuals ought to eventually agree on risk neutral probabilities, not “true” probabilities (given the opportunity to gamble with each other)

20 Measurable parameters of value: “value gambles” In the event strategy m yields greater utility than strategy n, the gamble  mn defined by whose state-by-state increments of cardinal utility are proportional to the state-by-state differences in value between m and n, evaluated from the perspective of the local marginal values for money that apply under the chosen strategy m, is utility non-decreasing.

21 In what sense do {  mn } represent “values” They coincide exactly with vectors of utility differences in the special case of quasi-linear utility They are independent of beliefs (subjective probabilities) in the case of state-dependent SEU They are independent of information in the case of scientific experiments Axiom CKV: for every strategy m and alternative strategy n, the vector  mn is an acceptable gamble.

22 Decision analysis in terms of acceptable gambles By construction:  m  mn  U(a m, 0)  U(a n, 0). Hence the direction of preference between any two concrete acts can be determined from observations of acceptable gambles under CKB and CKV. This suffices to determine the optimal act, even though “true” probabilities and utilities have not been observed But there is also another way to determine the “rational” outcome of the decision problem...

23 Common knowledge of rationality Definition: There is ex post arbitrage in outcome (m, s) if there exists a non-negative linear combination of acceptable gambles whose total payoff to the decision maker[s] is non-positive in all outcomes and strictly negative in state (m, s) Axiom CKR: There is no ex post arbitrage. In other words, if there is ex post arbitrage in outcome (m, s), then it should not happen.

24 Solution of the decision problem “by arbitrage” Fundamental theorem of decision analysis [one DM vs. nature]: Given axioms 1-4, CKB, CKV, and CKR, the outcome will be one in which the DM chooses a utility- maximizing strategy and nature chooses a state that has positive risk neutral probability given that strategy Fundamental theorem of games [2 or more DM’s vs. each other]: Given axioms 1-4, CKV, and CKR, the outcome will be one that has positive probability in a subjective correlated equilibrium based on common prior risk neutral probabilities

25 Onward to Bayesian updating... The paper goes on to assume that the state space S can be partitioned as S = H  I where I is a set of “informational” events that are expected to be resolved before an act is chosen and H is a set of “hypotheses” (consequential events) expected to be resolved only after an act has been chosen. Conditional preferences are defined in terms of contingent strategies that are pegged (only) to events in I. The DM is assumed to have no intrinsic interest in I-events, given the hypotheses, implying an observable likelihood. Optimal posterior decisions are made by updating risk neutral probabilities according to Bayes’ rule, then applying them to the evaluation of the original value gambles.