Cooperating Intelligent Systems Utility theory Chapter 16, AIMA.

Slides:



Advertisements
Similar presentations
Economics of Information (ECON3016)
Advertisements

Utility theory U: O-> R (utility maps from outcomes to a real number) represents preferences over outcomes ~ means indifference We need a way to talk about.
Decision Analysis (Decision Tables, Utility)
Paradoxes in Decision Making With a Solution. Lottery 1 $3000 S1 $4000 $0 80% 20% R1 80%20%
Making Simple Decisions Chapter 16 Some material borrowed from Jean-Claude Latombe and Daphne Koller by way of Marie desJadines,
Making Simple Decisions
Utility Theory.
Decisions under Uncertainty
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 14 From Randomness to Probability.
Clear your desk for your quiz. Unit 2 Day 8 Expected Value Average expectation per game if the game is played many times Can be used to evaluate and.
Risk Attitude Dr. Yan Liu
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Agent Technology for e-Commerce Appendix A: Introduction to Decision Theory Maria Fasli
Economics 202: Intermediate Microeconomic Theory 1.HW #5 on website. Due Tuesday.
1 Utility Theory. 2 Option 1: bet that pays $5,000,000 if a coin flipped comes up tails you get $0 if the coin comes up heads. Option 2: get $2,000,000.

Making Simple Decisions Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 16.
Decision. Decision Summary Descriptive, normative, prescriptive Expected Utility: normative theory of decision Psychology of decision Prospect theory:
PSY 5018H: Math Models Hum Behavior, Prof. Paul Schrater, Spring 2005 Normative Decision Theory A prescriptive theory for how decisions should be made.
Decision Making Under Uncertainty Russell and Norvig: ch 16 CMSC421 – Fall 2006.
Decision Trees and Utility Theory
Expected Value.  In gambling on an uncertain future, knowing the odds is only part of the story!  Example: I flip a fair coin. If it lands HEADS, you.
Decision. Decision Summary Search for alternatives Descriptive, normative, prescriptive Expected Utility: normative theory of decision Psychology of decision.
EXPECTED UTILITY AND RISK AVERSION
CMSC 671 Fall 2003 Class #26 – Wednesday, November 26 Russell & Norvig 16.1 – 16.5 Some material borrowed from Jean-Claude Latombe and Daphne Koller by.
Chapter 14 sec 4 Expected Value. Questions How many of you have car insurance? How many of you have health insurance? Do you wonder how the insurance.
Problems With Expected Utility or if the axioms fit, use them but...
Decision-making under uncertainty. Introduction Definition of risk Attitudes toward risk Avoiding risk: Diversification Insurance.
PGM 2003/04 Tirgul7 Foundations of Decision Theory (mostly from Pearl)
Decision Analysis (cont)
CPS 270: Artificial Intelligence Decision theory Instructor: Vincent Conitzer.
Decision making Making decisions Optimal decisions Violations of rationality.
Reminder Midterm Mar 7 Project 2 deadline Mar 18 midnight in-class
Chapter 8 Probability Section R Review. 2 Barnett/Ziegler/Byleen Finite Mathematics 12e Review for Chapter 8 Important Terms, Symbols, Concepts  8.1.
Making Simple Decisions
Lecture 6 Discrete Random Variables: Definition and Probability Mass Function Last Time Families of DRVs Cumulative Distribution Function (CDF) Averages.
Chapter 5 Uncertainty and Consumer Behavior. ©2005 Pearson Education, Inc.Chapter 52 Q: Value of Stock Investment in offshore drilling exploration: Two.
Axioms Let W be statements known to be true in a domain An axiom is a rule presumed to be true An axiomatic set is a collection of axioms Given an axiomatic.
Advanced AI Rob Lass July 26, Administrative WebCT Problems? Why aren’t homeworks graded!? Midterm next Wed (Aug. 2nd)
Making Simple Decisions Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 16.
Choice under uncertainty Assistant professor Bojan Georgievski PhD 1.
What is Probability?. The Mathematics of Chance How many possible outcomes are there with a single 6-sided die? What are your “chances” of rolling a 6?
1 Chapter 18 UNCERTAINTY AND RISK AVERSION. 2 Probability The probability of a repetitive event happening is the relative frequency with which it will.
Expected Value.
Quantitative Decision Techniques 13/04/2009 Decision Trees and Utility Theory.
Decision theory under uncertainty
Making Simple Decisions Utility Theory MultiAttribute Utility Functions Decision Networks The Value of Information Summary.
Chapter 16: Making Simple Decision March 23, 2004.
Expected Value, Expected Utility & the Allais and Ellsberg Paradoxes
Web-Mining Agents Agents and Rational Behavior Decision-Making under Uncertainty Simple Decisions Ralf Möller Universität zu Lübeck Institut für Informationssysteme.
Making Simple Decisions Chapter 16 Some material borrowed from Jean-Claude Latombe and Daphne Koller by way of Marie desJadines,
Binomial Distributions Chapter 5.3 – Probability Distributions and Predictions Mathematics of Data Management (Nelson) MDM 4U Authors: Gary Greer (with.
Chapter 16 March 25, Probability Theory: What an agent should believe based on the evidence Utility Theory: What the agent wants Decision Theory:
Behavioral Finance Preferences Part I Feb 16 Behavioral Finance Economics 437.
DADSS Lecture 11: Decision Analysis with Utility Elicitation and Use.
Making Decisions Under Uncertainty. To judge what one must do to obtain a good or avoid an evil, it is necessary to consider not only the good and the.
L12 Uncertainty. Model with real endowments 1. Labor Supply (Labor-Leisure Choice) 2. Intertemporal Choice (Consumption-Savings Choice) 3. Uncertainty.
Ralf Möller Universität zu Lübeck Institut für Informationssysteme
Copyright © 2016, 2013, and 2010, Pearson Education, Inc.
CHAPTER 1 FOUNDATIONS OF FINANCE I: EXPECTED UTILITY THEORY
CS 4/527: Artificial Intelligence
L11 Uncertainty.
Risk Chapter 11.
Rational Decisions and
13. Acting under Uncertainty Wolfram Burgard and Bernhard Nebel
Behavioral Finance Economics 437.
From Randomness to Probability
Probability Mutually exclusive and exhaustive events
Utilities and Decision Theory
Making Simple Decisions
Presentation transcript:

Cooperating Intelligent Systems Utility theory Chapter 16, AIMA

The utility function U(S) An agent’s preferences between different states S in the world are captured by the Utility function U(S). If U(S i ) > U(S j ) then the agent prefers state S i before state S j If U(S i ) = U(S j ) then the agent is indifferent between the two states S i and S j

Maximize expected utility A rational agent should choose the action that maximizes the the agent’s expected utility ( EU ): Where Result i (A) enumerates all the possible resulting states after doing action A.

The basis of utility theory Notation: A ≻ B A is preferred to B A ∼ B The agent is indifferent between A and B A ≿ B The agent prefers A to B, or is indifferent between them. A lottery is described with

The six axioms of utility theory Orderability (A ≻ B) ∨ (B ≻ A) ∨ (A ∼ B) Transitivity (A ≻ B) ∧ (B ≻ C) ⇒ (A ≻ C) Continuity (A ≻ B ≻ C) ⇒ ∃ p [p,A; 1-p,C] ∼ B Substitutability (A ∼ B) ⇒ [p,A; 1-p,C] ∼ [p,B; 1-p,C] Monotonicity (A ≻ B) ⇒ (p  q ⇔ [p,A; 1-p,B] ≿ [q,A; 1-q,B]) Decomposability [p,A; 1-p,[q,B; 1-q,C]] ∼ [p,A; (1-p)q,B; (1-p)(1-q),C] It follows from these axioms that there exists a real-valued function U that operates on states such that U(A) > U(B) ⇔ A ≻ B U(A) = U(B) ⇔ A ∼ B You must make a decision

The St. Petersburg ”paradox” You are offered to play the following game (bet): You flip a coin repeatedly until you get your first heads. You will then be paid $2 to the power of every flip you made, including the final one (the price matrix is below). How much are you willing to pay to participate (participation is not free)? TossWinning H$2 TH$4 TTH$8 TTTH$16...

The St. Petersburg ”paradox” What is the expected winning in this betting game? A rational player should be willing to pay any sum of money to participate......if $ = Utility The students in previous years’ classes have offered $4 or less on average...

The St. Petersburg ”paradox” Bernoulli (1738): The utility of money is not money; it is more like log(Money).

Mr Beard’s utility curve General ”human nature” utility curve Risk averse Risk seeking

Lottery game 1 $1,000,000 $5,000,000 $1,000,000 $ A B You can choose between alternatives A and B: A)You get $1,000,000 for sure. B)You can participate in a lottery where you can win up to $5 mill.

Lottery game 2 $5,000,000 $1,000,000 $ C D You can choose between alternatives C and D: C)A lottery where you can win $5 mill. D)A lottery where you can win $1 mill. $

Lottery preferences People should select A and D, or B and C. Otherwise they are not being consistent... Allais paradox. Utility function does not capture a human’s fear of looking like a complete idiot. In last years classes, fewer than 50% have been consistent...

Form of U(S) If the value of one attribute does not influence one’s opinion about the preference for another attribute, then we have mutual preferential independence and can write: Where V(X) is a value function (expressing the [monetary] value)

Example: The party problem We are about to give a wedding party. It will be held during summer- time. Should we be outdoors or indoors? The party is such that we can’t change our minds on the day of the party (different locations for indoors and outdoors). What is the rational decision? In Out Rain ¬ Rain Rain ¬ Rain Example adapted from Breese & Kooler 1997 Disaster! Perfect! Relieved Regret

Example: The party problem The value function: Assign a numerical (monetary) value to each outcome. (We avoid the question on how this is done for the time being) In Out Rain ¬ Rain Rain ¬ Rain Example adapted from Breese & Kooler 1997 Disaster! Perfect! Relieved Regret LocationWeatherValue IndoorsSun$25 IndoorsRain$75 OutdoorsSun$100 OutdoorsRain$0 Let U(S) = log[V(S)+1] U = 1.88 U = 1.41 U = 0.00 U = 2.00

Example: The party problem Get weather statistics for your location in the summer (June). In Out Rain ¬ Rain Rain ¬ Rain Example adapted from Breese & Kooler 1997 Disaster! Perfect! Relieved Regret LocationP(Rain) Stockholm, Sweden18/30 Bergen, Norway19/30 San Fransisco, USA1/30 Seattle, USA9/30 Paris, France14/30 Haifa, Israel0/30 U = 1.88 U = 1.41 U = 0.00 U = 2.00 Rain probabilities from Weatherbase

Example: The party problem Example: Stockholm, Sweden In Out Rain ¬ Rain Rain ¬ Rain Example adapted from Breese & Kooler 1997 Disaster! Perfect! Relieved Regret U = 1.88 U = 1.41 U = 0.00 U = 2.00 Be indoors!

Example: The party problem Example: San Fransisco, California In Out Rain ¬ Rain Rain ¬ Rain Example adapted from Breese & Kooler 1997 Disaster! Perfect! Relieved Regret U = 1.88 U = 1.41 U = 0.00 U = 2.00 Be outdoors! The change from outdoors to indoors occurs at P(Rain) > 7/30

Decision network for the party problem Decision represented by a rectangle Chance (random variable) represented by an oval. Utility function represented by a diamond. Location Happy guests WeatherU

The value of information The value of a given piece of information is the difference in expected utility value between best actions before and after information is obtained. Information has value to the extent that it is likely to cause a change of plan and to the extent that the new plan will be significantly better than the old plan.