MIT and James Orlin © 2003 1 Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)

Slides:



Advertisements
Similar presentations
Thursday, March 7 Duality 2 – The dual problem, in general – illustrating duality with 2-person 0-sum game theory Handouts: Lecture Notes.
Advertisements

Introduction to Game Theory
GAME THEORY.
Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.
Module 4 Game Theory To accompany Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna Power Point slides created by Jeff Heyl.
Two-Person, Zero-Sum Game: Advertising TV N TVN 0 TV N TVN Column Player: Row Player: Matrix of Payoffs to Row.
Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.
Totally Unimodular Matrices
Mixed Strategies CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte.
Lecturer: Moni Naor Algorithmic Game Theory Uri Feige Robi Krauthgamer Moni Naor Lecture 8: Regret Minimization.
Game Theory S-1.
Mixed Strategies For Managers
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Simultaneous- Move Games with Mixed Strategies Zero-sum Games.
Two-Player Zero-Sum Games
Operations Research Assistant Professor Dr. Sana’a Wafa Al-Sayegh 2 nd Semester ITGD4207 University of Palestine.
Study Group Randomized Algorithms 21 st June 03. Topics Covered Game Tree Evaluation –its expected run time is better than the worst- case complexity.
Game theory.
© 2015 McGraw-Hill Education. All rights reserved. Chapter 15 Game Theory.
Shengyu Zhang The Chinese University of Hong Kong.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Zero-Sum Games (follow-up)
Network Theory and Dynamic Systems Game Theory: Mixed Strategies
1. 2 APPENDIX B A PROCEDURE FOR GENERATING AN EQUILIBRIUM POINT FOR 2-PERSON GAMES (That sometimes works!)
Part 3: The Minimax Theorem
An Introduction to Game Theory Part I: Strategic Games
Chapter 14 Game Theory to accompany Operations Research: Applications and Algorithms 4th edition by Wayne L. Winston Copyright (c) 2004 Brooks/Cole, a.
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Duality Lecture 10: Feb 9. Min-Max theorems In bipartite graph, Maximum matching = Minimum Vertex Cover In every graph, Maximum Flow = Minimum Cut Both.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
QR 38, 2/22/07 Strategic form: dominant strategies I.Strategic form II.Finding Nash equilibria III.Strategic form games in IR.
Games of Chance Introduction to Artificial Intelligence COS302 Michael L. Littman Fall 2001.
Finite Mathematics & Its Applications, 10/e by Goldstein/Schneider/SiegelCopyright © 2010 Pearson Education, Inc. 1 of 68 Chapter 9 The Theory of Games.
Game Theory Objectives:
To accompany Quantitative Analysis for Management, 8e by Render/Stair/Hanna S-1 © 2003 by Prentice Hall, Inc. Upper Saddle River, NJ Supplement 1.
Game Theory.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
MIT and James Orlin © Chapter 3. The simplex algorithm Putting Linear Programs into standard form Introduction to Simplex Algorithm.
An Intro to Game Theory Avrim Blum 12/07/04.
Game Theory Statistics 802. Lecture Agenda Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for.
Experts Learning and The Minimax Theorem for Zero-Sum Games Maria Florina Balcan December 8th 2011.
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
Game Theory.
The Design & Analysis of the Algorithms Lecture by me M. Sakalli Download two pdf files..
Game theory & Linear Programming Steve Gu Mar 28, 2008.
Chapter 11 Game Theory Math Game Theory What is it? – a way to model conflict and competition – one or more "players" make simultaneous decisions.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Lecture 2: two-person non.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia.
Game theory is the study of the best play–safe strategy to adopt if 2 players are playing a game where decisions affect each other. Ex Two men are caught.
Game Theory, Part 2 Consider again the game that Sol and Tina were playing, but with a different payoff matrix: H T Tina H T Sol.
Part 3 Linear Programming
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Lecture 12. Game theory So far we discussed: roulette and blackjack Roulette: – Outcomes completely independent and random – Very little strategy (even.
Shall we play a game? Game Theory and Computer Science Game Theory /06/05 - Zero-sum games - General-sum games.
1. 2 Some details on the Simplex Method approach 2x2 games 2xn and mx2 games Recall: First try pure strategies. If there are no saddle points use mixed.
1. 2 You should know by now… u The security level of a strategy for a player is the minimum payoff regardless of what strategy his opponent uses. u A.
1 a1a1 A1A1 a2a2 a3a3 A2A Mixed Strategies When there is no saddle point: We’ll think of playing the game repeatedly. We continue to assume that.
Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem.
Statistics Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for Windows software Modeling.
GAME THEORY Day 5. Minimax and Maximin Step 1. Write down the minimum entry in each row. Which one is the largest? Maximin Step 2. Write down the maximum.
Lec 23 Chapter 28 Game Theory.
9.2 Mixed Strategy Games In this section, we look at non-strictly determined games. For these type of games the payoff matrix has no saddle points.
By: Donté Howell Game Theory in Sports. What is Game Theory? It is a tool used to analyze strategic behavior and trying to maximize his/her payoff of.
Mixed Strategies Keep ‘em guessing.
Game Theory M.Pajhouh Niya M.Ghotbi
The Duality Theorem Primal P: Maximize
Tools for Decision Analysis: Analysis of Risky Decisions
Communication Complexity as a Lower Bound for Learning in Games
Game Theory.
Lecture 20 Linear Program Duality
Presentation transcript:

MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)

MIT and James Orlin © person 0-sum game theory Person R chooses a row: either 1, 2, or 3 Person C chooses a column: either 1, 2, or 3 This matrix is the payoff matrix for player R. (And player C gets the negative.) e.g., R chooses row 3; C chooses column 1 1 R gets 1; C gets –1 (zero sum)

MIT and James Orlin © Some more examples of payoffs R chooses 2, C chooses R chooses row 3; C chooses column 3 0 R gets -2; C gets +2 (zero sum) R gets 0; C gets 0 (zero sum) 0 -2

MIT and James Orlin © Next: 2 volunteers Player R puts out 1, 2 or 3 fingers R tries to maximize his or her total C tries to minimize R’s total. Player C simultaneously puts out 1, 2, or 3 fingers We will run the game for 5 trials.

MIT and James Orlin © Next: Play the game with your partner (If you don’t have one, then watch) Player R puts out 1, 2 or 3 fingers R tries to maximize his or her total C tries to minimize R’s total. Player C simultaneously puts out 1, 2, or 3 fingers We will run the game for 5 trials.

MIT and James Orlin © Who has the advantage: R or C? Suppose that R and C are both brilliant players and they play a VERY LONG TIME. Will R’s payoff be positive in the long run, or will it be negative, or will it converge to 0? We will find a lower and upper bound on the payoff to R using linear programming.

MIT and James Orlin © Computing a lower bound Suppose that player R must announce his or her strategy in advance of C making a choice. A strategy that consists of selecting the same row over and over again is a “pure strategy.” R can guarantee a payoff of at least –1. If R is forced to announce a row, then what row will R select? 20

MIT and James Orlin © Computing a lower bound on R’s payoff Suppose we permit R to choose a random strategy. The column player makes the choice after hearing the strategy, but before seeing the flip of the coin. Suppose R will flip a coin, and choose row 1 if Heads, and choose row 3 if tails.

MIT and James Orlin © What is player’s C best response? What would C’s best response be? If C knows R’s random strategy, then C can determine the expected payoff for each column chosen. Prob Expected Payoff So, with a random strategy R can get at least -.5

MIT and James Orlin © Suppose that R randomizes between row 1 and row What would C’s best response be? Prob Expected Payoff So, with a random strategy R can get at least 0.

MIT and James Orlin © What is R’s best random strategy? Prob. x1x1 x2x2 x3x3 A: -2 x x 2 + x 3 A Expected Payoff B B: x 1 - x 2 C C: 2 x x 3 x 1 + x 2 + x 3 = 1 C will choose the column that is the minimum of A, B, and C.

MIT and James Orlin © Finding the min of 2 numbers as an optimization problem. Let z = min (x, y) Then z is the optimum solution value to the following LP: maximize z subject to z  x z  y

MIT and James Orlin © R’s best strategy, as an LP x1x1 x2x2 x3x3 A: P A = -2 x x 2 + x 3 AB B: P B = x 1 - x 2 C C: P C = 2 x x 3 Expected Payoff Maximize min (P A, P B, P C ) x 1 + x 2 + x 3 = 1 x 1, x 2, x 3  0

MIT and James Orlin © R’s best strategy, as an LP x1x1 x2x2 x3x3 A: z  -2 x x 2 + x 3 AB B: z  x 1 - x 2 C C: z  2 x x 3 Expected Payoff Maximize z (the payoff to x) x 1 + x 2 + x 3 = 1 x 1, x 2, x 3  0 2-person 0-sum game

MIT and James Orlin © The Row Player’s LP, in general a 11 a 12 a 13 x1x1 x2x2 xnxn a 1m a 21 a 22 a 23 a 2m a n1 a n2 a n3 a nm … … … … P j : z  a 1j x 1 + a 2j x 2 +… + a nj x n for all j P1P1 P2P2 P3P3 x 1 + x 2 + … + x n = 1 x j  0 for all j Expected Payoff Maximize z (the payoff to x) PmPm …

MIT and James Orlin © Here is the optimal random strategy for R The optimal payoff to R is 1/9. Prob. 7/18 5/18 1/3 1/9 Expected Payoff So, with a random strategy R guarantees obtaining at least 1/9. This strategy is a lower bound on what R can obtain if he or she takes into account what C is doing.

MIT and James Orlin © We can obtain a lower bound for C (and an upper bound for R) in the same manner If C announced this random strategy, R would select row 1. Exp. payoff 1/2 0 -1/4 1/41/21/4 Prob. So, with this random strategy C guarantees that R obtains at most 1/2. If C chooses a random strategy, it will give an upper bound on what R can obtain.

MIT and James Orlin © Exercise, what is the best bound for C? With your partner, write the LP whose solution gives an optimal strategy for the column player. Exp. payoff ? ? ? y1y1 y2y2 y3y3 Prob.

MIT and James Orlin © The best strategy for the column player Exp. payoff 1/9 1/35/91/9 Prob. So, with this random strategy R gets only 1/9.

MIT and James Orlin © Note: the payoff is the same, whether the Row Player goes first or the Column Player Exp. payoff 1/9 1/35/91/9 Prob. For 2-person 0-sum games, the maximum payoff that R can guarantee by choosing a random strategy is the minimum payoff to R that C can guarantee by choosing a random strategy. So, the optimal average payoff to the game is 1/9, assuming that both players play optimally.

MIT and James Orlin © Constant sum games Column Player C Row Player R 5 1 R chooses 2, C chooses 1, Payoff is 5 to R Payoff is 1 to C

MIT and James Orlin © Constant sum games Column Player C Row Player R 5 1 R chooses 2, C chooses 1, Payoff is 5 to R Payoff is 1 to C Here the total payoff is 6. For C, maximizing payoff is the same as minimizing payoff to R.

MIT and James Orlin © Constant sum games Column Player C Row Player R 5 1 This becomes equivalent to 0-sum games Here the total payoff is 6. For C, maximizing payoff is the same as minimizing payoff to R.

MIT and James Orlin © person 0-sum games in general. Let x denote a random strategy for R, with value z(x) and let y denote a random strategy for C with value v(y). z(x)  v(y) for all x, y The optimum x* can be obtained by solving an LP. So can the optimum y*. z(x*) = v(y*)

MIT and James Orlin © More on 2-person 0-sum games In principle, R can do as well with a random fixed strategy as by carefully varying a strategy over time Duality for game theory was discovered by Von Neumann and Morgenstern (predates LP duality). The idea of randomizing strategy permeates strategic gaming. Applications to games: football, baseball and more

MIT and James Orlin © Non-Constant sum 2-person games Prisoner 2 Prisoner 1 confess don’t confess confess don’t confess If Prisoner 1 announces a strategy, we assume that Prisoner 2 chooses optimally.

MIT and James Orlin © Remarks on strategies for Prisoner Prisoner 2 Prisoner 1 confess don’t confess confess don’t confess Suppose Prisoner 1 confesses Then the optimum strategy for Prisoner 2 is to confess.

MIT and James Orlin © Remarks on strategies for Prisoner Prisoner 2 Prisoner 1 confess don’t confess confess don’t confess Suppose Prisoner 1 does not confess Then the optimum strategy for Prisoner 2 is to confess.

MIT and James Orlin © Remarks on strategies for Prisoner Prisoner 2 Prisoner 1 confess don’t confess confess don’t confess Suppose Prisoner 1 announces that he will flip a coin to determine if he will confess or not. 1/2

MIT and James Orlin © Remarks on a mixed strategy Prisoner 2 Prisoner 1 confess don’t confess confess don’t confess Suppose Prisoner 1 announces that he will flip a coin to determine if he will confess or not. Then the optimum strategy for Prisoner 2 is to confess.

MIT and James Orlin © Consider the following pure strategy: Prisoner 1: confesses Prisoner 2: confesses A strategy is called a Nash equilibrium if neither player can benefit by a unilateral change in strategy. The above strategy is a Nash equilibrium

MIT and James Orlin © The optimal strategy for Prisoner 2 is to confess, regardless of what Prisoner 1 does. Similarly, the optimum strategy for Prisoner 1 is to confess, regardless of what Prisoner 2 does. It is rare that an optimal strategy does not depend on the strategy of the opponent.

MIT and James Orlin © Non-Constant sum 2-person games Prisoner 2 Prisoner 1 confess don’t confess confess don’t confess Suppose the game is repeated 100 times with participants. The Nash equilibrium is to always confess. But in practice …

MIT and James Orlin © Another 2-person game Column Player C Row Player R By a strategy for the row player, we mean a mixed strategy, such as choosing row 1 with probability 2/3 and choosing row 2 with probability 1/3.

MIT and James Orlin © A Nash Equilibrium Column Player C Row Player R 2/3 1/3 1/2 Let’s fix R’s strategy and look at C

MIT and James Orlin © A Nash Equilibrium Column Player C Row Player R 2/3 1/3 1/2 Let’s fix R’s strategy and look at C

MIT and James Orlin © C’s perspective 00 Column Player C Row Player R 1/2 Let’s fix C’s strategy and look at R Note: any strategy for C gives the same expected return, which is 0. So C cannot benefit by changing strategy

MIT and James Orlin © R’s Perspective Column Player C Row Player R 2/3 1/3 1/2 Let’s fix C’s strategy and look at R

MIT and James Orlin © R’s Perspective 0 0 Column Player C Row Player R 2/3 1/3 Let’s fix C’s strategy and look at R Note: any strategy for R gives the same expected return, which is 0. So R cannot benefit by changing strategy

MIT and James Orlin © Theorem. For two person game theory, there always exists a Nash equilibrium. To find a Nash equilibrium, one can use optimization theory, but it is not a linear program. It is more difficult.

MIT and James Orlin © Summary 2-person constant sum game theory –mini-max solution –randomized strategies 2-person game theory –Prisoner’s dilemma –Nash equilibrium

MIT and James Orlin © A 2-dimensional view of game theory The following slides show how to solve 2-person 0-sum game theory when there are two strategies per player If R goes first and decides on a strategy of choosing row 1 with probability p and row 2 with probability 1-p, then the strategy for C is easily determined. So R can determine the payoff as a function of p, and then choose p to maximize the payoff. These slides were not presented in lecture.

MIT and James Orlin © What is the optimal strategy for R? Here is the payoff if C picks column probability of selecting row 1 payoff Graph the payoff as a function of the probability p that R selects row 1

MIT and James Orlin © More on payoffs as a function of p probability of selecting row Here is the payoff if C picks column 2 payoff

MIT and James Orlin © What is the optimal strategy for C as a function of p? probability of selecting row C picks column 1 C picks column 2 payoff 1/5

MIT and James Orlin © Here is the payoff to R as a function of p probability of selecting row C picks column 1 C picks column 2 payoff 1/5

MIT and James Orlin © What is the optimal strategy for R? payoff to R is 2(1-p) if p > 1/5 payoff to R is 1 + 3p if p < 1/5 The maximum payoff is when p = 1/5, and the payoff is 8/5 to R.

MIT and James Orlin © What is the optimal strategy for C? Here is the payoff if R picks row probability of selecting col. 1 payoff Graph the payoff as a function of the probability q that C selects column 1

MIT and James Orlin © More on payoffs as a function of q probability of selecting col Here is the payoff if R picks row 2 payoff

MIT and James Orlin © What is the optimal strategy for C as a function of q? probability of selecting row R picks row 1 R picks row 2 payoff q = 3/5

MIT and James Orlin © Here is the payoff to R as a function of q, the probability that C chooses Column probability of selecting row payoff q = 3/5 R picks row 1 R picks row 2

MIT and James Orlin © What is the optimal strategy for C? payoff to R is 4(1-q) if q < 3/5 payoff to R is 2q + (1-q) if q > 3/5 The column player C can guarantee the minimum payoff to R when q = 3/5, and the payoff is 8/5 to R.

MIT and James Orlin © What is the optimal strategy for C? payoff to R is 2(1-p) if p > 1/5 payoff to R is 1 + 3p if p < 1/5 The maximum payoff is when p = 1/5, and the payoff is 8/5 to R.

MIT and James Orlin © What is the optimal strategy for R? Here is the payoff if C picks column probability of selecting row 1 payoff Graph the payoff as a function of the probability p that R selects row 1

MIT and James Orlin © More on payoffs as a function of p probability of selecting row Here is the payoff if C picks column 2 payoff 3-3 4

MIT and James Orlin © What is the optimal strategy for C as a function of p? probability of selecting row C picks column 1 C picks column 2 payoff p = 6/11

MIT and James Orlin © Here is the payoff to R as a function of p probability of selecting row C picks column 1 C picks column 2 payoff p = 6/11

MIT and James Orlin © What is the optimal strategy for R? payoff to R is 7p -3 if p < 6/11 payoff to R is 3 - 4p if p > 6/11 The maximum payoff is when p = 6/11, and the payoff is 9/11 to R.