Presentation is loading. Please wait.

Presentation is loading. Please wait.

Two-Player Zero-Sum Games Finding the saddle-point – Example #2.

Similar presentations


Presentation on theme: "Two-Player Zero-Sum Games Finding the saddle-point – Example #2."— Presentation transcript:

1 Two-Player Zero-Sum Games Finding the saddle-point – Example #2

2 Finding Saddle Points Consider the following 2-player zero-sum game… XY A-32 B4 Player 2 Player 1

3 Finding Saddle Points Consider the following 2-player zero-sum game… XY A-32 B4 Player 2 Player 1 First, determine if saddle-point is in pure strategies…. we determine minimax and maximin strategies.

4 Finding Saddle Points Consider the following 2-player zero-sum game… XY A-32 B P1 P2 maximum of minimum values minimum of maximum values

5 Finding Saddle Points Consider the following 2-player zero-sum game… XY A-32 B P1 P2 B is the maximin strategy for player 1 minimum of maximum values

6 Finding Saddle Points Consider the following 2-player zero-sum game… XY A-32 B P1 P2 B is the maximin strategy for player 1 Y is the minimax strategy for player 2

7 Finding Saddle Points Consider the following 2-player zero-sum game… XY A-32 B P1 P2 B is the maximin strategy for player 1 Y is the minimax strategy for player 2

8 Finding Saddle Points XY A-32 B P1 P2 Maximin strategy for player 1 Minimax strategy for player 2 If player 1 chooses strategy A, then player 2’s best strategy results in the payoff -3 If player 1 chooses strategy B, then player 2’s best strategy results in the payoff -1 Therefore, assuming player 2 is always playing his best strategy for any choice by player 1, the best strategy for player 1 is B… this is called the maximin strategy.

9 Finding Saddle Points XY A-32 B P1 P2 Maximin strategy for player 1 Minimax strategy for player 2 If player 2 chooses X then the best strategy for player 1 results in 4. If player 2 chooses Y then the best strategy for player 1 results in 2. Therefore, assuming player 1 is always playing his best strategy for any strategy choice of player 2, the best strategy for player 2 is Y. This is the minimax strategy for player 2.

10 Finding Saddle Points XY A-32 B P1 P2 These values are not the same and therefore the optimal strategies – the saddle point – lies in mixed strategies for each player. Assuming that player 2 is playing his best, player 1 could be sure that the worst that could happen would be to get a payoff of -1 if he continued with strategy B and it might turn out better, if player 2 mistakenly chose X.

11 Finding Saddle Points XY A-32 B P1 P2 Assuming that player 1 is playing his best, player 2 could choose strategy Y and the worst that could happen is to get a 2 (and it could turn out better for player 2, if player 1 mistakenly chose B.) The situation is that an optimal strategy for each will be mixed strategies because, the worst that could happen to each is not the same value… that is to say, based on the maximin and minimax strategies of each, each knows the worst that could happen assuming the other plays their best. Each player will try to do better than that by mixing strategies on repeated plays of the game.

12 XY A-32 B4 P1 P2 p 1-p q 1-q First, let’s determine P2’s optimal mixed strategy…To do this, we will consider the payoffs that P1 would receive with pure strategies corresponding to P2’s mixed strategy… E A = -3p + 2(1-p) = -3p+2-2p = -5p+2 E B = 4p + -1(1-p) = 4p -1+p = 5p-1 Finding Saddle Points Assign probabilities for repeated play for each strategy.

13 Graphing the Payoff Functions 1 p E B = 5p - 1 E A = -5p + 2 expected value Intersection at (3/10, 1/2) Find intersection by solving equations… 5p-1 = -5p+2 10p = 3 p=3/10 then plug p = 3/10 into either function to get payoff of 1/2 1/2 represents the expected payoff to the row player (player 1) and is therefore the referred to as the value of the game. Note that the expected payoff to the column player (player 2) would therefore by -1/2).

14 XY A-32 B4 P1 P2 p 1-p q 1-q Now, for player 1… we assume pure strategies for player 2 and find q and 1- q which represent the optimal mixed strategy for player 1… E X = -3q + 4(1-q) = -3q + 4 – 4q = -7q + 4 E Y = 2q -1(1-q) = 2q – 1 + q = 3q - 1 Finding Saddle Points

15 Graphing the Payoff Functions 1 q E Y = 3q -1 E X = -7q + 4 expected value Intersection at (1/2, 1/2) Find intersection by solving equations… -7q + 4 = 3q -1 10q = 5 q = 1/2 The goal for player 1 is to maximize his own payoff. Because it is a zero sum game, that’s the same as minimizing the other player’s payoff. For example, if player 1 chose a value of q = 1 which corresponds to playing strategy A, then the best strategy for player 2 is to pick strategy X. Player 1 finds that q=1/2 is optimal because player 2 can not get a lower payoff at this point.

16 Graphing the Payoff Functions 1 q E Y = 3q -1 E X = -7q + 4 expected value Intersection at (1/2, 1/2) Find intersection by solving equations… -7q + 4 = 3q -1 10q = 5 q = 1/2 The conclusion is that an optimal mixed strategy for player 1 is play strategy A with probability 1/2 and strategy B with probability 1/2.

17 Writing the Answer We were seeking an optimal strategy for each player in this 2-player zero-sum game. XY A-32 B4 Player 2 Player 1 The answer is that the optimal strategies for each player lie in mixed strategies. We’ve found that the optimal strategy for player 2 is play strategy X with probability 3/10 and therefore play strategy Y with probability 7/10. And we found that the optimal strategy for player 1 is to play strategy A with probability 1/2 and strategy B with probability 1/2. We’ve also found that the value of the same is 1/2. Note that the value of the game is the payoff to the row player (and thus negating the value of the game is the payoff to the column player). Also note that the value of the game is always the second coordinate in both calculations of an intersection point for the payoff functions.

18 Writing the Answer We were seeking an optimal strategy for each player in this 2-player zero-sum game. XY A-32 B4 Player 2 Player 1 Here is a succinct way to write the final answer…. Optimal strategies: Player 2: (3/10, 7/10) Player 1: (1/2, 1/2) Value of the game: 1/2 In writing it this way, it is understood that the first coordinate is the probability for playing the first strategy and the second coordinate is the probability for playing the other strategy when reading the strategies from the given table in the natural order – from left to right and top to bottom.


Download ppt "Two-Player Zero-Sum Games Finding the saddle-point – Example #2."

Similar presentations


Ads by Google