Presentation is loading. Please wait.

Presentation is loading. Please wait.

EC941 - Game Theory Prof. Francesco Squintani Lecture 2 1.

Similar presentations


Presentation on theme: "EC941 - Game Theory Prof. Francesco Squintani Lecture 2 1."— Presentation transcript:

1 EC941 - Game Theory Prof. Francesco Squintani Email: f.squintani@warwick.ac.uk Lecture 2 1

2 Structure of the Lecture Mixed Strategies Nash Equilibrium and Rationalizability Correlated Equilibrium 2

3 Mixed Strategies In a game in strategic form G=(I, S, u), for each player i, S i is the set of pure strategies. A mixed strategy  i is a probability distribution over S i. When playing  i, player i operates a randomizing device and chooses the strategy accordingly. For example, she flips a coin choosing one strategy if heads turns up, and another one if tails turns up. 3

4 In general, randomizing devices can be very complex and induce any probability distribution. Payoffs and best-response correspondences can be extended from pure strategies to mixed strategies. Solution concepts (Nash Equilibrium, Dominance and Rationalizability) can be extended to mixed strategies. To play a mixed strategy  i, player i must be indifferent among the possible pure strategies. This indifference principle is crucial to calculate solutions. 4

5 Example: Matching Pennies HeadTail Head Tail -1, 1 1, -1 A reasonable solution is that both players randomize between H and T with equal probability. The game results in a tie. -1, 1 1, -1 There is no sure way to win for either of the players. 5

6 Let p, 0<p<1, be player 1’s probability to play H, and q, 0<q<1, be player 2’s probability to play H. Player 1’s expected payoff for playing H is: = U 1 = -1. q + 1. (1-q). = Player 1’s payoff for T is U 1 = 1. q - 1. (1-q). 1 q 0 1 H T U  When player 2 mixes between T and H with equal probability, player 1 is indifferent between T and H. 6

7 We can write best response correspondences in terms of mixed strategy: B 1 (q) = 1 if q < ½B 2 (p) = 0 if p < ½ B 1 (q) = [0,1] if q = ½ B 2 (p) = [0,1] if p = ½ B 1 (q) = 0 if q > ½ B 2 (p) = 1 if p > ½. 0 ½ 1 p q1½q1½ The Nash Equilibrium is found by intersecting the Best Response correspondences. The Nash Equilibrium is (p, q) = (½, ½). B 1 (q) B 2 (p) 7

8 General Definitions Consider a game in strategic form G = (I, S, u). Definition A mixed strategy  i of a player i is a probability distribution over player i’s pure strategies S i. The set of all player i’s possible mixed strategies is  i ={(  i  (s i )) :  i  (s i ) > 0 for all s i, and  s  i  s i  }. 8

9 Definition When the players play the mixed strategy profile , player i’s expected payoff U i is: U i (  ) =  s  i  i  s i  u i  s  Definition The mixed strategy extension of G is a game in strategic form  = (I, , U). Note The mixed strategies  of the game G are the pure strategies of game  Best Response Correspondences, Nash Equilibrium, Dominance and Rationalizability can be defined for game  9

10 The mixed strategy profile  ∗ is a Nash equilibrium if, for each player i, U i (  ∗ ) ≥ U i (  i,  ∗ −i ) for every mixed strategy  i of player i. For every player i, the best response correspondence is B i (  −i ) = {  i : U i (  i,  −i ) ≥ U i (  ’ i,  −i ) for every  ’ i }. The profile  ∗ is a Nash equilibrium if and only if  ∗ i belongs to B i (  ∗ −i ), for every i. Player i’s mixed strategy  i strictly dominates her strategy s i if U i (  i, s −i ) > u i (s i, s −i ) for every s −i. 10

11 Proposition If player i’s strategy s i is strictly dominated, then any mixed strategy  i which assigns strictly positive probability to s i is also strictly dominated. The definition of rationalizability follows from the iterated deletion of strictly dominated strategies. Given a game G = (I, , U), let X i 1 =  i. For each t = 0,..., T − 1, X i t+1 is a subset of X i t such that every  i in X i t that is not in X i t+1 is strictly dominated in G = (I, (X i t ) I, U). The set X i T is the set of rationalizable strategies of player i. 11

12 The Indifference Principle Indifference Principle. If the opponents play  −i, player i is indifferent among all pure strategies in the best response set B i (  −i ). Definition For any  i, let C i (  i ) be the support of  i, the set of strategies s i to which  i assigns non-zero probability. Proposition The profile   is a Nash Equilibrium if and only if C i (   i ) is a subset of B i (   −i ), for every player i. 12

13 13 In games with 2 players and 2 strategies for each player (let A 1 = {U, D} and A 2 = {L, R}), we find the completely mixed strategy Nash equilibrium as follows. From the indifference condition for player 1:  L u 1 (U,L) + (1-  L ) u 1 (U,R) =  L u 1 (D,L) + (1-  L ) u 1 (D,R) we find the mixed strategy  L.  U u 2 (U,L) + (1-  U ) u 2 (D,L) =  U u 2 (U,R) + (1-  U ) u 2 (D,R) pins down the mixed strategy  U. If 0<  L <1 and 0<  U <1, then (  L,  U ) is Nash Equilibrium.

14 Definition For any player i, and (non-empty) subset B i of the strategy set S i, let  −i (B i ) be the set of mixed strategy profiles  −i such that B i = B i (  −i ). We can find all mixed strategy equilibria as follows. Consider each profile B of subsets B i, and calculate the profile (  −i (B i )) I. The mixed strategy  is a Nash Equilibrium if:    −i  belongs to  −i (B i ) for all players i. 2. 2. C i (  i ) = B i for all players i. 14

15 Let p, q be player 1’s and 2’s probability to play B. U 1 (B) = 2 ・ q + 0 ・ (1 − q) = 2q. U 1 (S) = 0 ・ q + 1 ・ (1 − q) = 1 − q. The condition U 1 (B) = U 1 (S), i.e. 2q = 1 − q yields q = 1/3. Battle of the Sexes B S BSBS 2, 1 0, 01, 2 0, 0 1 q 1 2 0 B S UU 1/3 15

16 U 2 (B) = 1 ・ p + 0 ・ (1 − p) = p. U 2 (S) = 0 ・ p + 2 ・ (1 − p) = 2 − 2p. The condition U 2 (B) = U 2 (S), yields p = 2/3. B 1 (q) = 0 if q < 1/3 B 1 (q) = [0, 1] if q = 1/3 B 1 (q) =1 if q > 1/3 B 2 (p) = 0 if p < 2/3 B 2 (p) = [0, 1] if p = 2/3 B 2 (p) =1 if p > 2/3 q 1 1/3 B 1 (q) B 2 (p) 0 2/3 1 p 16

17 The game has three mixed strategy Nash equilibria: (p, q) = (0, 0), (2/3, 1/3), and (1, 1). The mixed strategy equilibria (0, 0) and (1, 1) correspond to the two pure strategy equilibria. The expected payoff of the equilibrium (2/3, 1/3) is U 1 = 2 ・ 2/9 + 0 ・ 4/9 + 0 ・ 1/9 + 1 ・ 2/9 = 2/3. Likewise, U 2 = 2/3. So, it is Pareto dominated by both pure strategy equilibria, which yield payoffs (2, 1) and (1, 2). 17

18 Let p, q be player 1’s and 2’s probability to play E. U 1 (N) = 0, U 1 (E) = (1 – c)q - c(1 − q) = q - c. The condition U 1 (N) = U 1 (E), yields q = c. The case of player 2 is symmetric. N E NENE 0, 0 -c, 0 1-c, 1-c 0, -c 1 q 0 1-c -c E N UU c Coordination Game 18

19 B 1 (q) = 0 if q < c B 1 (q) = [0, 1] if q = c B 1 (q) =1 if q > c B 2 (p) = 0 if p < c B 2 (p) = [0, 1] if p = c B 2 (p) =1 if p > c The game has three mixed strategy Nash equilibria: (p, q) = (0, 0), (c, c), and (1, 1). The mixed strategy equilibria (0, 0) and (1, 1) correspond to the pure strategy equilibria (N, N) and (E, E). The expected payoff of the equilibrium (c, c) is U 1 = c 2 (1-c) - c(1-c)c + (1-c)0 = 0 = U 2. B 1 (q) B 2 (p) 0 c 1 p 19

20 Games with Dominated Strategies A B CA B C 2, 1 1, 3 1, 1 0, 2 3, 1 1, 1 4, 1 3, 0 2, 2 DEFDEF Strategy C is dominated by any mixed strategy  1 with 0 <  1 (A) < 2/3. Once C is deleted, strategy F is dominated by E. 1 2 20

21 The set of rationalizable strategies is {A, B} and {D, E}. Inspection of the best response correspondence shows that there are no pure-strategy Nash Equilibria. A B CA B C 2, 1 1, 3 1, 1 0, 2 3, 1 1, 1 4, 1 3, 0 2, 2 DEFDEF 1 2 21

22 With the indifference principle, we find completely mixed strategy N.E. in the game reduced to {A,B} X {D,E}. Player 1 is indifferent between A and B when 2  2 (D) =  2 (D) + 3[1-  2 (D)] i.e., when  2 (D) = 3/4. A B CA B C 2, 1 1, 3 1, 1 0, 2 3, 1 1, 1 4, 1 3, 0 2, 2 DEFDEF 1 2 22

23 A B CA B C 2, 1 1, 3 1, 1 0, 2 3, 1 1, 1 4, 1 3, 0 2, 2 DEFDEF 1 2 Player 2 is indifferent between D and E when  1 (A)+3[1-  1 (A)]=2  1 (A)+[1-  1 (A)] i.e., when  1 (A)=2/3. The unique Nash Equilibrium of the game has C(  )={A,B} X {D,E} with  1 (A)=2/3 and  2 (D) = 3/4. 23

24 A BC DA BC D 0, 3 3, 1 4, 0 1, 0 2, 2 1, 1 0, 2 1, 1 2, 2 EFGHEFGH First, we iteratively delete strictly dominated strategies. In the first round, only strategy A is deleted. 2, 51, 4 0, 2 0, 4 1, 1 0, 1 3, 1 1 24

25 A BC DA BC D 0, 3 3, 1 4, 0 1, 0 2, 2 1, 1 0, 2 1, 1 2, 2 EFGHEFGH After deleting A, strategy H is strictly dominated. H is the only strategy deleted in the second round. 2, 51, 4 0, 2 0, 4 1, 1 0, 1 3, 1 1 2 25

26 A BC DA BC D 0, 3 3, 1 4, 0 1, 0 2, 2 1, 1 0, 2 1, 1 2, 2 EFGHEFGH After deleting H, strategy D is strictly dominated. D is the only strategy deleted in the third round. 2, 51, 4 0, 2 0, 4 1, 1 0, 1 3, 1 1 2 3 26

27 A BC DA BC D 0, 3 3, 1 4, 0 1, 0 2, 2 1, 1 0, 2 1, 1 2, 2 EFGHEFGH After deleting D, strategy E is strictly dominated. E is the only strategy deleted in the third round. The remaining strategies {B,C}, {F,G} cannot be deleted. 2, 51, 4 0, 2 0, 4 1, 1 0, 1 3, 1 1 2 3 4 27

28 A BC DA BC D 0, 3 3, 1 4, 0 1, 0 2, 2 1, 1 0, 2 1, 1 2, 2 EFGHEFGH Inspection of best responses shows that (B, F) and (C, G) are pure-strategy Nash Equilibria. We find mixed strategy N.E. in the game on {B,C} X {F,G}. 2, 51, 4 0, 2 0, 4 1, 1 0, 1 3, 1 1 2 3 4 28

29 A BC DA BC D 0, 3 3, 1 4, 0 1, 0 2, 2 1, 1 0, 2 1, 1 2, 2 EFGHEFGH Player 1 is indifferent between B and C when 2  2 (F) + [1-  2 (F)] = 2[1-  2 (F)] +  2 (F), i.e.  2 (F)=1/2. By symmetry, player 2 is indifferent when  1 (A)=1/2. 2, 51, 4 0, 2 0, 4 1, 1 0, 1 3, 1 1 2 3 4 29

30 A Game with Many Players n people observe a crime. Each attaches the value v to the police being informed and bears the cost c if calling the police, where v > c > 0. There are n pure strategy equilibria. In each one of these equilibria, a player i reports, and the remaining n - 1 players do not. 30

31 31 Let p be the mixed strategy that a player calls the police. The symmetric mixed strategy equilibrium p is determined by the indifference condition: v − c = 0 ・ Pr{no one else calls} + v ・ Pr{someone else calls} v − c = v ・ (1- Pr{no one else calls}) v − c = v ・ (1 – (1 – p) n-1 ) or c = – v (1 – p) n-1 p = 1 − (c/v) 1/(n−1)

32 Correlated Equilibrium: Example DoveHawk Dove Hawk 0, 0 1, -2-3,-3 -2, 1 In the hawk-dove game, two States confront in a dispute. If a State plays Dove and the other Hawk, the Dove loses the dispute. But if both States are Hawks, a war takes place. 32

33 DoveHawk Dove Hawk 0, 0 1, -2-3,-3 -2, 1 There are two pure-strategy Nash Equilibria (H, D) and (D, H), with payoffs (1, -2) and (-2, 1). There is a mixed strategy Nash equilibrium  (D) = 1/2, which yields payoff of -1 to both players. 33

34 If the players can play (H, D) and (D, H) with equal probability, their expected payoff is -1/2. Playing the Nash equilibria (H, D) and (D, H) with equal probability, is a correlated equilibrium. The players coordinate on a different Nash equilibrium on the basis of a fair coin toss. The coin is the correlating device. Can the players achieve a higher payoff with a more complex correlating device? 34

35 Suppose that they resort to a mediator: 1.The mediator randomly chooses between one of the four outcomes (H, H), (H, D), (D, H), and (D, D) using the probability profile p = (p HH, p HD, p DH, p DD ). 2.The mediator does not report her choice. She makes a separate, private recommendation h or d, to each one of the players. For example, if her choice is (H, D), she recommends h to player 1 and d to player 2. The probability profile p is a correlated equilibrium if all the private recommendations are self-enforcing. 35

36 For player 1, the private recommendation d is self- enforcing if U 1 (D|d) = u 1 (D,D)p(DD|d) + u 1 (D,H)p(DH|d) > U 1 (H|d) = u 1 (H,D)p(DD|d) + u 1 (H,H)p(DH|d) and the private recommendation h is self-enforcing if: U 1 (H|h) = u 1 (H,D)p(HD|h) + u 1 (H,H)p(HH|h) > U 1 (D|h) = u 1 (D,D)p(HD|h) + u 1 (D,H)p(HH|h) and the conditions for player 2 are analogous. 36

37 To find the symmetric correlated equilibrium which maximizes the sum of payoffs, we solve the program: max [u 1 (D,D)+u 2 (D,D)]p(DD)+[u 1 (D,H)+u 2 (D,H)]p(DH) +[u 1 (H,D)+u 2 (H,D)]p(HD)+[u 1 (H,H)+u 2 (H,H)]p(HH) s.t. U i (D|d) > U i (H|d) U i (D|h) > U i (H|h) p(DH) = p(HD) for both players i = 1, 2. 37

38 38 Substituting in the payoffs, and using symmetry, we obtain: U 1 (D|d) = -2p(DH|d) > U 1 (H|d) = p(DD|d) - 3p(DH|d) U 1 (H|h) = p(HD|h) - 3p(HH|h) > U 1 (D|h) = -2p(HH|h). Expanding the conditional probabilities, and simplifying -2p(DH) > p(DD) - 3p(DH) p(HD) - 3p(HH) > -2p(HH). Note that this is a set of linear inequalities.

39 We solve for the optimal correlated equilibrium: max 0p(DD)+2(1-2)p(DH) - 6p(HH) s.t. p(DH) = p(HD) -2p(DH) > p(DD) -3p(DH) p(HD) -3p(HH) > -2p(HH) Simplifying, we obtain: p(DH) = p(HD) > p(DD) and p(HD) = p(DH) > p(HH). The solution is p(HH)=0, p(HD)=p(DH)=p(DD)=1/3. The expected payoff of both players i = 1, 2 is: U i (p) = 0p(DD) + p(HD) - 2p(DH) = -1/3. 39

40 Correlated Equilibrium Definition A correlated equilibrium in a game G=(I, S, u) is a probability distribution p over the set S such that, for any player i, and all her strategies s i  s -i [u i (s i, s -i ) p(s i, s -i )] >  s -i [u i (s’ i, s -i ) p(s’ i, s -i )], for all s’ i. Proposition Any correlated equilibrium is also a mixed strategy Nash Equilibrium. 40

41 Summary of the Lecture Mixed Strategies Nash Equilibrium and Rationalizability Correlated Equilibrium 41

42 Preview Next Lecture Bayesian Games and Nash Equilibrium Bayesian Games and Information Cournot Duopoly with Private Information Public Good Provision with Private Information 42


Download ppt "EC941 - Game Theory Prof. Francesco Squintani Lecture 2 1."

Similar presentations


Ads by Google