Infinitely Repeated Games

Slides:



Advertisements
Similar presentations
Game Theoretic Analysis of Oligopoly y n Y N 0000 Y N The unique dominant strategy Nash Equilibrium is (y,Y) A game of imperfect.
Advertisements

Vincent Conitzer CPS Repeated games Vincent Conitzer
Some Problems from Chapt 13
Pondering more Problems. Enriching the Alice-Bob story Go to AGo to B Go to A Alice Go to B Go to A Go to B Go shoot pool Alice.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
Crime, Punishment, and Forgiveness
Game Theory “Доверяй, Но Проверяй” - Russian Proverb (Trust, but Verify) - Ronald Reagan Mike Shor Lecture 6.
Game Theory “Доверяй, Но Проверяй” (“Trust, but Verify”) - Russian Proverb (Ronald Reagan) Topic 5 Repeated Games.
Dispute Settlement Mechanism The role of dispute settlement mechanism –information gathering and dispatching, not enforcement of trade arrangements Main.
Games With No Pure Strategy Nash Equilibrium Player 2 Player
Infinitely Repeated Games. In an infinitely repeated game, the application of subgame perfection is different - after any possible history, the continuation.
Non-Cooperative Game Theory To define a game, you need to know three things: –The set of players –The strategy sets of the players (i.e., the actions they.
Chapter 14 Infinite Horizon 1.Markov Games 2.Markov Solutions 3.Infinite Horizon Repeated Games 4.Trigger Strategy Solutions 5.Investing in Strategic Capital.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.5.Repeated Games Lecture
Infinitely Repeated Games Econ 171. Finitely Repeated Game Take any game play it, then play it again, for a specified number of times. The game that is.
EC941 - Game Theory Lecture 7 Prof. Francesco Squintani
Game Theory: Inside Oligopoly
Game Theory. Games Oligopolist Play ▫Each oligopolist realizes both that its profit depends on what its competitor does and that its competitor’s profit.
Game Theory Lecture 9.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.5.Repeated Games Lecture
Game Theory Lecture 8.
Repeated Prisoner’s Dilemma If the Prisoner’s Dilemma is repeated, cooperation can come from strategies including: “Grim Trigger” Strategy – one.
Games People Play. 8: The Prisoners’ Dilemma and repeated games In this section we shall learn How repeated play of a game opens up many new strategic.
Dynamic Games of Complete Information.. Repeated games Best understood class of dynamic games Past play cannot influence feasible actions or payoff functions.
Todd and Steven Divide the Estate Problem Bargaining over 100 pounds of gold Round 1: Todd makes offer of Division. Steven accepts or rejects. Round.
Yale Lectures 21 and Repeated Games: Cooperation vs the End Game.
More on Extensive Form Games. Histories and subhistories A terminal history is a listing of every play in a possible course of the game, all the way to.
Final Lecture. ``Life can only be understood backwards; but it must be lived forwards.” Søren Kierkegaard Thoughts on subgame perfection?
ECON6036 1st semester Format of final exam Same as the mid term
Game Theory: Key Concepts Zero Sum Games Zero Sum Games Non – Zero Sum Games Non – Zero Sum Games Strategic Form Games  Lay out strategies Strategic Form.
APEC 8205: Applied Game Theory Fall 2007
QR 38 3/20/07, More on repeated games in IR I.Folk theorem II.Other solutions to the PD III.Repeated PDs in practice.
TOPIC 6 REPEATED GAMES The same players play the same game G period after period. Before playing in one period they perfectly observe the actions chosen.
PRISONER’S DILEMMA By Ajul Shah, Hiten Morar, Pooja Hindocha, Amish Parekh & Daniel Castellino.
UNIT III: COMPETITIVE STRATEGY Monopoly Oligopoly Strategic Behavior 7/21.
0 MBA 299 – Section Notes 4/25/03 Haas School of Business, UC Berkeley Rawley.
On Bounded Rationality and Computational Complexity Christos Papadimitriou and Mihallis Yannakakis.
Beef or Horse? Cheap talk. Beef or horse? In a restaurant, some customers prefer beef, some prefer horse. Type s likes beef, type t likes horse Waitress.
Problems from Chapter 12. Problem 1, Chapter 12 Find a separating equilibrium Trial-and-error. Two possible separating strategies for Player 1: – Choose.
Social Choice Session 7 Carmen Pasca and John Hey.
Punishment and Forgiveness in Repeated Games. A review of present values.
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
Todd and Steven Divide the Estate Problem Bargaining over 100 pounds of gold Round 1: Todd makes offer of Division. Steven accepts or rejects. Round.
Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham.
Chapters 29, 30 Game Theory A good time to talk about game theory since we have actually seen some types of equilibria last time. Game theory is concerned.
Section 2 – Ec1818 Jeremy Barofsky
Punishment, Detection, and Forgiveness in Repeated Games.
Final Lecture. Problem 2, Chapter 13 Exploring the problem Note that c, x yields the highest total payoff of 7 for each player. Is this a Nash equilibrium?
Mixed Strategies and Repeated Games
Subgames and Credible Threats (with perfect information) Econ 171.
Oligopoly Theory1 Oligopoly Theory (11) Collusion Aim of this lecture (1) To understand the idea of repeated game. (2) To understand the idea of the stability.
Subgames and Credible Threats
Subgames and Credible Threats. Nuclear threat USSR Don’t Invade Hungary 0101 Invade US Give in Bomb USSR
Punishment, Detection, and Forgiveness in Repeated Games.
Subgames and Credible Threats. Russian Tanks Quell Hungarian Revolution of 1956.
Lec 23 Chapter 28 Game Theory.
Entry Deterrence Players Two firms, entrant and incumbent Order of play Entrant decides to enter or stay out. If entrant enters, incumbent decides to fight.
ECO290E: Game Theory Lecture 10 Examples of Dynamic Games.
1 Strategic Thinking Lecture 7: Repeated Strategic Situations Suggested reading: Dixit and Skeath, ch. 9 University of East Anglia School of Economics.
Dynamic Games of Complete Information
Vincent Conitzer CPS Repeated games Vincent Conitzer
Multiagent Systems Repeated Games © Manfred Huber 2018.
Game Theory Fall Mike Shor Topic 5.
Vincent Conitzer Repeated games Vincent Conitzer
Chapter 14 & 15 Repeated Games.
Chapter 14 & 15 Repeated Games.
Molly W. Dahl Georgetown University Econ 101 – Spring 2009
Collaboration in Repeated Games
Game Theory Spring Mike Shor Topic 5.
Vincent Conitzer CPS Repeated games Vincent Conitzer
Presentation transcript:

Infinitely Repeated Games

Finitely Repeated Game Take any game play it, then play it again, for a specified number of times. A single play of the game that is repeated is known as the stage game. Let players observe all previous play. For every history that you have observed, you could have a different response.

Prisoners’ dilemma 3 times How many strategies that vary with other players’s actions on previous move. First move: C or D Second move: C always C if C, D if D D if C, C if D D always Third move: There are 4 possible histories of other guy’s moves. For each move by other guy there are two things you can do on your next move. That gives you 2x2x2x2=16 possible 3d move strategies. Then you have 2 possible first move strategies, 4 possible 2d move strategies and 16 possible third move strategies That is 2x4x16=128 strategies that depend on observed behavior of other guy

Prisoners’ Dilemma R, R S, T T, S P, P T > R > P > S Player 2 Cooperate Defect R, R S, T T, S P, P P LAyER 1 Cooperate Defect T > R > P > S “Temptation” “Reward” “Punishment” “Sucker”

Twice Repeated Prisoners’ Dilemma Two players play two rounds of Prisoners’ dilemma. Before second round, each knows what other did on the first round. Payoff is the sum of earnings on the two rounds.

Two-Stage Prisoners’ Dilemma Working back Player 1 Cooperate Defect Player 2 Cooperate Defect Cooperate Defect Player 1 Player 1 Player 1 Player 1 C C C D D D C D Player 1 Pl. 2 Pl 2 C Pl 2 C Pl 2 D C D D C D C C D D C D C D 2R R+S R+T R+T R+S R+P S+R T+R P+R P+S P+T Etc…etc T+P P+S 2P

Two-Stage Prisoners’ Dilemma Working back further Player 1 Cooperate Defect Player 2 Cooperate Defect Cooperate Defect Player 1 Player 1 Player 1 Player 1 C C C D D D C D Player 1 Pl. 2 Pl 2 C Pl 2 C Pl 2 D C D D C D C C D D C D C D 10 21 21 10 11 10 21 22 11 1 12 11 22 12 1 11 2 12 12 1

Longer Game What is the subgame perfect outcome if Prisoners’ dilemma is repeated 100 times? Work backwards: In last round, nothing you do affects future, so you play the dominant strategy for stage game: defect. Since last round is determined, nothing you do in next to last round affects future, so you play dominant strategy for stage game: defect Work your way back. Only subgame perfect outcome is “Defect always”.

In a repeated game that consists of four repetitions of a stage game that has a unique Nash equilibrium There are four subgame perfect Nash equilibria There are 24=16 subgame perfect Nash equilibria There is only one subgame perfect Nash equilibrium. The number of subgame perfect Nash equilibria varies, depending on the details of the game.

More generally In a subgame perfect equilibrium for a finitely repeated game where the stage game has a unique N.E, the moves in the last stage are determined for each person’s strategy. Given that the moves in the last stage don’t depend on anything that happened before, the Nash equilibrium in previous stage is uniquely determined to be the stage game equilibrium. And so it goes…All the way back to the beginning.

Games without a last round Two kinds of models Games that continue for ever Games that end at a random, unknown time

Infinitely repeated game Wouldn’t make sense to add payoffs. You would be comparing infinities. Usual trick. Discounted sums. Just like in calculating present values. We will see that cooperative outcomes can often be sustained as Nash equilibria in infinitely repeated games.

Why consider infinite games? We only have finite lives. Many games do not have known end time. Just like many human relationships. Simple example—A favorite of game theorists After each time the stage game is played there is some probability d<1 that it will be played again and probability 1-d that play will stop. Expected payoff “discounts” payoffs in later rounds, because game is less likely to last until then.

Cleaning house as a Repeated Prisoners’ Dilemma Maybe a finite game if you have a fixed lease and don’t expect to see roommate again after lease expires. Most relationships don’t have a known last time. Usually some room for “residual good will.”

In a repeated game, after each round of play, a fair coin is tossed In a repeated game, after each round of play, a fair coin is tossed. If it comes up heads, the game continues to another round. If it comes up tails, the game stops. What is the probability that the game is played for at least three rounds? 1/3 2/3 1 /4 1 /2 1/8

Calculating sums In a repeated game, with probability d of continuation after each round, the probability that the game is still going at round k is dk-1 Calculate expected winnings if you receive R so long as the game continues. R+dR+d2R+ d3R+ d4R + ….+ =R(1+d +d2 + d3 + d4 + ….+ ) What is this infinite sum?

Adding forever The series (1+d +d2 + d3 + d4 + ….+ ) Is known as a geometric series. When |d|<1, this series converges. That is, to say, the limit as n approaches infinity of 1+d +d2 + d3 + d4 + ….+ dn exists. Let S= 1+d +d2 + d3 + d4 + ….+ Then dS=d +d2 + d3 + d4 + ….+ And S-dS=1. So S(1-d)=1 S=1/(1-d).

What is the limit as n approaches infinity of 1+3/4+(3/4)2+…+(3/4)n 16

Infinitely repeated prisoners’ dilemma and the “Grim Trigger Strategy” Suppose 2 players play repeated prisoners dilemma, where the probability is d<1 that you will play another round after the end of each round. The grim trigger strategy is to play cooperate on the first round and play cooperate on every round so long as the other doesn’t defect. If the other defects, the grim trigger strategy plays defect on all future rounds.

When is there a symmetric SPNE where all play Grim Trigger? Suppose that the other player is playing Grim Trigger. If you play Grim Trigger as well, then you will cooperate as long as the game continues and and you will receive a payoff of R. Your expected payoff from playing Grim Trigger if the other guy is playing Grim Trigger is therefore R(1+d +d2 + d3 + d4 + ….+ )=R/(1-d)

What if you defect against Grim Trigger If you defect and the other guy is playing Grim Trigger, you will get a payoff of T>R the first time that you defect. But after this, the other guy will always play defect. The best you can do, then is to always defect as well. Your expected payoff from defecting is therefore T+ P(d +d2 + d3 + d4 + ….+ ) =T+Pd/1-d

Cooperate vs Defect If other guy is playing Grim trigger and nobody has yet defected, your expected payoff from playing cooperate is R/(1-d) If other guy is playing Grim trigger and nobody has yet defected, your expected payoff from playing defect is T+Pd/(1-d) Cooperate isR/(1-d) better for you if R/(1-d)>T+Pd/(1-d) which implies d>(T-R)/(T-P) Example If T=10, R=5, P=2, then condition is d>5/8. If d is too small, it pays to “take the money and run”

Other equilbria? Grim trigger is a SPNE if d is large enough. Are there other SPNEs? Yes, for example both play Always Defect is an equilibrium. If other guy is playing Always Defect, what is your best response in any subgame? Another is Play Defect the first 10 rounds, then play Grim Trigger.

Tit for Tat What is both players play the following strategy in infinitely repeated P.D? Cooperate on the first round. Then on any round do what the other guy did on the previous round. Suppose other guy plays tit for tat. If I play tit for tat too, what will happen?

Payoffs If you play tit for tat when other guy is playing tit for tat, you get expected payoff of R(1+d +d2 + d3 + d4 + ….+ )=R/(1-d) Suppose instead that you choose to play “Always defect” when other guy is tit for tat. You will get T+ P(d +d2 + d3 + d4 + ….+ ) =T+Pd/1-d Same comparison as with Grim Trigger. Tit for tat is a better response to tit for tat than always defect if d>(T-R)/(T-P)

Another try Sucker punch him and then get him to forgive you. If other guy is playing tit for tat and you play D on first round, then C ever after, you will get payoff of T on first round, S on second round, and then R for ever. Expected payoff is T+ Sd+d2R(1+d +d2 + d3 + d4 + ….+ )=T+ Sd+d2R/(1-d).

Which is better? Tit for tat and Cheat and ask forgiveness give same payoff from round 3 on. Cheat and ask for forgiveness gives T in round 1 and S in round 2. Tit for tat give R in all rounds. So tit for tat is better if R+dR>T+dS, which means d(R-S)>T-R or d>(T-R)(R-S) If T=10, R=6, and S=1, this would mean if d>4/5. But if T=10, R=5, and S=1, this would be the case only if d>5/4, which can’t happen. In this case, tit for tat could not be a Nash equilibrium.