3.2.2. Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -

Slides:



Advertisements
Similar presentations
Some Problems from Chapt 13
Advertisements

Infinitely Repeated Games
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
Crime, Punishment, and Forgiveness
3. Basic Topics in Game Theory. Strategic Behavior in Business and Econ Outline 3.1 What is a Game ? The elements of a Game The Rules of the.
Game Theory “Доверяй, Но Проверяй” - Russian Proverb (Trust, but Verify) - Ronald Reagan Mike Shor Lecture 6.
Game Theory “Доверяй, Но Проверяй” (“Trust, but Verify”) - Russian Proverb (Ronald Reagan) Topic 5 Repeated Games.
Games With No Pure Strategy Nash Equilibrium Player 2 Player
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Infinitely Repeated Games. In an infinitely repeated game, the application of subgame perfection is different - after any possible history, the continuation.
Non-Cooperative Game Theory To define a game, you need to know three things: –The set of players –The strategy sets of the players (i.e., the actions they.
6-1 LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to MultiAgent Systems
Chapter 14 Infinite Horizon 1.Markov Games 2.Markov Solutions 3.Infinite Horizon Repeated Games 4.Trigger Strategy Solutions 5.Investing in Strategic Capital.
The basics of Game Theory Understanding strategic behaviour.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.5.Repeated Games Lecture
Infinitely Repeated Games Econ 171. Finitely Repeated Game Take any game play it, then play it again, for a specified number of times. The game that is.
EC941 - Game Theory Lecture 7 Prof. Francesco Squintani
Game Theory: Inside Oligopoly
Game Theory. Games Oligopolist Play ▫Each oligopolist realizes both that its profit depends on what its competitor does and that its competitor’s profit.
Industrial Organization - Matilde Machado Tacit Collusion Tacit Collusion Matilde Machado.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
Chapter 11 Game Theory and the Tools of Strategic Business Analysis.
Repeated Prisoner’s Dilemma If the Prisoner’s Dilemma is repeated, cooperation can come from strategies including: “Grim Trigger” Strategy – one.
Games People Play. 8: The Prisoners’ Dilemma and repeated games In this section we shall learn How repeated play of a game opens up many new strategic.
Dynamic Games of Complete Information.. Repeated games Best understood class of dynamic games Past play cannot influence feasible actions or payoff functions.
A camper awakens to the growl of a hungry bear and sees his friend putting on a pair of running shoes, “You can’t outrun a bear,” scoffs the camper. His.
Chapter 12 Choices Involving Strategy McGraw-Hill/Irwin Copyright © 2008 by The McGraw-Hill Companies, Inc. All Rights Reserved.
Yale Lectures 21 and Repeated Games: Cooperation vs the End Game.
© 2008 Pearson Addison Wesley. All rights reserved Chapter Fourteen Game Theory.
APEC 8205: Applied Game Theory Fall 2007
QR 38 3/20/07, More on repeated games in IR I.Folk theorem II.Other solutions to the PD III.Repeated PDs in practice.
Repeated games - example This stage game is played 2 times Any SPNE where players behave differently than in a 1-time game? Player 2 LMR L1, 10, 05, 0.
TOPIC 6 REPEATED GAMES The same players play the same game G period after period. Before playing in one period they perfectly observe the actions chosen.
Game Theoretic Analysis of Oligopoly lr L R 0000 L R 1 22 The Lane Selection Game Rational Play is indicated by the black arrows.
Communication Networks A Second Course Jean Walrand Department of EECS University of California at Berkeley.
Social Choice Session 7 Carmen Pasca and John Hey.
Chapter 12 Choices Involving Strategy Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written.
1 Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution – Non-Commercial 3.0 License.
Punishment and Forgiveness in Repeated Games. A review of present values.
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
Dynamic Games & The Extensive Form
Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham.
Chapters 29, 30 Game Theory A good time to talk about game theory since we have actually seen some types of equilibria last time. Game theory is concerned.
Public Policy Analysis MPA 404 Lecture 24. Previous Lecture Graphical Analysis of Tariff and Quota Game Theory; The prisoner's Dilemma.
Lecture 5 Leadership and Reputation Reputations arise in situations where there is an element of repetition, and also where coordination between players.
3.1.4 Types of Games. Strategic Behavior in Business and Econ Outline 3.1. What is a Game ? The elements of a Game The Rules of the Game:
제 10 장 게임이론 Game Theory: Inside Oligopoly
Section 2 – Ec1818 Jeremy Barofsky
Final Lecture. Problem 2, Chapter 13 Exploring the problem Note that c, x yields the highest total payoff of 7 for each player. Is this a Nash equilibrium?
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Strategic Behavior in Business and Econ Static Games of complete information: Dominant Strategies and Nash Equilibrium in pure and mixed strategies.
Subgames and Credible Threats (with perfect information) Econ 171.
Lecture 6 Reputation Reputations arise in situations where there is an element of repetition, and also where coordination between players is possible.
Lec 23 Chapter 28 Game Theory.
Entry Deterrence Players Two firms, entrant and incumbent Order of play Entrant decides to enter or stay out. If entrant enters, incumbent decides to fight.
By: Donté Howell Game Theory in Sports. What is Game Theory? It is a tool used to analyze strategic behavior and trying to maximize his/her payoff of.
ECO290E: Game Theory Lecture 10 Examples of Dynamic Games.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.1.Dynamic Games of Complete and Perfect Information Lecture
Dynamic Games of complete information: Backward Induction and Subgame perfection.
Yuan Deng Vincent Conitzer Duke University
Chapter 29 Game Theory Key Concept: Nash equilibrium and Subgame Perfect Nash equilibrium (SPNE)
Multiagent Systems Repeated Games © Manfred Huber 2018.
Game Theory Fall Mike Shor Topic 5.
Chapter 14 & 15 Repeated Games.
Chapter 14 & 15 Repeated Games.
Molly W. Dahl Georgetown University Econ 101 – Spring 2009
Lecture 10 Coordination and Reputation
Game Theory Spring Mike Shor Topic 5.
Presentation transcript:

Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -

Strategic Behavior in Business and Econ Outline 3.1. What is a Game ? The elements of a Game The Rules of the Game: Example Examples of Game Situations Types of Games 3.2. Solution Concepts Static Games of complete information: Dominant Strategies and Nash Equilibrium in pure and mixed strategies Dynamic Games of complete information: Backward Induction and Subgame perfection

Strategic Behavior in Business and Econ Repeated Games A Repeated Game is a special case of a dynamic (sequential moves) game that consists of a (usually) static game being played several times, one after the other The game that is repeated is called the “stage game” The (stage) game can be played a given number of times (known to all the players) or an indefinite number of times.

Strategic Behavior in Business and Econ Repeated Games Thus, we can have a: Finitely Repeated Game. When the stage game is player a number T of rounds (1, 2, 3,..., T). T is known to all the players Infinitely Repeated Game When either After each round the game continues to the next round with probability p and ends with probability (1 – p) The game is played forever but at each round the value of the payoffs decreases by a factor of “p”

Strategic Behavior in Business and Econ Finitely Repeated Games Recall the Prisoners' Dilemma A “generic” version of the game is represented in the table Below Where, C stands for “cooperate” D stands for “defect”

Strategic Behavior in Business and Econ Finitely Repeated Games We saw that both players defecting is the unique equilibrium of the game. In fact, D is a Dominant Strategy for each player The apparently paradoxical behavior is that, although both Players would mutually benefit from Cooperation, self Interests leads to the worse outcome by Defecting

Strategic Behavior in Business and Econ Finitely Repeated Games We saw that both players defecting is the unique equilibrium of the game. In fact, D is a Dominant Strategy for each player Repeating the game opens interesting possibilities To “punish” egoistic (defect) behaviors To “reward” the right (cooperative) behavior

Strategic Behavior in Business and Econ Finitely Repeated Games We saw that both players defecting is the unique equilibrium of the game. In fact, D is a Dominant Strategy for each player Examples: “Stick and Carrot Strategies” (Trigger Strategies) 1. I will start with cooperation, and will mimic your behavior afterwards 2. I will start with cooperation and will keep doing so unless you defect. In such case I will defect forever

Strategic Behavior in Business and Econ The 2-times Repeated Prisoners' Dilemma The tree representation of the 2-times Repeated Prisoners' Dilemma is shown in the next slide: Notice: The “dotted” lines representing the simultaneous choice in each stage of the game The payoffs at the end of the game correspond to the sum of the payoffs in each stage Try to imagine the tree in a 3-times Repeated Prisoners' Dilemma

Strategic Behavior in Business and Econ C C C C C C C C C C C C C C C D D D D D D D D D D D D D D D 6, 6 3, 8 8, 3 4, 4 3, 8 0, 10 5, 5 1, 6 8, 3 5, 5 10, 0 6, 1 4, 4 1, 6 6, 1 2, 2 Stage 1Stage 2

Strategic Behavior in Business and Econ The 2-times Repeated Prisoners' Dilemma The game must be solved by Backward Induction using the Subgame Perfection technique (since there are “linked” nodes that indicate that the game is of Imperfect Information) Notice that his is always the case when we repeat a static game We must, therefore, “solve” each of the 4 “subgames” in the second stage of the game and then move backwards

Strategic Behavior in Business and Econ C C C C C C C C C C C C C C C D D D D D D D D D D D D D D D 6, 6 3, 8 8, 3 4, 4 3, 8 0, 10 5, 5 1, 6 8, 3 5, 5 10, 0 6, 1 4, 4 1, 6 6, 1 2, 2 Stage 1Stage 2 Subgame 1 Subgame 2 Subgame 3 Subgame 4

Strategic Behavior in Business and Econ Subgame 1Subgame 2 Subgame 4Subgame 3

Strategic Behavior in Business and Econ The 2-times Repeated Prisoners' Dilemma Notice that the solution in each subgame is always the same: Player 1: Defect Player 2:Defect And, again, Defect is a Dominant Strategy for each player in each subgame This is not a coincidence (as we will see shortly) Thus, proceeding backwards in the tree we get...

Strategic Behavior in Business and Econ C C C C C C C C C C C C C C C D D D D D D D D D D D D D D D 6, 6 3, 8 8, 3 4, 4 3, 8 0, 10 5, 5 1, 6 8, 3 5, 5 10, 0 6, 1 4, 4 1, 6 6, 1 2, 2 Stage 1Stage 2 Subgame 1 Subgame 2 Subgame 3 Subgame 4

Strategic Behavior in Business and Econ C C C D D D 4, 4 1, 6 6, 1 2, 2 Stage 1Stage 2 Again, what remains after we move backwards in the three is another “simultaneous move” game, the one that corresponds to the first stage of the game. We must “solve” in looking at the table representation

Strategic Behavior in Business and Econ C C C D D D 4, 4 1, 6 6, 1 2, 2 Stage 1Stage 2

Strategic Behavior in Business and Econ C C C D D D 4, 4 1, 6 6, 1 2, 2 Stage 1Stage 2 Thus, knowing what will be the outcome in the second stage of the game...

Strategic Behavior in Business and Econ C C C D D D 4, 4 1, 6 6, 1 2, 2 Stage 1Stage 2 Both players will also defect in the first round. (It's again a Dominant Strategy !)

Strategic Behavior in Business and Econ The T-times Repeated Prisoners' Dilemma Stage 1 (D, D) T=1 Stage 1 (D, D) T=2 Stage 2 (D, D) Stage 1 (D, D) T=3 Stage 2 (D, D) Stage 3 (D, D) Stage 1 (D, D) Any T Stage 2 (D, D) Stage 3 (D, D) Stage T (D, D) · ·

Strategic Behavior in Business and Econ The T-times Repeated Prisoners' Dilemma No matter how many times the Prisoners' Dilemma is repeated, the equilibrium is always the same: Defect in every round. Why don't punishments (rewards) work ? Intuition: At the last repetition, since the players know that there will not be a “new chance” (no punishment-reward is possible), the best thing to do is to Defect Knowing that, in the next-to-last round players know that in the next round the opponent will not cooperate. Then, why should I cooperate today if tomorrow my opponent is going to defect ? Again, the best thing to do is to Defect We can apply this argument “backwards” to conclude that the best thing to do is to Defect all the time.

Strategic Behavior in Business and Econ Finitely Repeated Games: General Facts Any Finitely Repeated Game that consists of the repetition of a stage game that is a static game produces a “Dynamic Game of Imperfect Information The tree representing such game is (usually) very large It should be solved by Backward Induction The following statements are always true in such games: If the stage game has a unique Nash Equilibrium, then the Finitely Repeated Game has unique Subgame Perfect Equilibrium consisting of the repetition of that Nash Equilibrium in every round of the game If the stage game has more than one Nash Equilibria, then any “reasonable” combination of those equilibria is a Subgame Perfect Equilibrium of the Finitely Repeated Game

Strategic Behavior in Business and Econ Infinitely Repeated Games We have seen that both players defecting is the unique subgame perfect equilibrium of the Finitely Repeated Prisoners' Dilemma Trigger Strategies do not lead to cooperation 1. I will start with cooperation, and will mimic your behavior afterwards (Tit-for-Tat) 2. I will start with cooperation and will keep doing so unless you defect. In such case I will defect forever (Grimm Trigger)

Strategic Behavior in Business and Econ There are two possible interpretations of games that are repeated but not a fixed number of rounds After each round the game continues to the next round with probability p and ends with probability (1 – p) Example: Two firms compete day after day, but there is certain probability that one of them goes bankrupt and then the game is over The game is played forever (an indefinite number of times) but at each round the value of the payoffs decreases by “p” Example: Two people negotiate with offers and counteroffers over an item. As time goes by, the item loses value. The game is over when they reach an agreement Infinitely Repeated Games

Strategic Behavior in Business and Econ The two different interpretations of a Infinitely Repeated Game are technically equivalent. Since there is no “last round”, there is no possibility of thinking backwards. This opens real opportunities to achieve cooperation in the Prisoners' Dilemma ! In general, Infinitely Repeated Games are very complex Infinitely Repeated Games

Strategic Behavior in Business and Econ Let x be any positive number (for instance, money) and p any positive number smaller than 1 (for instance, a probability). Then, x·p + x·p 2 + x·p 3 + x·p 4 + · · · = x x·p 2 + x·p 3 + x·p 4 + x·p 5 · · · = x and so on... Mathematical aside (infinite sums) p (1 - p) p2p2

Strategic Behavior in Business and Econ The Infinitely Repeated Prisoners' Dilemma How cooperation can be sustained in equilibrium ? Payoff Computation: Imagine that Player 2 plays Grimm Trigger What is the (expected) payoff for Player 1 if after each round the game continues with probability p (and ends with probability (1-p)) If Player 1 plays “cooperate” all the time Stage 1Stage 2Stage 3 Stage 4 · · · 33·p + 0·(1-p)3·p 2 + 0·(1-p 2 ) · · · ·= 3 +3p + 3p 2 + 3p 3 + · · · If Player 1 plays “defect” all the time Stage 1Stage 2Stage 3 Stage 4 · · · 51·p + 0·(1-p)1·p 2 + 0·(1-p 2 ) · · · ·= 5 + 1p + 1p 2 + 1p 3 + · · ·

Strategic Behavior in Business and Econ The Infinitely Repeated Prisoners' Dilemma How cooperation can be sustained in equilibrium ? Payoff Computation: Imagine that Player 2 plays Grimm Trigger What is the payoff for Player 1 if after each round the value of the money decreases by a factor of p (for instance, if the money decreases a 10% then p=0.9) Playing “cooperate” all the time Stage 1Stage 2Stage 3 Stage 4 · · · 33·p 3·p 2 · · · ·= 3 + 3p + 3p 2 + · · · 3 3·(0.9)3·(0.9)·(0.9)= 3 + 3·(0.9) + 3·(0.9) 2 + · ·.= * · · · Playing “defect” all the time Stage 1Stage 2Stage 3 Stage 4 · · · 51·p 1·p 2 · · · ·= 5 + 1p + 1p 2 + · · · 5 1·(0.9)1·(0.9)·(0.9)= 5 + 1·(0.9) + 1·(0.9) 2 + · · = · · ··

Strategic Behavior in Business and Econ The Infinitely Repeated Prisoners' Dilemma How cooperation can be sustained in equilibrium ? Imagine that Player 2 plays Grimm Trigger What is the best for Player 1, Cooperate or Defect ? Expected Payoff from “cooperate” all the time Stage 1Stage 2Stage 3 Stage 4 · · · 33·p + 0·(1-p)3·p 2 + 0·(1-p 2 ) · · · ·= 3 + 3p + 3p 2 + · · · Expected Payoff from “defect” all the time Stage 1Stage 2Stage 3 Stage 4 · · · 51·p + 0·(1-p)1·p 2 + 0·(1-p 2 ) · · · ·= 5 + 1p + 1p 2 + · · ·

Strategic Behavior in Business and Econ The Infinitely Repeated Prisoners' Dilemma How cooperation can be sustained in equilibrium ? Imagine that Player 2 plays Grimm Trigger What is the best for Player 1, Cooperate or Defect ? Expected Payoff from “cooperate” all the time E(Cooperate) = 3 + 3p + 3p 2 + 3p 3 + · · · = Expected Payoff from “defect” all the time E(Defect) = 5 + 1p + 1p 2 + 1p 3 + · · · = p (1 - p) p

Strategic Behavior in Business and Econ The Infinitely Repeated Prisoners' Dilemma How cooperation can be sustained in equilibrium ? Imagine that Player 2 plays Grimm Trigger What is the best for Player 1, Cooperate or Defect ? E(Cooperate) = E(Defect) = Cooperate will be better if E(Cooperate) > E(Defect), that is, if > p > ½ p (1 - p) p p p

Strategic Behavior in Business and Econ The Infinitely Repeated Prisoners' Dilemma How cooperation can be sustained in equilibrium ? Thus, cooperation can be sustained in equilibrium in the Infinitely Repeated Prisoners' Dilemma thanks to Trigger Strategies Depends on “p” With Tit-for-Tat it is also possible to sustain cooperation, but then p > 2/3 But, “cooperation” is not the unique equilibrium. There are equilibria with “defection” as well

Strategic Behavior in Business and Econ Infinitely Repeated Games: General Facts Any Infinitely Repeated Game that consists of the repetition of a stage game that is a static game produces a “Dynamic Game of Imperfect Information” The tree representing such game is (usually) very large It can not be solved by Backward Induction The following statement is always true in such games: No matter how many equilibria the stage game has, any “reasonable” combination of strategies can be a Subgame Perfect Equilibrium in such games (Folk Theorem)

Strategic Behavior in Business and Econ Infinitely Repeated Games: General Facts What does “reasonable” mean ? Notice that by playing Defect all the time any player can guarantee himself a payoff of at least 1 per each round. Thus, any “reasonable” outcome of the game should pay each player at least 1 per round

Strategic Behavior in Business and Econ Summary Any Repeated Game that consists of the repetition of a stage game that is a static game produces a “Dynamic Game of Imperfect Information” The tree representing such game is very large If the game is Finitely Repeated, it must be solved by Backward Induction If the game is Infinitely Repeated, it can not be solved by Backward Induction

Strategic Behavior in Business and Econ The following statements are always true in Finitely Repeated Games:: If the stage game has a unique Nash Equilibrium, then the Finitely Repeated Game has unique Subgame Perfect Equilibrium consisting of the repetition of that Nash Equilibrium in every round of the game If the stage game has more than one Nash Equilibria, then any “reasonable” combination of those equilibria is a Subgame Perfect Equilibrium of the Finitely Repeated Game The following statement is always true in Infinitely Repeated Games: No matter how many equilibria the stage game has, any “reasonable” combination of strategies can be a Subgame Perfect Equilibrium in such games (Folk Theorem) Summary

Strategic Behavior in Business and Econ Axelrod's Simulation R. Axelrod, The Evolution of Cooperation Prisoner’s Dilemma repeated 200 times Economists submitted strategies Pairs of strategies competed Winner: Tit-for-Tat Reasons: Forgiving, Nice, Clear

Strategic Behavior in Business and Econ Not necessarily tit-for-tat Doesn’t always work Don’t be envious Don’t be the first to cheat Reciprocate opponent’s behavior Cooperation and defection Don’t be too clever To be credible, incorporate a clear policy of punishment Lessons from Axelrod’s Simulation