Game Theory Lecture 8.

Slides:



Advertisements
Similar presentations
Game Theoretic Analysis of Oligopoly y n Y N 0000 Y N The unique dominant strategy Nash Equilibrium is (y,Y) A game of imperfect.
Advertisements

Vincent Conitzer CPS Repeated games Vincent Conitzer
Some Problems from Chapt 13
Infinitely Repeated Games
Crime, Punishment, and Forgiveness
This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
Evolution and Repeated Games D. Fudenberg (Harvard) E. Maskin (IAS, Princeton)
Game Theory “Доверяй, Но Проверяй” (“Trust, but Verify”) - Russian Proverb (Ronald Reagan) Topic 5 Repeated Games.
1 Evolution & Economics No Evolutionary Stability in Repeated Games Played by Finite Automata K. Binmore & L. Samuelson J.E.T Automata.
Infinitely Repeated Games. In an infinitely repeated game, the application of subgame perfection is different - after any possible history, the continuation.
Non-Cooperative Game Theory To define a game, you need to know three things: –The set of players –The strategy sets of the players (i.e., the actions they.
Calibrated Learning and Correlated Equilibrium By: Dean Foster and Rakesh Vohra Presented by: Jason Sorensen.
Eponine Lupo.  Questions from last time  3 player games  Games larger than 2x2—rock, paper, scissors  Review/explain Nash Equilibrium  Nash Equilibrium.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.5.Repeated Games Lecture
Infinitely Repeated Games Econ 171. Finitely Repeated Game Take any game play it, then play it again, for a specified number of times. The game that is.
LAUREN FRATAMICO Common knowledge of rationality and backward induction.
Repeated games with Costly Observations Eilon Solan, Tel Aviv University Ehud Lehrer Tel Aviv University with.
EC941 - Game Theory Lecture 7 Prof. Francesco Squintani
Game Theory Lecture 9.
Repeated Games and the Prisoner’s Dilemma. Prisoner’s dilemma What if the game is played “repeatedly” for several periods? DefectCooperate Defect10 yr,
Games People Play. 8: The Prisoners’ Dilemma and repeated games In this section we shall learn How repeated play of a game opens up many new strategic.
Equilibrium Concepts in Two Player Games Kevin Byrnes Department of Applied Mathematics & Statistics.
Dynamic Games of Complete Information.. Repeated games Best understood class of dynamic games Past play cannot influence feasible actions or payoff functions.
1 Game Theory Lecture 2 Game Theory Lecture 2. Spieltheorie- Übungen P. Kircher: Dienstag – 09: HS M S. Ludwig: Donnerstag Uhr.
Yale Lectures 21 and Repeated Games: Cooperation vs the End Game.
More on Extensive Form Games. Histories and subhistories A terminal history is a listing of every play in a possible course of the game, all the way to.
ECON6036 1st semester Format of final exam Same as the mid term
Harsanyi transformation Players have private information Each possibility is called a type. Nature chooses a type for each player. Probability distribution.
Game Theory: Key Concepts Zero Sum Games Zero Sum Games Non – Zero Sum Games Non – Zero Sum Games Strategic Form Games  Lay out strategies Strategic Form.
APEC 8205: Applied Game Theory Fall 2007
Nash Q-Learning for General-Sum Stochastic Games Hu & Wellman March 6 th, 2006 CS286r Presented by Ilan Lobel.
Repeated games - example This stage game is played 2 times Any SPNE where players behave differently than in a 1-time game? Player 2 LMR L1, 10, 05, 0.
TOPIC 6 REPEATED GAMES The same players play the same game G period after period. Before playing in one period they perfectly observe the actions chosen.
PRISONER’S DILEMMA By Ajul Shah, Hiten Morar, Pooja Hindocha, Amish Parekh & Daniel Castellino.
UNIT III: COMPETITIVE STRATEGY Monopoly Oligopoly Strategic Behavior 7/21.
On Bounded Rationality and Computational Complexity Christos Papadimitriou and Mihallis Yannakakis.
Game Theoretic Analysis of Oligopoly lr L R 0000 L R 1 22 The Lane Selection Game Rational Play is indicated by the black arrows.
Communication Networks A Second Course Jean Walrand Department of EECS University of California at Berkeley.
© 2009 Institute of Information Management National Chiao Tung University Lecture Notes II-2 Dynamic Games of Complete Information Extensive Form Representation.
1. problem set 6 from Osborne’s Introd. To G.T. p.210 Ex p.234 Ex p.337 Ex. 26,27 from Binmore’s Fun and Games.
1 Game Theory Sequential bargaining and Repeated Games Univ. Prof.dr. M.C.W. Janssen University of Vienna Winter semester Week 46 (November 14-15)
1 Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution – Non-Commercial 3.0 License.
Punishment and Forgiveness in Repeated Games. A review of present values.
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
Chapters 29, 30 Game Theory A good time to talk about game theory since we have actually seen some types of equilibria last time. Game theory is concerned.
Section 2 – Ec1818 Jeremy Barofsky
Punishment, Detection, and Forgiveness in Repeated Games.
Final Lecture. Problem 2, Chapter 13 Exploring the problem Note that c, x yields the highest total payoff of 7 for each player. Is this a Nash equilibrium?
Mixed Strategies and Repeated Games
Subgames and Credible Threats (with perfect information) Econ 171.
Game Theory (Microeconomic Theory (IV)) Instructor: Yongqin Wang School of Economics, Fudan University December, 2004.
Punishment, Detection, and Forgiveness in Repeated Games.
Extensive Form (Dynamic) Games With Perfect Information (Theory)
Lec 23 Chapter 28 Game Theory.
Entry Deterrence Players Two firms, entrant and incumbent Order of play Entrant decides to enter or stay out. If entrant enters, incumbent decides to fight.
Game theory basics A Game describes situations of strategic interaction, where the payoff for one agent depends on its own actions as well as on the actions.
Yuan Deng Vincent Conitzer Duke University
Dynamic Games of Complete Information
Vincent Conitzer CPS Repeated games Vincent Conitzer
Multiagent Systems Repeated Games © Manfred Huber 2018.
Game Theory Fall Mike Shor Topic 5.
Vincent Conitzer Repeated games Vincent Conitzer
Chapter 14 & 15 Repeated Games.
Chapter 14 & 15 Repeated Games.
Molly W. Dahl Georgetown University Econ 101 – Spring 2009
Collaboration in Repeated Games
Game Theory Spring Mike Shor Topic 5.
Vincent Conitzer CPS Repeated games Vincent Conitzer
Presentation transcript:

Game Theory Lecture 8

problem set 8 from Osborne’s Introd. To G.T. Ex. (426.1), 428.1, 429.1, 430.1, 431.1, (431.2)

Not in a finitely repeated Prisoners’ Dilemma. A reminder Repeated games The grim (trigger) strategy Begin by playing C and do not initiate a deviation from C If the other played D, play D for ever after. i.e. is the pair (grim , grim) a N.E. ?? Is the grim strategy a Nash equilibrium? Not in a finitely repeated Prisoners’ Dilemma. C D 2 , 2 0 , 3 3 , 0 1 , 1 Punishment does not seem to work in the finitely repeated game.

> > > Proof: player 1 … C D …. player 2 ? player 1 … D …. Every Nash equilibrium of the finitely repeated P.D. generates a path along which the players play only D Proof: Consider the last time that any of the players plays C along the Nash Equilibrium path. (assume it is player 1) After that period they both play D player 1 … C D …. player 2 ? > > > He is better off here If he switches to play D: player 1 … D …. player 2 ?

A reminder infinitely An infinitely repeated prisoners’ Dilemma sub-games 1 2 C D D 1 2 1 2 1 2 1 2 1 C D 1 C D

An infinitely repeated game A history at time t is: { a1, a2, ….. at } where ai is a vector of actions taken at time i ai is [C,C] or [DC] etc. A strategy is a function that assigns an action for each history.

An infinitely repeated game The payoff of player 1 following a history { a0, a1, ….. at,...… } is a stream { G1(a0), G1(a1), ….. G1(at)...… }

An infinitely repeated game If the payoff stream of a player is a cycle (of length n): w0,w1,w2,……wn-1,w0,w1,w2,……wn-1, w0,w1,w2,……wn-1, ……… his utility is:

An infinitely repeated game

Not in a finitely repeated Prisoners’ Dilemma. An infinitely repeated game Is the pair (grim , grim) a N.E. ?? Not in a finitely repeated Prisoners’ Dilemma. (grim,grim) is a N.E. in the infinitely repeated P.D. if the discount rate is sufficiently large i.e. if the future is sufficiently important

Assume that player 2 plays ‘grim’: If at some time t player 1 considers deviating from C (for the first time) time … t-2 t-1 t t+1 t+2 player 1 C D player 2 time … t-2 t-1 t t+1 t+2 player 1 C player 2 time … t-2 t-1 t t+1 t+2 player 1 C player 2 D …. ? D …. while if he did not deviate: time … t-2 t-1 t t+1 t+2 player 1 C …. player 2

time … t-2 t-1 t t+1 t+2 player 1 C D …. player 2 time … t-2 t-1 t t+1 2 , 2 0 , 3 3 , 0 1 , 1 The payoffs: time … t-2 t-1 t t+1 t+2 player 1 C D …. player 2 time … t-2 t-1 t t+1 t+2 player 1 C D …. player 2 time … t-2 t-1 t t+1 t+2 player 1 C …. player 2 time … t-2 t-1 t t+1 t+2 player 1 C …. player 2

(grim,grim) is a N.E. if the discount rate is sufficiently large 2 , 2 0 , 3 3 , 0 1 , 1 Player 1 will not deviate if: (grim,grim) is a N.E. if the discount rate is sufficiently large i.e. if the future is sufficiently important

(grim,grim) is not a Sub-game Perfect equilibrium of the game However, (grim,grim) is not a Sub-game Perfect equilibrium of the game Assume player 1 follows the grim strategy, and that in the last period C,D was played player 1 … C player 2 D D …. C D …. D .... Player 1’s (grim) reaction will be: If Player 1 follows grim: but he could do better with :

Strategies as Finite Automata A finite automaton has a finite xxxxnumber of states (+ initial state) Each state is characterized xxxxby an action Input changes the state of the xxxxautomaton

C C,D C D D The grim strategy A state and its action Inputs : The actions of the other player { C,D } The transition: How inputs change the state Initial State

Modified Grim 1 2 3 4 Some more strategies C D C,D C,D C C D D D D D

Some more strategies Tit for Tat C D D C D C

Axelrod’s Tournament (Nice !!!) Tit for Tat Robert Axelrod: The Evolution of Cooperation, 1984 C D Robert Axelrod Tit for Tat (Nice !!!) C C D D C D

Modified Tit for Tat Tit for Tat Can you still bite ??? D C C C D D C ‘simpler’ than D D D C

a strategy that exploits the weakness of Can you still bite ??? D C C C D C D C C D D a modification a strategy that exploits the weakness of C C C,D D D D C

What payoffs are N.E. payoffs of the infinitely repeated P.D. ?? The FEASIBLE payoffs as π1 π2 C D 2 , 2 0 , 3 3 , 0 1 , 1

The folk theorem ???? (R. Aumann, J. Friedman) What payoffs are N.E. payoffs of the infinitely repeated P.D. ?? Clearly, Nash Equilibria payoffs are ≥ (1,1) ???? All feasible payoffs above (1,1) can be obtained as Nash Equilibria payoffs π1 π2 The folk theorem (R. Aumann, J. Friedman) C D 2 , 2 0 , 3 3 , 0 1 , 1 (1,1)

All feasible payoffs above (1,1) can be obtained as Nash Equilibria payoffs Proof: choose a point in this region π1 π2 it can be represented as: C D 2 , 2 0 , 3 3 , 0 1 , 1 (1,1)

The coefficients αi can be approximated by rational numbers Proof: The coefficients αi can be approximated by rational numbers π1 π2 C D 2 , 2 0 , 3 3 , 0 1 , 1 (1,1)

Proof: { { { { π1 π2 If the players follow this cycle, their payoff will be approximately the chosen point when the discount rate is close to 1. C D 2 , 2 0 , 3 3 , 0 1 , 1 (1,1)

Follow the sequence of the cycle as long as the other player does. Proof: { { { { A strategy: Follow the sequence of the cycle as long as the other player does. If not, play D forever. π1 π2 C D 2 , 2 0 , 3 3 , 0 1 , 1 (1,1)

Follow the sequence of the cycle as long as the other player does. { { { { A strategy: Follow the sequence of the cycle as long as the other player does. If not, play D forever. π1 π2 This pair of strategies is a N.E. C D 2 , 2 0 , 3 3 , 0 1 , 1 (1,1)

One Deviation Property and Agent Equilibria A player cannot increase his payoff in a sub-game in which he is the first to move, by changing his action in that node only. A Theorem A strategy profile is a sub-game perfect equilibrium in an extensive game with perfect information iff both strategies have the one deviation property (no proof)