Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem.

Slides:



Advertisements
Similar presentations
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
Advertisements

This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
Mixed Strategies CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte.
APPENDIX An Alternative View of the Payoff Matrix n Assume total maximum profits of all oligopolists is constant at 200 units. n Alternative policies.
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Simultaneous- Move Games with Mixed Strategies Zero-sum Games.
Two-Player Zero-Sum Games
Operations Research Assistant Professor Dr. Sana’a Wafa Al-Sayegh 2 nd Semester ITGD4207 University of Palestine.
1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang.
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Copyright (c) 2003 Brooks/Cole, a division of Thomson Learning, Inc
Study Group Randomized Algorithms 21 st June 03. Topics Covered Game Tree Evaluation –its expected run time is better than the worst- case complexity.
Game theory.
EC941 - Game Theory Lecture 7 Prof. Francesco Squintani
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Zero-Sum Games (follow-up)
Part 3: The Minimax Theorem
Game-theoretic analysis tools Necessary for building nonmanipulable automated negotiation systems.
Extensive-form games. Extensive-form games with perfect information Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 5 Players do not move simultaneously.
Basics on Game Theory For Industrial Economics (According to Shy’s Plan)
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
1 Introduction APEC 8205: Applied Game Theory. 2 Objectives Distinguishing Characteristics of a Game Common Elements of a Game Distinction Between Cooperative.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Review Midterm3/19 3/12.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
UNIT III: MONOPOLY & OLIGOPOLY Monopoly Oligopoly Strategic Competition 7/20.
UNIT III: COMPETITIVE STRATEGY Monopoly Oligopoly Strategic Behavior 7/21.
Extensive Game with Imperfect Information Part I: Strategy and Nash equilibrium.
Games of Chance Introduction to Artificial Intelligence COS302 Michael L. Littman Fall 2001.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Review Midterm3/23 3/9.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
UNIT III: COMPETITIVE STRATEGY
Game Theory.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Game Theory Statistics 802. Lecture Agenda Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for.
Experts Learning and The Minimax Theorem for Zero-Sum Games Maria Florina Balcan December 8th 2011.
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
1. problem set 6 from Osborne’s Introd. To G.T. p.210 Ex p.234 Ex p.337 Ex. 26,27 from Binmore’s Fun and Games.
Game Theory.
Reading Osborne, Chapters 5, 6, 7.1., 7.2, 7.7 Learning outcomes
UNIT III: MONOPOLY & OLIGOPOLY Monopoly Oligopoly Strategic Competition 7/28.
Chapter 9 Games with Imperfect Information Bayesian Games.
ECO290E: Game Theory Lecture 12 Static Games of Incomplete Information.
Agents that can play multi-player games. Recall: Single-player, fully-observable, deterministic game agents An agent that plays Peg Solitaire involves.
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
The Design & Analysis of the Algorithms Lecture by me M. Sakalli Download two pdf files..
Games with Imperfect Information Bayesian Games. Complete versus Incomplete Information So far we have assumed that players hold the correct belief about.
Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
Chapter 11 Game Theory Math Game Theory What is it? – a way to model conflict and competition – one or more "players" make simultaneous decisions.
Dominance Since Player I is maximizing her security level, she prefers “large” payoffs. If one row is smaller (element- wise) than another,
Extensive Games with Imperfect Information
Part 3 Linear Programming
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
1. 2 You should know by now… u The security level of a strategy for a player is the minimum payoff regardless of what strategy his opponent uses. u A.
1 a1a1 A1A1 a2a2 a3a3 A2A Mixed Strategies When there is no saddle point: We’ll think of playing the game repeatedly. We continue to assume that.
Statistics Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for Windows software Modeling.
5.1.Static Games of Incomplete Information
MiniMax Principle in Game Theory Slides Made by Senjuti Basu Roy.
Lec 23 Chapter 28 Game Theory.
By: Donté Howell Game Theory in Sports. What is Game Theory? It is a tool used to analyze strategic behavior and trying to maximize his/her payoff of.
Introduction to Game Theory
Introduction to Game Theory
Simultaneous-Move Games: Mixed Strategies
PENGANTAR INTELIJENSIA BUATAN (64A614)
CPS Extensive-form games
Lecture 20 Linear Program Duality
Vincent Conitzer Extensive-form games Vincent Conitzer
CPS 173 Extensive-form games
Molly W. Dahl Georgetown University Econ 101 – Spring 2009
UNIT II: The Basic Theory
Lecture Game Theory.
Presentation transcript:

Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem

The Essentials of a Game 1. Players: We require at least 2 players (Players choose actions and receive payoffs.) 2. Actions: Player i chooses from a finite set of actions, S = {s 1,s 2,…..,s n }. Player j chooses from a finite set of actions T = {t 1,t 2,……,t m }. 3. Payoffs: We define P i (s,t) as the payoff to player i, if Player i chooses s and player j chooses t. We require that P i (s,t) + P j (s,t) = 0 for all combinations of s and t. 4. Information: What players know (believe) when choosing actions. ZERO-SUM

The Essentials of a Game 4. Information: What players know (believe) when choosing actions. Perfect Information: Players know their own payoffs other player(s) payoffs the history of the game, including other(s) current action* *Actions are sequential (e.g., chess, tic-tac-toe). Common Knowledge

Extensive Game Player 1 chooses a = {1, 2 or 3} Player 2 b = {1 or 2} Player 1 c = {1, 2 or 3} Payoffs = a 2 + b 2 + c 2 if /4 leaves remainder of 0 or 1. -(a 2 + b 2 + c 2 ) if /4 leaves remainder of 2 or 3. Player1’s decision nodes GAME 1. “Square the Diagonal” (Rapoport: 48-9) Player 2’s decision nodes

Extensive Game How should the game be played? Solution: a set of “advisable” strategies, one for each player. Strategy: a complete plan of action for every possible decision node of the game, including nodes that could only be reached by a mistake at an earlier node. Player1‘s advisable Strategy in red GAME Start at the final decision nodes (in red) Backwards-induction

Extensive Game How should the game be played? Solution: a set of “advisable” strategies, one for each player. Strategy: a complete plan of action for every possible decision node of the game, including nodes that could only be reached by a mistake at an earlier node. Player1‘s advisable Strategy in red GAME 1. Player2’s advisable strategy in green Player1’s advisable strategy in red

Extensive Game How should the game be played? If both player’s choose their advisable (prudent) strategies, Player1 will start with 2, Player2 will choose 1, then Player1 will choose 2. The outcome will be 9 for Player1 (-9 for Player2). If a player makes a mistake, or deviates, her payoff will be less GAME

Extensive Game A Clarification: Rapoport (pp ) claims Player 1 has 27 strategies. However, if we consider inconsistent strategies, the actual number of strategies available to Player 1 is 3 7 = An inconsistent strategy includes actions at decision nodes that would not be reached by correct implementation at earlier nodes, i.e., could only be reached by mistake. Since we can think of a strategy as a set of instructions (or program) given to an agent or referee (or machine) to implement, a complete strategy must include instructions for what to do after a mistake is made. This greatly expands the number of strategies available, though the essence of Rapoport’s analysis is correct.

Extensive Game Complete Information: Players know their own payoffs; other player(s) payoffs; history of the game excluding other(s) current action* *Actions are simultaneous GAME Information Sets

Matrix Game T1T2 T1T2 Also called “Normal Form” or “Strategic Game” Solution = {S 22, T 1 } S 11 S 12 S 13 S 21 S 22 S 23 S 31 S 32 S 33

Dominant Strategies Definition Dominant Strategy: a strategy that is best no matter what the opponent(s) choose(s). T 1 T 2 T S1S2S3S1S2S3 S1S2S3S1S2S3

Dominant Strategies Definition Dominant Strategy: a strategy that is best no matter what the opponent(s) choose(s). T 1 T 2 T 3 Sure Thing Principle: If you have a dominant strategy, use it! S1S2S3S1S2S3 S1S2S3S1S2S3

Prudent Strategies T 1 T 2 T 3 Player 1’s worst payoffs for each strategy are in red S1S2S3S1S2S3 Definitions Prudent Strategy: A prudent strategy for player i maximizes the minimum payoff she can get from playing different strategies. Such a strategy is simply max s min t P(s,t) for player i.

Prudent Strategies T 1 T 2 T 3 Player 2’s worst payoffs for each strategy are in green S1S2S3S1S2S3 Definitions Prudent Strategy: A prudent strategy for player i maximizes the minimum payoff she can get from playing different strategies. Such a strategy is simply max s min t P(s,t) for player i.

Prudent Strategies T 1 T 2 T S1S2S3S1S2S3 Definitions Prudent Strategy: A prudent strategy for player i maximizes the minimum payoff she can get from playing different strategies. Such a strategy is simply max s min t P(s,t) for player i. Saddlepoint: A set of prudent strategies (one for each player), s. t. (s’, t’) is a saddlepoint, iff maxmin = minmax. We call the solution {S 2, T 1 } a saddlepoint

Prudent Strategies S1S2S3S1S2S3 Saddlepoint: A set of prudent strategies (one for each player), s. t. (s’, t’) is a saddlepoint, iff maxmin = minmax.

Mixed Strategies Left Right L R L R Player 1 Player 2 GAME 2: Button-Button Player 1 hides a button in his Left or Right hand. Player 2 observes Player 1’s choice and then picks either Left or Right. Draw the game in matrix form.

Mixed Strategies Left Right L R L R Player 1 Player 2 GAME 2: Button-Button Player 1 has 2 strategies; Player 2 has 4 strategies: LRLR LL RR LR RL

Mixed Strategies Left Right L R L R Player 1 Player 2 GAME 2: Button-Button The game can be solve by backwards-induction. Player 2 will … LRLR LL RR LR RL

Mixed Strategies Left Right L R L R Player 1 Player 2 GAME 2: Button-Button The game can be solve by backwards-induction. … therefore, Player 1 will: LRLR LL RR LR RL

Mixed Strategies Left Right L R L R Player 1 Player L R L R GAME 2: Button-Button What would happen if Player 2 cannot observe Player 1’s choice?

Solving the Zero-sum Game GAME Definition Mixed Strategy: A mixed strategy for player i is a probability distribution over all strategies available to player i. Let (p, 1-p) = prob. Player I chooses L, R. (q, 1-q) = prob. Player 2 chooses L, R. L R LRLR

Solving the Zero-sum Game GAME Then Player 1’s expected payoffs are: EP(L) = -2(p) + 2(1-p) = 2 – 4p EP(R) = 4(p) – 1(1-p) = 5p – 1 L R LRLR (p) (1-p) (q) (1-q) 01 p EP(L) = 2 – 4p EP(R) = 5p – 1 EP p*=1/

Solving the Zero-sum Game GAME Player 2’s expected payoffs are: EP(L) = 2(q) – 4(1-q) = 6q – 4 EP(R) = -2(q) + 1(1-q) = -3q + 1 EP(L) = EP(R) => q* = 5/9 L R LRLR (p) (1-p) (q) (1-q)

Solving the Zero-sum Game Player 1 EP(L) = -2(p) + 2(1-p) = 2 – 4p EP(R) = 4(p) – 1(1-p) = 5p – 1 0 p 1 q -EP 2 p*=1/ /3 = EP 1 * = - EP 2 * =-2/3 This is the Value of the game. EP q*= 5/9 Player 2 EP(L) = 2(q) – 4(1-q) = 6q – 4 EP(R) = -2(q) + 1(1-q) = -3q + 1

Solving the Zero-sum Game GAME Then Player 1’s expected payoffs are: EP(T 1 ) = -2(p) + 2(1-p) EP(T 2 ) = 4(p) – 1(1-p) EP(T 1 ) = EP(T 2 ) => p* = 1/3 And Player 2’s expected payoffs are: (V)alue = 2/3 L R LRLR (p) (1-p) (q) (1-q) (Security) Value: the expected payoff when both (all) players play prudent strategies. Any deviation by an opponent leads to an equal or greater payoff.

The Minimax Theorem Von Neumann (1928) Every zero sum game has a saddlepoint (in pure or mixed strategies), s.t., there exists a unique value, i.e., an outcome of the game where maxmin = minmax.