Matrix Games Mahesh Arumugam Borzoo Bonakdarpour Ali Ebnenasir CSE 960: Selected Topics in Algorithms and Complexity Instructor: Dr. Torng.

Slides:



Advertisements
Similar presentations
Thursday, March 7 Duality 2 – The dual problem, in general – illustrating duality with 2-person 0-sum game theory Handouts: Lecture Notes.
Advertisements

The Basics of Game Theory
Lecture 13. Poker Two basic concepts: Poker is a game of skill, not luck. If you want to win at poker, – make sure you are very skilled at the game, and.
Introduction to Game Theory
GAME THEORY.
Name: Trương Hoài Anh Facebook: Quasar Hoaianh
Totally Unimodular Matrices
Game Theory S-1.
15 THEORY OF GAMES CHAPTER.
Two-Player Zero-Sum Games
Operations Research Assistant Professor Dr. Sana’a Wafa Al-Sayegh 2 nd Semester ITGD4207 University of Palestine.
1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang.
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Copyright (c) 2003 Brooks/Cole, a division of Thomson Learning, Inc
Study Group Randomized Algorithms 21 st June 03. Topics Covered Game Tree Evaluation –its expected run time is better than the worst- case complexity.
Game theory.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Zero-Sum Games (follow-up)
Part 3: The Minimax Theorem
Minimax and Alpha-Beta Reduction Borrows from Spring 2006 CS 440 Lecture Slides.
An Introduction to Game Theory Part I: Strategic Games
Chapter 14 Game Theory to accompany Operations Research: Applications and Algorithms 4th edition by Wayne L. Winston Copyright (c) 2004 Brooks/Cole, a.
Duality Lecture 10: Feb 9. Min-Max theorems In bipartite graph, Maximum matching = Minimum Vertex Cover In every graph, Maximum Flow = Minimum Cut Both.
Artificial Intelligence for Games and Puzzles1 Games in the real world Many real-world situations and problems.
An Introduction to Game Theory Part III: Strictly Competitive Games Bernhard Nebel.
Lectures in Microeconomics-Charles W. Upton Minimax Strategies.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Games of Chance Introduction to Artificial Intelligence COS302 Michael L. Littman Fall 2001.
Finite Mathematics & Its Applications, 10/e by Goldstein/Schneider/SiegelCopyright © 2010 Pearson Education, Inc. 1 of 68 Chapter 9 The Theory of Games.
Artificial Intelligence for Games and Puzzles1 Games in the real world Many real-world situations and.
Game Theory Objectives:
Game Theory.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Minimax Strategies. Everyone who has studied a game like poker knows the importance of mixing strategies. –With a bad hand, you often fold –But you must.
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
Brian Duddy.  Two players, X and Y, are playing a card game- goal is to find optimal strategy for X  X has red ace (A), black ace (A), and red two (2)
To accompany Quantitative Analysis for Management,9e by Render/Stair/Hanna M4-1 © 2006 by Prentice Hall, Inc. Upper Saddle River, NJ Module 4 Game.
Game Theory Part 2: Zero Sum Games. Zero Sum Games The following matrix defines a zero-sum game. Notice the sum of the payoffs to each player, at every.
The Design & Analysis of the Algorithms Lecture by me M. Sakalli Download two pdf files..
Chapter 14 Game Theory to accompany Operations Research: Applications and Algorithms 4th edition by Wayne L. Winston Copyright (c) 2004 Brooks/Cole, a.
1 CRP 834: Decision Analysis Week Four Notes. 2 Review Game Theory –Classification and Description –Two Person Zero-Sum Game Game w/ Dominating Strategy.
Introduction to Game Theory application to networks Joy Ghosh CSE 716, 25 th April, 2003.
Chapter 11 Game Theory Math Game Theory What is it? – a way to model conflict and competition – one or more "players" make simultaneous decisions.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Lecture 2: two-person non.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia.
1 1 Slide © 2006 Thomson South-Western. All Rights Reserved. Slides prepared by JOHN LOUCKS St. Edward’s University.
Part 3 Linear Programming
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Lecture 12. Game theory So far we discussed: roulette and blackjack Roulette: – Outcomes completely independent and random – Very little strategy (even.
Hon Wai Leong, NUS (CS6234, Spring 2009) Page 1 Copyright © 2009 by Leong Hon Wai CS6234: Lecture 4  Linear Programming  LP and Simplex Algorithm [PS82]-Ch2.
1. 2 Some details on the Simplex Method approach 2x2 games 2xn and mx2 games Recall: First try pure strategies. If there are no saddle points use mixed.
1. 2 You should know by now… u The security level of a strategy for a player is the minimum payoff regardless of what strategy his opponent uses. u A.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 a1a1 A1A1 a2a2 a3a3 A2A Mixed Strategies When there is no saddle point: We’ll think of playing the game repeatedly. We continue to assume that.
Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem.
INDEX Introduction of game theory Introduction of game theory Significance of game theory Significance of game theory Essential features of game theory.
GAME THEORY Day 5. Minimax and Maximin Step 1. Write down the minimum entry in each row. Which one is the largest? Maximin Step 2. Write down the maximum.
9.2 Mixed Strategy Games In this section, we look at non-strictly determined games. For these type of games the payoff matrix has no saddle points.
Chapter 5 Adversarial Search. 5.1 Games Why Study Game Playing? Games allow us to experiment with easier versions of real-world situations Hostile agents.
1 Fahiem Bacchus, University of Toronto CSC384: Intro to Artificial Intelligence Game Tree Search I ● Readings ■ Chapter 6.1, 6.2, 6.3, 6.6 ● Announcements:
Game Theory [geym theer-ee] : a mathematical theory that deals with the general features of competitive situations in a formal abstract way.
Contents of the Talk Preliminary Materials Motivation and Contribution
Lecture 13.
The Duality Theorem Primal P: Maximize
Game Theory Just last week:
Tools for Decision Analysis: Analysis of Risky Decisions
PENGANTAR INTELIJENSIA BUATAN (64A614)
Lecture 12.
Game Theory.
Lecture 20 Linear Program Duality
Presentation transcript:

Matrix Games Mahesh Arumugam Borzoo Bonakdarpour Ali Ebnenasir CSE 960: Selected Topics in Algorithms and Complexity Instructor: Dr. Torng

2 Outline Basic concepts Problem statement LP Formulation of Matrix Games Minimax Theorem Gambling Bluffing and Underbidding

3 Basic Concepts Game: A description of strategic interaction between rationale parties based on a set of rules Rules: Constraints on the set of actions that each party can take and the players’ interest Finite Game: Set of actions of each player is finite Two-Player Game: There exist only two players [OR94] Osborne and Rubinstein, A Course in Game Theory, MIT press, 1994.

4 Example: The Game of Morra Rule: –Each player hides one or two francs, and –Tries to guess how many francs the other player has hidden Payoff: –If only one player guesses correctly he wins the total amount of hidden money –Otherwise, the result is a draw

5 The Game of Morra: Pure Strategies Possible courses of action for each player –Hide one, guess one  [1, 1] –Hide one, guess two  [1, 2] –Hide two, guess one  [2, 1] –Hide two, guess two  [2, 2] Pure strategy: a course of action –Denoted [x,y]; i.e., hide x, guess y

6 The Game of Morra: Payoff Matrix [1,1] [1,2] [2,1] [2,2] [1,1][1,2][2,1][2,2] AB x i – probability that row i is selected by row player y j – relative frequency with which column j is selected by column player – X and Y are stochastic vectors y 1 y 2 y 3 y 4 y = [] x1x2x3x4x1x2x3x4 x =

7 The Game of Morra - Cont’d A only plays [1,2] or [2,1] with probability 0.5 B plays –[1,1], [1,2], [2,1], [2,2] in c 1, c 2, c 3, c 4 rounds c 1 + c 2 +c 3 +c 4 = N, where N is total number of rounds Record of the game –In c 1 /2 rounds, A played [1,2] and B played [1,1]: A losing 2 francs –In c 1 /2 rounds, A played [2,1] and B played [1,1]: A winning 3 francs –In c 4 /2 rounds, A played [1,2] and B played [2,2]: A winning 3 francs –In c 4 /2 rounds, A played [2,1] and B played [2,2]: A losing 4 francs –Other rounds, result in a draw Total winning of A : ( c 1 – c 4 )/2 francs What if the roles of A and B are swapped?

8 Basic Concepts - Cont’d Round: a course of actions in which each player moves once Payoff: the value gained by a player in a round The Payoff Matrix defines a game for two players Zero-sum game: The sum of the average payoffs of the two players is 0 a 11 a ij ……. a mn Possible moves of the row player 12i..m12i..m Possible moves of the column player 1 2 … j … n The resulting payoff of the row player

9 Problem Statement Given the payoff matrix A = [a ij ], –identify a mixture of moves of the row player where the average payoff per round is optimal no matter what moves the column player takes

10 LP Formulation of Matrix Games x i – probability that row i is selected by row player y j – relative frequency with which column j is selected by column player – X and Y are stochastic vectors Average payoff to the row player in each round or

11 LP Formulation of Matrix Games - Cont’d If row player adopts the strategy specified by stochastic vector x, he is assured to win = The objective is to maximize this payoff s.t., or

12 LP Formulation of Matrix Games - Cont’d What is the dual of this problem? What does this problem formalize? Column player’s optimal strategy and the value he is assured to win if he adopts such a strategy! s.t., P D

13 Minimax Theorem For every m  n matrix A there is a stochastic row vector x* of length m and a stochastic column vector y* of length n such that min x*Ay = max xAy* with the minimum taken over all stochastic column vectors y of length n and maximum taken over all stochastic row vectors x of length m. Value of game In a game, v = min x*Ay = max xAy* is called the value of that game. What are the implications of this theorem?

14 Ready for Gambling?!! As long as a player adopts an optimal strategy, the player can reveal it to the opponent Example: (The Game of Morra) –column player announces his/her guess –row player announces his/her guess either independent of the opponent or adjust his/her guess based on the extra information –Additional pure strategies for row player Hide 1, make the same guess  [1, S] Hide 1, make a different guess  [1, D] Hide 2, make the same guess  [2, S] Hide 2, make a different guess  [2, D]

15 Gambling: Payoff Matrix and LP Solution Consider the optimal solution x=[0, 56/99, 40/99, 0, 0, 2/99, 0, 1/99] y=[28/99, 30/99, 21/99, 20/99] Game value = 4/99 -row player is assured to win at least this amount on the average -column player is assured to lose no more than this amount on the average Do you think this game is fair? What does this suggest? [1,1] [1,2] [2,1] [2,2] Revealing the guess does not hurt the prospects for the column player!!

16 How about Bluffing or Underbidding? Are bluffing or underbidding rational strategies? Example: (Game invented by H. W. Kuhn) –2 players, deck of cards numbered 1, 2, or 3 –Each player bets or passes in every round –Play terminates when Bet is answered by bet; payoff 2 to player holding higher card Pass is answered by pass; payoff 1 to player holding higher card Bet is answered by pass; payoff 1 to the player who bets

17 Bluffing, Underbidding: Pure Strategies A’s strategies 1.Pass; if B bets, pass again 2.Pass; if B bets, bet again 3.Bet 3x3x3 pure strategies x 1 x 2 x 3 – strategy for A instructing him to follow line x j when holding j B’s strategies 1.Pass no matter what A did 2.If A passes, pass; if A bets, bet 3.If A passes, bet; if A bets, pass 4.Bet no matter what A did 4x4x4 pure strategies y 1 y 2 y 3 – strategy for B Payoff matrix size: 27x64! Holding 1: A – refrain line 2; B – refrain lines 2 and 4; Holding 3: A – refrain line 1; B – refrain lines 1, 2 and 3; Holding 2: choose to pass in the first round; lines 1 or 2 Payoff matrix size: 8x4!

18 Bluffing, Underbidding: Payoff Matrix and LP Solution /6 -1/ /6 -1/3 -1/ /6 -1/6 1/6 1/ / / /6 -1/3 0 -1/ /6 -1/6 -1/6 -1/ /2 1/3 -1/ /3 1/6 -1/6 Consider the optimal solution A: [1/3, 0, 0, 1/2, 1/6, 0, 0, 0] B: [2/3, 0, 0, 1/3] Game Value = -1/18 Holding 1:BLUFF A is allowed to bet 1/6 th times! B is allowed to bet 1/3 rd times! A is allowed to pass 1/2 times! Holding 3:UNDERBID

Thank U!

20 LP Formulation of Matrix Games: Identity (15.1) min y xAy = min j  i m a ij x i –It is trivial that min y xAy <= min j  i m a ij x i –Now, we show min y xAy >= min j  i m a ij x i –Let t = min j  i m a ij x i, thus we have xAy =  j n y j ( i m a ij x i ) >=  j n y j t = t