Mixed Strategies CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte.

Slides:



Advertisements
Similar presentations
Chapter 17: Making Complex Decisions April 1, 2004.
Advertisements

Mixed Strategies.
Variants of Nash Equilibrium CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte 1.
Vincent Conitzer CPS Repeated games Vincent Conitzer
Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.
Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.
This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
Totally Unimodular Matrices
C&O 355 Mathematical Programming Fall 2010 Lecture 12 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
6.896: Topics in Algorithmic Game Theory Lecture 11 Constantinos Daskalakis.
Introduction to Game theory Presented by: George Fortetsanakis.
Bayesian Games Yasuhiro Kirihata University of Illinois at Chicago.
ECO290E: Game Theory Lecture 5 Mixed Strategy Equilibrium.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium
Study Group Randomized Algorithms 21 st June 03. Topics Covered Game Tree Evaluation –its expected run time is better than the worst- case complexity.
Repeated games with Costly Observations Eilon Solan, Tel Aviv University Ehud Lehrer Tel Aviv University with.
EC941 - Game Theory Lecture 7 Prof. Francesco Squintani
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Zero-Sum Games (follow-up)
Network Theory and Dynamic Systems Game Theory: Mixed Strategies
Game-theoretic analysis tools Necessary for building nonmanipulable automated negotiation systems.
Dynamic Games of Complete Information.. Repeated games Best understood class of dynamic games Past play cannot influence feasible actions or payoff functions.
Chapter 6 © 2006 Thomson Learning/South-Western Game Theory.
by Vincent Conitzer of Duke
Non-cooperative Game Theory Notes by Alberto Bressan.
An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.
Harsanyi transformation Players have private information Each possibility is called a type. Nature chooses a type for each player. Probability distribution.
Duality Lecture 10: Feb 9. Min-Max theorems In bipartite graph, Maximum matching = Minimum Vertex Cover In every graph, Maximum Flow = Minimum Cut Both.
1 Computing Nash Equilibrium Presenter: Yishay Mansour.
An Introduction to Game Theory Part III: Strictly Competitive Games Bernhard Nebel.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Quantal Response Equilibrium APEC 8205: Applied Game Theory Fall 2007.
Nash Equilibrium - definition A mixed-strategy profile σ * is a Nash equilibrium (NE) if for every player i we have u i (σ * i, σ * -i ) ≥ u i (s i, σ.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Experts Learning and The Minimax Theorem for Zero-Sum Games Maria Florina Balcan December 8th 2011.
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
1. problem set 6 from Osborne’s Introd. To G.T. p.210 Ex p.234 Ex p.337 Ex. 26,27 from Binmore’s Fun and Games.
Mechanism Design without Money Lecture 2 1. Let’s start another way… Everyone choose a number between zero and a hundred, and write it on the piece of.
Assoc. Prof. Yeşim Kuştepeli1 GAME THEORY AND APPLICATIONS UNCERTAINTY AND EXPECTED UTILITY.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 11.
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
Worlds with many intelligent agents An important consideration in AI, as well as games, distributed systems and networking, economics, sociology, political.
Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
Game theory & Linear Programming Steve Gu Mar 28, 2008.
1 Topics covered in the course Games in extensive and strategic (normal form) Games in extensive and strategic (normal form) Backwards induction (zermelo’s.
Mixed Strategies and Repeated Games
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Shall we play a game? Game Theory and Computer Science Game Theory /06/05 - Zero-sum games - General-sum games.
1 a1a1 A1A1 a2a2 a3a3 A2A Mixed Strategies When there is no saddle point: We’ll think of playing the game repeatedly. We continue to assume that.
Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem.
5.1.Static Games of Incomplete Information
MiniMax Principle in Game Theory Slides Made by Senjuti Basu Roy.
9.2 Mixed Strategy Games In this section, we look at non-strictly determined games. For these type of games the payoff matrix has no saddle points.
By: Donté Howell Game Theory in Sports. What is Game Theory? It is a tool used to analyze strategic behavior and trying to maximize his/her payoff of.
Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.
Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.
Hypothesis testing and statistical decision theory
Game Theory M.Pajhouh Niya M.Ghotbi
Vincent Conitzer CPS Repeated games Vincent Conitzer
LECTURE 2 MIXED STRATEGY GAME
Multiagent Systems Game Theory © Manfred Huber 2018.
Vincent Conitzer Normal-form games Vincent Conitzer
Multiagent Systems Repeated Games © Manfred Huber 2018.
Lecture 20 Linear Program Duality
MinMax Theorem John von Neumann
Instructor: Vincent Conitzer
Normal Form (Matrix) Games
Vincent Conitzer CPS Repeated games Vincent Conitzer
Presentation transcript:

Mixed Strategies CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte

Motivation Some games like Rock, Paper, Scissor don’t have a Nash equilibrium as defined so far. Intuitively, the reason is that there is no steady state where players have perfect knowledge of each other’s actions: knowing exactly what the other player will do allows me to achieve an optimal payoff at their expense. Von Neumann and Morgenstern observed that this changes if we allow players to be unpredictable by choosing randomized strategies.

Definition Consider a 2-player game (A 1,A 2,u 1,u 2 ). The members of A i are the pure strategies for player i. A mixed strategy is a probability distribution over pure strategies. Or, if we have k pure strategies, a mixed strategy is a k-dimensional vector whose nonnegative entries sum to 1. Note: this is the notation from the text. More usual are Greek letters for mixed strategies, e.g. σ.

Example: Matching Pennies Heads: 1/2Tails: 1/2 Heads: 1/31,-1-1,1 Tails: 2/3-1,11,-1

Mixed Strategies ctd. Suppose that each player chooses a mixed strategy s i. The expected utility of pure strategy a 1 is given by EU 1 (a 1,s 2 ) = Σ k u 1 (a 1,a k ) x s 2 (a k ). where a j ranges over the strategies of player 2. The expected utility of mixed strategy s 1 is the expectation over the pure strategies: EU 1 (s 1,s 2 ) = Σ j EU 1 (a j,s 2 ) x s 1 (a j ) = Σ j Σ k u 1 (a j,a k ) x s 1 (a j ) x s 2 (a k ). We also write u 1 (s 1,s 2 ) for EU 1 (s 1,s 2 ). Then we have in effect a new game whose strategy sets are the sets of mixed strategies, and whose utility functions are the expected payoffs.

Nash equilibrium in mixed strategies The definition of best response is as before: a mixed strategy s 1 is a best response to s 2 if and only if there is no other mixed strategy s’ 1 with u 1 (s’ 1,s 2 ) > u 1 (s 1,s 2 ). Similarly, two mixed strategies (s 1,s 2 ) are a Nash equilibrium if each is a best response to the other.

Examples and Exercises Find a mixed Nash equilibrium for the following games. – Matching Pennies. – Coordination Game. – Prisoner’s Dilemma. – Battle of the Sexes or Chicken. Can you find all the Nash equilibria?

Visualization of Equilibria In a 2x2 game, we can graph player 2’s utility as a function of p, the probability that 1 chooses strategy 1. Similarly for player 1’s utility.

Strategies and Topology Consider mixed strategy profiles as k- dimensional vectors, where k is the total number of pure strategies for each player. This set is convex and compact (why?). convexcompact

Visualization of Matching Pennies

Visualization of M.P. equilibrium

Computation of Equlibria Definition The support of a probability measure p is the set of all points x s.t. p(x)>0. Proposition. A mixed strategy pair (s 1,s 2 ) is an N.E. if and only if for all pure strategies a i in the support of s i, the strategy s i is a best reply so s -i. Corollary. If (s 1,s 2 ) is an N.E. and a 1,b 1 are in the support of s 1, then u 1 (a 1,s 2 ) = u 1 (b 1,s 2 ). That is, player 1 is indifferent between a 1 and b 1. Ditto for a 2,b 2 in the support of s 2.

A general NP procedure for finding a Nash Equilibrium 1.Choose support for player 1, support for player 2. 2.Check if there is a Nash equilibrium with those supports.

Existence of Nash Equilibrium Theorem (Nash 1950). In any finite game (any number of players) there exists a Nash equilibrium. Short proof by Nash.proof by Nash

Existence Proof (1) Transform a given pair of mixed strategies (s 1,s 2 ) as follows. For each pure strategy a 1 of player 1, set c(a 1 ) := max(0,[u 1 (a 1,s 2 ) – u 1 (s 1,s 2 )]). s’ 1 (a 1 ) := [s 1 (a 1 ) + c(a 1 )]/ [1+ Σ b c(b)]. Ditto for player 2. This defines a continuous operator f(s 1,s 2 ) = (s’ 1,s’ 2 ) on the space of mixed strategy pairs. A strategy a is a best reply if and only if c(a) = 0. So if (s 1,s 2 ) is an N.E., then c(a) = 0 for all a, so f(s 1,s 2 ) = (s 1,s 2 ).

Existence Proof (2) The generalized Brouwer fixed point theorem states that if K is convex and compact, and f: K  K is continuous, then f has a fixed point f(x) = x. If (s 1,s 2 ) is a fixed point of the mapping on the previous slide, then (s 1,s 2 ) is an N.E. Proof outline: Some action a 1 with s 1 (a) > 0 must be a best reply against any s 2. Therefore c(a 1 ) = 0. Since we have a fixed point, 1+ Σ b c(b) = 1. This implies that c(b) = 0 for each b.

Illustration in Excel Illustrate construction in Excel. Note that the update operation can be viewed as a local computation method, and even as a learning method!

Maxmin and Minmax Consider a 2-player game. A maxmin strategy for player 1 solves max s 1 min s 2 u 1 (s 1,s 2 ). Ditto for player 2. Interpretation: Conservatively choose strategy against worst-case adversary. The value max s 1 min s 2 u 1 (s 1,s 2 ) is called the security level of player 1. A minmax strategy for player 1 solves min s 1 max s 2 u 2 (s 1,s 2 ). Ditto for player 2. Interpreation: Punish the other player by minimizing the best payoff she can get.

N.E. in Zero-Sum Games: The Minimax Theorem (von Neumann 1928).. Consider a 2-player zero-sum game. 1.s i is a maxmin strategy if and only if s i is a minmax strategy for i = 1,2. 2.For both players, the maxmin value = minmax value. 3.If s 1,s 2 are each maxmin (minmax) strategies, then (s 1,s 2 ) is a Nash equilibrium. 4.If (s 1,s 2 ) is an N.E., then s 1 and s 2 are maxmin (minmax) strategies.

Interpretation of mixed N.E.

N.E. in Population Games