Normal Form (Matrix) Games

Slides:



Advertisements
Similar presentations
Chapter 17: Making Complex Decisions April 1, 2004.
Advertisements

Vincent Conitzer CPS Repeated games Vincent Conitzer
Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.
Continuation Methods for Structured Games Ben Blum Christian Shelton Daphne Koller Stanford University.
This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
Mixed Strategies CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte.
Computation of Nash Equilibrium Jugal Garg Georgios Piliouras.
Joint Strategy Fictitious Play Sherwin Doroudi. “Adapted” from J. R. Marden, G. Arslan, J. S. Shamma, “Joint strategy fictitious play with inertia for.
Calibrated Learning and Correlated Equilibrium By: Dean Foster and Rakesh Vohra Presented by: Jason Sorensen.
 1. Introduction to game theory and its solutions.  2. Relate Cryptography with game theory problem by introducing an example.  3. Open questions and.
Nash equilibria in Electricity Markets: A comparison of different approaches Seminar in Electric Power Networks, 12/5/12 Magdalena Klemun Master Student,
Regret Minimization and the Price of Total Anarchy Paper by A. Blum, M. Hajiaghayi, K. Ligett, A.Roth Presented by Michael Wunder.
Algorithms and Economics of Networks Abraham Flaxman and Vahab Mirrokni, Microsoft Research.
Network Theory and Dynamic Systems Game Theory: Mixed Strategies
Part 3: The Minimax Theorem
Game-theoretic analysis tools Necessary for building nonmanipulable automated negotiation systems.
ECO290E: Game Theory Lecture 4 Applications in Industrial Organization.
Non-cooperative Game Theory Notes by Alberto Bressan.
Complexity Results about Nash Equilibria
Temporal Action-Graph Games: A New Representation for Dynamic Games Albert Xin Jiang University of British Columbia Kevin Leyton-Brown University of British.
An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.
Lecture 1 - Introduction 1.  Introduction to Game Theory  Basic Game Theory Examples  Strategic Games  More Game Theory Examples  Equilibrium  Mixed.
1 Computing Nash Equilibrium Presenter: Yishay Mansour.
An Introduction to Game Theory Part III: Strictly Competitive Games Bernhard Nebel.
1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.
CS121 Heuristic Search Planning CSPs Adversarial Search Probabilistic Reasoning Probabilistic Belief Learning.
Game Theory.
Simple search methods for finding a Nash equilibrium Ryan Porter, Eugene Nudelman, and Yoav Shoham Games and Economic Behavior, Vol. 63, Issue 2. pp ,
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
EC941 - Game Theory Prof. Francesco Squintani Lecture 2 1.
MAKING COMPLEX DEClSlONS
Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.
林偉楷 Taiwan Evolutionary Intelligence Laboratory.
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Lecture 2: two-person non.
Ásbjörn H Kristbjörnsson1 The complexity of Finding Nash Equilibria Ásbjörn H Kristbjörnsson Algorithms, Logic and Complexity.
Price of Anarchy Georgios Piliouras. Games (i.e. Multi-Body Interactions) Interacting entities Pursuing their own goals Lack of centralized control Prediction?
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Algorithms for solving two-player normal form games
1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.
MAIN RESULT: We assume utility exhibits strategic complementarities. We show: Membership in larger k-core implies higher actions in equilibrium Higher.
Day 9 GAME THEORY. 3 Solution Methods for Non-Zero Sum Games Dominant Strategy Iterated Dominant Strategy Nash Equilibrium NON- ZERO SUM GAMES HOW TO.
Game theory basics A Game describes situations of strategic interaction, where the payoff for one agent depends on its own actions as well as on the actions.
Chapter 28 Game Theory.
Microeconomics Course E
Nash Equilibrium: P or NP?
Chapter 7. Classification and Prediction
Non-additive Security Games
Lecture 6: Other Game Models and Solution Concepts
The minimum cost flow problem
Communication Complexity as a Lower Bound for Learning in Games
Vincent Conitzer CPS Repeated games Vincent Conitzer
Structured Models for Multi-Agent Interactions
Game Theory in Wireless and Communication Networks: Theory, Models, and Applications Lecture 2 Bayesian Games Zhu Han, Dusit Niyato, Walid Saad, Tamer.
Multiagent Systems Extensive Form Games © Manfred Huber 2018.
Multiagent Systems Game Theory © Manfred Huber 2018.
Rational Decisions and
Vincent Conitzer Normal-form games Vincent Conitzer
Presented By Aaron Roth
Computing Nash Equilibrium
Multiagent Systems Repeated Games © Manfred Huber 2018.
Enumerating All Nash Equilibria for Two-person Extensive Games
Vincent Conitzer Repeated games Vincent Conitzer
9.3 Linear programming and 2 x 2 games : A geometric approach
CS 416 Artificial Intelligence
Computing a Nash Equilibrium of a Congestion Game: PLS-completeness
A Technique for Reducing Normal Form Games to Compute a Nash Equilibrium Vincent Conitzer and Tuomas Sandholm Carnegie Mellon University, Computer Science.
Vincent Conitzer CPS Repeated games Vincent Conitzer
Presentation transcript:

Normal Form (Matrix) Games Multiagent Systems Solution Concepts for Normal Form (Matrix) Games © Manfred Huber 2018

Solution Concepts Solution concepts define “interesting” outcomes in a multiagent game, e.g.: Pareto-optimal outcomes Pure-strategy Nash equilibrium Mixed-strategy Nash equilibrium Maxmin strategy profile Minmax strategy profile Important questions Do they exist ? How can they be computed ? © Manfred Huber 2018

Zero-Sum Games Zero-Sum games are games where all the sum of the payoffs for all agents for all strategies is equal to 0. Any game in which a transformation of the utility function of the form u’ = k*u + l, k>0, l≥0 leads to a zero-sum game can itself be considered a zer-sum game. a 2-payer zero-sum game is completely antagonistic u1 = -u2 TCP backoff game -1/-1 , -4/0, 0/-4, -3/-3 © Manfred Huber 2018

Nash Equilibria in 2-Player Zero-Sum Games In 2-player zero-sum games Nash equilibria are minmax and maxmin strategies Solution can be found by searching through the possible sets of support for mixed strategies Exponential complexity Maxmin solution can be formulated as a linear program Maxmin for agent 1 is equal to Minmax for agent 2 © Manfred Huber 2018

Linear Programming Linear programs are optimization problems under linear, closed inequality constraints xi are the variables to solve for © Manfred Huber 2018

2-Player Zero-Sum Game In 2-player zero-sum games, maxmin and minmax can be expressed as linear programs maxmin for agent 1 or minmax for agent 2: © Manfred Huber 2018

2-Player Zero-Sum Game minmax for agent 1 or maxmin for agent 2: © Manfred Huber 2018

Maxmin Strategies for General Games maxmin strategies can be computed for general sum games maxmin strategy determines the agent’s strategy only based on its utility function Computation by re-designing the game Construct a zero-sum game G’ by maintaining the agent’s payoffs and changing the other agent’s payoff function The maxmin solution for the agent in the modified game G’ is the same as for the original game maxmin strategy in a general sum game is not necessarily a Nash equilibrium © Manfred Huber 2018

Nash Equilibria for 2-Player Non Zero-Sum Games For non zero-sum games, linear programming does no longer work because there is no single objective function The complexity of computing Nash equilibria in general sum games is unknown. It is assumed that it is worst case exponential 2-Player general sum games can be fomulated as Linear Complementarity Problems (LCP) LCP is a constraint satisfaction problem without an objective function Formulation considers both players’ utility functions © Manfred Huber 2018

LCP Formulation for 2-Player Non Zero-Sum Games LCP combines the constraints of both agents and adds complementarity constraints © Manfred Huber 2018

Nash Equilibria for 2-Player Non Zero-Sum Games LCP for 2-player games can be solved, e.g., using the Lemke-Howson algorithm Moves along the edges of a labeled graph in strategy space Worst-case exponential Other solution is to search the space of supports Determine which sets of actions could yield mixed Nash equilibria Compute the corresponding probabilities and verify that it is an equilibrium © Manfred Huber 2018

Domination and Strategy Removal Strategy profiles can be related si strictly dominates si’ for player i if si weakly dominates si’ for player i if si very weakly dominates si’ for player i if TCP backoff game -1/-1 , -4/0, 0/-4, -3/-3 © Manfred Huber 2018

Domination and Nash Equilibria A dominant strategy is a strategy that dominates all others A strategy profile consisting of dominant strategies for all players must be a Nash equilibrium © Manfred Huber 2018

Iterated Removal of Dominated Strategies No equilibrium can be strictly dominated by another strategy All strictly dominated strategies can be removed while still maintaining the solution to the game Iterated removal of dominated strategies repeats the removal process until no further dominated strategies are available © Manfred Huber 2018

Iterated Removal of Dominated Strategies R is dominated by L for player 2 M is dominated by [0.5:U; 0.5:D] for player 1 L C R U 3, 1 0, 1 0, 0 M 1, 1 5, 0 D 4, 1 L C R U 3, 1 0, 1 0, 0 M 1, 1 5, 0 D 4, 1 L C R U 3, 1 0, 1 0, 0 M 1, 1 5, 0 D 4, 1 © Manfred Huber 2018

Iterated Removal of Dominated Strategies Iterated removal preserves Nash equilibria Strict dominance preserves all equilibria Weak or very weak dominance preserves at least one equilibrium Removal order can influence which equilibria are preserved Iterated removal can be used as a preprocessing step for Nash equilibrium calculation Game: 90,90 0,0 0,40 / 400,40 0,0 180,180 0,40 © Manfred Huber 2018

Correlated Equilibria In particular in coordination games the Nash equilibrium does not achieve a very good performance Equilibrium results in strategies that often end up in low payoff outcomes What could solve this ? go wait -100, -100 10, 0 0, 10 -10, -10 © Manfred Huber 2018

Correlated Equilibria A solution would be to coordinate the random picks by the players Has to be outside the control and insight of each player or they could change their strategy Central random variable with a common, known distribution and a private signal to each of the players Signal is correlated to other signals but does not determine the other players’ signals Game: 90,90 0,0 0,40 / 400,40 0,0 180,180 0,40 © Manfred Huber 2018

Correlated Equilibria A correlated equilibrium is a tuple (v, π, σ) v is a vector of random variables with domains D π is the joint distribution of v σ is a vector of mappings from D to actions in A for every mapping σ’ : Game: 90,90 0,0 0,40 / 400,40 0,0 180,180 0,40 © Manfred Huber 2018

Correlated Equilibria For every Nash equilibrium there exists a corresponding correlated equilibrium. If the mapping is replaced by the decoupled probabilistic choices and the mapping is reduced to the action choice indicated by the domain, the correlated equilibrium reduces to a Nash equilibrium Not every correlated equilibrium is a Nash equilibrium Game: 90,90 0,0 0,40 / 400,40 0,0 180,180 0,40 © Manfred Huber 2018

Computing Correlated Equilibria Linear programming constraints for CE Objective function: e.g. social-welfare Game: 90,90 0,0 0,40 / 400,40 0,0 180,180 0,40 Not necessarily a Nash Equilibrium © Manfred Huber 2018

Computing Correlated Equilibria CE are easier to compute than Nash Equilibria Only one randomization in CE and product of independent probabilities in NE Constraints for NE: Game: 90,90 0,0 0,40 / 400,40 0,0 180,180 0,40 This is a nonlinear constraint – No linear program © Manfred Huber 2018

Computing Nash Equilibria No algorithm is known to compute Nash equilibria for n-player general sum games in polynomial time A number of iterated algorithms provide good approximations Multiagent learning can lead to efficient approximate solutions Game: 90,90 0,0 0,40 / 400,40 0,0 180,180 0,40 © Manfred Huber 2018