Lirong Xia Friday, April 25, 2014 Introduction to Game Theory.

Slides:



Advertisements
Similar presentations
CPS Extensive-form games Vincent Conitzer
Advertisements

M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
Basics on Game Theory Class 2 Microeconomics. Introduction Why, What, What for Why Any human activity has some competition Human activities involve actors,
BASICS OF GAME THEORY. Recap Decision Theory vs. Game Theory Rationality Completeness Transitivity What’s in a game? Players Actions Outcomes Preferences.
Mechanism Design without Money Lecture 1 Avinatan Hassidim.
This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
3. Basic Topics in Game Theory. Strategic Behavior in Business and Econ Outline 3.1 What is a Game ? The elements of a Game The Rules of the.
Sep. 8, 2014 Lirong Xia Introduction to MD (mooncake design or mechanism design)
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Non-Cooperative Game Theory To define a game, you need to know three things: –The set of players –The strategy sets of the players (i.e., the actions they.
1 Game Theory. By the end of this section, you should be able to…. ► In a simultaneous game played only once, find and define:  the Nash equilibrium.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
Game-theoretic analysis tools Necessary for building nonmanipulable automated negotiation systems.
Lecture 2A Strategic form games
Extensive-form games. Extensive-form games with perfect information Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 5 Players do not move simultaneously.
Games What is ‘Game Theory’? There are several tools and techniques used by applied modelers to generate testable hypotheses Modeling techniques widely.
An Introduction to Game Theory Part I: Strategic Games
Chapter 6 © 2006 Thomson Learning/South-Western Game Theory.
Sep. 5, 2013 Lirong Xia Introduction to Game Theory.
A camper awakens to the growl of a hungry bear and sees his friend putting on a pair of running shoes, “You can’t outrun a bear,” scoffs the camper. His.
Basics on Game Theory For Industrial Economics (According to Shy’s Plan)
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
APEC 8205: Applied Game Theory Fall 2007
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Review Midterm3/23 3/9.
PLANNING Partial order regression planning Temporal representation 1 Deductive planning in Logic Temporal representation 2.
CPS 170: Artificial Intelligence Game Theory Instructor: Vincent Conitzer.
Reading Osborne, Chapters 5, 6, 7.1., 7.2, 7.7 Learning outcomes
Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
EC941 - Game Theory Prof. Francesco Squintani Lecture 5 1.
Extensive-form games Vincent Conitzer
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 4.1.Dynamic Games of Incomplete Information Lecture
Dynamic Games & The Extensive Form
Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Lecture 2: two-person non.
Chapters 29, 30 Game Theory A good time to talk about game theory since we have actually seen some types of equilibria last time. Game theory is concerned.
CPS 270: Artificial Intelligence Planning Instructor: Vincent Conitzer.
Topic 3 Games in Extensive Form 1. A. Perfect Information Games in Extensive Form. 1 RaiseFold Raise (0,0) (-1,1) Raise (1,-1) (-1,1)(2,-2) 2.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.1.Dynamic Games of Complete and Perfect Information Lecture
CPS 570: Artificial Intelligence Planning Instructor: Vincent Conitzer.
3.1.4 Types of Games. Strategic Behavior in Business and Econ Outline 3.1. What is a Game ? The elements of a Game The Rules of the Game:
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Game theory 2 Lukáš Lehotský
AI Lecture 17 Planning Noémie Elhadad (substituting for Prof. McKeown)
Strategic Behavior in Business and Econ Static Games of complete information: Dominant Strategies and Nash Equilibrium in pure and mixed strategies.
Intermediate Microeconomics Game Theory. So far we have only studied situations that were not “strategic”. The optimal behavior of any given individual.
Lec 23 Chapter 28 Game Theory.
ECO290E: Game Theory Lecture 8 Games in Extensive-Form.
ECO290E: Game Theory Lecture 10 Examples of Dynamic Games.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.1.Dynamic Games of Complete and Perfect Information Lecture
Game theory basics A Game describes situations of strategic interaction, where the payoff for one agent depends on its own actions as well as on the actions.
Yuan Deng Vincent Conitzer Duke University
Instructor: Vincent Conitzer
Introduction to Game Theory
Multiagent Systems Game Theory © Manfred Huber 2018.
Introduction to Game Theory
Chapter 29 Game Theory Key Concept: Nash equilibrium and Subgame Perfect Nash equilibrium (SPNE)
CPS Extensive-form games
CPS 570: Artificial Intelligence Planning
Multiagent Systems Repeated Games © Manfred Huber 2018.
Chapter 14 & 15 Repeated Games.
Vincent Conitzer Extensive-form games Vincent Conitzer
CPS 173 Extensive-form games
Molly W. Dahl Georgetown University Econ 101 – Spring 2009
Artificial Intelligence Planning
Introduction to Game Theory
Introduction to Game Theory
Presentation transcript:

Lirong Xia Friday, April 25, 2014 Introduction to Game Theory

STRIPS language –state of the world: conjunction of positive, ground, function-free literals Action –Preconditions: a set of activating literals –Effects: updates of active literals 1 Planning problems

Blocks world A B C D

Start: On(B, A), On(A, Table), On(D, C), On(C, Table), Clear(B), Clear(D) Move(x,y,z) –Preconditions: On(x,y), Clear(x), Clear(z) –Effects: On(x,z), Clear(y), NOT(On(x,y)), NOT(Clear(z)) MoveToTable(x,y) –Preconditions: On(x,y), Clear(x) –Effects: On(x,Table), Clear(y), NOT(On(x,y)) A B C D

Blocks world example Goal: On(A,B) AND Clear(A) AND On(C,D) AND Clear(C) A plan: MoveToTable(B, A), MoveToTable(D, C), Move(C, Table, D), Move(A, Table, B) A B C D

Forward search –c.f. standard BFS, DFS, UCS Backward search –c.f. backward BFS, DFS, UCS Heuristics –Heuristic 1: plan for each subgoal (literal) separately, sum costs of plans –Heuristic 2: solve a relaxed planning problem in which actions never delete literals (empty-delete-list heuristic) 5 Search algorithms

Not everything decomposes into multiple problems: Sussman Anomaly Goal: On(A,B) AND On(B,C) Sequentially pursuing goals does not work Optimal plan: MoveToTable(C,A), Move(B,Table,C), Move(A,Table,B) AB C

Related grad-level courses at RPI –Algorithmic Game Theory (Prof. Elliot Anshelevich) –Computational Social Choice (me, Fall 2014) 7 Today: Game theory

plurality rule (ties are broken in favor of ) > > > Plurality rule YOU Bob Carol

What if everyone is incentivized to lie? > > Plurality rule YOU Bob Carol > > > >

What? –Agents may have incentives to lie Why? –Hard to predict the outcome when agents lie How? –A general framework for games Solution concept: Nash equilibrium –Modeling preferences and behavior: utility theory –Special games Normal form games: mixed Nash equilibrium Extensive form games: subgame-perfect equilibrium 10 High-level questions

11 A game R1*R1* s1s1 Outcome R2*R2* s2s2 Rn*Rn* snsn Mechanism … … Strategy Profile D Players: N ={ 1,…,n } Strategies (actions):  S j for agent j, s j ∈ S j  ( s 1,…, s n ) is called a strategy profile. Outcomes: O Preferences: total preorders (full rankings with ties) over O Mechanism f : Π j S j → O

Players: { YOU, Bob, Carol } Outcomes: O = {,, } Strategies: S j = Rankings( O ) Preferences: See above Mechanism: the plurality rule 12 A game of plurality elections > > Plurality rule YOU Bob Carol > > > >

Players: Strategies: { Cooperate, Defect } Outcomes: {( -2, -2 ), ( -3, 0 ), ( 0, -3 ), ( -1, -1 )} Preferences: self-interested 0 > -1 > -2 > -3 – : ( 0, -3 ) > ( -1, -1 ) > ( -2, -2 ) > ( -3, 0 ) – : ( -3, 0 ) > ( -1, -1 ) > ( -2, -2 ) > ( 0, -3 ) Mechanism: the table 13 A game of two prisoners CooperateDefect Cooperate ( -1, -1 )( -3, 0 ) Defect ( 0, -3 )( -2, -2 ) Column player Row player

Suppose –every player wants to make the outcome as preferable (to her) as possible by controlling her own strategic (but not the other players’) What is the outcome? –No one knows for sure –A “stable” situation seems reasonable A Nash Equilibrium (NE) is a strategy profile ( s 1,…, s n ) such that –For every player j and every s j ' ∈ S j, f ( s j, s -j ) ≥ j f ( s j ', s -j ) – s -j = ( s 1,…, s j-1, s j+1,…, s n ) –no single player can be better off by deviating 14 Solving the game

15 Prisoner’s dilemma CooperateDefect Cooperate ( -1, -1 )( -3, 0 ) Defect ( 0, -3 )( -2, -2 ) Column player Row player

“If everyone competes for the blond, we block each other and no one gets her. So then we all go for her friends. But they give us the cold shoulder, because no one likes to be second choice. Again, no winner. But what if none of us go for the blond. We don’t get in each other’s way, we don’t insult the other girls. That’s the only way we win. That’s the only way we all get [a girl.]” 16 A beautiful mind

Players: { Nash, Hansen } Strategies: { Blond, another girl } Outcomes: {( 0, 0 ), ( 5, 1 ), ( 1, 5 ), ( 2, 2 )} Preferences: high value for self only Mechanism: the table 17 A beautiful mind: the bar game BlondAnother girl Blond ( 0, 0 )( 5, 1 ) Another girl ( 1, 5 )( 2, 2 ) Column player Row player Nash Hansen

Lesson 1: scientific skepticism (critical thinking) –default of any argument is “wrong” –it is the authors’ responsibility to prove the correctness –once you find an error, try to correct and improve it Example: Nash equilibrium in “A beautiful mind” 18 Research 101

Try to apply research 101 to the example in the last slide –Example: Nash equilibrium in “A beautiful mind” –Is it really an example of misinterpreting Nash equilibrium? 19 Really?

Not always But an NE exists when every player has a dominant strategy – s j is a dominant strategy for player j, if for every s j ' ∈ S j, 1.for every s -j, f ( s j, s -j ) ≥ j f ( s j ', s -j ) 2.the preference is strict for some s -j 20 Does an NE always exists? LR U ( 0, 1 )( 1, 0 ) D ( 0, 1 ) Column player Row player

How to evaluate this solution concept? –Does it really model real-world situations? –What if an NE does not exist? approximate NE (beyond this course) –What if there are too many NE? Equilibrium selection refinement: a “good” NE Cases where an NE always exists –Normal form games –Strategy space: lotteries over pure strategies –Outcome space: lotteries over atomic outcomes 21 End of story?

Given pure strategies: S j for agent j Normal form games Players: N ={ 1,…,n } Strategies: lotteries (distributions) over S j – L j ∈ Lot( S j ) is called a mixed strategy –( L 1,…, L n ) is a mixed-strategy profile Outcomes: Π j Lot( S j ) Mechanism: f ( L 1,…, L n ) = p – p ( s 1,…, s n ) = Π j L j ( s j ) Preferences: –Soon 22 Normal form games LR U ( 0, 1 )( 1, 0 ) D ( 0, 1 ) Column player Row player

Option 1 vs. Option 2 –Option 1: –Option 2: $5 for sure Option 3 vs. Option 4 –Option 3: –Option 4: $5M for sure 23 Preferences over lotteries

There are m objects. Obj={ o 1,…, o m } Lot(Obj): all lotteries (distributions) over Obj In general, an agent’s preferences can be modeled by a preorder (ranking with ties) over Lot(Obj) –But there are infinitely many outcomes 24 Lotteries

Utility function: u : Obj → ℝ For any p ∈ Lot(Obj) – u ( p ) = Σ o ∈ Obj p ( o ) u ( o ) u represents a total preorder over Lot(Obj) – p 1 > p 2 if and only if u ( p 1 )> u ( p 2 ) Utility is virtual, preferences are real –Preferences represented by utility theory have a neat characterization 25 Utility theory

u (Option 1) = u ( 0 ) ×50% + u ( 30 ) ×50%=5.5 u (Option 2) = u ( 5 ) ×100%=3 u (Option 3) = u ( 0 ) ×50% + u ( 30M ) ×50%=75.5 u (Option 4) = u ( 5M ) ×100%= Example Money05305M30M Utility utility Money

Given pure strategies: S j for agent j Players: N ={ 1,…,n } Strategies: lotteries (distributions) over S j – L j ∈ Lot( S j ) is called a mixed strategy –( L 1,…, L n ) is a mixed-strategy profile Outcomes: Π j Lot( S j ) Mechanism: f ( L 1,…, L n ) = p, such that – p ( s 1,…, s n ) = Π j L j ( s j ) Preferences: represented by utility functions u 1,…, u n 27 Normal form games

Mixed-strategy Nash Equilibrium is a mixed strategy profile ( L 1,…, L n ) s.t. for every j and every L j ' ∈ Lot( S j ) u j ( L j, L -j ) ≥ u j ( L j ', L -j ) Any normal form game has at least one mixed- strategy NE [Nash 1950] Any L j with L j ( s j )=1 for some s j ∈ S j is called a pure strategy Pure Nash Equilibrium –a special mixed-strategy NE ( L 1,…, L n ) where all strategies are pure strategy 28 Solution concepts for normal form games

29 Example: mixed-strategy NE LR U ( 0, 1 )( 1, 0 ) D ( 0, 1 ) Column player Row player Row player’s strategy Column player’s strategy }}

In normal-form games –Mixed-strategy NE = NE in the general framework –pure NE = a refinement of (mixed-strategy) NE How good is mixed-strategy NE as a solution concept? –At least one –Maybe many –Can use pure NE as a refinement (may not exist) 30 A different angle

Players move sequentially Outcomes: leaves Preferences are represented by utilities A strategy of player j is a combination of all actions at her nodes All players know the game tree (complete information) At player j ’s node, she knows all previous moves (perfect information) 31 Extensive-form games Nash Hansen Nash (5,1)(5,1) (1,5)(1,5) (2,2)(2,2) (0,0)(0,0) (-1,5) B A B ABA BA leaves: utilities (Nash,Hansen)

32 Convert to normal-form Nash Hansen Nash (5,1)(5,1) (1,5)(1,5) (2,2)(2,2) (0,0)(0,0) (-1,5) B A B ABA BA (B,B)(B,A)(A,B)(A,A) (B,B)(0,0)(0,0)(0,0)(0,0)(5,1)(5,1)(5,1)(5,1) (B,A)(-1,5) (5,1)(5,1)(5,1)(5,1) (A,B)(1,5)(1,5)(2,2)(2,2)(1,5)(1,5)(2,2)(2,2) (A,A)(1,5)(1,5)(2,2)(2,2)(1,5)(1,5)(2,2)(2,2) Hansen Nash Nash: (Up node action, Down node action) Hansen: (Left node action, Right node action)

Usually too many NE (pure) SPNE –a refinement (special NE) –also an NE of any subgame (subtree) 33 Subgame perfect equilibrium Nash Hansen Nash (5,1)(5,1) (1,5)(1,5) (2,2)(2,2) (0,0)(0,0) (-1,5) B A B ABA BA

Determine the strategies bottom-up Unique if no ties in the process All SPNE can be obtained, if –the game is finite –complete information –perfect information 34 Backward induction Nash Hansen Nash (5,1)(5,1) (1,5)(1,5) (2,2)(2,2) (0,0)(0,0) (-1,5) B A B ABA BA (0,0)(0,0) (1,5)(1,5) (5,1)(5,1) (5,1)(5,1)

How good is SPNE as a solution concept? –At least one –In many cases unique –is a refinement of NE (always exists) 35 A different angle

36 Wrap up Preferences Solution concept How manyComputation General gametotal preordersNE0-many Normal form game utilities mixed-strategy NE pure NE mixed: 1-many pure: 0-many Extensive form game utilities Subgame perfect NE 1 (no ties) many (ties) Backward induction

What is the problem? –agents may have incentive to lie Why we want to study this problem? How general it is? –The outcome is hard to predict when agents lie –It is very general and important How was problem addressed? –by modeling the situation as a game and focus on solution concepts, e.g. Nash Equilibrium Appreciate the work: what makes the work nontrivial? –It is by far the most sensible solution concept. Existence of (mixed-strategy) NE for normal form games Critical thinking: anything you are not very satisfied with? –Hard to justify NE in real-life –How to obtain the utility function? 37 The reading questions

So far we have been using game theory for prediction How to design the mechanism? –when every agent is self-interested –as a whole, works as we want The next class: mechanism design 38 Looking forward