Design of Multi-Agent Systems Teacher Bart Verheij Student assistants Albert Hankel Elske van der Vaart Web site

Slides:



Advertisements
Similar presentations
Concepts of Game Theory I
Advertisements

Concepts of Game Theory II. 2 The prisioners reasoning… Put yourself in the place of prisoner i (or j)… Reason as follows: –Suppose I cooperate… If j.
Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.
Game Theory S-1.
Game Theory Overheads. A game is a formal representation of a situation in which a number of decision makers (players) interact in a setting of strategic.
SIF8072 Distributed Artificial Intelligence and Intelligent Agents
Multiagent Interactions
Game Theory “Доверяй, Но Проверяй” - Russian Proverb (Trust, but Verify) - Ronald Reagan Mike Shor Lecture 6.
Infinitely Repeated Games. In an infinitely repeated game, the application of subgame perfection is different - after any possible history, the continuation.
6-1 LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to MultiAgent Systems
1 Chapter 14 – Game Theory 14.1 Nash Equilibrium 14.2 Repeated Prisoners’ Dilemma 14.3 Sequential-Move Games and Strategic Moves.
The basics of Game Theory Understanding strategic behaviour.
1 Game Theory. By the end of this section, you should be able to…. ► In a simultaneous game played only once, find and define:  the Nash equilibrium.
An Introduction to... Evolutionary Game Theory
Game Theory Advertising Example 1. Game Theory What is the optimal strategy for Firm A if Firm B chooses to advertise? 2.
Coye Cheshire & Andrew Fiore March 21, 2012 // Computer-Mediated Communication Collective Action and CMC: Game Theory Approaches and Applications.
Multi-player, non-zero-sum games
17. (A very brief) Introduction to game theory Varian, Chapters 28, 29.
Games What is ‘Game Theory’? There are several tools and techniques used by applied modelers to generate testable hypotheses Modeling techniques widely.
An Introduction to Game Theory Part I: Strategic Games
Game Theory Part 5: Nash’s Theorem.
2008/02/06Lecture 21 ECO290E: Game Theory Lecture 2 Static Games and Nash Equilibrium.
Nash Equilibrium: Theory. Strategic or Simultaneous-move Games Definition: A simultaneous-move game consists of: A set of players For each player, a set.
5/16/20151 Game Theory Game theory was developed by John Von Neumann and Oscar Morgenstern in Economists! One of the fundamental principles of.
Slide 1 of 13 So... What’s Game Theory? Game theory refers to a branch of applied math that deals with the strategic interactions between various ‘agents’,
Arguments for Recovering Cooperation Conclusions that some have drawn from analysis of prisoner’s dilemma: – the game theory notion of rational action.
Game-Theoretic Approaches to Multi-Agent Systems Bernhard Nebel.
Design of Multi-Agent Systems Teacher Bart Verheij Student assistants Albert Hankel Elske van der Vaart Web site
A Memetic Framework for Describing and Simulating Spatial Prisoner’s Dilemma with Coalition Formation Sneak Review by Udara Weerakoon.
APEC 8205: Applied Game Theory Fall 2007
1 Section 2d Game theory Game theory is a way of thinking about situations where there is interaction between individuals or institutions. The parties.
TOPIC 6 REPEATED GAMES The same players play the same game G period after period. Before playing in one period they perfectly observe the actions chosen.
Introduction to Game Theory and Behavior Networked Life CIS 112 Spring 2009 Prof. Michael Kearns.
Coye Cheshire & Andrew Fiore June 28, 2015 // Computer-Mediated Communication Game Theory, Games, and CMC.
Introduction to Game Theory Yale Braunstein Spring 2007.
Game Theory.
Game Theory, Strategic Decision Making, and Behavioral Economics 11 Game Theory, Strategic Decision Making, and Behavioral Economics All men can see the.
A Game-Theoretic Approach to Strategic Behavior. Chapter Outline ©2015 McGraw-Hill Education. All Rights Reserved. 2 The Prisoner’s Dilemma: An Introduction.
Game Theory Formalizing the Hobbesian Dilemma. A. Assumptions 1. Assumptions a. Rational choice – People act according to their preferences (desires,
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
Game Theory Robin Burke GAM 224 Spring Outline Admin Game Theory Utility theory Zero-sum and non-zero sum games Decision Trees Degenerate strategies.
Strategic Decisions in Noncooperative Games Introduction to Game Theory.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Lecture 2: two-person non.
Chapter 12 - Imperfect Competition: A Game-Theoretic Approach Copyright © 2015 The McGraw-Hill Companies, Inc. All rights reserved.
LECTURE 6: MULTIAGENT INTERACTIONS
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia.
The Science of Networks 6.1 Today’s topics Game Theory Normal-form games Dominating strategies Nash equilibria Acknowledgements Vincent Conitzer, Michael.
I THINK THAT YOU THINK THAT I THINK - GAME THEORY AND VIDEO GAMES Jonas Heide Smith Game Analysis ITU, “If I had my way, a solid.
Lecture 5 Introduction to Game theory. What is game theory? Game theory studies situations where players have strategic interactions; the payoff that.
Section 2 – Ec1818 Jeremy Barofsky
1 GAME THEORY AND OLIGOPOLY l Principles of Microeconomic Theory, ECO 284 l John Eastwood l CBA 247 l l address:
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
How to Analyse Social Network? : Part 2 Game Theory Thank you for all referred contexts and figures.
Mohsen Afsharchi Multiagent Interaction. What are Multiagent Systems?
1 Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Franz J. Kurfess CPE/CSC 580: Intelligent Agents 1.
Oligopoly CHAPTER 13B. Oligopoly IRL In some markets there are only two firms. Computer chips are an example. The chips that drive most PCs are made by.
Games, Strategies, and Decision Making By Joseph Harrington, Jr. First Edition Chapter 4: Stable Play: Nash Equilibria in Discrete Games with Two or Three.
Arguments for Recovering Cooperation Conclusions that some have drawn from analysis of prisoner’s dilemma: – the game theory notion of rational action.
전략적 선택모델: 게임이론 동시적 게임 죄수의 딜레마 비겁자 게임 전개형(순차적) 게임 농부와 호랑이 오사카 성의 해자
Advanced Subjects in GT Outline of the tutorials Static Games of Complete Information Introduction to games Normal-form (strategic-form) representation.
Competitive Safety Analysis: Robust Decision-Making in Multi- Agent Systems Moshe Tennenholtz Kyle Rokos.
1 Fahiem Bacchus, University of Toronto CSC384: Intro to Artificial Intelligence Game Tree Search I ● Readings ■ Chapter 6.1, 6.2, 6.3, 6.6 ● Announcements:
Introduction to Game Theory
Decision Theory and Game Theory
Multiagent Systems Game Theory © Manfred Huber 2018.
Computer-Mediated Communication
LECTURE 6: MULTIAGENT INTERACTIONS
Game Theory: The Nash Equilibrium
Presentation transcript:

Design of Multi-Agent Systems Teacher Bart Verheij Student assistants Albert Hankel Elske van der Vaart Web site (Nestor contains a link)

Student presentations Week 37 * C. Jonker et al. (2002). BDI-Modelling of Intracellular Dynamics. Joris Ijsselmuiden * R. Wulfhorst et al. (2003). A Multiagent Approach for Musical Interactive Systems. Rosemarijn Looije * M. Dastani, J. Hulstijn, F. Dignum, J.-J.Ch. Meyer (2004). Issues in Multiagent System Development. Sander van Dijk

Week 38 * W. C. Stirling, M. A. Goodrich and D. J. Packard (2002). Satisficing Equilibria: A Non- Classical Theory of Games and Decisions. Dimitri Vrehen * A. Bazzan and R.H. Bordini (2001). A framework for the simulation of agents with emotions. Report on Experiments with the Iterated Prisoner's Dilemma. Stijn Colen * I. Dickinson and M. Wooldridge (2003). Towards Practical Reasoning Agents for the Semantic Web. * E. Norling (2004). Folk Psychology for Human Modelling: Extending the BDI Paradigm. Student presentations

Some practical matters  Please submit exercises to  Please use naming conventions for file names and message subjects.  Please read your student mail.

Overview Introduction Evaluation criteria & equilibria Social welfare Pareto efficiency Nash equilibria The Prisoner’s Dilemma Loose end: dominant strategies Not or different in the book

Typical structure of a multi-agent system

Interactions  Communication  Influence on environment (‘spheres of influence’)  Organizations, communities, coalitions  Hierarchical relations  Cooperation, competition

Utilities & preferences How to measure the results of a multi-agent systems? In terms of preferences and utilities. Some notation:  ={  1,  2, … }‘outcomes’, future environmental states group preferences (assumes cooperation) individual preferences

Preferences Strict preferences Properties Reflexive: Transitive: Comparable:

Utilities According to utility theory, preferences can be measured in terms of real numbers Example: money But money isn’t always the right measure: think of the subjective value of a million dollars when you have nothing or when you are Bill Gates.

Utility & money

Zero-sum & constant-sum games Simplification: two agents Constant sum games The sum of all players' payoffs is the same for any outcome. u i (  ) + u j (  ) = C for all    Zero-sum games All outcomes involve a sum of the players’ payoffs of 0: u i (  ) + u j (  ) = 0 for all    Chess 0, ½, 1 -½, 0, ½

Zero-sum & constant-sum games One agent’s gain is another agent’s loss. Zero-sum games are necessarily always competitive. But there are many non-zero sum situations.

Overview Introduction Evaluation criteria & equilibria Social welfare Pareto efficiency Nash equilibria The Prisoner’s Dilemma Loose end: dominant strategies

Kinds of evaluation criteria & equilibria Social welfare Pareto efficiency Nash equilibrium

Social welfare Social welfare measures the sum of all individual outcomes. Optimal social welfare may not be achievable when individuals are self-interested Individual agents follow their own (different) utility function.

Example 1 Agenta2a2 Strategys 2,1 s 2,2 s 1,1 (5,6)(4,3) a1a1 s 1,2 (1,2)(6,4) highest social welfare

Overview Introduction Evaluation criteria & equilibria Social welfare Pareto efficiency Nash equilibria The Prisoner’s Dilemma Loose end: dominant strategies

Pareto efficiency or optimality An outcome is Pareto optimal if a better outcome for one agent always results in a worse outcome for some other agent When all agents pursue social welfare, highest social welfare is Pareto optimal. However, a Pareto optimal outcome need not be desirable. E.g., dictatorship Pareto improvement: change that is an improvement for someone without hurting anyone

Example 1 Agenta2a2 Strategys 2,1 s 2,2 s 1,1 (5,6)(4,3) a1a1 s 1,2 (1,2)(6,4) Pareto efficient Pareto improvements

Overview Introduction Evaluation criteria & equilibria Social welfare Pareto efficiency Nash equilibria The Prisoner’s Dilemma Loose end: dominant strategies

Nash equilibrium Two strategies s 1 and s 2 are in Nash equilibrium if: 1.under the assumption that agent i plays s 1, agent j can do no better than play s 2 ; and 2.under the assumption that agent j plays s 2, agent i can do no better than play s 1. No individual has the incentive to unilaterally change strategy Example: driving on the right side of the road Nash equilibria do not always exist and are not always unique

Example 1 Agenta2a2 Strategys 2,1 s 2,2 s 1,1 (5,6)  (4,3) a1a1  s 1,2 (1,2)  (6,4) Nash equilibria ‘Nash incentives’

Example 1 Agenta2a2 Strategys 2,1 s 2,2 s 1,1 (5,6)  (4,3) a1a1  s 1,2 (1,2)  (6,4) outcomes corresponding to strategies in Nash equilibrium

Example 2 Agenta2a2 Strategys 2,1 s 2,2 s 1,1 (3,6)  (5,3) a1a1  s 1,2 (6,2)  (2,5) no Nash equilibrium

Example 3 unique Nash equilibrium Agenta2a2 Strategys 2,1 s 2,2 s 1,1 (1,1)  (5,0) a1a1  s 1,2 (0,5)  (3,3)

Example 3 unique Nash equilibrium Agenta2a2 Strategys 2,1 s 2,2 s 1,1 (1,1)  (5,0) a1a1  s 1,2 (0,5)  (3,3) highest social welfare & Pareto efficient

Overview Introduction Evaluation criteria & equilibria Social welfare Pareto efficiency Nash equilibria The Prisoner’s Dilemma Loose end: dominant strategies

The Prisoner’s Dilemma Two men are collectively charged with a crime and held in separate cells, with no way of meeting or communicating. They are told that: – if one confesses and the other does not, the confessor will be freed, and the other will be jailed for three years – if both confess, then each will be jailed for two years Both prisoners know that if neither confesses, then they will each be jailed for one year

The Prisoner’s Dilemma The prisoners can either defect or cooperate. The rational action for each individual prisoner is to defect. Example 3 is a prisoner’s dilemma (but note that it tables utilities, not prison years: less years in prison has a higher utility). Real life: nuclear arms reduction, free riders

The Prisoner’s Dilemma The Prisoner’s Dilemma is the fundamental problem of multi-agent interactions. It appears to imply that cooperation will not occur in societies of self-interested agents.

Recovering cooperation... Conclusions that some have drawn from this analysis: – the game theory notion of rational action is wrong! – somehow the dilemma is being formulated wrongly Arguments to recover cooperation: – We are not all Machiavelli! – The other prisoner is my twin! – The shadow of the future…

The Iterated Prisoner’s Dilemma One answer: play the game more than once If you know you will be meeting your opponent again, then the incentive to defect appears to evaporate When you now how many times you’ll meet your opponent, defection is again rational

Axelrod’s tournament Suppose you play iterated prisoner’s dilemma against a range of opponents… What strategy should you choose, so as to maximize your overall payoff? Axelrod (1984) investigated this problem, with a computer tournament for programs playing the prisoner’s dilemma

Strategies in Axelrod’s tournament ALL-D: Always defect TIT-FOR-TAT: At the first meeting of an opponent: cooperate. Then do what your opponent did on the previous meeting TESTER: First: defect. If the opponent retaliates, play TIT-FOR-TAT. Otherwise intersperse cooperation and defection. JOSS: As TIT-FOR-TAT, except periodically defect

Reasons for TIT-FOR-TAT’s success – Don’t be envious: Don’t play as if it were zero sum! – Be nice: Start by cooperating, and reciprocate cooperation – Retaliate appropriately: Always punish defection immediately, but use “measured” force — don’t overdo it – Don’t hold grudges: Always reciprocate cooperation immediately

Overview Introduction Evaluation criteria & equilibria Social welfare Pareto efficiency Nash equilibria The Prisoner’s Dilemma Loose end: dominant strategies

Dominant strategy A strategy is dominant for an agent if it is the best under all circumstances Dominant strategy equilibrium: each agent uses a dominant strategy A dominant strategy equilibrium is always a Nash equilibrium (but there are ‘more’ of the latter).

Example 4 (2,3)  (1,2)s 1,2  a1a1 (4,5)  (2,3)s 1,1 s 2,2 s 2,1 Strategy a2a2 Agent Dominant for a 1 Dominant for a 2

Just to play with: new roads - There are 6 cars going from A to D each day. - (A,B) and (C,D) are highways time(c) = 5 + 2c, where c is the number of cars -(B,D) and (A,C) are local roads time(c) = 20 + c A B C D What will happen when a new highway is made between B and C?