Heterogeneous Payoffs and Social Diversity in the Spatial Prisoner’s Dilemma game Dept Computer Science and Software Engineering Golriz Rezaei Dr. Michael.

Slides:

Advertisements

Similar presentations

Game Theory. I What is Game theory? The Theory of Games and Economic Behaviour by John von Neumann and Oskar Morgenstern (1944). Especially one institution:

Advertisements

Concepts of Game Theory II. 2 The prisioners reasoning… Put yourself in the place of prisoner i (or j)… Reason as follows: –Suppose I cooperate… If j.

1 Small Worlds and Phase Transition in Agent Based Models with Binary Choices. Denis Phan ENST de Bretagne, Département Économie et Sciences Humaines &

Complex Cooperative Networks from Evolutionary Preferential Attachment Complex Cooperative Networks from Evolutionary Preferential Attachment Jesús Gómez.

Evolving Cooperation in the N-player Prisoner's Dilemma: A Social Network Model Dept Computer Science and Software Engineering Golriz Rezaei Michael Kirley.

Distributed Advice-Seeking on an Evolving Social Network Dept Computer Science and Software Engineering The University of Melbourne - Australia Golriz.

PhD Completion Seminar Golriz Rezaei Supervisors: Dr. Michael Kirley

Tutorial 1 Ata Kaban School of Computer Science University of Birmingham.

News and Notes 4/13 HW 3 due now HW 4 distributed today, due Thu 4/22 Final exam is Mon May 3 11 AM Levine 101 Today: –intro to evolutionary game theory.

Evolution and Repeated Games D. Fudenberg (Harvard) E. Maskin (IAS, Princeton)

Evolution of Cooperation The importance of being suspicious.

6-1 LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to MultiAgent Systems

An Introduction to... Evolutionary Game Theory

Dynamics of Cooperation in Spatial Prisoner’s Dilemma of Memory- Based Players Chenna Reddy Cotla Department of Computational Social Science George Mason.

Game Theory Eduardo Costa. Contents What is game theory? Representation of games Types of games Applications of game theory Interesting Examples.

Automata-based adaptive behavior for economic modeling using game theory Rawan Ghnemat, Khalaf Khatatneh, Saleh Oqeili Al-Balqa’ Applied University, Al-Salt,

The Evolution of Cooperation within the Iterated Prisoner’s dilemma on a Social Network.

Maynard Smith Revisited: Spatial Mobility and Limited Resources Shaping Population Dynamics and Evolutionary Stable Strategies Pedro Ribeiro de Andrade.

Cooperation in Anonymous Dynamic Social Networks Brendan Lucier University of Toronto Brian Rogers Northwestern University Nicole Immorlica Northwestern.

On Quantum Walks and Iterated Quantum Games G. Abal, R. Donangelo, H. Fort Universidad de la República, Montevideo, Uruguay UFRJ, RJ, Brazil.

EC – Tutorial / Case study Iterated Prisoner's Dilemma Ata Kaban University of Birmingham.

Institutions and the Evolution of Collective Action Mark Lubell UC Davis.

Satisfaction Equilibrium Stéphane Ross. Canadian AI / 21 Problem In real life multiagent systems :  Agents generally do not know the preferences.

1 Economics & Evolution Number 2. 2 Reading List.

A Memetic Framework for Describing and Simulating Spatial Prisoner’s Dilemma with Coalition Formation Sneak Review by Udara Weerakoon.

6/2/2001 Cooperative Agent Systems: Artificial Agents Play the Ultimatum Game Steven O. Kimbrough Presented at FMEC 2001, Oslo Joint work with Fang Zhong.

1 Pendahuluan Pertemuan 9 Matakuliah: H0062/Teori Sistem Tahun: 2006.

Promotion of cooperation on networks? The best response case Carlos P. Roca (1,2) José A. Cuesta (1) Anxo Sánchez (1,3,4) The.

Peter B. Henderson Butler University

Conference title 1 A Few Bad Apples Are Enough. An Agent-Based Peer Review Game. Juan Bautista Cabotà, Francisco Grimaldo (U. València) Lorena Cadavid.

Agent Based Modeling and Simulation

Zhiyong Wang In cooperation with Sisi Zlatanova

Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.

Example Department of Computer Science University of Bologna Italy ( Decentralised, Evolving, Large-scale Information Systems (DELIS)

Rationality meets the tribe: Some models of cultural group selection David Hales, The Open University Hales, D., (2010) Rationality.

SLAC and SLACER: Simple copy & rewire algorithms for trust and cooperation in P2P David Hales, Stefano Arteconi, Ozalp Babaoglu University of Bologna,

Presenter: Chih-Yuan Chou GA-BASED ALGORITHMS FOR FINDING EQUILIBRIUM 1.

Aemen Lodhi (Georgia Tech) Amogh Dhamdhere (CAIDA)

Daniel Ariosa Ecole Polytechnique Fédérale de Lausanne (EPFL) Institut de Physique de la Matière Complexe CH-1015 Lausanne, Switzerland and Hugo Fort Instituto.

Cognitive Modeling / University of Groningen / / Artificial Intelligence |RENSSELAER| Cognitive Science CogWorks Laboratories › Christian P. Janssen ›

Evolving cooperation in one-time interactions with strangers Tags produce cooperation in the single round prisoner’s dilemma and it’s.

Evolving the goal priorities of autonomous agents Adam Campbell* Advisor: Dr. Annie S. Wu* Collaborator: Dr. Randall Shumaker** School of Electrical Engineering.

Evolving Social Rationality for MAS using “Tags” Trying to “make things work” by applying results gained from Agent-Based Social Simulation.

Game Theory by James Crissey Luis Mendez James Reid.

Iterated Prisoner’s Dilemma Game in Evolutionary Computation Seung-Ryong Yang.

IJCAI’07 Emergence of Norms through Social Learning Partha Mukherjee, Sandip Sen and Stéphane Airiau Mathematical and Computer Sciences Department University.

Alternating-offers Bargaining problems A Co-evolutionary Approach Nanlin Jin, Professor Edward Tsang, Professor Abhinay Muthoo, Tim Gosling, Dr Maria Fasli,

Socially Inspired Computing Engineering with Social Metaphors.

The Role of Altruistic Punishment in Promoting Cooperation

Evolving Strategies for the Prisoner’s Dilemma Jennifer Golbeck University of Maryland, College Park Department of Computer Science July 23, 2002.

Simple Rewire Protocols for Cooperation in Dynamic Networks David Hales, Stefano Arteconi, Ozalp Babaoglu University of Bologna, Italy Bio-Inspired Workshop,

Evolution of Cooperation in Mobile Ad Hoc Networks Jeff Hudack (working with some Italian guy)

Ec1818 Economics of Discontinuous Change Section 1 [Lectures 1-4] Wei Huang Harvard University (Preliminary and subject to revisions)

UNIVERSITA’ DEGLI STUDI NAPOLI FEDERICO II DOTTORATO IN INGEGNERIA DEI MATERIALI E DELLE STRUTTURE Brunella Corrado Filomena Gioiella Bernadette Lombardi.

The highly intelligent virtual agents for modeling financial markets G. Yang 1, Y. Chen 2 and J. P. Huang 1 1 Department of Physics, Fudan University.

Social Norm, Costly Punishment and the Evolution to Cooperation : Theory, Experiment and Simulation Tongkui Yu 1, 2, Shu-Heng Chen 2, Honggang Li 1* 1.

Indirect Reciprocity in the Selective Play Environment Nobuyuki Takahashi and Rie Mashima Department of Behavioral Science Hokkaido University 08/07/2003.

Pengyuan Du, Mario Gerla Department of Computer Science, UCLA, USA

Game Theory and Cooperation

Evolution for Cooperation

LECTURE 6: MULTIAGENT INTERACTIONS

The outbreak of cooperation among success-driven individuals under noisy conditions Success-driven migration and imitation as a driver for cooperative.

Self-Organising, Open and Cooperative P2P Societies – From Tags to Networks David Hales Department of Computer Science University of.

Evolution for Cooperation

Evolving cooperation in one-time interactions with strangers

Evolution of human cooperation without reciprocity

When fairness bends rationality: Ernst Fehr meets John Nash

Introduction to RePast and Tutorial I

Mutual support in agent networks

Phase transitions to cooperation in the prisoner‘s dilemma

Presentation transcript:

Heterogeneous Payoffs and Social Diversity in the Spatial Prisoner’s Dilemma game Dept Computer Science and Software Engineering Golriz Rezaei Dr. Michael Kirley SEAL08 Conference 8 Dec 2008

Evolution of cooperation Open ended question in many areas Evolutionary Computation (IEEE Trans, CEC, GECCO) Autonomous agents and multi agent systems (AAMAS) Distributed Artificial Intelligence (DAI) Physics (Statistical Physics) Biology (Theoretical biology, Nature) Prisoner’s Dilemma (PD game)  Different individual conditions (Heterogeneity) have impact In this paper we investigate this idea on a version of the Spatial Prisoner’s Dilemma (SPD) game. Good abstract Game theoretic approach Mathematical model Applied in many areas (biology, economics, and sociology)

Today’s Agenda Brief overview of Prisoner’s Dilemma game and different variations The challenge and related works Proposed model Evaluation by experiments Conclusion Questions

Prisoner’s Dilemma C cooperate D Defect C cooperate R=3 T=5 S=0 D Defect S=0 T=5 P=1 2 players / agents 2 choices (C or D) Actual values  order Order change  game change (D,D)  Nash Equilibrium But i) T > R > P > S ii) 2R >= (T + S) Iteration  reciprocal interaction Spatial  local neighbourhood

Spatial Prisoner’s Dilemma Limited to local neighbourhood interaction only Accumulates received payoffs from games  fitness At the end of each round  selection process imitation of the most successful neighbour (MSN) Clusters of cooperators  outweigh losses against defectors

The Challenge Typically  “Universal fixed payoff matrix” Hypothesis  Introducing “social diversity” alters trajectory of the population.

Related work Few studies  investigated the impact of varying the magnitude of the payoff matrix values 1. Tomochi and Kono [ Physical Review E 2002 ]: Payoff matrix evolved based on the ratio of defectors (considered R and P only) - Universal payoff matrix 2. Perc and Szolnoki [ Physical Review E 2008 ]: Random noise added to the individual payoff matrix at the beginning of the game - Fixed matrix till the end 3. Fort [ Physica A 2007 ]: The payoff matrix was correlated with a spatial and temporal zones (considered only T) - The Prisoner’s Dilemma inequality was relaxed.

Proposed model Idea  Associated payoffs evolve based on individual experience. Each agent Dynamic payoffs  each agent has its own version of payoff matrix and it gets updated at each time step based on the level of the agent’s experience Age  increases at each time step α i (t+1) = α i (t) + 1 Life-span  expected life time (λ i ) randomly drawn from a uniform distribution αi(t) == λi  die and replaced by a new random agent

Proposed model Update  Where is the payoff values for agent i at time t is the default payoff matrix values T, R, P, S is the magnitude of the rescaled values is the age of agent i at time t is the expected life time of agent i is limiting factor and characterises the uncertainty related to the environment 1) 2)

Three scenarios 1.Standard PD  universal fixed Payoffs no Age 2.Homogeneous model  universal fixed Payoffs Age 3.Heterogeneous model  individual Payoffs Age What is the equilibrium state?

Experimental Setup Implemented in Netlogo4.0 [ Wilensky 1999 ] Underlying framework  Standard Spatial Iterated Prisoner’s Dilemma. Agents mapped on 2-D regular lattice (32*32 torus) Population initialized  20% cooperators Each trial  1000 iterations All configurations  30 times Statistical results are reported

Experiment 1  sensitivity to the base payoff values Two different base level payoff values T, R, P, S and K = 0.2 a) Big  5, 3, 1, 0 b) Small  1, 1, 0, 0

Experiment 2  sensitivity to the magnitude of K base level payoff values T, R, P, S  5, 3, 1, 0 K was changed systematically K represents environmental constraint on social diversity

Snapshots Evolving population for homogeneous and heterogeneous model K = 0.1 and initial cooperation 20% Varying size clusters of cooperators (black) Homogeneous  Heterogeneous 

Conclusion Results  heterogeneous social diversity, promotes cooperation. Differences to previous work  each agent is equipped with their own evolving payoff matrix. The evolving payoff matrix  agents’ age or experience level. More realistic approach  real world scenarios. Future work  extend the model to distributed multiagent systems (P2P, MANET)

Questions? Thank you

References H. Fort, On evolutionary spatial heterogeneous games, Physica A (2007). M. Perc and A. Szolnoki, Social diversity and promotion of cooperation in spatial prisoner's dilemma game, Physical Review E 77 (2008). M. Tomochi and M. Kono, Spatial prisoner's dilemma games with dynamic payoff matrices, Physical Review E 65 (2002), no Wilensky, U.: NetLogo is a cross-platform multi-agent programmable modeling environment. In: Modeling Nature’s Emergent Patterns with Multi-agent Languages. Proceedings of EuroLogo 2002 (2002),

Experiment 3  sensitivity to the life span (λ) base level payoff values T, R, P, S  5, 3, 1, 0 K = 0.2 λ from different range

Experiment 4  sensitivity to the replacement strategy base level payoff values T, R, P, S  5, 3, 1, 0 K = 0.2 Replacement with random generated agent and defector agent

Related work Few studies have examined the impact of varying the magnitude of the payoff matrix values in PD Tomochi and Kono: Payoff matrix was designed to evolve based on the ratio of defectors (cooperators) to the whole population. (considered R and P only) Universal payoff matrix applicable to all agents at time t. The level of cooperation within population was directly related to the payoff matrix values

Related work … Perc and Szolnoki: Random noise drawn from alternative statistical distributions was added to the payoff matrix at the beginning of the game. (fixed matrix till the end) They concluded that this correlated “social diversity mechanism” promoted higher-levels of cooperation in the spatial game examined. It was suggested that variable social status might play a crucial role in the evolution of cooperation.

Related work … Fort: The payoff matrix was correlated with a spatial and temporal zones. (considered only T) It was possible that the payoffs for an agent and their opponent were not equal – reminiscent of what happens in general in real life. The results reported suggested that the effect of asymmetries in the interactions between agents, which takes into account the effect of asymmetries in the costs and benefits on the evolution of cooperation, had a direct impact on the proportion of agents cooperating in the population. The Prisoner’s Dilemma inequality was relaxed, and when the payoff matrix values changed, the game oscillated between the Prisoner’s Dilemma game and Chicken game or the game becomes Stag Hunt game.

What is the idea? Ex./ You and your friend, colleague Ex./ 2 countries  punishment system for the same crime. Different individual conditions (Heterogeneity) have impact on the behaviour of two people/agents and may alter their interaction and their cooperation. In this paper we investigate this idea on a version of the Spatial Prisoner’s Dilemma (SPD) game. Why? Good abstract  many real world scenarios. Famous game theoretic approach  capture agents interaction Mathematical model  study the evolution of cooperation Applied in many areas  biology, economics, and sociology