An Architectural View of Game Theoretic Control Raga Gopalakrishnan and Adam Wierman California Institute of Technology Jason R. Marden University of Colorado.

Slides:

Advertisements

Similar presentations

A DISTRIBUTED CSMA ALGORITHM FOR THROUGHPUT AND UTILITY MAXIMIZATION IN WIRELESS NETWORKS.

Advertisements

Distributed Rate Assignments for Broadband CDMA Networks Tara Javidi Electrical & Computer Engineering University of California, San Diego.

Equilibrium of Heterogeneous Protocols Steven Low CS, EE netlab.CALTECH.edu with A. Tang, J. Wang, Clatech M. Chiang, Princeton.

Real-Time Competitive Environments: Truthful Mechanisms for Allocating a Single Processor to Sporadic Tasks Anwar Mohammadi, Nathan Fisher, and Daniel.

Dispatching to Incentivize Fast Service in Multi-Server Queues Raga Gopalakrishnan and Adam Wierman California Institute of Technology Sherwin Doroudi.

TAU Agent Team: Yishay Mansour Mariano Schain Tel Aviv University TAC-AA 2010.

The Weighted Proportional Resource Allocation Milan Vojnović Microsoft Research Joint work with Thành Nguyen Microsoft Research Asia, Beijing, April, 2011.

THE PRICE OF STOCHASTIC ANARCHY Christine ChungUniversity of Pittsburgh Katrina LigettCarnegie Mellon University Kirk PruhsUniversity of Pittsburgh Aaron.

Network Security: an Economic Perspective Marc Lelarge (INRIA-ENS) currently visiting STANFORD TRUST seminar, Berkeley 2011.

Evolving Cooperation in the N-player Prisoner's Dilemma: A Social Network Model Dept Computer Science and Software Engineering Golriz Rezaei Michael Kirley.

Risk Models and Controlled Mitigation of IT Security R. Ann Miura-Ko Stanford University February 27, 2009.

Spectrum Sharing for Unlicensed Bands Raul Etkin, Abhay Parekh, and David Tse Dept of EECS U.C. Berkeley Project supported by NSF ITR ANI grant.

Characterizing distribution rules for cost sharing games Raga Gopalakrishnan Caltech Joint work with Jason R. Marden & Adam Wierman.

Potential games are necessary to ensure pure Nash equilibria in cost sharing games.

Nash Implementation of Lindahl Equilibria Sébastien Rouillon Journées LAGV, 2007.

Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.

(CS/SS 241) Introduction to SISL: Topics in Algorithmic game theory Adam Wierman – 258 Jorgensen John Ledyard – 102 Baxter Jason R. Marden – 335 Moore.

Fast Convergence of Selfish Re-Routing Eyal Even-Dar, Tel-Aviv University Yishay Mansour, Tel-Aviv University.

Joint Strategy Fictitious Play Sherwin Doroudi. “Adapted” from J. R. Marden, G. Arslan, J. S. Shamma, “Joint strategy fictitious play with inertia for.

Regret Minimization and the Price of Total Anarchy Paper by A. Blum, M. Hajiaghayi, K. Ligett, A.Roth Presented by Michael Wunder.

Coalition Formation and Price of Anarchy in Cournot Oligopolies Joint work with: Nicole Immorlica (Northwestern University) Georgios Piliouras (Georgia.

1 Algorithmic Game Theoretic Perspectives in Networking Dr. Liane Lewin-Eytan.

Sogang University ICC Lab Using Game Theory to Analyze Wireless Ad Hoc networks.

Satisfaction Equilibrium Stéphane Ross. Canadian AI / 21 Problem In real life multiagent systems :  Agents generally do not know the preferences.

Xu Chen Xiaowen Gong Lei Yang Junshan Zhang

Strategic Network Formation and Group Formation Elliot Anshelevich Rensselaer Polytechnic Institute (RPI)

Convergent Learning in Unknown Graphical Games Dr Archie Chapman, Dr David Leslie, Dr Alex Rogers and Prof Nick Jennings School of Mathematics, University.

Distributed Association Control in Shared Wireless Networks Krishna C. Garikipati and Kang G. Shin University of Michigan-Ann Arbor.

Centre for Autonomous Systems Petter ÖgrenCAS talk1 A Control Lyapunov Function Approach to Multi Agent Coordination P. Ögren, M. Egerstedt * and X. Hu.

Game-Theoretic Approaches to Multi-Agent Systems Bernhard Nebel.

Kuang-Hao Liu et al Presented by Xin Che 11/18/09.

A Game Theoretic Approach to Provide Incentive and Service Differentiation in P2P Networks John C.S. Lui The Chinese University of Hong Kong Joint work.

Autonomous Target Assignment: A Game Theoretical Formulation Gurdal Arslan & Jeff Shamma Mechanical and Aerospace Engineering UCLA AFOSR / MURI.

Nov 2003Group Meeting #2 Distributed Optimization of Power Allocation in Interference Channel Raul Etkin, Abhay Parekh, and David Tse Spectrum Sharing.

Job Market Signaling (Spence model)

A Scalable Network Resource Allocation Mechanism With Bounded Efficiency Loss IEEE Journal on Selected Areas in Communications, 2006 Johari, R., Tsitsiklis,

Convergence Time to Nash Equilibria in Load Balancing Eyal Even-Dar, Tel-Aviv University Alex Kesselman, Tel-Aviv University Yishay Mansour, Tel-Aviv University.

A Game Theoretic Approach to Provide Incentive and Service Differentiation in P2P Networks Richard Ma, Sam Lee, John Lui (CUHK) David Yau (Purdue)

1 A Game Theoretic Formulation of the Dynamic Sensor Coverage Problem Jason Marden ( UCLA ) Gürdal Arslan ( University of Hawaii ) Jeff Shamma ( UCLA )

Multiple timescales for multiagent learning David Leslie and E. J. Collins University of Bristol David Leslie is supported by CASE Research Studentship.

Homogeneous Interference Game in Wireless Networks Joseph (Seffi) Naor, Technion Danny Raz, Technion Gabriel Scalosub, University of Toronto.

1 Issues on the border of economics and computation נושאים בגבול כלכלה וחישוב Congestion Games, Potential Games and Price of Anarchy Liad Blumrosen ©

Raga Gopalakrishnan University of Colorado at Boulder Sean D. Nixon (University of Vermont) Jason R. Marden (University of Colorado at Boulder) Stable.

A Projection Framework for Near- Potential Polynomial Games Nikolai Matni Control and Dynamical Systems, California.

DaVinci: Dynamically Adaptive Virtual Networks for a Customized Internet Jennifer Rexford Princeton University With Jiayue He, Rui Zhang-Shen, Ying Li,

A Framework for Distributed Model Predictive Control

Mohammadreza Ataei Instructor : Prof. J.Omidi. 2.

ECE559VV – Fall07 Course Project Presented by Guanfeng Liang Distributed Power Control and Spectrum Sharing in Wireless Networks.

Information Theory for Mobile Ad-Hoc Networks (ITMANET): The FLoWS Project Thrust 3 Application Metrics and Network Performance Asu Ozdaglar and Devavrat.

TRUST: A General Framework for Truthful Double Spectrum Auctions Xia Zhou Heather Zheng (University of California, Santa Barbara) Presenter: Emil Huang.

A Study of Central Auction Based Wholesale Electricity Markets S. Ceppi and N. Gatti.

Non-Cooperative Multi-Radio Channel Allocation in Wireless Networks Márk Félegyházi*, Mario Čagalj†, Shirin Saeedi Bidokhti*, Jean-Pierre Hubaux* * Ecole.

DaVinci: Dynamically Adaptive Virtual Networks for a Customized Internet Jiayue He, Rui Zhang-Shen, Ying Li, Cheng-Yen Lee, Jennifer Rexford, and Mung.

Equilibria in Network Games: At the Edge of Analytics and Complexity Rachel Kranton Duke University Research Issues at the Interface of Computer Science.

Ç ç Cellular Operators in a Shared Spectrum Sivan Altinakar Supervisors: Tinaz Ekim-Asici Márk Félegyházi.

Information Theory for Mobile Ad-Hoc Networks (ITMANET): The FLoWS Project Competitive Scheduling in Wireless Networks with Correlated Channel State Ozan.

Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp ,

Hedonic Clustering Games Moran Feldman Joint work with: Seffi Naor and Liane Lewin-Eytan.

CMSC 100 Multi-Agent Game Day Professor Marie desJardins Tuesday, November 20, 2012 Tue 11/20/12 1 Multi-Agent Game Day.

Vasilis Syrgkanis Cornell University

1 Multi-radio Channel Allocation in Competitive Wireless Networks Mark Felegyhazi, Mario Čagalj, Jean-Pierre Hubaux EPFL, Switzerland IBC’06, Lisbon, Portugal.

1 Ann Nowé Nature inspired agents to handle interaction in IT systems Ann Nowé Computational modeling Lab Vrije Universiteit Brussel.

1 Bottleneck Routing Games on Grids Costas Busch Rajgopal Kannan Alfred Samman Department of Computer Science Louisiana State University.

Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.

Satisfaction Games in Graphical Multi-resource Allocation

Resource Allocation in Non-fading and Fading Multiple Access Channel

Aspiration-based Learning

Application Metrics and Network Performance

Richard Ma, Sam Lee, John Lui (CUHK) David Yau (Purdue)

Normal Form (Matrix) Games

Presentation transcript:

An Architectural View of Game Theoretic Control Raga Gopalakrishnan and Adam Wierman California Institute of Technology Jason R. Marden University of Colorado at Boulder 6/18/2010Hotmetrics 2010

Distributed Resource Allocation Sensor CoverageWireless Access Point Selection Wireless Channel SelectionPower Control (sensor networks)

Resource Allocation Problem – A Simple Model Set of (distributed) agents, N = {1, 2,..., n} Set of resources, R Action sets, A i µ 2 R for agents i 2 N – Set of action profiles, A = A 1 £ A 2 £... £ A n – Set of agents choosing resource r in action profile a, {a} r Objective function, W : A! R – Linearly separable, i.e., W(a) =  r 2 R W r ( {a} r ) Goal: Find an allocation a 2 A that maximizes W(a)

Distributed Approaches Distributed Optimization Lyapunov-based Control Physics-inspired Control Game-theoretic Control

Distributed Approaches Distributed Optimization Lyapunov-based Control Physics-inspired Control Game-theoretic Control  Promising new approach  Model the agents as “self-interested” players in a non-cooperative game  Still being explored  The solution to the problem emerges as the equilibrium of the game

Modeling the problem as a game Set of players, N = {1, 2,..., n} Action sets, A i µ 2 R for players i 2 N – Set of action profiles, A = A 1 £ A 2 £  £ A n – Set of players choosing resource r in action profile a, {a} r Utility functions, U i : A! R for players i 2 N – Linearly separable, i.e., U i (a) =  r 2 R f r ( i, {a} r ) Welfare function W : A! R – Linearly separable, i.e., W(a) =  r 2 R W r ( {a} r ) Resource Allocation ProblemResource Allocation Game Set of agents, N = {1, 2,..., n} Set of resources, R Action sets, A i µ 2 R for agents i 2 N – Set of action profiles, A = A 1 £ A 2 £... £ A n – Set of agents choosing resource r in action profile a, {a} r Objective function, W : A! R – Linearly separable, i.e., W(a) =  r 2 R W r ( {a} r )

Game Theoretic Control (GTC) Setup the game 1 Design the players 2  decision makers/players  action sets  utility functions agent decision rules (learning rules) Desirable global behavior emerges as equilibrium of the game Goal:

Game Theoretic Control (GTC) Setup the game 1 Design the players 2  decision makers/players  action sets  utility functions agent decision rules (learning rules) Desirable properties  Existence of an equil.  Efficiency of an equil.  Tractability  Locality of information  Budget balance  … Desirable properties  Locality of information  Fast convergence  Equilibrium selection  Robust convergence  … Learning Design Utility Design Inherited Designed

Many other applications: [Akella et al. 2002, Kaumann et al. 2007, Marden et al. 2007, 2008, Mhatre et al. 2007, Komali and MacKenzie 2007, Zou and Chakrabarty 2004, Campos-Nanez 2008, Marden & Effros 2009] [Marden, Wierman 2008] [Campos-Nanez, Garcia, Li 2008] Applications of GTC Utility Design Learning Design Sensor Coverage Power Control (sensor networks) Is there a way to view Game Theoretic Control from an application-independent perspective?

Architectural View for GTC Utility Design Learning Design Class of Games “Virtualization” layer IP Network Apps Network hardware OS software hardware Potential Games are games for which there exists a potential function  : A! R such that ∀ i 2 N, ∀ a –i 2 A –i, ∀ a i, a i ’ 2 A i, it holds that  (a i, a –i ) –  (a i ’, a –i ) = U i  (a i, a –i ) – U i (a i ’, a –i ) Key Property: Local maxima of  are Nash equilibria

Potential Games-based Architecture Utility Design Learning Design Potential Games Unifying view of several existing designs: [Akella et al. 2002] [Kaumann et al. 2007] [Marden et al. 2007, 2008] [Mhatre et al. 2007] [Komali and MacKenzie 2007] [Zou and Chakrabarty 2004] [Campos-Nanez 2008] [Marden & W 2008] [Marden & Effros 2009] and many others…

Utility Design (examples) Wonderful Life Utility (WLU) [Wolpert et al. 1999] – Potential game with © = W (hence, price of stability = 1) – Price of anarchy = ½ for sub-modular games Shapley Value Utility (SVU) [Shapley 1953] – Potential game – Price of anarchy = Price of stability = ½ for sub-modular games Weighted SVU [Shapley 1953] – Similar properties as SVU Adapted from cost-sharing literature in economic theory [Marden, Wierman]

Learning Design (examples) Gradient Play [Ermoliev et al. 1997, Shamma et al. 2005] – Convergence to a Nash equilibrium Joint Strategy Fictitious Play (JSFP) [Marden et al. 2009] – Convergence to a Nash equilibrium Log-Linear Learning [Blume 1993, Marden et al.] – Convergence to the best Nash equilibrium Many others... [Ozdaglar et al. 2009, Shah et al. 2010]

Potential Games-based Architecture Utility Design Learning Design Potential Games SVU Wonderful Life WSVU Gradient Play Log-Linear Learning JSFP +Modularity / Decoupling +Flexibility ?Relationships to other approaches ?Limitations +Modularity / Decoupling +Flexibility ?Relationships to other approaches ?Limitations

Distributed Approaches Distributed Optimization Lyapunov-based Control Physics-inspired Control Potential Games Utility Design Learning Design Relationships to Other Approaches Game-theoretic Control

Distributed Constraint Optimization Problem (DCOP) – Utility Design: WLU – Learning Design: Variety Chapman, Rogers, Jennings – Benchmarking hybrid algorithms for distributed constraint optimization games [OptMAS ‘08] Potential Games WLU Variety Distributed Optimization

Distributed Approaches Distributed Optimization Lyapunov-based Control Physics-inspired Control Potential Games Utility Design Learning Design Game-theoretic Control Relationships to Other Approaches

Gibbs-sampler-based control ―Utility Design: WLU ―Learning Design: Log-Linear Learning Access Point SelectionChannel Selection Kauffmann, Baccelli, Chaintreau, Mhatre, Papagiannaki, Diot – Measurement-based self organization of interfering wireless access networks [INFOCOM ‘07] Potential Games WLU Log-Linear Learning Physics-inspired Control We prove that

Distributed Approaches Distributed Optimization Lyapunov-based Control Physics-inspired Control Potential Games Utility Design Learning Design Game-theoretic Control Relationships to Other Approaches

Distributed Approaches Distributed Optimization Lyapunov-based Control Physics-inspired Control Potential Games Utility Design Learning Design Game-theoretic Control Relationships to Other Approaches

Potential Games-based Architecture Utility Design Learning Design Potential Games SVU Wonderful Life WSVU Gradient Play Log-Linear Learning JSFP +Modularity / Decoupling +Flexibility Relationships to other approaches ?Limitations +Modularity / Decoupling +Flexibility Relationships to other approaches ?Limitations

Do Potential Games Suffice? No utility design with all the desirable properties Utility Design Learning Design POTENTIAL GAMES Desirable properties  Existence of an equil.  Efficiency of an equil.  Budget balance  Tractability  Locality of information  … Not always! Open Question: What other limitations are there? Any linearly separable, budget-balanced utility design that guarantees equilibrium existence has PoS · ½ [Marden, Wierman 2009]

Summary Utility Design Learning Design Potential Games SVU Wonderful Life WSVU Gradient Play Log-Linear Learning JSFP +Modularity / Decoupling +Flexibility Relationships to other approaches ―Not all desirable properties can be achieved +Modularity / Decoupling +Flexibility Relationships to other approaches ―Not all desirable properties can be achieved ?Beyond Potential Games

Conclusion Utility Design Learning Design Potential Games SVU Wonderful Life WSVU Gradient Play Log-Linear Learning JSFP +Modularity / Decoupling +Flexibility Relationships to other approaches ―Not all desirable properties can be achieved ? Other choices for virtualization layer [MW’09,AJWG’09,Sv’09] Strengths and Limitations A library of architectures

Thank You