A Game-Theoretic Approach to Determining Efficient Patrolling Strategies for Mobile Robots Francesco Amigoni, Nicola Gatti, Antonio Ippedico.

Slides:



Advertisements
Similar presentations
N. Basilico, N. Gatti, F. Amigoni DEI, Politecnico di Milano Leader-Follower Strategies for Robotic Patrolling in Environments with Arbitrary Topology.
Advertisements

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Computing Bayes-Nash Equilibria through Support Enumeration Methods in Bayesian Two-Player.
F. Amigoni, N. Basilico, N. Gatti DEI, Politecnico di Milano Finding the Optimal Strategies in Robotic Patrolling with Adversaries in Topologically-Represented.
Multiagent Technology Solutions for Planning in Ambient Intelligence Nicola Gatti, Francesco Amigoni, Marco Rolando {ngatti,
Autonomic Scaling of Cloud Computing Resources
Jose-Luis Blanco, Javier González, Juan-Antonio Fernández-Madrigal University of Málaga (Spain) Dpt. of System Engineering and Automation May Pasadena,
Continuation Methods for Structured Games Ben Blum Christian Shelton Daphne Koller Stanford University.
1 University of Southern California Keep the Adversary Guessing: Agent Security by Policy Randomization Praveen Paruchuri University of Southern California.
Randomized Sensing in Adversarial Environments Andreas Krause Joint work with Daniel Golovin and Alex Roper International Joint Conference on Artificial.
Game Theoretical Insights in Strategic Patrolling: Model and Analysis Nicola Gatti – DEI, Politecnico di Milano, Piazza Leonardo.
Bargaining in Markets with One-Sided Competition: Model and Analysis Nicola Gatti DEI, Politecnico di Milano, Piazza Leonardo da.
© 2015 McGraw-Hill Education. All rights reserved. Chapter 15 Game Theory.
Playing Games for Security: An Efficient Exact Algorithm for Solving Bayesian Stackelberg Games Praveen Paruchuri, Jonathan P. Pearce, Sarit Kraus Catherine.
Maynard Smith Revisited: Spatial Mobility and Limited Resources Shaping Population Dynamics and Evolutionary Stable Strategies Pedro Ribeiro de Andrade.
EC102: Class 9 Christina Ammon.
Convergent Learning in Unknown Graphical Games Dr Archie Chapman, Dr David Leslie, Dr Alex Rogers and Prof Nick Jennings School of Mathematics, University.
Zach Ramaekers Computer Science University of Nebraska at Omaha Advisor: Dr. Raj Dasgupta 1.
Algorithmic Game Theory Nicola Gatti and Marcello Restelli {ngatti, DEI, Politecnico di Milano, Piazza Leonardo da Vinci 32, Milano,
Autonomous Target Assignment: A Game Theoretical Formulation Gurdal Arslan & Jeff Shamma Mechanical and Aerospace Engineering UCLA AFOSR / MURI.
Approximate Solutions for Partially Observable Stochastic Games with Common Payoffs Rosemary Emery-Montemerlo joint work with Geoff Gordon, Jeff Schneider.
XYZ 6/18/2015 MIT Brain and Cognitive Sciences Convergence Analysis of Reinforcement Learning Agents Srinivas Turaga th March, 2004.
Bargaining in-Bundle over Multiple Issues in Finite- Horizon Alternating-Offers Protocol Francesco Di Giunta and Nicola Gatti Politecnico di Milano Milan,
On Spectrum Selection Games in Cognitive Radio Networks
Developing a Deterministic Patrolling Strategy for Security Agents Nicola Basilico, Nicola Gatti, Francesco Amigoni.
Complexity of Mechanism Design Vincent Conitzer and Tuomas Sandholm Carnegie Mellon University Computer Science Department.
Better automated abstraction techniques for imperfect information games, with application to Texas Hold’em poker * Andrew Gilpin and Tuomas Sandholm, CMU,
1 A Game Theoretic Formulation of the Dynamic Sensor Coverage Problem Jason Marden ( UCLA ) Gürdal Arslan ( University of Hawaii ) Jeff Shamma ( UCLA )
CS121 Heuristic Search Planning CSPs Adversarial Search Probabilistic Reasoning Probabilistic Belief Learning.
Reinforcement Learning Game playing: So far, we have told the agent the value of a given board position. How can agent learn which positions are important?
An Environmental Multiagent Architecture for Health Management Francesco Amigoni Nicola Gatti.
Game Theory.
1 Combinatorial Problems in Cooperative Control: Complexity and Scalability Carla Gomes and Bart Selman Cornell University Muri Meeting March 2002.
Alternating-Offers Bargaining under One-Sided Uncertainty on Deadlines Francesco Di Giunta and Nicola Gatti Dipartimento di Elettronica e Informazione.
Hardness-Aware Restart Policies Yongshao Ruan, Eric Horvitz, & Henry Kautz IJCAI 2003 Workshop on Stochastic Search.
Simple search methods for finding a Nash equilibrium Ryan Porter, Eugene Nudelman, and Yoav Shoham Games and Economic Behavior, Vol. 63, Issue 2. pp ,
6-1 Solving Systems by Graphing
History-Dependent Graphical Multiagent Models Quang Duong Michael P. Wellman Satinder Singh Computer Science and Engineering University of Michigan, USA.
A quantum protocol for sampling correlated equilibria unconditionally and without a mediator Iordanis Kerenidis, LIAFA, Univ Paris 7, and CNRS Shengyu.
1 S ystems Analysis Laboratory Helsinki University of Technology Kai Virtanen, Janne Karelahti, Tuomas Raivio, and Raimo P. Hämäläinen Systems Analysis.
CPS LP and IP in Game theory (Normal-form Games, Nash Equilibria and Stackelberg Games) Joshua Letchford.
Influence Maximization in Dynamic Social Networks Honglei Zhuang, Yihan Sun, Jie Tang, Jialin Zhang, Xiaoming Sun.
Patrolling Games Katerina Papadaki London School of Economics with Alec Morton and Steven Alpern.
1 S ystems Analysis Laboratory Helsinki University of Technology Kai Virtanen, Tuomas Raivio and Raimo P. Hämäläinen Systems Analysis Laboratory Helsinki.
Playing GWAP with strategies - using ESP as an example Wen-Yuan Zhu CSIE, NTNU.
A Study of Central Auction Based Wholesale Electricity Markets S. Ceppi and N. Gatti.
Optimal Resource Allocation for Protecting System Availability against Random Cyber Attack International Conference Computer Research and Development(ICCRD),
Adversarial Search Chapter Games vs. search problems "Unpredictable" opponent  specifying a move for every possible opponent reply Time limits.
1 - CS7701 – Fall 2004 Review of: Detecting Network Intrusions via Sampling: A Game Theoretic Approach Paper by: – Murali Kodialam (Bell Labs) – T.V. Lakshman.
Practical LFU implementation for Web Caching George KarakostasTelcordia Dimitrios N. Serpanos University of Patras.
Linear Systems of Equations Section 3.1. What is a “system” of equations?
1 Random Disambiguation Paths Al Aksakalli In Collaboration with Carey Priebe & Donniell Fishkind Department of Applied Mathematics and Statistics Johns.
Engineering Systems Analysis for Design Massachusetts Institute of Technology Richard de Neufville © Decision Analysis Basics Slide 1 of 21 Calculations.
Sporadic model building for efficiency enhancement of the hierarchical BOA Genetic Programming and Evolvable Machines (2008) 9: Martin Pelikan, Kumara.
Slide 1/20 Defending Against Strategic Adversaries in Dynamic Pricing Markets for Smart Grids Paul Wood, Saurabh Bagchi Purdue University
John D. Hey LUISS & University of York Julia A. Knoll University of Düsseldorf Strategies in Dynamic Decision Making An Experimental Investigation on the.
Mean Field Methods for Computer and Communication Systems Jean-Yves Le Boudec EPFL Network Science Workshop Hong Kong July
Commitment to Correlated Strategies Vince Conitzer and Dima Korzhyk.
Keep the Adversary Guessing: Agent Security by Policy Randomization
C. Kiekintveld et al., AAMAS 2009 (USC) Presented by Neal Gupta
A Game Theoretic Study of Attack and Defense in Cyber-Physical Systems
Coordination with Linear Equations
C.-S. Shieh, EC, KUAS, Taiwan
Extensive-form games and how to solve them
Solutions Sample Games 1
Game Theory Solutions 1 Find the saddle point for the game having the following payoff table. Use the minimax criterion to find the best strategy for.
Maximum Lifetime of Sensor Networks with Adjustable Sensing Range
[# ] A Game Theoretical Approach to Model Decision Making for Merging Maneuvers at Freeway On-ramps Authors: Kyungwon K. and Hesham A. R. Session.
Kai Virtanen, Janne Karelahti, Tuomas Raivio, and Raimo P. Hämäläinen
Game Theory Solutions 1 Find the saddle point for the game having the following payoff table. Use the minimax criterion to find the best strategy for.
Classifier-Based Approximate Policy Iteration
Presentation transcript:

A Game-Theoretic Approach to Determining Efficient Patrolling Strategies for Mobile Robots Francesco Amigoni, Nicola Gatti, Antonio Ippedico

F. Amigoni, N. Gatti, A, Ippedico Scenario 2

F. Amigoni, N. Gatti, A, Ippedico Summary of Contributions Problem: determination of an efficient patrolling strategy for a mobile robot Idea: 1. model the scenario as an extensive-form game played by the patroller and the intruder 2. solve the game to find the strategy for the patroller 3

F. Amigoni, N. Gatti, A, Ippedico The Proposed Model: Assumptions Time is discrete, players play in turns Environment with n places Patroller detects the presence of the intruder (captures the intruder) when it is in the patroller’s current place Intruder knows the strategy of the patroller Patroller’s actions: move from one place to another one (incurring in different costs), movements can be between any pairs of places Intruder’s actions: wait or attempt to enter a place Entering a place takes d turns The game ends either when the intruder is captured or has entered a place Players payoffs are defined according to values attributed to places, to costs for moving between places, and to rewards for capturing the intruder Intruder can be of different types, each one with different values for places 4

F. Amigoni, N. Gatti, A, Ippedico The Proposed Model: Extensive-form Game The intruder knows the patroller’s strategy and the patroller knows it  commitment-based strategy for the patroller Finding an optimal solution is not easy, basically because the environment can dynamically change and because the game is infinite-horizon  approximate solution 5 intruder’s action patroller’s action ……………………

F. Amigoni, N. Gatti, A, Ippedico Solving the Game: Finding a Patrolling Strategy Greedy approach: we consider a slice of the extensive-form game as an independent strategic-form game Solving each slice means finding the next optimal action for the patrolling robot A slice can be solved by resorting to a multi-LP [Conitzer and Sandholm, EC 2006] or to a MILP [Paruchuri et al., AAMAS 2008] mathematical programming formulation Solution: mixed strategy for the patrolling robot: {γ 1,γ 2,…,γ n } 6 intruder’s action patroller’s action ……………………

F. Amigoni, N. Gatti, A, Ippedico Experimental Results The approach scales reasonably well with the number n of places (using the multi-LP formulation) and with the number of intruder’s types (using the MILP formulation) The approach can be applied to different environments Linear environment Ring and star environments The approach adapts to dynamic changes in the environment 7 methodutility our approach (multi-LP)79.0 uniform52.1 proportional69.5 random nodes, 10 runs, 500 steps each

F. Amigoni, N. Gatti, A, Ippedico Conclusions We proposed a game-theoretic approach to determining strategies for patrolling robots Modeling a patrolling situation as an extensive-form game Finding an (approximate) solution of the game Patrolling strategies found with our approach are efficient Ongoing work Optimal solutions for the extensive-form game More realistic scenarios Implementation on real robots 8

F. Amigoni, N. Gatti, A, Ippedico State of the Art Patrolling strategies for mobile robots are based on random movements to be unpredictable for an observing intruder Some approaches do not consider any model for the intruder [Paruchuri et al., AAMAS 2006] [Agmon et al., AAMAS 2008] Few approaches do consider a model for the intruder, in a strategic- form game formulation [Paruchuri et al., AAMAS 2007] Some game theoretical inconsistencies, as shown in [Gatti, ECAI 2008] 9