Weighted Voting Game Based Multi-robot Team Formation for Distributed Area Coverage Ke Cheng and Prithviraj (Raj) Dasgupta Computer Science Department.

Slides:

Advertisements

Similar presentations

Chapter 6: Memory Management

Advertisements

Modeling Maze Navigation Consider the case of a stationary robot and a mobile robot moving towards a goal in a maze. We can model the utility of sharing.

Robot Sensor Networks. Introduction For the current sensor network the topography and stability of the environment is uncertain and of course time is.

Computer Science Dr. Peng NingCSC 774 Adv. Net. Security1 CSC 774 Advanced Network Security Topic 7.3 Secure and Resilient Location Discovery in Wireless.

MULTI-ROBOT SYSTEMS Maria Gini (work with Elizabeth Jensen, Julio Godoy, Ernesto Nunes, abd James Parker,) Department of Computer Science and Engineering.

Carl Nelson*, Khoa Chu*, Prithviraj (Raj) Dasgupta**, Zachary Ramaekers** University of Nebraska *: Mechanical Engineering, University of Nebraska, Lincoln.

PRAM (Parallel Random Access Machine)

The AGILO Autonomous Robot Soccer Team: Computational Principles, Experiences, and Perspectives Michael Beetz, Sebastian Buck, Robert Hanek, Thorsten Schmitt,

Flocking Behaviors Presented by Jyh-Ming Lien. Flocking System What is flocking system? – A system that simulates behaviors of accumulative objects (e.g.

COORDINATION and NETWORKING of GROUPS OF MOBILE AUTONOMOUS AGENTS.

Yoram Bachrach Jeffrey S. Rosenschein November 2007.

Zachary Wilson Computer Science Department University of Nebraska, Omaha Advisor: Dr. Raj Dasgupta.

Adaptive Multi-Robot Team Reconfiguration using a Policy-Reuse Reinforcement Learning Approach Ke Cheng 1, Raj Dasgupta 1 and Bikramjit Banerjee 2 1 Computer.

Zach Ramaekers Computer Science University of Nebraska at Omaha Advisor: Dr. Raj Dasgupta 1.

1 Mobile Sensor Network Deployment using Potential Fields : A Distributed, Scalable Solution to the Area Coverage Problem Andrew Howard, Maja J Mataric´,

Memory Management Chapter 7. Memory Management Subdividing memory to accommodate multiple processes Memory needs to be allocated efficiently to pack as.

Brent Dingle Marco A. Morales Texas A&M University, Spring 2002

Distributed Cluster Repair for OceanStore Irena Nadjakova and Arindam Chakrabarti Acknowledgements: Hakim Weatherspoon John Kubiatowicz.

Coalition Structures in Weighted Voting Games Georgios Chalkiadakis Edith Elkind Nicholas R. Jennings.

Randomized Planning for Short Inspection Paths Tim Danner Lydia E. Kavraki Department of Computer Science Rice University.

Distributed Rational Decision Making Sections By Tibor Moldovan.

Design of Autonomous Navigation Controllers for Unmanned Aerial Vehicles using Multi-objective Genetic Programming Gregory J. Barlow March 19, 2004.

Memory Management Chapter 5.

Continuum Crowds Adrien Treuille, Siggraph 王上文.

Intelligent Agents: an Overview. 2 Definitions Rational behavior: to achieve a goal minimizing the cost and maximizing the satisfaction. Rational agent:

CS401 presentation1 Effective Replica Allocation in Ad Hoc Networks for Improving Data Accessibility Takahiro Hara Presented by Mingsheng Peng (Proc. IEEE.

Computational aspects of stability in weighted voting games Edith Elkind (NTU, Singapore) Based on joint work with Leslie Ann Goldberg, Paul W. Goldberg,

The Agencies Method for Coalition Formation in Experimental Games John Nash (University of Princeton) Rosemarie Nagel (Universitat Pompeu Fabra, ICREA,

Tal Saiag & Anna Itin May 2013

ModRED: A Modular Self-Reconfigurable Robot for Autonomous Exploration Carl Nelson*, Khoa Chu*, Prithviraj (Raj) Dasgupta** University of Nebraska *: Mechanical.

 Optimal Packing of High- Precision Rectangles By Eric Huang & Richard E. Korf 25 th AAAI Conference, 2011 Florida Institute of Technology CSE 5694 Robotics.

Nuttapon Boonpinon Advisor Dr. Attawith Sudsang Department of Computer Engineering,Chulalongkorn University Pattern Formation for Heterogeneous.

Networks of Autonomous Unmanned Vehicles Prof. Schwartz Presentation to Dr. Ponsford of Raytheon May 20, 2008.

Multiple Autonomous Ground/Air Robot Coordination Exploration of AI techniques for implementing incremental learning. Development of a robot controller.

Complete Coverage Path Planning Based on Ant Colony Algorithm International conference on Mechatronics and Machine Vision in Practice, p.p , Dec.

Lyon, June 26th 2006 ICPS'06: IEEE International Conference on Pervasive Services 2006 Routing and Localization Services in Self-Organizing Wireless Ad-Hoc.

Optimal Power Control, Rate Adaptation and Scheduling for UWB-Based Wireless Networked Control Systems Sinem Coleri Ergen (joint with Yalcin Sadi) Wireless.

ROBUST RESOURCE ALLOCATION OF DAGS IN A HETEROGENEOUS MULTI-CORE SYSTEM Luis Diego Briceño, Jay Smith, H. J. Siegel, Anthony A. Maciejewski, Paul Maxwell,

Bounding the Cost of Stability in Games with Restricted Interaction Reshef Meir, Yair Zick, Edith Elkind and Jeffrey S. Rosenschein COMSOC 2012 (to appear)

Patch Based Mobile Sink Movement By Salman Saeed Khan Omar Oreifej.

Intelligent Database Systems Lab 1 Advisor ： Dr. Hsu Graduate ： Jian-Lin Kuo Author ： Silvia Nittel Kelvin T.Leung Amy Braverman 國立雲林科技大學 National Yunlin.

Robotica Lecture 3. 2 Robot Control Robot control is the mean by which the sensing and action of a robot are coordinated The infinitely many possible.

ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.

1 Distributed and Optimal Motion Planning for Multiple Mobile Robots Yi Guo and Lynne Parker Center for Engineering Science Advanced Research Computer.

1 11 Channel Assignment for Maximum Throughput in Multi-Channel Access Point Networks Xiang Luo, Raj Iyengar and Koushik Kar Rensselaer Polytechnic Institute.

Controlling the Behavior of Swarm Systems Zachary Kurtz CMSC 601, 5/4/

Distributed Algorithms for Multi-Robot Observation of Multiple Moving Targets Lynne E. Parker Autonomous Robots, 2002 Yousuf Ahmad Distributed Information.

Energy conservation in Wireless Sensor Networks Sagnik Bhattacharya, Tarek Abdelzaher University of Virginia, Department of Computer Science School of.

Decision Trees Binary output – easily extendible to multiple output classes. Takes a set of attributes for a given situation or object and outputs a yes/no.

Covering Points of Interest with Mobile Sensors Milan Erdelj, Tahiry Razaﬁndralambo and David Simplot-Ryl INRIA Lille - Nord Europe IEEE Transactions on.

Behavior-based Multirobot Architectures. Why Behavior Based Control for Multi-Robot Teams? Multi-Robot control naturally grew out of single robot control.

ARTIFICIAL INTELLIGENCE (CS 461D) Princess Nora University Faculty of Computer & Information Systems.

Fast SLAM Simultaneous Localization And Mapping using Particle Filter A geometric approach (as opposed to discretization approach)‏ Subhrajit Bhattacharya.

Mobile Sensor Network Deployment Using Potential Fields: A Distributed, Scalable Solution to the Area Coverage Problem Andrew Howard, Maja J Matari´c,

Ben Miller.   A distributed algorithm is a type of parallel algorithm  They are designed to run on multiple interconnected processors  Separate parts.

Load Balancing : The Goal Given a collection of tasks comprising a computation and a set of computers on which these tasks may be executed, find the mapping.

Cooperative Location-Sensing for Wireless Networks Charalampos Fretzagias and Maria Papadopouli Department of Computer Science University of North Carolina.

Efficient Placement and Dispatch of Sensors in a Wireless Sensor Network You-Chiun Wang, Chun-Chi Hu, and Yu-Chee Tseng IEEE Transactions on Mobile Computing.

Heterogeneous Teams of Modular Robots for Mapping and Exploration by Grabowski et. al.

Optimization and Stability in Games with Restricted Interactions Reshef Meir, Yair Zick and Jeffrey S. Rosenschein CoopMAS 2012.

Dynamic Weighted Voting Games Edith Elkind Dmitrii Pasechnik Yair Zick AAMAS 2013.

COGNITIVE APPROACH TO ROBOT SPATIAL MAPPING

Parallel Density-based Hybrid Clustering

The story of distributed constraint optimization in LA: Relaxed

Divide Areas Algorithm For Optimal Multi-Robot Coverage Path Planning

Networks of Autonomous Unmanned Vehicles

Effective Replica Allocation

Operating Systems: Internals and Design Principles, 6/E

Area Coverage Problem Optimization by (local) Search

Presentation transcript:

Weighted Voting Game Based Multi-robot Team Formation for Distributed Area Coverage Ke Cheng and Prithviraj (Raj) Dasgupta Computer Science Department University of Nebraska, Omaha

Research Objective: Multi-robot Coverage Use a set of robots to perform complete coverage of an initially unknown environment in an efficient manner Efficiency is measured in time and space –Time: reduce the time required to cover the environment –Space: avoid repeated coverage of regions that have already been covered Tradeoff in achieving both simultaneously

Major Challenges Distributed – no shared memory or map of the environment that the robots can use to know which portion of the environment is covered Each robot has limited storage and computation capabilities –Can’t store map of the entire environment Other challenges: Sensor and encoder noise, communication overhead, localizing robots

How does a robot do area coverage? Using an actuator (e.g., vacuum) or a sensor (e.g., camera or sonar) Source: Manuel Mazo Jr. and Karl Henrik Johansson, “Robust area coverage using hybrid control,”, TELEC'04, Santiago de Cuba, Cuba, 2004 Robot’s coverage tool The region of the environment that passes under the swathe of the robot’s coverage tool is considered as covered

E-puck Mini Robot IR sensors (8); range ~ 4 cm Camera; 640 X 480 VGA Bluetooth wireless communication LEDs Mic + speaker 7 cm 4.1 cm 144 KB RAM dsPIC Photo courtesy: Mobots

Multi-robot coverage: Individually coordinated robots using swarming Global Objective: Complete coverage of environment

Multi-robot coverage: Individually coordinated robots using swarming Global Objective: Complete coverage of environment Local coverage rule of robot... Local coverage rule of robot

Multi-robot coverage: Individually coordinated robots using swarming Global Objective: Complete coverage of environment Local coverage rule of robot... Local coverage rule of robot Local interactions between robots

Multi-robot coverage: Individually coordinated robots using swarming Global Objective: Complete coverage of environment Local coverage rule of robot... Local coverage rule of robot Local interactions between robots How well do the results of the local interactions translate to achieving the global objective? Done empirically References: 1. K. Cheng and P. Dasgupta, "Dynamic Area Coverage using Faulty Multi-agent Swarms" Proc. IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT 2007), Fremont, CA, 2007, pp P. Dasgupta, K. Cheng, "Distributed Coverage of Unknown Environments using Multi-robot Swarms with Memory and Communication Constraints," UNO CS Technical Report (cst ).

Multi-robot coverage: Team-based robots using swarming Global Objective: Complete coverage of environment Local coverage rule of robot-team... Local coverage rule of robot-team Flocking technique to maintain team formation

Multi-robot coverage: Team-based robots using swarming Global Objective: Complete coverage of environment Local coverage rule of robot-team... Local coverage rule of robot-team Flocking technique to maintain team formation Local interactions between robot teams How well do the results of the local interactions translate to achieving the global objective? Done empirically Relevant publications: 1.K. Cheng, P. Dasgupta, Yi Wang ”Distributed Area Coverage Using Robot Flocks”, Nature and Biologically Inspired Computing (NaBIC’09), P. Dasgupta, K. Cheng, and L. Fan, ”Flocking-based Distributed Terrain Coverage with Mobile Mini-robots,” Swarm Intelligence Symposium 2009.

Multi-robot teams for area coverage Theoretical analysis: Forming teams gives a significant speed-up in terms of coverage efficiency Simulation Results: The speed-up decreases from the theoretical case but still there is some speed-up as compared to not forming teams Based on Reynolds’ flocking model Leader referenced Follower robots designated specific positions within team

Coverage with Multi-robot Teams Square Corridor Office

Dynamic Reconfigurations of Robot Teams Having teams of robots is efficient for coverage Having large teams of robots doing frequent reformations is inefficient for coverage Can we make the modules change their configurations dynamically – Based on their recent performance: If a team of robots is doing frequent reformations (and getting bad coverage efficiency), split the team into smaller teams and see if coverage improves

Robot Team Formation for Coverage: Agent Utility-based Approach Each robot/agent tries to get into a configuration that maximizes its utility Utility-function of each robot in a team Flocking-based Controller Mediator A team needs to reconfigure Calculate the configuration that gives highest utility Check inconsistencies Large team…inefficient coverage: low individual utility Reference: P. Dasgupta and K. Cheng, “Coalition game-based distributed coverage of unknown environments using robot swarms, “ AAMAS 2008.

Coalition game-based team formation We used coalition games to solve the multirobot team formation problem – Coalition games provide a theory to divide a set of players into smaller subsets or teams – We used a form of coalition games called weighted voting games (WVG)

Robot Team Formation for Coverage: Weighted Voting Game Coalition Game Layer Flocking-based Controller Mediator A team needs to reconfigure Calculate the best partition of a team Maintain consistency between coalition game result and team formations 17

Coalitional Games: Weighted Voting Game (WVG) Definitions N: set of players v: characteristic function, assigns a real-valued utility to each subset of players Each player i is assigned a weight w i – W max  w i q: quota, fixed positive real number <= W max If there is a subset of players C whose weights taken together equal or exceed the quota, C is called a winning coalition and v(C) = 1 – Players not part of winning coalition get v = 0

Weighted Voting Game: Definitions Minimal winning coalition: smallest subset of players whose weights reach the quota Veto player: player that appears in all winning coalitions, without him other players can’t reach quota – A game may not have a veto player

WVG Example N = {A, B, C, D} w A = 45, w B = 25, w C = 15, w D = 15; quota = 51 – Winning coalitions are {A, B} {A, C} {A, D} {A, B, C} {A, B, D} {A, C, D} {B, C, D} {A, B, C, D} no veto player Same weights, quota = 56 – Winning coalitions are {A, B} {A, C} {A, D} {A, B, C} {A, B, D} {A, C, D} {A, B, C, D} A is a veto player

Robot Coverage as WVG Determining weights of players (robots) – Modeled as coverage capability Environment considered as a 2-D grid Coverage map: Region covered by robot in last T timesteps Coverage efficiency: – Time: What fraction of the coverage map has been covered at least once? – Space: What fraction of the coverage map has been covered more than once? C i = a X  i – b X  i + C 0 a=2, b=1, C 0 = C i = 1.96C i = 0.96

Breaking Ties Between Multiple Minimal Winning Coalitions Tie breaking using heuristic

Stability of Coalitions Is the partition of players imposed by the MWC going to be stable? – Yes, if it’s in the core of the game – Core: Sum of the payoffs of all the players in a team is at least as great as the payoff of the whole team Theorem 1: The core of a WVG is non-empty iff it has a veto player Theorem 2: The best minimal winning coalition (BMWC) is in the core Theorem 3: The best minimal winning coalition is unique

Outline of Algorithm for Team Reformation When a team needs to reconfigure – For all robots that are within communication range of a leader robot Find the veto players, set MWC = veto players – If no veto players, don’t form team and move individually If the veto players weights are enough to reach the quota then stop * Else add players from non-veto set to MWC, one at a time, until sum of players’ weights reaches quota *: If there are multiple MWCs apply heuristic to find BMWC

Experimental Results on Webots Experimental Settings Percentage of environment covered after 2 hours of clock-time simulations Repeated Coverage after 2 hours of clock-time simulations E-puck robots Wheel speed: 2.8 cm/sec On-board GPS Arena size: 4 m X 4m Robot size = Grid cell size = 7 cm X 7 cm Results averaged over 10 runs

Effect of Environment (Obstacles) 20 robots, quota = 0.7 X W max

Effect of Communication Range 20 robots, 10% of environment occupied by obstacles

Video Demo 1

Conclusions, Ongoing and Future Work Coalition games (WVGs) provide a suitable, structured mechanism to dynamically reconfigure multi-robot teams Ongoing work: Reduce the computation complexity of generating winning coalitions in a WVG Future work: Dynamically changing quota value based on performance, learning from long-term coverage histories Tests with physical robots

Acknowledgements We are grateful to the sponsors of our projects: – COMRADES project, Office of Naval Research – NASA Nebraska EPSCoR Mini-grant Thank You! For more information C-MANTIC Lab:

Video Demo 2

Video Demo 3