Spatio-Temporal Case-Based Reasoning for Behavioral Selection Maxim Likhachev and Ronald Arkin Mobile Robot Laboratory Georgia Tech.

Slides:



Advertisements
Similar presentations
CMPUT 615 Applications of Machine Learning in Image Analysis
Advertisements

Kinematic Synthesis of Robotic Manipulators from Task Descriptions June 2003 By: Tarek Sobh, Daniel Toundykov.
Georgia Tech / Mobile Intelligence 1 Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems DARPA MARS Review.
Fault-Tolerant Target Detection in Sensor Networks Min Ding +, Dechang Chen *, Andrew Thaeler +, and Xiuzhen Cheng + + Department of Computer Science,
NUS CS5247 Motion Planning for Camera Movements in Virtual Environments By Dennis Nieuwenhuisen and Mark H. Overmars In Proc. IEEE Int. Conf. on Robotics.
1. Algorithms for Inverse Reinforcement Learning 2
DARPA Mobile Autonomous Robot SoftwareMay Adaptive Intelligent Mobile Robotics William D. Smart, Presenter Leslie Pack Kaelbling, PI Artificial.
Patch to the Future: Unsupervised Visual Prediction
Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems Georgia Tech College of Computing Georgia Tech Research.
CENTRE Cellular Network’s Positioning Data Generator Fosca GiannottiKDD-Lab Andrea MazzoniKKD-Lab Puntoni SimoneKDD-Lab Chiara RensoKDD-Lab.
Using Inaccurate Models in Reinforcement Learning Pieter Abbeel, Morgan Quigley and Andrew Y. Ng Stanford University.
1 Simulation Modeling and Analysis Session 13 Simulation Optimization.
Data Mining Cluster Analysis: Advanced Concepts and Algorithms Lecture Notes for Chapter 9 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Paper Title Your Name CMSC 838 Presentation. CMSC 838T – Presentation Motivation u Problem paper is trying to solve  Characteristics of problem  … u.
Data classification based on tolerant rough set reporter: yanan yean.
Soft Computing and Its Applications in SE Shafay Shamail Malik Jahan Khan.
CS 326 A: Motion Planning 2 Dynamic Constraints and Optimal Planning.
Motor Schema Based Navigation for a Mobile Robot: An Approach to Programming by Behavior Ronald C. Arkin Reviewed By: Chris Miles.
LYU0503 Document Image Reconstruction on Mobile Using Onboard Camera Supervisor: Professor Michael R.Lyu Group Members: Leung Man Kin, Stephen Ng Ying.
7. Experiments 6. Theoretical Guarantees Let the local policy improvement algorithm be policy gradient. Notes: These assumptions are insufficient to give.
High Speed Obstacle Avoidance using Monocular Vision and Reinforcement Learning Jeff Michels Ashutosh Saxena Andrew Y. Ng Stanford University ICML 2005.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
DOG I : an Annotation System for Images of Dog Breeds Antonis Dimas Pyrros Koletsis Euripides Petrakis Intelligent Systems Laboratory Technical University.
Georgia Tech / Mobile Intelligence 1 Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems DARPA MARS Review.
IE 594 : Research Methodology – Discrete Event Simulation David S. Kim Spring 2009.
IMPLEMENTATION ISSUES REGARDING A 3D ROBOT – BASED LASER SCANNING SYSTEM Theodor Borangiu, Anamaria Dogar, Alexandru Dumitrache University Politehnica.
Enhancing Learning Through Technology b Choosing the technology b Meeting the needs of students with exceptionalities b Specific Technologies.
Soft Computing Colloquium 2 Selection of neural network, Hybrid neural networks.
Computer Implementation of Genetic Algorithm
Accuracy-Aware Aquatic Diffusion Process Profiling Using Robotic Sensor Networks Yu Wang, Rui Tan, Guoliang Xing, Jianxun Wang, Xiaobo Tan Michigan State.
Modeling and simulation of systems Simulation optimization and example of its usage in flexible production system control.
DARPA Mobile Autonomous Robot SoftwareLeslie Pack Kaelbling; March Adaptive Intelligent Mobile Robotics Leslie Pack Kaelbling Artificial Intelligence.
Indiana GIS Conference, March 7-8, URBAN GROWTH MODELING USING MULTI-TEMPORAL IMAGES AND CELLULAR AUTOMATA – A CASE STUDY OF INDIANAPOLIS SHARAF.
Knowledge Discovery and Delivery Lab (ISTI-CNR & Univ. Pisa)‏ www-kdd.isti.cnr.it Anna Monreale Fabio Pinelli Roberto Trasarti Fosca Giannotti A. Monreale,
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
Zhiphone: A Mobile Phone that Learns Context and User Preferences Jing Michelle Liu Raphael Hoffmann CSE567 class project, AUT/05.
Study on Genetic Network Programming (GNP) with Learning and Evolution Hirasawa laboratory, Artificial Intelligence section Information architecture field.
Energy-Aware Scheduling with Quality of Surveillance Guarantee in Wireless Sensor Networks Jaehoon Jeong, Sarah Sharafkandi and David H.C. Du Dept. of.
1 CSE 980: Data Mining Lecture 17: Density-based and Other Clustering Algorithms.
CENTRE CEllular Network Trajectories Reconstruction Environment F. Giannotti, A. Mazzoni, S. Puntoni, C. Renso KDDLab, Pisa.
Generic Approaches to Model Validation Presented at Growth Model User’s Group August 10, 2005 David K. Walters.
Design & Analysis of Algorithms Combinatory optimization SCHOOL OF COMPUTING Pasi Fränti
1 Mobile-Assisted Localization in Wireless Sensor Networks Nissanka B.Priyantha, Hari Balakrishnan, Eric D. Demaine, Seth Teller IEEE INFOCOM 2005 March.
Achieving High Software Reliability Using a Faster, Easier and Cheaper Method NASA OSMA SAS '01 September 5-7, 2001 Taghi M. Khoshgoftaar The Software.
Ibrahim Fathy, Mostafa Aref, Omar Enayet, and Abdelrahman Al-Ogail Faculty of Computer and Information Sciences Ain-Shams University ; Cairo ; Egypt.
I Robot.
1 Distributed and Optimal Motion Planning for Multiple Mobile Robots Yi Guo and Lynne Parker Center for Engineering Science Advanced Research Computer.
Evolving the goal priorities of autonomous agents Adam Campbell* Advisor: Dr. Annie S. Wu* Collaborator: Dr. Randall Shumaker** School of Electrical Engineering.
A DISTRIBUTION BASED VIDEO REPRESENTATION FOR HUMAN ACTION RECOGNITION Yan Song, Sheng Tang, Yan-Tao Zheng, Tat-Seng Chua, Yongdong Zhang, Shouxun Lin.
Georgia Tech / Mobile Intelligence 1 Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems DARPA MARS Kickoff.
DARPA Mobile Autonomous Robot SoftwareLeslie Pack Kaelbling; January Adaptive Intelligent Mobile Robotics Leslie Pack Kaelbling Artificial Intelligence.
Written by Changhyun, SON Chapter 5. Introduction to Design Optimization - 1 PART II Design Optimization.
Selection of Behavioral Parameters: Integration of Case-Based Reasoning with Learning Momentum Brian Lee, Maxim Likhachev, and Ronald C. Arkin Mobile Robot.
Learning Momentum: Integration and Experimentation Brian Lee and Ronald C. Arkin Mobile Robot Laboratory Georgia Tech Atlanta, GA.
1/17/2016CS225B Kurt Konolige Probabilistic Models of Sensing and Movement Move to probability models of sensing and movement Project 2 is about complex.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Extracting stay regions with uncertain boundaries from GPS trajectories a case study in animal ecology Haidong Wang.
Chapter 14. Active Vision for Goal-Oriented Humanoid Robot Walking (2/2) in Creating Brain-Like Intelligence, Sendhoff et al. Course: Robots Learning from.
Learning Behavioral Parameterization Using Spatio-Temporal Case-Based Reasoning Maxim Likhachev, Michael Kaess, and Ronald C. Arkin Mobile Robot Laboratory.
Optimization of a Dynamic Vaulting Behavior for LittleDog Alex Grubb and Nathan Ratliff ACRL 04/28/09.
A Case-based Reasoning Approach to Imitating RoboCup Players Michael W. Floyd, Babak Esfandiari and Kevin Lam FLAIRS 21 May 15, 2008.
University of Pennsylvania 1 GRASP Control of Multiple Autonomous Robot Systems Vijay Kumar Camillo Taylor Aveek Das Guilherme Pereira John Spletzer GRASP.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. A Cluster Validity Measure With Outlier Detection for Support Vector Clustering Presenter : Lin, Shu-Han.
SOUTHERN TAIWAN UNIVERSITY ELECTRICAL ENGINEERING DEPARTMENT
CIS 488/588 Bruce R. Maxim UM-Dearborn
Capturing the Secret Dances in the Brain
Mobile Computing.
Design & Analysis of Algorithms Combinatorial optimization
for Vision-Based Navigation
Area Coverage Problem Optimization by (local) Search
Presentation transcript:

Spatio-Temporal Case-Based Reasoning for Behavioral Selection Maxim Likhachev and Ronald Arkin Mobile Robot Laboratory Georgia Tech

Maxim Likhachev and Ronald Arkin Broad Picture of the Work Part of Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems project at Georgia Tech Sponsored by the DARPA MARS program Spatio-Temporal Case-Based Reasoning for Behavioral Selection

Maxim Likhachev and Ronald Arkin Motivation Constant parameterization of robotic behavior results in inefficient robot performance Manual selection of “right” parameters is difficult and tedious work Spatio-Temporal Case-Based Reasoning for Behavioral Selection

Maxim Likhachev and Ronald Arkin Motivation (cont’d) Use of Case-Based Reasoning methodology for an automatic selection of optimal parameters in run- time Spatio-Temporal Case-Based Reasoning for Behavioral Selection

Maxim Likhachev and Ronald Arkin Evaluated on: Simulations Real robot –ATRV-JR in outdoor environment –Nomad 150 in indoor environment Spatio-Temporal Case-Based Reasoning for Behavioral Selection

Maxim Likhachev and Ronald Arkin Related Work ACBARR, SINS and KINS systems –use of case-based reasoning and reinforcement learning for the optimization of behavioral parameters –contribute to some ideas behind the present algorithm Automatic optimization of parameters –genetic programming –reinforcement learning Spatio-Temporal Case-Based Reasoning for Behavioral Selection

Maxim Likhachev and Ronald Arkin Behavioral Control and CBR Module CBR Module controls: Weights for each behavior BiasMove Vector Noise PersistenceObstacle Sphere Spatio-Temporal Case-Based Reasoning for Behavioral Selection

Maxim Likhachev and Ronald Arkin Input Features for Case Selection Vector of spatial characteristics of environment –D - distance to the goal – - degree of obstruction and distance to the most obstructing cluster of obstacles for each of K angular regions around the robot Spatio-Temporal Case-Based Reasoning for Behavioral Selection

Maxim Likhachev and Ronald Arkin Input Features for Case Selection Vector of temporal characteristics of environment –R s - short term robot movement –R l - long term robot movement Spatio-Temporal Case-Based Reasoning for Behavioral Selection

Maxim Likhachev and Ronald Arkin Computation of Traversability Vector F F: –represents traversability of each region –approximates obstacle density function around the robot –independent of goal distance –smoothed over time: Spatio-Temporal Case-Based Reasoning for Behavioral Selection

Maxim Likhachev and Ronald Arkin Input Features: Example Spatio-Temporal Case-Based Reasoning for Behavioral Selection f 0 =0.92 f 1 =0.58 f 2 =1.0 f 3 =0.68 f 0 =0.02 f 1 =0.22 f 2 =0.63 f 3 =0.02 V temporal ShortTerm: R s =1.0 LongTerm: R l =0.7 V temporal ShortTerm: R s =0.01 LongTerm: R l =1.0

Maxim Likhachev and Ronald Arkin High Level Structure of CBR Module Spatio-Temporal Case-Based Reasoning for Behavioral Selection Current environment Feature Identification Spatial Features & Temporal Features vectors Spatial Features Vector Matching (1st stage of Case Selection) Temporal Features Vector Matching (2nd stage of Case Selection) Set of Spatially Matching cases Set of Spatially and Temporally Matching cases Case switching Decision tree Case Adaptation Case Library All the cases in the library Best Matching or currently used case Case Application Case ready for application Case Output Parameters (Behavioral Assemblage Parameters) Random Selection Process (3rd stage of Case Selection) Best Matching case

Maxim Likhachev and Ronald Arkin Case Example I Spatio-Temporal Case-Based Reasoning for Behavioral Selection CLEARGOAL Spatial Vector: D (goal distance) = 5 density distance Region 0: σ 0 = 0.00; r 0 = 0.00 Region 1: σ 1 = 0.00; r 1 = 0.00 Region 2: σ 2 = 0.00; r 2 = 0.00 Region 3: σ 3 = 0.00; r 3 = 0.00 Temporal Vector: (0 - min, 1 - max) ShortTerm_Motion R s = LongTerm_Motion R l = Case Output Parameters: MoveToGoal_Gain = 2.00 Noise_Gain = 0.00 Noise_Persistence = 10 Obstacle_Gain = 2.00 Obstacle_Sphere = 0.50 Bias_Vector_X = 0.00 Bias_Vector_Y = 0.00 Bias_Vector_Gain = 0.00 CaseTime = 3.0

Maxim Likhachev and Ronald Arkin Case Example II Spatio-Temporal Case-Based Reasoning for Behavioral Selection FRONTOBSTRUCTED_SHORTTERM Spatial Vector: D (goal distance) = 5 density distance Region 0: σ 0 = 1.00; r 0 = 1.00 Region 1: σ 1 = 0.80; r 1 = 1.00 Region 2: σ 2 = 0.00; r 2 = 1.00 Region 3: σ 3 = 0.80; r 3 = 1.00 Temporal Vector: (0 - min, 1 - max) ShortTerm_Motion R s = LongTerm_Motion R l = Case Output Parameters: MoveToGoal_Gain = 0.10 Noise_Gain = 0.02 Noise_Persistence = 10 Obstacle_Gain = 0.80 Obstacle_Sphere = 1.50 Bias_Vector_X = Bias_Vector_Y = 0.70 Bias_Vector_Gain = 0.70 CaseTime = 2.0

Maxim Likhachev and Ronald Arkin Results Spatio-Temporal Case-Based Reasoning for Behavioral Selection Average travel distanceMission success rate Simulations: ATRV-JR: 12% average performance improvement in time steps ( based on 10 runs for each system in outdoor environment)

Maxim Likhachev and Ronald Arkin Simulations & real robot experiments: Performance improvement as a function of obstacle density Spatio-Temporal Case-Based Reasoning for Behavioral Selection SimulationsNomad 150 Based on 10 runs for each system in indoor environment

Maxim Likhachev and Ronald Arkin Real Robot Run with CBR Spatio-Temporal Case-Based Reasoning for Behavioral Selection

Maxim Likhachev and Ronald Arkin Real Robot Run without CBR Spatio-Temporal Case-Based Reasoning for Behavioral Selection

Maxim Likhachev and Ronald Arkin Trajectories of the robot Spatio-Temporal Case-Based Reasoning for Behavioral Selection Robot with CBR moduleRobot without CBR module 11% less travel distance

Maxim Likhachev and Ronald Arkin Conclusions Automatic selection of optimal behavioral parameters results in robot performance improvement (based on simulations and real robot experiments) Careful manual selection of behavioral parameters is no longer required from a user Future Work –Automatic learning of cases: identifying when to create a new case applying reinforcement learning techniques in finding optimal parameters for existing cases –Integration with other adaptation & learning methods (e.g., Learning Momentum) Spatio-Temporal Case-Based Reasoning for Behavioral Selection