Beyond Human Factors: An Approach to Human/Automation Teams Haomiao Huang Jerry Ding Wei Zhang Claire J. Tomlin Hybrid Systems Lab Action Webs Meeting.

Slides:

Advertisements

Similar presentations

METAGAMER: An Agent for Learning and Planning in General Games Barney Pell NASA Ames Research Center.

Advertisements

Discrete variational derivative methods: Geometric Integration methods for PDEs Chris Budd (Bath), Takaharu Yaguchi (Tokyo), Daisuke Furihata (Osaka)

Todd W. Neller Gettysburg College

Hadi Goudarzi and Massoud Pedram

Continuation Methods for Structured Games Ben Blum Christian Shelton Daphne Koller Stanford University.

Timed Automata.

Hybrid Systems Controller Synthesis Examples EE291E Tomlin.

AA278A: Supplement to Lecture Notes 10. Controller Synthesis for Hybrid Systems Claire J. Tomlin Department of Aeronautics and Astronautics Department.

1. Algorithms for Inverse Reinforcement Learning 2

P. Venkataraman Mechanical Engineering P. Venkataraman Rochester Institute of Technology DETC2011 –47658 Determining ODE from Noisy Data 31 th CIE, Washington.

Train DEPOT PROBLEM USING PERMUTATION GRAPHS

Planning under Uncertainty

Robust Hybrid and Embedded Systems Design Jerry Ding, Jeremy Gillula, Haomiao Huang, Michael Vitus, and Claire Tomlin MURI Review Meeting Frameworks and.

COMPUTER MODELS IN BIOLOGY Bernie Roitberg and Greg Baker.

Motion Editing and Retargetting Jinxiang Chai. Outline Motion editing [video, click here]here Motion retargeting [video, click here]here.

This time: Outline Game playing The minimax algorithm

© 2003 Warren B. Powell Slide 1 Approximate Dynamic Programming for High Dimensional Resource Allocation NSF Electric Power workshop November 3, 2003 Warren.

An Introduction to the Soft Walls Project Adam Cataldo Prof. Edward Lee University of Pennsylvania Dec 18, 2003 Philadelphia, PA.

Games with Chance Other Search Algorithms CPSC 315 – Programming Studio Spring 2008 Project 2, Lecture 3 Adapted from slides of Yoonsuck Choe.

Game Playing CSC361 AI CSC361: Game Playing.

1 DCP 1172 Introduction to Artificial Intelligence Lecture notes for Chap. 6 [AIMA] Chang-Sheng Chen.

Softwalls: Preventing Aircraft from Entering Unauthorized Airspace Adam Cataldo Prof. Edward Lee Ian Mitchell Prof. Shankar Sastry CHESS Review May 8,

Uninformed Search Reading: Chapter 3 by today, Chapter by Wednesday, 9/12 Homework #2 will be given out on Wednesday DID YOU TURN IN YOUR SURVEY?

1 Collision Avoidance Systems: Computing Controllers which Prevent Collisions By Adam Cataldo Advisor: Edward Lee Committee: Shankar Sastry, Pravin Varaiya,

Softwalls: Preventing Aircraft from Entering Unauthorized Airspace Adam Cataldo Prof. Edward Lee Prof. Ian Mitchell, UBC Prof. Shankar Sastry NASA JUP.

PDE control using viability and reachability analysis Alexandre Bayen Jean-Pierre Aubin Patrick Saint-Pierre Philadelphia, March 29 th, 2004.

D Nagesh Kumar, IIScOptimization Methods: M1L4 1 Introduction and Basic Concepts Classical and Advanced Techniques for Optimization.

A Multi-Agent Learning Approach to Online Distributed Resource Allocation Chongjie Zhang Victor Lesser Prashant Shenoy Computer Science Department University.

1 An Open Boundary Safety-of- Territory Solver for the Game of Go Author: Xiaozhen Niu, Martin Mueller Dept of Computing Science University of Alberta.

APPLICATION OF GEOMETRICAL EXTRAPOLATION METHOD BASED HYBRID SYSTEM CONTROLLER ON PURSUIT-AVOIDANCE DIFFERENTIAL GAME Ginzburg Pavel and Slavnaya Lyudmila.

NSF Foundations of Hybrid and Embedded Software Systems UC Berkeley: Chess Vanderbilt University: ISIS University of Memphis: MSI Program Review May 10,

Algorithmically Adversarial Input Design “Making Mathematical Reasoning Fun” Workshop ACM SIGCSE, 2013 Brian C. Dean, Chad Waters School of Computing,

Ryann Kramer EDU Prof. R. Moroney Summer 2010.

1 S ystems Analysis Laboratory Helsinki University of Technology Kai Virtanen, Tuomas Raivio, and Raimo P. Hämäläinen Systems Analysis Laboratory (SAL)

Constraints-based Motion Planning for an Automatic, Flexible Laser Scanning Robotized Platform Th. Borangiu, A. Dogar, A. Dumitrache University Politehnica.

1 Outline:  Outline of the algorithm  MILP formulation  Experimental Results  Conclusions and Remarks Advances in solving scheduling problems with.

Stochastic Algorithms Some of the fastest known algorithms for certain tasks rely on chance Stochastic/Randomized Algorithms Two common variations – Monte.

Presenter: Chih-Yuan Chou GA-BASED ALGORITHMS FOR FINDING EQUILIBRIUM 1.

1 S ystems Analysis Laboratory Helsinki University of Technology Kai Virtanen, Tuomas Raivio and Raimo P. Hämäläinen Systems Analysis Laboratory Helsinki.

1 CPSC 320: Intermediate Algorithm Design and Analysis July 28, 2014.

Mobile Agent Migration Problem Yingyue Xu. Energy efficiency requirement of sensor networks Mobile agent computing paradigm Data fusion, distributed processing.

Neural Network Implementation of Poker AI

Computing Reachable Sets via Toolbox of Level Set Methods Michael Vitus Jerry Ding 4/16/2012.

Lecture 2: 11/4/1435 Problem Solving Agents Lecturer/ Kawther Abas 363CS – Artificial Intelligence.

MURI Telecon, Update 7/26/2012 Summary, Part I:  Completed: proving and validating numerically optimality conditions for Distributed Optimal Control (DOC)

Reactive and Output-Only HKOI Training Team 2006 Liu Chi Man (cx) 11 Feb 2006.

Reachability-based Controller Design for Switched Nonlinear Systems EE 291E / ME 290Q Jerry Ding 4/18/2012.

Outline Introduction Research Project Findings / Results

1 S ystems Analysis Laboratory Helsinki University of Technology Manuscript “On the Use of Influence Diagrams in a One-on-One Air Combat Game” in Kai Virtanen’s.

Design and Analysis of Algorithms (09 Credits / 5 hours per week) Sixth Semester: Computer Science & Engineering M.B.Chandak

Application: Multiresolution Curves Jyun-Ming Chen Spring 2001.

Randomized Kinodynamics Planning Steven M. LaVelle and James J

ARTIFICIAL INTELLIGENCE (CS 461D) Princess Nora University Faculty of Computer & Information Systems.

An Exact Algorithm for Difficult Detailed Routing Problems Kolja Sulimma Wolfgang Kunz J. W.-Goethe Universität Frankfurt.

Safe Control Strategies for Hopping Over Uneven Terrain Part I Brian Howley RiSE Group Meeting October 9, 2006.

Chapter 5 Adversarial Search. 5.1 Games Why Study Game Playing? Games allow us to experiment with easier versions of real-world situations Hostile agents.

Evolutionary Computing Systems Lab (ECSL), University of Nevada, Reno 1 Authors : Siming Liu, Christopher Ballinger, Sushil Louis

AAR Rendezvous Algorithm Progress Meeting 10 May 2005 REID A. LARSON, 2d Lt, USAF Control Systems Engineer MARK J. MEARS, Ph.D. Control Systems Engineer.

Use or disclosure of the information contained herein is subject to specific written CIRA approval 1 PURSUIT – EVASION GAMES GAME THEORY AND ANALYSIS OF.

Computational Fluid Dynamics Lecture II Numerical Methods and Criteria for CFD Dr. Ugur GUVEN Professor of Aerospace Engineering.

Design and Analysis of Algorithms (09 Credits / 5 hours per week)

Last time: search strategies

New Characterizations in Turnstile Streams with Applications

Comparison of and Practical Uses of Kahoot and

Artificial Intelligence

Networks of Unmanned Underwater Vehicles

Kevin Mason Michael Suggs

Artificial Intelligence

Optimal Control and Reachability with Competing Inputs

CS51A David Kauchak Spring 2019

Presentation transcript:

Beyond Human Factors: An Approach to Human/Automation Teams Haomiao Huang Jerry Ding Wei Zhang Claire J. Tomlin Hybrid Systems Lab Action Webs Meeting 11/17/2010 1

2 [nasa.gov, businessweek.com, tgdaily.com, techeasy.co.za, deere.com, aurore-sciences.org] Advances in complex multi-agent systems require smart integration of human elements.

[foxnews.com][wikipedia] 3 [media.weirdworm.com] [knowyourmeme.com] [adriandayton.com] This requires new approaches to analyze humans as part of the system! Let’s think about humans as part of the solution, not the problem.

Two related problems 4 2) Control - generating useful directives and controls for human agents 1) Modeling- Properly representing humans as components of the overall system

Outline  Motivation  Scenario for Research on Human/Automation Teams  Adversarial Game Problem  Reachability Based Approach  Results  Conclusions & Future Work 5

Choosing a Research Scenario Games are representative of hard, real-world problems, yet provide relatively benign “sandbox” environments for development Robocup Chess What is a good game to capture the aspects of human-automation teams that we want to explore? Starcraft 6

Time tested and fun Capture-the-Flag Capture-the-flag embodies the basic research challenges we are trying to address Limited Information Multiple Agents Competing Objectives Human players Adversarial 7

Automation-Assisted Human Capture-the-Flag Using mobile phones, computers, and UAVs, we have turned capture-the- flag into a testbed for advanced automation concepts involving human team members Game software on Android phones STARMAC Quadrotor UAVs Server-side Management Software 8

Time tested and fun Narrowing the problem Limited Information Multiple Agents Competing Objectives Human players Adversarial 9

Outline  Motivation  Scenario for Research on Human/Automation Teams  Adversarial Game Problem  Problem statement  Related Work  Solution Insights  Reachability Based Approach  Results  Conclusions & Future Work 10

Our Problem Capture Region Defender Attacker Flag Flag Region Return Region Game Domain Characterize and solve a 1-sided capture-the-flag game with a single attacker and defender 11

Related Work on Adversarial Games  Multi-agent games on discrete state spaces Greedy search Hespanha, Kim, and Sastry 1999 Approximate DP/Reinforcement Learning Lagoudakis and Parr 2002 Discrete Play Matching Browning, Bruce, and Veloso 2005  Pursuit-evasion games with continuous states Receding-Horizon Control Mcgrew, How, Bush, Williams and Roy 2008 Sprinkle, Eklund, Kim, and Sastry 2004 Optimal Trajectory Planning Earl and D’Andrea 2001 Chasparis and Shamma 2005 Analytical game theory approaches Basar 1989, Lewin 1994, Stipanovic, Melikyan, Hovakimyan 2010 Hamilton-Jacobi Reachability Mitchell, Bayen, and Tomlin, 2005 Ding, Sprinkle, and Tomlin Assumed, learned, or randomized opponent model

Reachability Approach, derived from pursuit-evasion games: CTF game can be posed as a reachability problem. Assume system dynamics Where is the input for Player I and is the input for Player II Define as the reach-avoid set where a player can arrive in a goal region in at most time while avoiding region, no matter what the other player does 13

Capture-the-Flag as Reachability Victory conditions for each player can be encoded as reach-avoid sets in the joint state-space Defender Attacker Joint Capture Set Joint Return Set 14 Flag Return Set (For Attacker) Game Domain

1-D Game 15

Geometric insights Geometric analysis allows some insight into the 2-D capture-the-flag problem 16

Geometric insights Geometric analysis allows some insight into the 2-D capture-the-flag problem 17

Utility of Reachability Analysis 18 Reachability analysis gives complete characterization of game, and are a natural display tool for guiding human decision-making and allowing least-restrictive control Teo and Tomlin, 2003 Geometric analysis is not terribly general, though…

Outline  Motivation  Scenario for Research on Human/Automation Teams  Adversarial Game Problem  Reachability Based Approach  Hamilton-Jacobi Reachability  Computation  Results  Conclusions & Future Work 19

Hamilton-Jacobi Reachability Reachability in continuous state-spaces can analyzed as a terminal cost- only optimization problem, solved backward in time Reachability Cost Function Classic Optimal Control Cost Function Tomlin

Level-Set Representation Sets can be represented using sub-level sets of signed distance functions as terminal cost functions Set operations using point-wise minimum and maximums can be used to create arbitrary sets Tomlin 2009, Mitchell

Solution Based on HJBI Equation The cost-to-go function is the unique viscosity solution to the Hamilton- Jacobi-Bellman-Isaacs equation Classic Optimal Control Cost Function Hamilton-Jacobi-Bellman-Isaacs Equation Optimal Hamiltonian 22

Reachability Via Modified HJBI Equation The backward reachable set is the zero sub-level set of the viscosity solution to a modified HJBI equation Modified HJBI Equation Optimal Hamiltonian Reachability Cost Function Mitchell, Bayen, Tomlin

Numerical Solution to the Modified HJBI Equation 24 The viscosity solution to the modified HJBI Equation can be computed on a grid using the Level Set Toolbox from UBC

Reach-Avoid & Control Inputs 25 Reach-avoid sets can be computed by masking the reach set at each integration time step with the avoid set Control inputs can be extracted using the co-states, which can be calculated by numerical differentiation of the value function

Outline  Motivation  Scenario for Research on Human/Automation Teams  Adversarial Game Problem  Reachability Based Approach  Results  HJBI Reachability applied to capture-the-flag  Simulation results  Experimental setup  Conclusions & Future Work 26

Problem Formulation for 1v1 Capture-the-Flag HJBI reachability analysis allows us to fully characterize the game 27 Dynamics Optimal Hamiltonian Optimal Inputs

Flag Return & Flag Capture 28 Winning regions for each portion of the game can be calculated directly from reach-avoid conditions

Sequenced Capture and Return 29 Winning regions for the full sequence (flag capture and subsequent return) can be computed by using the intersection of the flag return set and flag zone as the initial condition for flag capture

Simulation Results 30 Simulation results demonstrate the use of the reachability solutions

Field Experiments in Progress 31 Reachability-based control and input directives are being implemented on Droid Incredible phones Game software on Android phones Server-side Management Software Player Positions and State Reachable sets & optimal control inputs

Outline  Motivation  Scenario for Research on Human/Automation Teams  Adversarial Game Problem  Reachability Based Approach  Results  Conclusions & Future Work 32

Conclusions  Capture-the-flag is great platform for developing human- automation systems research.  A differential game formulation using HJBI reachability solves perfect information, 1v1 CTF 33

Future Work We have the “correct” answer to the adversarial problem… now what? Limited Information Multiple Agents Competing Objectives Human players Adversarial 34

Thank you! Questions? 35