Constructing Intelligent Agents via Neuroevolution By Jacob Schrum

Slides:



Advertisements
Similar presentations
Approaches, Tools, and Applications Islam A. El-Shaarawy Shoubra Faculty of Eng.
Advertisements

Constructing Complex NPC Behavior via Multi- Objective Neuroevolution Jacob Schrum – Risto Miikkulainen –
Institute of Intelligent Power Electronics – IPE Page1 Introduction to Basics of Genetic Algorithms Docent Xiao-Zhi Gao Department of Electrical Engineering.
4-1 Management Information Systems for the Information Age Copyright 2002 The McGraw-Hill Companies, Inc. All rights reserved Chapter 4 Decision Support.
Genetic Algorithms, Part 2. Evolving (and co-evolving) one-dimensional cellular automata to perform a computation.
Neuro-Evolution of Augmenting Topologies Ben Trewhella.
Genetic Algorithms. Some Examples of Biologically Inspired AI Neural networks Evolutionary computation (e.g., genetic algorithms) Immune-system-inspired.
Parallelized Evolution System Onur Soysal, Erkin Bahçeci Erol Şahin Dept. of Computer Engineering Middle East Technical University.
Evolving Neural Network Agents in the NERO Video Game Author : Kenneth O. Stanley, Bobby D. Bryant, and Risto Miikkulainen Presented by Yi Cheng Lin.
Introduction to Genetic Algorithms Yonatan Shichel.
Using a GA to Create Prey Tactics Presented by Tony Morelli on 11/29/04.
Evolutionary Algorithms Simon M. Lucas. The basic idea Initialise a random population of individuals repeat { evaluate select vary (e.g. mutate or crossover)
Design of Autonomous Navigation Controllers for Unmanned Aerial Vehicles using Multi-objective Genetic Programming Gregory J. Barlow March 19, 2004.
Genetic Algorithms Learning Machines for knowledge discovery.
Torcs Simulator Presented by Galina Volkinshtein and Evgenia Dubrovsky.
Othello Sean Farrell June 29, Othello Two-player game played on 8x8 board All pieces have one white side and one black side Initial board setup.
McGraw-Hill/Irwin ©2005 The McGraw-Hill Companies, All rights reserved ©2005 The McGraw-Hill Companies, All rights reserved McGraw-Hill/Irwin.
Neural Networks Slides by Megan Vasta. Neural Networks Biological approach to AI Developed in 1943 Comprised of one or more layers of neurons Several.
By Jacob Schrum and Risto Miikkulainen
CHAPTER 12 ADVANCED INTELLIGENT SYSTEMS © 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang.
Evolving Multi-modal Behavior in NPCs Jacob Schrum – Risto Miikkulainen –
Genetic Algorithms and Ant Colony Optimisation
Evolution, Brains and Multiple Objectives
Chapter 11: Artificial Intelligence
On comparison of different approaches to the stability radius calculation Olga Karelkina Department of Mathematics University of Turku MCDM 2011.
Adapting Convergent Scheduling Using Machine Learning Diego Puppin*, Mark Stephenson †, Una-May O’Reilly †, Martin Martin †, and Saman Amarasinghe † *
林偉楷 Taiwan Evolutionary Intelligence Laboratory.
Introduction to Genetic Algorithms and Evolutionary Computation
Study on Genetic Network Programming (GNP) with Learning and Evolution Hirasawa laboratory, Artificial Intelligence section Information architecture field.
Evolution of Teamwork in Multiagent Systems Research Preparation Examination by Jacob Schrum.
Design of a real time strategy game with a genetic AI By Bharat Ponnaluri.
Predator/Prey Simulation for Investigating Emergent Behavior Jay Shaffstall.
CAP6938 Neuroevolution and Developmental Encoding Real-time NEAT Dr. Kenneth Stanley October 18, 2006.
Artificial Life/Agents Creatures: Artificial Life Autonomous Software Agents for Home Entertainment Stephen Grand, 1997 Learning Human-like Opponent Behaviour.
I Robot.
Algorithms and their Applications CS2004 ( ) 13.1 Further Evolutionary Computation.
Game Theory, Social Interactions and Artificial Intelligence Supervisor: Philip Sterne Supervisee: John Richter.
UTeach Presentation By Jacob Schrum. Who Am I? Jacob Schrum Born in Honolulu, Hawaii Undergraduate at Southwestern University PhD Student at University.
Mike Taks Bram van de Klundert. About Published 2005 Cited 286 times Kenneth O. Stanley Associate Professor at University of Central Florida Risto Miikkulainen.
Evolving the goal priorities of autonomous agents Adam Campbell* Advisor: Dr. Annie S. Wu* Collaborator: Dr. Randall Shumaker** School of Electrical Engineering.
1 Genetic Algorithms and Ant Colony Optimisation.
科技英文作業 黃方世 陳竹軒. Introduction Talking about video games. Training agents for what. Focus on training Non-player-character (NPC).
Chapter 9 Genetic Algorithms.  Based upon biological evolution  Generate successor hypothesis based upon repeated mutations  Acts as a randomized parallel.
Pac-Man AI using GA. Why Machine Learning in Video Games? Better player experience Agents can adapt to player Increased variety of agent behaviors Ever-changing.
Reactive and Output-Only HKOI Training Team 2006 Liu Chi Man (cx) 11 Feb 2006.
Artificial Intelligence Research in Video Games By Jacob Schrum
CITS7212: Computational Intelligence An Overview of Core CI Technologies Lyndon While.
CAP6938 Neuroevolution and Artificial Embryogeny Competitive Coevolution Dr. Kenneth Stanley February 20, 2006.
Evolving Multimodal Networks for Multitask Games
Evolving Agent Behavior in Multiobjective Domains Using Fitness-Based Shaping Jacob Schrum and Risto Miikkulainen University of Texas at Austin Department.
Comparative Reproduction Schemes for Evolving Gathering Collectives A.E. Eiben, G.S. Nitschke, M.C. Schut Computational Intelligence Group Department of.
CAP6938 Neuroevolution and Developmental Encoding Intro to Neuroevolution Dr. Kenneth Stanley September 18, 2006.
Riza Erdem Jappie Klooster Dirk Meulenbelt EVOLVING MULTI-MODAL BEHAVIOR IN NPC S.
A field of study that encompasses computational techniques for performing tasks that require intelligence when performed by humans. Simulation of human.
Design of a real time strategy game with a genetic AI By Bharat Ponnaluri.
Botprize 2010 Jacob Schrum, Igor Karpov, and Risto Miikkulainen
CAP6938 Neuroevolution and Artificial Embryogeny Real-time NEAT Dr. Kenneth Stanley February 22, 2006.
Genetic Algorithms An Evolutionary Approach to Problem Solving.
March 1, 2016Introduction to Artificial Intelligence Lecture 11: Machine Evolution 1 Let’s look at… Machine Evolution.
UT^2: Human-like Behavior via Neuroevolution of Combat Behavior and Replay of Human Traces Jacob Schrum Igor V. Karpov
CAP6938 Neuroevolution and Artificial Embryogeny Real-time NEAT Dr. Kenneth Stanley February 22, 2006.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Intelligent Exploration for Genetic Algorithms Using Self-Organizing.
Solving Interleaved and Blended Sequential Decision-Making Problems through Modular Neuroevolution Jacob Schrum Risto Miikkulainen.
Evolutionary Algorithms Jim Whitehead
Evolving Multimodal Networks for Multitask Games
UT^2: Human-like Behavior via Neuroevolution of Combat Behavior and Replay of Human Traces Jacob Schrum, Igor Karpov, and Risto Miikkulainen
Faustino J. Gomez, Doug Burger, and Risto Miikkulainen
Introduction to Artificial Intelligence Lecture 11: Machine Evolution
Lecture 4. Niching and Speciation (1)
Presentation transcript:

Constructing Intelligent Agents via Neuroevolution By Jacob Schrum

Motivation Intelligent agents are needed –Search-and-rescue robots –Mars exploration –Training simulations –Video games Insight into nature of intelligence –Sufficient conditions for emergence of: Cooperation Communication Multimodal behavior

Talk Outline Bio-inspired learning methods –Neural networks –Evolutionary computation My research –Learning multimodal behavior –Modular networks in Ms. Pac-Man –Human-like behavior in Unreal Tournament Future work Conclusion

Artificial Neural Networks Brain = network of neurons ANN = abstraction of brain –Neurons organized into layers Inputs Outputs

What Can Neural Networks Do? In theory, anything! –Universal Approximation Theorem – Can’t program: too complicated In practice, learning/training is hard –Supervised: Backpropagation –Unsupervised: Self-Organizing Maps –Reinforcement Learning: Temporal-Difference and Evolutionary Computation

Evolutionary Computation Computational abstraction of evolution –Descent with modification (mutation) –Sexual reproduction (crossover) –Survival of the fittest (natural selection) Evolution + Neural Nets = Neuroevolution –Population of neural networks –Mutation and crossover modify networks –Net used as control policy to evaluate fitness

Neuroevolution Example Start With Parent Population

Neuroevolution Example Start With Parent Population Evaluate and Assign Fitness

Neuroevolution Example Start With Parent Population Evaluate and Assign Fitness Clone, Crossover and Mutate To Get Child Population

Neuroevolution Example Start With Parent Population Evaluate and Assign Fitness Clone, Crossover and Mutate Children Are Now the New Parents Repeat Process: Fitness Evaluations As the process continues, each successive population improves performance

Neuroevolution Applications F. Gomez and R. Miikkulainen, “2-D Pole Balancing With Recurrent Evolutionary Networks” ICANN 1998 Double Pole Balancing

Neuroevolution Applications F. Gomez and R. Miikkulainen, “Active Guidance for a Finless Rocket Using Neuroevolution” GECCO 2003 Finless Rocket Control

Neuroevolution Applications N. Kohl, K. Stanley, R. Miikkulainen, M. Samples, and R. Sherony, "Evolving a Real-World Vehicle Warning System" GECCO 2006 Vehicle Crash Warning System

Neuroevolution Applications K. O. Stanley, B. D. Bryant, I. Karpov, R. Miikkulainen, "Real-Time Evolution of Neural Networks in the NERO Video Game" AAAI 2006 Training Video Game Agents

What is Missing? NERO agents are specialists –Sniping from a distance –Aggressively rushing in Humans can do all of this, and more Multimodal behavior –Different behaviors for different situations Human-like behavior –Preferred by humans

What I do With Neuroevolution Discover complex agent behavior Discover multimodal behavior Contributions: Use multi-objective evolution –Different objectives for different modes Evolve modular networks –Networks with modules for each mode Human-like behavior –Constrain evolution

Pareto-based Multiobjective Optimization High health but did not deal much damage Dealt lot of damage, but lost lots of health Tradeoff between objectives

Non-dominated Sorting Genetic Algorithm II Population P with size N; Evaluate P Use mutation (& crossover) to get P´ size N; Evaluate P´ Calculate non-dominated fronts of P  P´ size 2N New population size N from highest fronts of P  P´ K. Deb, S. Agrawal, A. Pratap, T. Meyarivan, "A Fast Elitist Non-dominated sorting genetic algorithm for multi-objective optimization: NSGA-II" PPSN VI, 2000

Ms. Pac-Man Popular classic game Predator-prey scenario –Ghosts are predators –Until power pill is eaten Multimodal behavior needed –Running from threats –Chasing edible ghosts –More?

Modular Networks Different areas of brain specialize –Structural modularity → functional modularity Apply to evolved neural networks –Separate module → behavioral mode Preference neurons (grey) arbitrate between modules Use module with highest preference output ( )( )

Module Mutation Let evolution decide how many modules Networks start with one module New modules added by one of several module mutations Previous Random Duplicate

Intelligent Module Usage Evolution discovers a novel task division –Not programmed Dedicates one module to luring (cyan) Improves ghost eating when using other module

Comparison With Other Work AuthorsMethodGameAVGMAX Alhejali and Lucas [1]GPFourMaze16,01444,560 Alhejali and Lucas [2]GP+CampsFourMaze11,41331,850 My Module Mutation Duplicate ResultsFourMaze32,64744,520 Brandstetter and Ahmadi [3]GPCIG ,19833,420 Recio et al. [4]ACOCIG ,03143,467 Alhejali and Lucas [5]GP+MCTSCIG ,64169,010 My Module Mutation Duplicate ResultsCIG ,29984,980 [1] A.M. Alhejali, S.M. Lucas: Evolving diverse Ms. Pac-Man playing agents using genetic programming. UKCI [2] A.M. Alhejali, S.M. Lucas: Using a training camp with Genetic Programming to evolve Ms Pac-Man agents. CIG [3] M.F. Brandstetter, S. Ahmadi: Reactive control of Ms. Pac Man using information retrieval based on Genetic Programming. CIG [4] G. Recio, E. Martín, C. Estébanez, Y. Sáez: AntBot: Ant Colonies for Video Games. TCIAIG [5] A.M. Alhejali, S.M. Lucas: Using genetic programming to evolve heuristics for a Monte Carlo Tree Search Ms Pac-Man agent. CIG 2013.

Types of Intelligence Evolved intelligent Ms. Pac-Man behavior –Surprising module usage –Evolution discovers the unexpected –Diverse collection of solutions Still not human-like –Human-like vs. optimal –Human intelligence

Modern Game: Unreal Tournament 3D world with simulated physics Multiple human and software agents interacting Agents attack, retreat, explore, etc. Multimodal behavior required to succeed

Human-like Behavior: BotPrize International competition at CIG conference A Turing Test for video game bots –Judge as human over 50% of time to win –After 5 years, we won in 2012 Evolved combat behavior –Constrained to be human-like

Guessing Game Coleman: ???? Milford: ???? Moises: ???? Lawerence: ???? Clifford: ???? Kathe: ???? Tristan: ???? Jackie: ????

Judging Game

Player Identities Coleman: UT^2 (Our winning bot) Milford: ICE-2010 (bot) Moises: Discordia (bot) Lawerence: Native UT2004 bot Clifford: w00t (bot) Kathe: Human Tristan: Human Jackie: Native UT2004 bot

Human Subject Study Six participants played the judging game Recorded extensive post-game interviews What criteria to humans claim to judge by?

Lessons Learned Don’t be too skilled –Evolved with accuracy restrictions –Disable elaborate dodging Humans are “tenacious” –Opponent-relative actions –Encourage “focusing” on opponent Don’t repeat mistakes –Database of human traces to get unstuck

Bot Architecture

Future Work Evolving teamwork –Ghosts must cooperate to eat Ms. Pac-Man –Unreal Tournament supports team play Domination, Capture the Flag, etc. Interactive evolution –Evolve in response to human interaction Adaptive opponents/assistants Evolutionary art Content generation

Conclusion Evolution discovers unexpected behavior Modular networks learn multimodal behavior Human behavior not optimal –Evolution can be constrained to be more human-like Many directions for future research

Questions? contact Jacob Schrum