CAP6938 Neuroevolution and Artificial Embryogeny Competitive Coevolution Dr. Kenneth Stanley February 20, 2006.

Slides:



Advertisements
Similar presentations
Genetic Algorithms (Evolutionary Computing) Genetic Algorithms are used to try to “evolve” the solution to a problem Generate prototype solutions called.
Advertisements

Using Parallel Genetic Algorithm in a Predictive Job Scheduling
Constructing Complex NPC Behavior via Multi- Objective Neuroevolution Jacob Schrum – Risto Miikkulainen –
Playing Evolution Games in the Classroom Colin Garvey GK-12 Fellow.
On the Genetic Evolution of a Perfect Tic-Tac-Toe Strategy
Multiobjective Optimization Chapter 7 Luke, Essentials of Metaheuristics, 2011 Byung-Hyun Ha R1.
The Dominance Tournament Method of Monitoring Progress in Coevolution Speaker: Lin, Wei-Kai (2009/04/30) 1.
Genetic Algorithms, Part 2. Evolving (and co-evolving) one-dimensional cellular automata to perform a computation.
Neuro-Evolution of Augmenting Topologies Ben Trewhella.
Coevolutionary Learning with Genetic Algorithms. Problem for learning algorithms: How to select “training environments” appropriate to different stages.
Evolving Neural Network Agents in the NERO Video Game Author : Kenneth O. Stanley, Bobby D. Bryant, and Risto Miikkulainen Presented by Yi Cheng Lin.
Evolutionary Games The solution concepts that we have discussed in some detail include strategically dominant solutions equilibrium solutions Pareto optimal.
Evolutionary Computation and Co-evolution Alan Blair October 2005.
Othello Sean Farrell June 29, Othello Two-player game played on 8x8 board All pieces have one white side and one black side Initial board setup.
Evolutionary Games The solution concepts that we have discussed in some detail include strategically dominant solutions equilibrium solutions Pareto optimal.
Genetic Programming. Agenda What is Genetic Programming? Background/History. Why Genetic Programming? How Genetic Principles are Applied. Examples of.
Evolving Multi-modal Behavior in NPCs Jacob Schrum – Risto Miikkulainen –
1 Evolutionary Growth of Genomes for the Development and Replication of Multi-Cellular Organisms with Indirect Encodings Stefano Nichele and Gunnar Tufte.
Changing Perspective… Common themes throughout past papers Repeated simple games with small number of actions Mostly theoretical papers Known available.
Lecture 2. Co-Evolution (II)
林偉楷 Taiwan Evolutionary Intelligence Laboratory.
Evolutionary Computation. Evolutionary Complexification Two major goals in intelligent systems are the discovery and improvement of solutions to complex.
CAP6938 Neuroevolution and Developmental Encoding Working with NEAT Dr. Kenneth Stanley September 27, 2006.
More on coevolution and learning Jing Xiao April, 2008.
Lecture 8: 24/5/1435 Genetic Algorithms Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Coevolutionary Models A/Prof. Xiaodong Li School of Computer Science and IT, RMIT University Melbourne, Australia April.
Design of a real time strategy game with a genetic AI By Bharat Ponnaluri.
CAP6938 Neuroevolution and Developmental Encoding Developmental Encoding 2 Dr. Kenneth Stanley October 9, 2006.
Pareto Coevolution Presented by KC Tsui Based on [1]
Coevolution Chapter 6, Essentials of Metaheuristics, 2013 Spring, 2014 Metaheuristics Byung-Hyun Ha R2R3.
Machine Learning for an Artificial Intelligence Playing Tic-Tac-Toe Computer Systems Lab 2005 By Rachel Miller.
CAP6938 Neuroevolution and Developmental Encoding Developmental Encoding Dr. Kenneth Stanley October 2, 2006.
CAP6938 Neuroevolution and Developmental Encoding Real-time NEAT Dr. Kenneth Stanley October 18, 2006.
Artificial Life/Agents Creatures: Artificial Life Autonomous Software Agents for Home Entertainment Stephen Grand, 1997 Learning Human-like Opponent Behaviour.
CAP6938 Neuroevolution and Developmental Encoding Basic Concepts Dr. Kenneth Stanley August 23, 2006.
Game Theory, Social Interactions and Artificial Intelligence Supervisor: Philip Sterne Supervisee: John Richter.
Mike Taks Bram van de Klundert. About Published 2005 Cited 286 times Kenneth O. Stanley Associate Professor at University of Central Florida Risto Miikkulainen.
Learning Othello The quest for general strategy building.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Evolving Reactive NPCs for the Real-Time Simulation Game.
Pac-Man AI using GA. Why Machine Learning in Video Games? Better player experience Agents can adapt to player Increased variety of agent behaviors Ever-changing.
Interactive Evolution in Automated Knowledge Discovery Tomáš Řehořek March 2011.
Artificial Intelligence Research in Video Games By Jacob Schrum
Evolving Multimodal Networks for Multitask Games
CAP6938 Neuroevolution and Developmental Encoding Evolving Adaptive Neural Networks Dr. Kenneth Stanley October 23, 2006.
CAP6938 Neuroevolution and Artificial Embryogeny Leaky Integrator Neurons and CTRNNs Dr. Kenneth Stanley March 6, 2006.
CAP6938 Neuroevolution and Artificial Embryogeny Evolving Adaptive Neural Networks Dr. Kenneth Stanley March 1, 2006.
Comparative Reproduction Schemes for Evolving Gathering Collectives A.E. Eiben, G.S. Nitschke, M.C. Schut Computational Intelligence Group Department of.
CAP6938 Neuroevolution and Developmental Encoding Intro to Neuroevolution Dr. Kenneth Stanley September 18, 2006.
Riza Erdem Jappie Klooster Dirk Meulenbelt EVOLVING MULTI-MODAL BEHAVIOR IN NPC S.
Competition and Cooperation
Design of a real time strategy game with a genetic AI By Bharat Ponnaluri.
CAP6938 Neuroevolution and Artificial Embryogeny Approaches to Neuroevolution Dr. Kenneth Stanley February 1, 2006.
CAP6938 Neuroevolution and Artificial Embryogeny Evolutionary Computation Theory Dr. Kenneth Stanley January 25, 2006.
CAP6938 Neuroevolution and Artificial Embryogeny Evolutionary Comptation Dr. Kenneth Stanley January 23, 2006.
CAP6938 Neuroevolution and Artificial Embryogeny Real-time NEAT Dr. Kenneth Stanley February 22, 2006.
Evolutionary Computing Systems Lab (ECSL), University of Nevada, Reno 1 Authors : Siming Liu, Christopher Ballinger, Sushil Louis
CAP6938 Neuroevolution and Artificial Embryogeny Real-time NEAT Dr. Kenneth Stanley February 22, 2006.
Selected Topics in CI I Genetic Programming Dr. Widodo Budiharto 2014.
Dr. Kenneth Stanley September 11, 2006
Evolutionary Algorithms Jim Whitehead
USING MICROBIAL GENETIC ALGORITHM TO SOLVE CARD SPLITTING PROBLEM.
CAP6938 Neuroevolution and Artificial Embryogeny Artificial Embryogeny
Dr. Kenneth Stanley January 30, 2006
Evolution strategies Can programs learn?
Dr. Kenneth Stanley September 25, 2006
Evolving Multimodal Networks for Multitask Games
Announcements Homework 3 due today (grace period through Friday)
Dr. Kenneth Stanley February 6, 2006
CHAPTER I. of EVOLUTIONARY ROBOTICS Stefano Nolfi and Dario Floreano
Coevolutionary Automated Software Correction
Presentation transcript:

CAP6938 Neuroevolution and Artificial Embryogeny Competitive Coevolution Dr. Kenneth Stanley February 20, 2006

Example: I Want to Evolve a Go Player Go is one of the hardest games for computers I am terrible at it There are no good Go programs either (hypothetically) I have no idea how to measure the fitness of a Go player How can I make evolution solve this problem?

Generally: Fitness May Be Difficult to Formalize Optimal policy in competitive domains unknown Only winner and loser can be easily determined What can be done?

Competitive Coevolution Coevolution: No absolute fitness function Fitness depends on direct comparisons with other evolving agents Hope to discover solutions beyond the ability of fitness to describe Competition should lead to an escalating arms race

The Arms Race

The Arms Race is an AI Dream Computer plays itself and becomes champion No need for human knowledge whatsoever In practice, progress eventually stagnates (Darwen 1996; Floreano and Nolfi 1997; Rosin and Belew 1997)

So Who Plays Against Whom? If evaluation is expensive, everyone can’t play everyone Even if they could, a lot of candidates might be very poor If not everyone, who then is chosen as competition for each candidate? Need some kind of intelligent sampling

Challenges with Choosing the Right Opponents Red Queen Effect: Running in Circles –A dominates B –C dominates B –A dominates B Overspecialization –Optimizing a single skill to the neglect of all others –Likely to happen without diverse opponents in sample Several other failure dynamics

Heuristic in NEAT: Utilize Species Champions Each individual plays all the species champions and keeps a score

Hall of Fame (HOF) (Rosin and Belew 1997) Keep around a list of past champions Add them to the mix of opponents If HOF gets too big, sample from it

More Recently: Pareto Coevolution Separate learners and tests The tests are rewarded for distinguishing learners from each other The learners are ranked in Pareto layers –Each test is an objective –If X wins against a superset of tests that Y wins again, then X Pareto-dominates Y –The first layer is a nondominated front –Think of tests as objectives in a multiobjective optimization problem Potentially costly: All learners play all tests De Jong, E.D. and J.B. Pollack (2004). Ideal Evaluation from Coevolution Evolutionary Computation, Vol. 12, Issue 2, pp , published by The MIT Press.Ideal Evaluation from Coevolution The MIT Press

Choosing Opponents Isn’t Everything How can new solutions be continually created that maintain existing capabilities? Mutations that lead to innovations could simultaneously lead to losses What kind of process ensures elaboration over alteration?

Alteration vs. Elaboration

Answer: Complexification Fixed-length genomes limit progress Dominant strategies that utilize the entire genome must alter and thereby sacrifice prior functionality If new genes can be added, dominant strategies can be elaborated, maintaining existing capabilities

Test Domain: Robot Duel Robot with higher energy wins by colliding with opponent Moving costs energy Collecting food replenishes energy Complex task: When to forage/save energy, avoid/pursue?

Robot Neural Networks

Experimental Setup 13 complexifying runs, 15 fixed-topology runs 500 generations per run 2-population coevolution with hall of fame (Rosin & Belew 1997)

Performance is Difficult to Evaluate in Coevolution How can you tell if things are improving when everything is relative? –Number of wins is relative to each generation No absolute measure is available No benchmark is comprehensive

Expensive Method: Master Tournament (Cliff and Miller 1995; Floreano and Nolfi 1997) Compare all generation champions to each other Requires n^2 evaluations –An accurate evaluation may involve e.g. 288 games Defeating more champions does not establish superiority

Strict and Efficient Performance Measure: Dominance Tournament (Stanley & Miikkulainen 2002)

Result: Evolution of Complexity As dominance increases so does complexity on average Networks with strictly superior strategies are more complex

Comparing Performance

Summary of Performance Comparisons

The Superchamp

Cooperative Coevolution Groups attempt to work with each other instead of against each other But sometimes it’s not clear what’s cooperation and what’s competition Maybe competitive/cooperative is not the best distinction? –Newer idea: Compositional vs. test-based

Summary Picking best opponents Maintaining and elaborating on strategies Measuring performance Different types of coevolution Advanced papers on coevolution: Ideal Evaluation from CoevolutionIdeal Evaluation from Coevolution by De Jong, E.D. and J.B. Pollack (2004) Monotonic Solution Concepts in Coevolution by Ficici, Sevan G. (2005) Monotonic Solution Concepts in Coevolution

Next Topic: Real-time NEAT (rtNEAT) Simultaneous and asynchronous evaluation Non-generational Useful in video games and simulations NERO: Video game with rtNEAT Homework due 2/27/06: Working genotype to phenotype mapping. Genetic representation completed. Saving and loading of genome file I/O functions completed. Turn in summary, code, and examples demonstrating that it works. -Shorter symposium paper: Evolving Neural Network Agents in the NERO Video Game by Kenneth O. Stanley and Risto Miikkulainen (2005) -Optional journal (longer, more detailed) paper: Real-time Neuroevolution in the NERO Video Game by Kenneth O. Stanley and Risto Miikkulainen (2005)Evolving Neural Network Agents in the NERO Video Game Real-time Neuroevolution in the NERO Video Game - -Extra coevolution papers

Project Milestones (25% of grade) 2/6: Initial proposal and project description 2/15: Domain and phenotype code and examples 2/27: Genes and Genotype to Phenotype mapping 3/8: Genetic operators all working 3/27: Population level and main loop working 4/10: Final project and presentation due (75% of grade)