Games of Chance Introduction to Artificial Intelligence COS302 Michael L. Littman Fall 2001.

Slides:



Advertisements
Similar presentations
Lecture 13. Poker Two basic concepts: Poker is a game of skill, not luck. If you want to win at poker, – make sure you are very skilled at the game, and.
Advertisements

Introduction to Game Theory
GAME THEORY.
Adversarial Search We have experience in search where we assume that we are the only intelligent being and we have explicit control over the “world”. Lets.
Two-Player Zero-Sum Games
Operations Research Assistant Professor Dr. Sana’a Wafa Al-Sayegh 2 nd Semester ITGD4207 University of Palestine.
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Copyright (c) 2003 Brooks/Cole, a division of Thomson Learning, Inc
Study Group Randomized Algorithms 21 st June 03. Topics Covered Game Tree Evaluation –its expected run time is better than the worst- case complexity.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Zero-Sum Games (follow-up)
For Friday Finish chapter 5 Program 1, Milestone 1 due.
February 7, 2006AI: Chapter 6: Adversarial Search1 Artificial Intelligence Chapter 6: Adversarial Search Michael Scherger Department of Computer Science.
1 CSC 550: Introduction to Artificial Intelligence Fall 2008 search in game playing  zero-sum games  game trees, minimax principle  alpha-beta pruning.
Lecture 12 Last time: CSPs, backtracking, forward checking Today: Game Playing.
10/19/2004TCSS435A Isabelle Bichindaritz1 Game and Tree Searching.
Games CPSC 386 Artificial Intelligence Ellen Walker Hiram College.
Part 3: The Minimax Theorem
Minimax and Alpha-Beta Reduction Borrows from Spring 2006 CS 440 Lecture Slides.
Markov Models Introduction to Artificial Intelligence COS302 Michael L. Littman Fall 2001.
This time: Outline Game playing The minimax algorithm
Games and adversarial search
Duality Lecture 10: Feb 9. Min-Max theorems In bipartite graph, Maximum matching = Minimum Vertex Cover In every graph, Maximum Flow = Minimum Cut Both.
Matrix Games Mahesh Arumugam Borzoo Bonakdarpour Ali Ebnenasir CSE 960: Selected Topics in Algorithms and Complexity Instructor: Dr. Torng.
Game Trees Introduction to Artificial Intelligence COS302 Michael L. Littman Fall 2001.
Lectures in Microeconomics-Charles W. Upton Minimax Strategies.
Minimax Strategies. Everyone who has studied a game like poker knows the importance of mixing strategies. –With a bad hand, you often fold –But you must.
Game Theory Statistics 802. Lecture Agenda Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for.
Experts Learning and The Minimax Theorem for Zero-Sum Games Maria Florina Balcan December 8th 2011.
Game Playing: Adversarial Search Chapter 6. Why study games Fun Clear criteria for success Interesting, hard problems which require minimal “initial structure”
Games with Hidden Information
Utility Theory & MDPs Tamara Berg CS Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein, Stuart.
Minimax Trees: Utility Evaluation, Tree Evaluation, Pruning CPSC 315 – Programming Studio Spring 2008 Project 2, Lecture 2 Adapted from slides of Yoonsuck.
Game Playing. Introduction Why is game playing so interesting from an AI point of view? –Game Playing is harder then common searching The search space.
Game Playing ECE457 Applied Artificial Intelligence Spring 2007 Lecture #5.
Agents that can play multi-player games. Recall: Single-player, fully-observable, deterministic game agents An agent that plays Peg Solitaire involves.
Adversarial Search CS311 David Kauchak Spring 2013 Some material borrowed from : Sara Owsley Sood and others.
The Design & Analysis of the Algorithms Lecture by me M. Sakalli Download two pdf files..
For Wednesday Read chapter 7, sections 1-4 Homework: –Chapter 6, exercise 1.
Game Playing. Introduction One of the earliest areas in artificial intelligence is game playing. Two-person zero-sum game. Games for which the state space.
Quiz 4 : Minimax Minimax is a paranoid algorithm. True
For Friday Finish chapter 6 Program 1, Milestone 1 due.
Artificial Intelligence and Searching CPSC 315 – Programming Studio Spring 2013 Project 2, Lecture 1 Adapted from slides of Yoonsuck Choe.
Part 3 Linear Programming
Models for Strategic Marketing Decision Making. Market Entry Decisions To enter first or to wait Sources of First-Mover Advantages –Technological leadership.
1. 2 You should know by now… u The security level of a strategy for a player is the minimum payoff regardless of what strategy his opponent uses. u A.
Game tree search Thanks to Andrew Moore and Faheim Bacchus for slides!
Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem.
ARTIFICIAL INTELLIGENCE (CS 461D) Princess Nora University Faculty of Computer & Information Systems.
Statistics Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for Windows software Modeling.
Game Playing: Adversarial Search chapter 5. Game Playing: Adversarial Search  Introduction  So far, in problem solving, single agent search  The machine.
MiniMax Principle in Game Theory Slides Made by Senjuti Basu Roy.
Game Algorithms Prepared for COSC 6111 By Stephanie Wilson November 15th, 2006.
Lecture 13.
4. Games and adversarial search
Game Theory Just last week:
PENGANTAR INTELIJENSIA BUATAN (64A614)
Adversarial Search and Game Playing (Where making good decisions requires respecting your opponent) R&N: Chap. 6.
Adversarial Search.
David Kauchak CS52 – Spring 2016
Extensive-form games and how to solve them
Game Playing in AI by: Gaurav Phapale 05 IT 6010
Artificial Intelligence
Artificial Intelligence and Searching
Kevin Mason Michael Suggs
Artificial Intelligence
Lecture 20 Linear Program Duality
Adversarial Search CMPT 420 / CMPG 720.
Artificial Intelligence and Searching
CS51A David Kauchak Spring 2019
Minimax Trees: Utility Evaluation, Tree Evaluation, Pruning
Presentation transcript:

Games of Chance Introduction to Artificial Intelligence COS302 Michael L. Littman Fall 2001

Administration Rush hour (10/22). Today not part of midterm (10/24), just final.

Uncertainty in Search We’ve assumed everything is known: starting state, neighbors, goals, etc. Often need to make decisions even though some things are uncertain. Complicates things…

Types of Uncertainty Opponent: What will other player do? MinimaxMinimax Outcome: Which neighbor get? Model via probability distributionModel via probability distribution State: Where are we now? Hidden informationHidden information Transition: What are the rules? Need to use learning to find outNeed to use learning to find out

Nim-Rand Pile of sticks. Lose if take last stick.Lose if take last stick. On your turn, take 1 or 2.On your turn, take 1 or 2. Flip a coin. If H, take 1 more.Flip a coin. If H, take 1 more. Which type of uncertainty?

Value of a Game Without randomness: maximize your winnings in the worst case. With randomness: maximize your expected winnings in the worst case. Want to do well on average. What games are like this?

Nim-Rand Tree (|||)-Xcc (||)-Y (|)-Y(|)-Y()-Y c ()-X()-X()-X (|)-X ()-X ()-Y

Nim-Rand Values (|||)-Xcc (||)-Y (|)-Y(|)-Y()-Y c ()-X()-X()-X (|)-X ()-X ()-Y

Search Model States, terminal states (G), values for terminal states (V). X states (maximizer), Y states (minimizer), Z states (chance) For all s in Z, for all s’ in N(s) P(s’|s) is the probability of reaching s’ from s.

Game Value (no loops) Gameval(s) = { If (G(s)) return V(s) Else if s in X return max s’ in N(s) Gameval(s’) Else if s in Y return min s’ in N(s) Gameval(s’) Else return sum s’ in N(s) P(s’|s) Gameval(s’) }

Games with Loops No known poly time algorithm. Approximated by value iteration: For all s, if G(s), L(s) = V(s), else 0 Repeat until changes are small: for all s, L(s) = max, min, avg L(s’), s’ in N(s) depending on s in X, Y, or Z.

Hidden Information Games like Poker, 2-player bridge, Scrabble ™, Diplomacy, Stratego Don’t fit game tree model, even when chance nodes included.

Pure Strategies X: I : 1=L, 4=L II : 1=L, 4=R III : 1=R, 4=L IV : 1=R, 4=R Y: I : 2=L, 3=R II : 2=M, 3=R III : 2=R, 3=R X Y-2 Y-3 X-4 L R L R L M R R

Matrix Form Summarizes all decisions in one for each, chosen simultaneously X- I X- II X- III X- IV Y- I 7722 Y- II 3322 Y- III 422

Value of Matrix Game X picks column with largest min Y picks row with smallest max X- I X- II X- III X- IV Y- I 7722 Y- II 3322 Y- III 422

Minimax Von Neumann proved zero-sum matrix game, minimax=maximin. Given perfect information (no state uncertainty), there exists optimal pure strategy for each player.

Game w/ Chance Nodes X c Y-3 c L R 0.5 R L Use expected values X- I X- I (L) X- II X- II (R) Y- I Y- I (L) Y- II Y- II (R)-8+3

More General Matrices What game tree leads to this matrix? Does von Neumann’s theorem still hold? X- I X- I (L) X- II X- II (R) Y- I Y- I (L)10 Y- II Y- II (R)01

Hidden Info. Matrices X picks L or R, keeping the choice hidden from Y. Y makes a choice. X’s choice is revealed and game ends. X- I X- I (L) X- II X- II (R) Y- I Y- I (L)10 Y- II Y- II (R)01

Micro Poker X is dealt high or low card, holds/folds. Y folds/sees. High card wins Y can’t see X’s card. c X-L X-H Y fold hold 0.5 Y see fold see hold

Matrix Form Player X can guarantee itself +1 on average. How? It can even announce its strategy. X- I X- I (fold) X- II X- II (hold) Y- I Y- I (fold)-5+10 Y- II Y- II (see)+5-5

Mixed Strategies Pick a number p. X: With prob. p, fold; else hold. Since Y doesn’t know what’s coming, the response will sometimes work, sometimes not.

Guess a Probability X announces p=1/3. Y’s pick? X- I X- I (fold) X- II X- II (hold) Y- I Y- I (fold)-5+10 Y- II Y- II (see)+5-5 Fold: +5 See: -1 2/3 see see

Guess a Probability X announces p=2/3. Y’s pick? X- I X- I (fold) X- II X- II (hold) Y- I Y- I (fold)-5+10 Y- II Y- II (see)+5-5 Fold: +0 See: +1 2/3 fold fold

All Strategies What should X pick for p to maximize its worst case? p=0.6 Payoff +1 see fold p

Randomizing Y If Y random, answer is the same. No matter what, X can guarantee itself +1. see fold

Bluffing c X-L X-H Y fold hold 0.5 Y see fold see hold X: On a low card, bluff with prob Y: On hold, fold with prob. 0.4.

Solving 2x2 Game X- I with prob. p X’s expected gain vs. Y- I : m 11 p+m 12 (1-p) vs. Y- II : m 21 p+m 22 (1-p) X- I X- II Y- I m 11 m 12 Y- II m 21 m 22 Maximize the minimum. Try p=0, p=1, where lines meet.

Solving General mxn Linear program: p 1,…,p n. p 1 +…+p n = 1, p i  0 Maximize X’s gain, g vs Y- I : m 11 p 1 + … +m n1 p n  g vs Y- II : m 12 p 1 + … +m n2 p n  g … Against all Y strategies.

Issues Can we solve poker? More than 2 playersMore than 2 players Not zero sum (collude)Not zero sum (collude) Huge state spaceHuge state space Poker: Opponent modeling Bridge: Use simulation to approximate

What to Learn Minimax value in games of chance and the DFS algorithm for computing it. Converting games to matrix form. Solve 2x2 game.

Homework 5 (due 11/7) 1.The value iteration algorithm from the Games of Chance lecture can be applied to deterministic games with loops. Argue that it produces the same answer as the “Loopy” algorithm from the Game Tree lecture. 2.Write the matrix form of the game tree below.

Game Tree X Y-2 Y-3 X-4 L R L R L R +5 L +2 R