The Parameterized Poker Squares EAAI NSG Challenge

Slides:



Advertisements
Similar presentations
The Poker Squares Family: cards, letters, and hexes Todd W. Neller.
Advertisements

© Glenn Rowe AC Lab 2 A simple card game (twenty- one)
Mock Objects. Unit-testing and TDD are challenging They require some effort to: – Write a test for a small functionality – Refactor production code and.
Todd W. Neller Gettysburg College
Adversarial Search We have experience in search where we assume that we are the only intelligent being and we have explicit control over the “world”. Lets.
CSE 332: Lab 6 overview Overview of Lab 6 Your Lab 6 program will –Correct a glitch in the lab 5 specification Use exceptions only for abnormal flow control.
Techniques for Dealing with Hard Problems Backtrack: –Systematically enumerates all potential solutions by continually trying to extend a partial solution.
CS 484 – Artificial Intelligence
Dealer Comm Hand Player makes Ante bet and optional Bonus bet. Five cards are dealt to each player from the shuffler. Five cards are dealt from the shuffler.
Great Theoretical Ideas in Computer Science.
Midterm Review Solutions Math 210G-04, Spring 2011.
1 search CS 331/531 Dr M M Awais A* Examples:. 2 search CS 331/531 Dr M M Awais 8-Puzzle f(N) = g(N) + h(N)
Shallow Blue Project 2 Due date: April 5 th. Introduction Second in series of three projects This project focuses on getting AI opponent Subsequent project.
1 Probability Parts of life are uncertain. Using notions of probability provide a way to deal with the uncertainty.
1 Section 5.1 Discrete Probability. 2 LaPlace’s definition of probability Number of successful outcomes divided by the number of possible outcomes This.
Intro to Probability & Games
Online Chess Project 3 Due date: April 17 th. Introduction Third in series of three projects This project focuses on adding online support –2 players.
Monte Carlo Go Has a Way to Go Haruhiro Yoshimoto (*1) Kazuki Yoshizoe (*1) Tomoyuki Kaneko (*1) Akihiro Kishimoto (*2) Kenjiro Taura (*1) (*1)University.
double AlphaBeta(state, depth, alpha, beta) begin if depth
A R R A Y ! for Multiplication!!
Laws of Probability What is the probability of throwing a pair of dice and obtaining a 5 or a 7? These are mutually exclusive events. You can’t throw.
VOCABULARY  Deck or pack  Suit  Hearts  Clubs  Diamonds  Spades  Dealer  Shuffle  Pick up  Rank  Draw  Set  Joker  Jack 
The Poker Squares Challenge Todd W. Neller. What is the Poker Squares Challenge? A semester-long contest where Gettysburg College students (individuals.
Dice Games Properly Presented Todd W. Neller. What’s Up With the Title? My favorite book on dice games: – Many games – Diverse games – Well-categorized.
1 The game of poker You are given 5 cards (this is 5-card stud poker) The goal is to obtain the best hand you can The possible poker hands are (in increasing.
Permutations and Combinations
UNIT PLAN PROBABILITY AND STATISTICS FOR 9 th AND 10 th GRADE By EVANS ASUBONTENG.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Combinatorics.
Game Playing.
CSCE 2100: Computing Foundations 1 Probability Theory Tamara Schneider Summer 2013.
Introduction for Rotarians
Poker Solutions.
Game Rules Rules define game objects and define allowable actions by the players.
Counting. Techniques for counting Rule 1 Suppose we carry out have a sets A 1, A 2, A 3, … and that any pair are mutually exclusive (i.e. A 1  A 2 =
Basic Counting Lecture 12: Oct 28. This Lecture We will study some basic rules for counting. Sum rule, product rule, generalized product rule Permutations,
Upper Confidence Trees for Game AI Chahine Koleejan.
Agents that can play multi-player games. Recall: Single-player, fully-observable, deterministic game agents An agent that plays Peg Solitaire involves.
Game-playing AIs Part 1 CIS 391 Fall CSE Intro to AI 2 Games: Outline of Unit Part I (this set of slides)  Motivation  Game Trees  Evaluation.
GOMOKU ALGORITHM STUDY MIN-MAX AND MONTE CARLO APPROACHING
Homework Homework due now. Reading: relations
Games. Adversaries Consider the process of reasoning when an adversary is trying to defeat our efforts In game playing situations one searches down the.
Games 1 Alpha-Beta Example [-∞, +∞] Range of possible values Do DF-search until first leaf.
Senior Project Poster Day 2007, CIS Dept. University of Pennsylvania Reversi Meng Tran Faculty Advisor: Dr. Barry Silverman Strategies: l Corners t Corners.
INTELLIGENT SYSTEM FOR PLAYING TAROK
MA471 Fall 2003 Lecture 2. On With The Games Today we are going to watch each group play a couple of rounds of cards. We will go through the game slowly.
Math-Nuts Janie, Johnny, Jesse, Bob Introduction to Algebra and Geometry Fall 2011.
Introduction Lecture 25 Section 6.1 Wed, Mar 22, 2006.
Game Procedures Who does what, where, when, and how?
Pedagogical Possibilities for the 2048 Puzzle Game Todd W. Neller.
ARTIFICIAL INTELLIGENCE (CS 461D) Princess Nora University Faculty of Computer & Information Systems.
The Standard Genetic Algorithm Start with a “population” of “individuals” Rank these individuals according to their “fitness” Select pairs of individuals.
GOMOKU ALGORITHM STUDY MIN-MAX AND MONTE CARLO APPROACHING
By: John Cook 11/06/2009 PTTE John Cook 3/4/2016.
Stat 35b: Introduction to Probability with Applications to Poker Outline for the day: 1.Expected value. 2.Heads up with AA. 3.Heads up with Gus vs.
Probability and statistics - overview Introduction Basics of probability theory Events, probability, different types of probability Random variable, probability.
Texas Holdem A Poker Variant vs. Flop TurnRiver. How to Play Everyone is dealt 2 cards face down (Hole Cards) 5 Community Cards Best 5-Card Hand Wins.
Arrangements and Selections (and how they may be counted)
POKER-6S ooA, OOD, OOP.
Stochastic tree search and stochastic games
Sequential Placement Optimization Games: Poker Squares, Word Squares, and Take It Easy! Todd W. Neller Abstract: In this talk, we teach how to play “Poker.
Adversarial Search.
A Sampling of Chess and Chip Games
The Birds of a Feather Research Challenge
The Parameterized Poker Squares EAAI NSG Challenge
Strategies for Poker AI player
Probability of casino games
J. J. Divin Astrid Glende Jon McClain Lorenzo Moore
A R R A Y ! for Multiplication!!
Term Project: Poker Game
A R R A Y ! for Multiplication!!
Presentation transcript:

The Parameterized Poker Squares EAAI NSG Challenge

What is the EAAI NSG Challenge? Goal: a fun way to encourage good, faculty-mentored undergraduate research experiences that includes an option for peer-reviewed paper publication and presentation at a major CS conference. Students may work independently or in teams with a faculty mentor to meet the challenge. Challenge submissions and associated papers would be submitted at the following EAAI paper submission deadline (late-summer, one month before AAAI submission deadline). At the next EAAI (February 2016): challenge results, accepted paper presentations, announcement of next NSG Challenge EAAI 2016 NSG Challenge: Parameterized Poker Squares

Poker Squares (Original) Materials: shuffled standard (French) 52-card card deck, paper with 5-by-5 grid, and pencil Each turn, a player draws a card and writes the card rank and suit in an empty grid position. After 25 turns, the grid is full and the player scores each grid row and column as a 5-card poker hand according to a given point system.

American Point System Poker Hand Points Description Example Royal Flush 100 A 10-J-Q-K-A sequence all of the same suit 10, J, Q, K, A Straight Flush 75 Five cards in sequence all of the same suit A, 2, 3, 4, 5 Four of a Kind 50 Four cards of the same rank 9, 9, 9, 9, 6 Full House 25 Three cards of one rank with two cards of another rank 7, 7, 7, 8, 8 Flush 20 Five cards all of the same suit A, 2, 3, 5, 8 Straight 15 Five cards in sequence; Aces may be high or low but not both 8, 9, 10, J, Q Three of a Kind 10 Three cards of the same rank 2, 2, 2, 5, 7 Two Pair 5 Two cards of one rank with two cards of another rank 3, 3, 4, 4, A One Pair 2 Two cards of one rank 5, 5, 9, Q, A High Card None of the above 2, 3, 5, 8, Q

Scoring Examples

Let’s Play Web: http://games.emantranet.com/PokerSquare.aspx Other platforms: See http://tinyurl.com/pokersqrs

Parameterization of Poker Squares The American Point System (0, 2, 5, 10, 15, 20, 25, 50, 75, 100) is based on hand rank in Poker. The British Point System (1, 3, 6, 12, 5, 10, 16, 30, 30) is based on the difficulty of forming the hands in Poker Squares. (a.k.a. English Point System) For our challenge, AI players will be given the scoring system at play time with points in the range [-128, 127]. Examples: Ameritish point systems: random variations on American and British systems Single Hand: 1 point for one hand type, 0 points otherwise Hypercorners: all 1 or -1 score values Random: random score system in range [-128, 127]

Player Interface Java software supplied manages games, tournaments, scoring, partial scoring The Java PokerSquaresPlayer interface is very simple: void setPointSystem(PokerSquaresPointSystem system, long millis) – Player is given a point system and a given amount of milliseconds (300K) to form strategy from the point system void init() – Player initializes before a game (e.g. clear board, game history) int[] getPlay(Card card, long millisRemaining) – Given a drawn card and the number of ms remaining in the 30 second game, return the 0-based row and column the player chooses for the Card String getName() – return the player name

Structure of the Game The game is structured as an alternating sequence of chance nodes and player choice nodes. Each card draw is a probabilistic event where any remaining card is drawn with equal probability. Each player action is a commitment to a card placement. chance choice chance choice

Game Tree Size How big is the Poker Squares game tree? Root chance node: 52 possible cards 52 depth-1 choice nodes: 25 possible placements 52x25 depth-2 chance nodes: 51 possible cards 52x25x51 depth-3 choice nodes: 24 possible placements … 52!/27! x 25! = 52!/(27x26)  1.15x1065 nodes Although: Different draw/play sequences can lead to the same state. Rows/columns may be reordered without affecting score. Still, we will not be able to evaluate entire expectimax trees except for much smaller end-game situations.

Possible Approaches Rule-based Reinforcement Learning Abstract each (partial) hand in play according to scoring potential Learn expected hand score from simulated experience Expectimax variations E.g. Iterative-deepening, iterative-broadening Monte Carlo Tree Search (MCTS) ???

Evaluation Players will be evaluated using 4 to 6 point systems. For each point system: Players will have 5 minutes to process the point system, forming strategy, etc. Players will then play 100 games with 30 seconds of decision time per game. Total scores will be linearly scaled between 0 (min. total score) and 1 (max. total score). The player with the maximum sum of scaled total scores is the winner.

Supplied Code Classes PokerSquares – manages tournaments, individual games, game sequences, etc. Card – models playing card PokerHand – classifies full/partial Poker hand PokerSquaresPointSystem – generates point systems, scores hands and grids, prints grids PokerSquaresPlayer – player interface Sample PokerSquaresPlayer implementations

What’s Next? Share Poker Squares with potential faculty mentors and undergraduate student researchers. Form teams and contact the contest organizer (Todd Neller) to get on the contest mailing list for updates. Download software and resources from http://tinyurl.com/ppokersqrs Enjoy creating and experimenting with your Parameterized Poker Squares players. Submit them for evaluation towards the end of the summer 2015. Optional: Write up your research results with your faculty mentor and submit your paper to EAAI one month later. If accepted, present your work at EAAI in February 2016. Papers will appear with EAAI papers in the AAAI proceedings.

Resources and References Organizer email: Todd Neller <tneller@gettysburg.edu> Poker Squares Page: http://tinyurl.com/pokersqrs References Rules and play grids Parameterized Poker Squares Page: http://tinyurl.com/ppokersqrs Monte Carlo Tree Search (MCTS): C. Browne et al. A Survey of Monte Carlo Tree Search Methods L. Kocsis, C. Szepesvari. Bandit based Monte-Carlo Planning. http://www.mcts.ai/?q=mcts MCTS application to similar problem: R. Lorentz. An MCTS Program to Play EinStein Würfelt Nicht!