Probabilistic Dynamic Programming

Slides:



Advertisements
Similar presentations
Chip amount matches bet amount – Deal button near chips – Clear Button.
Advertisements

This is a game to help you learn all the names of my family.
Chapter 11 Dynamic Programming.
EGR 141 Computer Problem Solving in Engineering and Computer Science
Prof. Anthony Patt, Potsdam Institute and Boston University Pablo Suarez, Boston University Chiedza Gwata, University of Zimbabwe Seasonal Forecasts, Scientific.
An Introduction to Markov Chains. Homer and Marge repeatedly play a gambling game. Each time they play, the probability that Homer wins is 0.4, and the.
Section 5.7 Suppose X 1, X 2, …, X n is a random sample from a Bernoulli distribution with success probability p. Then Y = X 1 + X 2 + … + X n has a distribution.
Mathematical Expectation Making the game fair. Make the Bet = X (we need to figure it out) Create the Table Under the column for outcome (O) subtract.
The game of Craps Rules of play: 1. Played with two dice (six faces to a die – numbers 1-6 per face) 2. Sequence of betting rounds (or just rounds) 3.
ECES 741: Stochastic Decision & Control Processes – Chapter 1: The DP Algorithm 49 DP can give complete quantitative solution Example 1: Discrete, finite.
Suppose someone bets (or raises) you, going all-in. What should your chances of winning be in order for you to correctly call? Let B = the amount bet to.
Copyright Robert J. Marks II ECE 5345 Multiple Random Variables: A Gambling Sequence.
Decision Theory: Outcomes and Consequences Not Considered Decision Outcomes Consequences Decision: choose between betting on green or red chip. Objective:
Dependent and Independent Events. Events are said to be independent if the occurrence of one event has no effect on the occurrence of another. For example,
Copyright 2013, 2010, 2007, Pearson, Education, Inc. Section 12.4 Expected Value (Expectation)
Craps Probability Solution Or: More than you ever wanted to know about conditional probability!
Charting your stocks By Garth Mackenzie. 1.Identify the major trend 2.Time your entry 3.Execute the trade 4.Manage the risk.
Lesson 96 – Expected Value & Variance of Discrete Random Variables HL2 Math - Santowski.
The Features of Best Online Casinos Canada Casino In Canada.
Story Board : UEFA EURO 2016 Knockout stages from the eyes of data.
Unit 7: Chance Variation
An Introduction to Markov Chains
Probability OCR Stage 6.
Probability Grade 7: Make predictions, given a probability
Expected values of games
I like to play games and I like to win!
Chapter 11 Dynamic Programming.
INDIANA’S GAME Indiana’s game, we’re talking basketball.
Lecture 11.
Stat 35b: Introduction to Probability with Applications to Poker
Ten Commandments of Arsenal FC
Buy Best Spy Marked Cheating Playing Cards in Kolkata.
The participants learn to construct a line by 2 points.
Dot Cards Face-Off Game
Whiteboardmaths.com © 2004 All rights reserved
How to Make Money in Online Sports Betting World!
YEAR 11 MATHS REVISION Probability.
Strategies for Poker AI player
IENG 362 Markov Chains.
The Game: Attack and Defend
أنماط الإدارة المدرسية وتفويض السلطة الدكتور أشرف الصايغ
Probability 14.1 Experimental Probability 14.2 Principles of Counting
Gambling Quiz Gambling Quiz.
Multi-Step Equation Game! Play!.
Money and Consumer Choice
10:00.
Pirate Place Value Coloring Game
ROULETTE Yasmina, Phoebe, and Roshen.
How to play indoor soccer
Source: 2018 Delaware State Epidemiological Profile
Continuous Dynamic Programming
A First Grade Number Routine
Connect 4 Michael Yura.
Twenty-one In each round, give all teams their additional cards at the same time (not one at a time). So ask if they will be taking another card. They.
冀教版 三年级上册 Lesson 8 Letters.
Fun… Tree Diagrams… Probability.
Know about the physical aspects of human reproduction
Let’s Win a Lollipop! Door 1 Door 2 Door 3 Win! You pick...and stay with and the Lose.   You pick...and stay with... Door 1 Door 2 Door 3 ...and.
Types of Events Groups of 3 Pick a Card
Goal Space Parts Rules Mechanics Space – Where the game takes place
How well do you KNOW your 2 times table ?
Section 12.4 Expected Value (Expectation)
Connect 4 michael yura.
Connect 4 Michael Yura.
Playing 25 Lines ( Total Bet 25 ).
Presentation transcript:

Probabilistic Dynamic Programming

Gambling Problem Start with 3 chips Have 3 plays of the game P{win} = 2/3 Object: have 5 or more chips at game end

Gambling Problem Let n = play of the game (1, 2, 3) Sn = # chips on hand to begin play n Xn = # chips to bet at play n If s3 = 0, 1, 2, we can not bet enough chips to meet objective

Gambling Problem 3 4 5 2 1 P{win}=2/3 No bet P{Lose}=1/3

Gambling Problem Consider Bet 3 5 4 3 2 1 Consider Bet 3 If S3 > 5, we have met objective, x3 = 0, or x3 = s3 - 5

Gambling Problem Consider Bet 3 5 4 3 2 1 Consider Bet 3 If S3 > 5, we have met objective, x3 = 0, or x3 = s3 - 5 If S3 = 4, we could bet 1, 2, 3, or 4

Gambling Problem Consider Bet 3 5 4 3 2 1 Consider Bet 3 If S3 > 5, we have met objective, x3 = 0, or x3 = s3 - 5 If S3 = 4, we could bet 1, 2, 3, or 4 If S3 = 3, we could bet 2 or 3

Gambling Problem Consider Bet 3 5 4 3 2 1 Consider Bet 3 If S3 > 5, we have met objective, x3 = 0, or x3 = s3 - 5 If S3 = 4, we could bet 1, 2, 3, or 4 If S3 = 3, we could bet 2 or 3 If S3 < 2, we can’t win, no bet

Gambling Problem 3 4 5 2 1

Gambling Problem Consider Bet 2 5 4 3 2 1 Consider Bet 2 If S2 > 5, we have met objective, x2 = 0, or x2 = s2 - 5

Gambling Problem Consider Bet 2 5 4 3 2 1 Consider Bet 2 If S2 > 5, we have met objective, x2 = 0, or x2 = s2 - 5 If S2 = 4, we could bet 0, 1, 2, 3, or 4

Gambling Problem Consider Bet 2 5 4 3 2 1 Consider Bet 2 If S2 > 5, we have met objective, x2 = 0, or x2 = s2 - 5 If S2 = 4, we could bet 0, 1, 2, 3, or 4 If S2 = 3, we could bet 0, 1, 2 or 3 If S2 = 2, we can bet 1 or 2

Gambling Problem 5 4 3 2 1 Bet 2, If Sn = 0, 1 P{objective} = 0 Bet 2, If Sn = 0, 1 P{objective} = 0 If Sn = 2, x2 = 1 or 2

Gambling Problem 5 4 3 2 1 Bet 2, If Sn = 2, x2 = 1 Bet 2, If Sn = 2, x2 = 1 P{objective} = P{win stage 2 and win stage 3}

Gambling Problem 5 4 3 2 1 Bet 2, If xn = 1 Bet 2, If xn = 1 P{objective} = P{lose}P{win 3} + P{win}P{win 3} = 1/3(0) + 2/3(2/3) = 4/9

Gambling Problem 5 4 3 2 1 S2=4, x2=1 P{objective} = P{win2}P{win3} S2=4, x2=1 P{objective} = P{win2}P{win3} + P{lose2}P{win3} = (2/3)(1)+(1/3)(2/3) = 8/9

Gambling Problem 5 4 3 2 1 S1=3, X1=1 P(Objective) = 2/3P{meeting objective from S2=4} + 1/3P{meeting objective from S2=2} = (2/3)(8/9) + (1/3)(4/9) = 20/27 3 4 5 2 1

Recapture Optimal 5 4 3 2 1 x1* = 1 if win x2* = 1 x1* = 1 if win x2* = 1 if lose x2* = 1 or 2