Presentation is loading. Please wait.

Presentation is loading. Please wait.

Reinforcement Learning

Similar presentations


Presentation on theme: "Reinforcement Learning"— Presentation transcript:

1 Reinforcement Learning

2 Overview Tabular Methods Approximate Methods
Deep Reinforcement Learning

3 Tabular Methods

4 Model: Mathematical models of dynamics and reward
Policy: function mapping agent’s states to action Value function: future rewards from being in a state and/or action when following a particular policy

5 MDP

6 Markov Reward Process

7 Markov Reward Process

8

9

10 MDP = MRP + Action

11 MDP + Policy

12 Compare

13 How to Control?

14 Policy Search

15

16 State-Action Value Q

17 Policy Iteration

18

19

20

21 Worst Case Policy Iteration Can Take At Most |A|^|S| Iterations* (Size of # Policies)

22 Value Iteration

23


Download ppt "Reinforcement Learning"

Similar presentations


Ads by Google