Download presentation

Presentation is loading. Please wait.

Published byJadyn Ravenscroft Modified over 3 years ago

1
A pre-Weekend Talk on Online Learning TGIF Talk Series Purushottam Kar

2
Outline Some Motivating Examples Discovering customer preferences Learning investment profiles Detecting credit card fraud The Formal Online Learning Framework Notion of regret Formalization of motivating examples Simple Online Algorithms Online classification, regression Online ranking Batch solvers for large scale learning problems Other Feedback-based Learning Frameworks 2

3
Some Motivating Examples Why Online Learning can be Useful 3

4
The Cooks Dilemma 4

5
Discovering Customer Preferences 5 Loss 2 1 1 0 2

6
Learning Investment Profiles 6

7
Detecting Credit Card Fraud 7

8
The Formal Online Learning Framework How we assess Online Learning Algorithms 8

9
The Online Learning Model 9

10
Motivating Examples Revisited 10

11
Motivating Examples Revisited 11

12
Simple Online Algorithms What makes online learning click ? 12

13
Online Linear Classification 13

14
The Perceptron Algorithm in action 14

15
Online Regression 15

16
Online Regression via Online Gradient Descent 16

17
Online Bipartite Ranking 17

18
Batch Solvers 18

19
Batch Solvers 19

20
Other Feedback based Learning Frameworks Two axes of variation: modelling of environment and feedback Online Learning: some modelling of environment and full feedback Losses are simple functions over linear models (can be made non linear though) At each step the loss function itself is given to us: full information Models are agnostic: no realizability assumptions are made Multi-armed Bandits: weak modelling of environment, weak feedback Often no assumptions made on nature of loss function Limited feedback: only loss value on played action made available Contextual bandits try to model loss function but make realizability assumptions Reinforcement Learning: Strong modelling of environment, weak feedback Environment modelled as a state space with adaptive stochastic transitions Reward functions modeled as functions of state space and action Limited feedback available: need to learn, state space as well as reward function 20

Similar presentations

OK

MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.

MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.

© 2017 SlidePlayer.com Inc.

All rights reserved.

Ads by Google

Ppt on shell scripting expect Ppt on bridge construction in india Ppt on international business law Ppt on national education day essay Ppt on conceptual art books Ppt on group development theories Free ppt on save girl child Ppt on chapter 12 electricity bill Ppt on layer 3 switching vs layer Ppt on abo blood grouping system