Outline Classification Linear classifiers Perceptron Multi-class classification Generative approach Naïve Bayes classifier 2.

Slides:



Advertisements
Similar presentations
Classification Classification Examples
Advertisements

Unsupervised Learning Clustering K-Means. Recall: Key Components of Intelligent Agents Representation Language: Graph, Bayes Nets, Linear functions Inference.
Pattern Recognition and Machine Learning
Data Mining Classification: Alternative Techniques
Support Vector Machines
Support Vector Machines and Margins
Supervised Learning Recap
Indian Statistical Institute Kolkata
Lecture 13 – Perceptrons Machine Learning March 16, 2010.
Middle Term Exam 03/01 (Thursday), take home, turn in at noon time of 03/02 (Friday)
Chapter 4: Linear Models for Classification
CS Perceptrons1. 2 Basic Neuron CS Perceptrons3 Expanded Neuron.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
x – independent variable (input)
Prénom Nom Document Analysis: Linear Discrimination Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Lecture 10: Support Vector Machines
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Linear Discriminant Functions Chapter 5 (Duda et al.)
Statistical Learning: Pattern Classification, Prediction, and Control Peter Bartlett August 2002, UC Berkeley CIS.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
Machine learning Image source:
Collaborative Filtering Matrix Factorization Approach
Machine learning Image source:
Final review LING572 Fei Xia Week 10: 03/11/
Data mining and machine learning A brief introduction.
DATA MINING LECTURE 10 Classification k-nearest neighbor classifier Naïve Bayes Logistic Regression Support Vector Machines.
Machine Learning Lecture 11 Summary G53MLE | Machine Learning | Dr Guoping Qiu1.
CSE 446 Perceptron Learning Winter 2012 Dan Weld Some slides from Carlos Guestrin, Luke Zettlemoyer.
Overview of Supervised Learning Overview of Supervised Learning2 Outline Linear Regression and Nearest Neighbors method Statistical Decision.
CS Statistical Machine learning Lecture 10 Yuan (Alan) Qi Purdue CS Sept
LARGE MARGIN CLASSIFIERS David Kauchak CS 451 – Fall 2013.
Ch 4. Linear Models for Classification (1/2) Pattern Recognition and Machine Learning, C. M. Bishop, Summarized and revised by Hee-Woong Lim.
Text Classification 2 David Kauchak cs459 Fall 2012 adapted from:
Non-Bayes classifiers. Linear discriminants, neural networks.
Linear Classification with Perceptrons
Linear Models for Classification
Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.
Logistic Regression (Classification Algorithm)
METU Informatics Institute Min720 Pattern Classification with Bio-Medical Applications Part 7: Linear and Generalized Discriminant Functions.
Regress-itation Feb. 5, Outline Linear regression – Regression: predicting a continuous value Logistic regression – Classification: predicting a.
Logistic Regression William Cohen.
Classification Course web page: vision.cis.udel.edu/~cv May 14, 2003  Lecture 34.
Information Retrieval and Organisation Chapter 14 Vector Space Classification Dell Zhang Birkbeck, University of London.
DATA MINING LECTURE 10b Classification k-nearest neighbor classifier
CSE 446 Logistic Regression Perceptron Learning Winter 2012 Dan Weld Some slides from Carlos Guestrin, Luke Zettlemoyer.
METU Informatics Institute Min720 Pattern Classification with Bio-Medical Applications Part 9: Review.
CS 2750: Machine Learning Linear Models for Classification Prof. Adriana Kovashka University of Pittsburgh February 15, 2016.
Giansalvo EXIN Cirrincione unit #4 Single-layer networks They directly compute linear discriminant functions using the TS without need of determining.
Support Vector Machines Optimization objective Machine Learning.
Computer Vision Lecture 7 Classifiers. Computer Vision, Lecture 6 Oleh Tretiak © 2005Slide 1 This Lecture Bayesian decision theory (22.1, 22.2) –General.
Learning by Loss Minimization. Machine learning: Learn a Function from Examples Function: Examples: – Supervised: – Unsupervised: – Semisuprvised:
Linear Discriminant Functions Chapter 5 (Duda et al.) CS479/679 Pattern Recognition Dr. George Bebis.
Linear machines márc Decison surfaces We focus now on the decision surfaces Linear machines = linear decision surface Non-optimal solution but.
Machine Learning – Classification David Fenyő
Large Margin classifiers
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Perceptrons Lirong Xia.
Announcements HW4 due today (11:59pm) HW5 out today (due 11/17 11:59pm)
CS 4/527: Artificial Intelligence
Classification Discriminant Analysis
Linear machines 28/02/2017.
Collaborative Filtering Matrix Factorization Approach
دسته بندی با استفاده از مدل های خطی
Lecture 08: Soft-margin SVM
Pattern Recognition and Machine Learning
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Machine Learning Perceptron: Linearly Separable Supervised Learning
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Perceptrons Lirong Xia.
Outline Announcement Neural networks Perceptrons - continued
Presentation transcript:

Outline Classification Linear classifiers Perceptron Multi-class classification Generative approach Naïve Bayes classifier 2

Classification: Oranges and Lemons 3

4

Classification problem 5

Linear classifiers 6

7

Decision boundary 8

Linear Decision boundary (Perceptron) 9

Linear Decision boundary (Decision Tree) 10 t1t3 t2 Income

Linear Decision boundary (K Nearest Neighbor) 11 O O O x x x Feature 1 Feature 2

Non-Linear Decision boundary 12 Decision Boundary Decision Region 1 Decision Region 2

Decision boundary Linear classifier 13

Non-linear decision boundary Choose non-linear features Classifier still linear in parameters 14

Linear boundary: geometry 15

SSE cost function for classification SSE cost function is not suitable for classification Sum of Squared Errors loss penalizes “too correct” predictions SSE also lack robustness to noise 16

SSE cost function for classification 17

Perceptron algorithm 18

Perceptron criterion 19

Batch gradient for descent Perceptron “Gradient Descent” to solve the optimization problem Batch Perceptron converges in finite number of steps for linearly separable data 20

Stochastic gradient descent for Perceptron 21

Convergence of Perceptron 22

Convergence of Perceptron 23

Multi-class classification 24

Multi-class classification One-vs-all (one-vs-rest) 25

Multi-class classification One-vs-one 26

Multi-class classification: ambiguity regions in which the classification is undefined Converting the multi-class problem to a set of two- class problems can lead to regions in which the classification is undefined 27

Probabilistic approach Bayes’ theorem 28

Bayes’ theorem 29

Bayes decision theory 30

Probabilistic classifiers Probabilistic classification approaches can be divided in two main categories Generative Discriminative 31

Discriminative vs. generative approach 32

Generative approach 33

Discriminative approach 34

Naïve Bayes classifier 35

Naïve Bayes classifier 36

Naïve Bayes: discrete example 37

38