Presentation is loading. Please wait.

Presentation is loading. Please wait.

Lecture 6 K-Nearest Neighbor Classifier

Similar presentations


Presentation on theme: "Lecture 6 K-Nearest Neighbor Classifier"— Presentation transcript:

1 Lecture 6 K-Nearest Neighbor Classifier
Machine Learning Lecture 6 K-Nearest Neighbor Classifier G53MLE Machine Learning Dr Guoping Qiu

2 Objects, Feature Vectors, Points
x1 x2 Elliptical blobs (objects) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 X(15) X(1) X(7) X(16) X(3) X(8) X(25) X(12) X(13) X(6) X(9) X(10) X(4) X(11) X(14)

3 G53MLE Machine Learning Dr Guoping Qiu
Nearest Neighbours x1 x2 X(j)=(x1(j), x2(j), …,xn(j)) X(i)=(x1(i), x2(i), …,xn(i)) G53MLE Machine Learning Dr Guoping Qiu

4 Nearest Neighbour Algorithm
Given training data (X(1),D(1)), (X(2),D(2)), …, (X(N),D(N)) Define a distance metric between points in inputs space. Common measures are: Euclidean Distance G53MLE Machine Learning Dr Guoping Qiu

5 K-Nearest Neighbour Model
Given test point X Find the K nearest training inputs to X Denote these points as (X(1),D(1)), (X(2), D(2)), …, (X(k), D(k)) x G53MLE Machine Learning Dr Guoping Qiu

6 K-Nearest Neighbour Model
Classification The class identification of X Y = most common class in set {D(1), D(2), …, D(k)} x x G53MLE Machine Learning Dr Guoping Qiu

7 K-Nearest Neighbour Model
Example : Classify whether a customer will respond to a survey question using a 3-Nearest Neighbor classifier Customer Age Income No. credit cards Response John 35 35K 3 No Rachel 22 50K 2 Yes Hannah 63 200K 1 Tom 59 170K Nellie 25 40K 4 David 37 ? G53MLE Machine Learning Dr Guoping Qiu

8 K-Nearest Neighbour Model
Example : 3-Nearest Neighbors 15.16 15 152.23 122 15.74 G53MLE Machine Learning Dr Guoping Qiu

9 K-Nearest Neighbour Model
Example : 3-Nearest Neighbors 15.16 15 152.23 122 15.74 Three nearest ones to David are: No, Yes, Yes G53MLE Machine Learning Dr Guoping Qiu

10 K-Nearest Neighbour Model
Example : 3-Nearest Neighbors 15.16 15 152.23 122 Yes 15.74 Three nearest ones to David are: No, Yes, Yes G53MLE Machine Learning Dr Guoping Qiu

11 K-Nearest Neighbour Model
Picking K Use N fold cross validation – Pick K to minimize the cross validation error For each of N training example Find its K nearest neighbours Make a classification based on these K neighbours Calculate classification error Output average error over all examples Use the K that gives lowest average error over the N training examples G53MLE Machine Learning Dr Guoping Qiu

12 K-Nearest Neighbour Model
Example: For the example we saw earlier, pick the best K from the set {1, 2, 3} to build a K-NN classifier G53MLE Machine Learning Dr Guoping Qiu

13 G53MLE Machine Learning Dr Guoping Qiu
Further Readings T. M. Mitchell, Machine Learning, McGraw-Hill International Edition, 1997 Chapter 8 G53MLE Machine Learning Dr Guoping Qiu

14 Tutorial/Exercise Questions
K nearest neighbor classifier has to store all training data creating high requirement on storage. Can you think of ways to reduce the storage requirement without affecting the performance? (hint: search the Internet, you will find many approximation methods). G53MLE Machine Learning Dr Guoping Qiu


Download ppt "Lecture 6 K-Nearest Neighbor Classifier"

Similar presentations


Ads by Google