Bayes Classifier, Linear Regression 10701/15781 Recitation January 29, 2008 Parts of the slides are from previous years’ recitation and lecture notes,

Bayes Classifier, Linear Regression 10701/15781 Recitation January 29, 2008 Parts of the slides are from previous years’ recitation and lecture notes, and from Prof. Andrew Moore’s data mining tutorials.

Classification and Regression  Classification  Goal: Learn the underlying function f: X (features)  Y (class, or category) e.g. words  “spam”, or “not spam”  Regression f: X (features)  Y (continuous values) e.g. GPA  salary

Supervised Classification  How to find an unknown function f: X  Y (features  class) or equivalently P(Y|X)  Classifier: 1. Find P(X|Y), P(Y), and use Bayes rule - generative 2. Find P(Y|X) directly - discriminative

Generative Classifier: Bayes Classifier Learn P(X|Y), P(Y)  e.g. email classification problem  3 classes for Y = { spam, not spam, maybe }  10,000 binary features for X = {“Cash”, “Rolex”,…}  How many parameters do we have?  P(Y) :  P(X|Y) :

Naïve Bayes  3 classes for Y = {spam, not spam, maybe}  10,000 binary features for X = {“Cash”,”Rolex”,…}  Now, how many parameters?  P(Y)  P(X|Y) fewer parameters “simpler” – less likely to overfit

Full Bayes vs. Naïve Bayes  XOR X1X2Y 101 011 110 000 P(Y=1|(X1,X2)=(0,1))=?  Full Bayes: P(Y=1)=? P((X1,X2)=(0,1)|Y=1)=?  Naïve Bayes: P(Y=1)=? P((X1,X2)=(0,1)|Y=1)=?

Regression  Prediction of continuous variables  e.g. I want to predict salaries from GPA.   I can regress that …  Learn the mapping f: X  Y  Model is linear in the parameters (+ some noise)  linear regression  Assume Gaussian noise  Learn MLE Θ

1-parameter linear regression  Normal linear regression or equivalently,  MLE Θ ?  MLE σ 2 ?

Multivariate linear regression  What if the inputs are vectors?  Write matrix X and Y : (n data points, k features for each data)  MLE Θ =

Constant term?  We may expect linear data that does not go through the origin  Trick?

The constant term

Regression: another example  Assume the following model to fit the data. The model has one unknown parameter θ to be learned from data.  A maximum likelihood estimation of θ?

Bayes Classifier, Linear Regression 10701/15781 Recitation January 29, 2008 Parts of the slides are from previous years’ recitation and lecture notes,

Similar presentations

Presentation on theme: "Bayes Classifier, Linear Regression 10701/15781 Recitation January 29, 2008 Parts of the slides are from previous years’ recitation and lecture notes,"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Bayes Classifier, Linear Regression 10701/15781 Recitation January 29, 2008 Parts of the slides are from previous years’ recitation and lecture notes,

Similar presentations

Presentation on theme: "Bayes Classifier, Linear Regression 10701/15781 Recitation January 29, 2008 Parts of the slides are from previous years’ recitation and lecture notes,"— Presentation transcript:

Similar presentations

About project

Feedback