Support Vector Machines Elena Mokshyna Odessa 2014.

Slides:



Advertisements
Similar presentations
Introduction to Support Vector Machines (SVM)
Advertisements

Support Vector Machines
Lecture 9 Support Vector Machines
ECG Signal processing (2)
SVM - Support Vector Machines A new classification method for both linear and nonlinear data It uses a nonlinear mapping to transform the original training.
An Introduction of Support Vector Machine
Classification / Regression Support Vector Machines
Linear Classifiers/SVMs

CHAPTER 10: Linear Discrimination
Pattern Recognition and Machine Learning
An Introduction of Support Vector Machine
Support Vector Machines and Kernels Adapted from slides by Tim Oates Cognition, Robotics, and Learning (CORAL) Lab University of Maryland Baltimore County.
SVM—Support Vector Machines
Support vector machine
Search Engines Information Retrieval in Practice All slides ©Addison Wesley, 2008.
Machine learning continued Image source:
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Fuzzy Support Vector Machines (FSVMs) Weijia Wang, Huanren Zhang, Vijendra Purohit, Aditi Gupta.
Support Vector Machines (SVMs) Chapter 5 (Duda et al.)
University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Support Vector Machines.
Rutgers CS440, Fall 2003 Support vector machines Reading: Ch. 20, Sec. 6, AIMA 2 nd Ed.
Support Vector Machines
Lecture outline Support vector machines. Support Vector Machines Find a linear hyperplane (decision boundary) that will separate the data.
What is Learning All about ?  Get knowledge of by study, experience, or being taught  Become aware by information or from observation  Commit to memory.
Lecture 10: Support Vector Machines
Computational Learning Theory
Ch. Eick: Support Vector Machines: The Main Ideas Reading Material Support Vector Machines: 1.Textbook 2. First 3 columns of Smola/Schönkopf article on.
Support Vector Machines Piyush Kumar. Perceptrons revisited Class 1 : (+1) Class 2 : (-1) Is this unique?
July 11, 2001Daniel Whiteson Support Vector Machines: Get more Higgs out of your data Daniel Whiteson UC Berkeley.
Linear hyperplanes as classifiers Usman Roshan. Hyperplane separators.
Support Vector Machine & Image Classification Applications
CS 8751 ML & KDDSupport Vector Machines1 Support Vector Machines (SVMs) Learning mechanism based on linear programming Chooses a separating plane based.
1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.
Machine Learning Using Support Vector Machines (Paper Review) Presented to: Prof. Dr. Mohamed Batouche Prepared By: Asma B. Al-Saleh Amani A. Al-Ajlan.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
SVM Support Vector Machines Presented by: Anas Assiri Supervisor Prof. Dr. Mohamed Batouche.
Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)
Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.
CS 478 – Tools for Machine Learning and Data Mining SVM.
RSVM: Reduced Support Vector Machines Y.-J. Lee & O. L. Mangasarian First SIAM International Conference on Data Mining Chicago, April 6, 2001 University.
Ohad Hageby IDC Support Vector Machines & Kernel Machines IP Seminar 2008 IDC Herzliya.
컴퓨터 과학부 김명재.  Introduction  Data Preprocessing  Model Selection  Experiments.
Linear hyperplanes as classifiers Usman Roshan. Hyperplane separators.
Biointelligence Laboratory, Seoul National University
Linear Models for Classification
University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Support Vector Machines.
CS558 Project Local SVM Classification based on triangulation (on the plane) Glenn Fung.
Support vector machine LING 572 Fei Xia Week 8: 2/23/2010 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A 1.
CSE4334/5334 DATA MINING CSE4334/5334 Data Mining, Fall 2014 Department of Computer Science and Engineering, University of Texas at Arlington Chengkai.
Support Vector Machines. Notation Assume a binary classification problem. –Instances are represented by vector x   n. –Training examples: x = (x 1,
Final Exam Review CS479/679 Pattern Recognition Dr. George Bebis 1.
Feature Selction for SVMs J. Weston et al., NIPS 2000 오장민 (2000/01/04) Second reference : Mark A. Holl, Correlation-based Feature Selection for Machine.
Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)
Greg GrudicIntro AI1 Support Vector Machine (SVM) Classification Greg Grudic.
SVMs in a Nutshell.
Support Vector Machine (SVM) Presented by Robert Chen.
Support Vector Machines Optimization objective Machine Learning.
A Brief Introduction to Support Vector Machine (SVM) Most slides were from Prof. A. W. Moore, School of Computer Science, Carnegie Mellon University.
Support Vector Machines Reading: Textbook, Chapter 5 Ben-Hur and Weston, A User’s Guide to Support Vector Machines (linked from class web page)
Support Vector Machines (SVMs) Chapter 5 (Duda et al.) CS479/679 Pattern Recognition Dr. George Bebis.
PREDICT 422: Practical Machine Learning
Support Vector Machine
Support Vector Machines
An Introduction to Support Vector Machines
Support Vector Machines Introduction to Data Mining, 2nd Edition by
CSSE463: Image Recognition Day 14
Support Vector Machines and Kernels
COSC 4368 Machine Learning Organization
Linear Discrimination
Presentation transcript:

Support Vector Machines Elena Mokshyna Odessa 2014

Plan What’s SVM – Intuition & Math behind Kernel trick, huh? SVM for regression Recommendations Some kernels Different R libraries Advantages & Disadvantages 2

A bit of history Support vector machine classifiers have a long history of development starting from the 1960’s. The most important milestone for development of modern SVMs is the 1992 paper by Boser, Guyon, and Vapnik(“A training algorithm for optimal margin classifiers”) 3

What’s SVM - intuition 4 x2x2 x1x1

LIKE and NOT LIKE logistic regression 5 Logistic regression: Support vector machine: If (want ):

Sigmoid function.. again If, we want, 6 Cost of example:

If, we want (not just ) Support Vector Machine 1 1 NOT LIKE logistic regression 7 C =

8

So how does it look like? 9 x2x2 Find a classifier (hyperplane) An infinite number of such hyperplanes exist. Find the hyperplane that maximizes the gap between data points on the boundaries (so- called “support vectors”). margin In general – LARGE MARGIN CLASSIFIERS

“Soft-margin” linear SVM Use so-called slack variable ξ Subject to: 10 Outside the margin or on the margin Inside the margin correctly classified Inside the margin misclassified

SVM: introducing nonlinearity x2x2 11 Need to generate new features!

Kernel trick, huh? There is no straight line (hyper plane in 2 dimensions) which can separate red and blue dots. Generate more features! Inseparable data! T________T What to do?  12

Kernel trick, huh? Simple as it is Inseparable data project all points up to a two dimensional space using the mapping Voila! We can indeed find a hyper plane to separate data with SVM. The mapping in this case is called KERNEL FUNCTION 13

Example of kernel trick 14

Kernel trick in practice x2x2 15 f is approx 1 f is approx 0

SVM with radial basis kernels: sigma 16

Support Vector Regression Do kernel trick Build linear model using parameter ε - precision 17

Recommendations Small : Features vary less smoothly. Lower bias, higher variance. C ( ). Large C: Lower bias, high variance. Small C: Higher bias, low variance. Large : Features vary more smoothly. Higher bias, lower variance. 18 Small λ Large λ

Kernels 19

R: SVM implemetations Five packages that implement SVM in R: e1071 kernlab klaR svmpath shogun 20

Libraries’ comparison 21 Alexandros Karatzoglou. Support Vector Machines in R// http://

SVM advantages Flexibility in the choice of the form of the threshold Robustness towards small number of data points SVMs deliver a unique solution You can do almost anything with the right kernel (tricky!) 22

SVM disadvantages If the points on the boundaries are not informative (e.g., due to noise), SVMs will not do well. Can be computationally expensive 23 "However, from a practical point of view perhaps the most serious problem with SVMs is the high algorithmic complexity and extensive memory requirements.." Horváth (2003) in Suykens et al. p 392 "Besides the advantages of SVMs - from a practical point of view - they have some drawbacks. An important practical question that is not entirely solved, is the selection of the kernel function parameters…" Horváth (2003) in Suykens et al.

What we missed here (for our own good & sake of time) Relevance vector machines (RVM) Quadratic programming Loss functions Different types of kernels (loads of them!) One-class SVM SVM-based variable selection SVM usage to calculate posterior class probabilities All the differences in library implementations 24

References !!! Cousera’s Machine Learning with Andrew Ng: SVM without Tears: -content/Final.pdf -content/Final.pdf Quora: Learning/What-are-Kernels-in-Machine-Learning- and-SVMhttp:// Learning/What-are-Kernels-in-Machine-Learning- and-SVM explain-what-a-kernel-is-in-plain-english explain-what-a-kernel-is-in-plain-english 25

Thank you for the attention! 26

27