Presentation is loading. Please wait.

Presentation is loading. Please wait.

Multi-class SVM with Negative Data Selection for Web Page Classification Chih-Ming Chen, Hahn-Ming Lee and Ming-Tyan Kao International Joint Conference.

Similar presentations


Presentation on theme: "Multi-class SVM with Negative Data Selection for Web Page Classification Chih-Ming Chen, Hahn-Ming Lee and Ming-Tyan Kao International Joint Conference."— Presentation transcript:

1 Multi-class SVM with Negative Data Selection for Web Page Classification Chih-Ming Chen, Hahn-Ming Lee and Ming-Tyan Kao International Joint Conference on Neural Networks 2004

2 Motivation Several new websites are launched everyday Need to search fast and efficiently Search engines organize websites under topic hierarchy (taxonomy) Need a classifier: one-against-all SVM Catch: huge negative data  increased training time

3 Negative Data Selection Support vectors in the negative data are much similar to the positive data than the other negative data

4 Negative Data Selection 1.Feature Selection: top n keywords from the positive data 2.All websites are represented as vectors of these top n keywords. 3.Cosine Similarity:

5 Negative Data Selection Plot similarity scores of negative to positive documents in descending order with negative documents Similarity Scores in Descending order Negative Documents Convergence Point

6 Experiments Reuters dataset (10802 training, 565 test) ClassNumber of Positive Data Number of Negative Data Crude58010222 Trade47510327 Dlr16210640 Nat-gas9210710 Acq23578445

7 Experiments

8


Download ppt "Multi-class SVM with Negative Data Selection for Web Page Classification Chih-Ming Chen, Hahn-Ming Lee and Ming-Tyan Kao International Joint Conference."

Similar presentations


Ads by Google