Download presentation

Presentation is loading. Please wait.

Published byWillie Poulton Modified about 1 year ago

1
A Vector Space Model for Automatic Indexing G. Salton, A. Wong and C. S. Yang Enhanced Vector Space Models for Content-based Recommender Systems Cataldo Musto Presenter Sawood Alam

2
A Vector Space Model for Automatic Indexing G. Salton, A. Wong and C. S. Yang Cornell University

3
Introduction In document retrieval, best indexing space is where each entity lies far away from others Density of the object space becomes a measure of indexing system Retrieval performance correlate inversely with space density

4
Document Space Di = (d i1, d i2, d i3, …, d ij )

5
Document Space (cont.)

6

7
Indexing Performance vs. Space Density

8
Cluster Density vs. Indexing Performance

9
Discrimination Value Model

10
Discrimination Value Model (cont.)

11
Discrimination Value Model Summary

12
Average Recall vs. Precision

13
Summary Recall vs. Precision

14
Enhanced Vector Space Models for Content-based Recommender Systems Cataldo Musto Dept. of Computer Science University of Bari, Italy

15
Introduction Vector Space Models (VSM) in Information Retrieval is an established practice Investigate the impact of vector space models in Information Filtering – Recommender system

16
Problems of VSM High dimensionality – Becoming more serious due to emerging social apps and micro-blogging, generating lots of web content and new vocabulary Inability to manage document semantics – Order of the term occurrence in the document

17
Components Context vector for each term – Values in {-1, 0, 1} Vector Space representation of a term (t) Vector Space representation of a document (d) Vector Space representation of a user profile (p u )

18
Indexing Technique Random Indexing-based model Weighted Random Indexing-based model Semantic Vector-based model Weighted Semantic Vector-based model

19
Experimental Evaluation

20
Conclusions First prototype with naive weighting scheme is comparable to other content based filtering techniques like Bayesian classifier Other complex weighting schemes should perform better User profiles may be studied based on Linked Data rather than keyword based user profiles

Similar presentations

© 2017 SlidePlayer.com Inc.

All rights reserved.

Ads by Google