PROBLEM BEING ATTEMPTED Privacy -Enhancing Personalized Web Search Based on:  User's Existing Private Data Browsing History E-Mails Recent Documents 

Slides:



Advertisements
Similar presentations
Recommender Systems & Collaborative Filtering
Advertisements

Introduction to Information Retrieval
Query Chains: Learning to Rank from Implicit Feedback Paper Authors: Filip Radlinski Thorsten Joachims Presented By: Steven Carr.
Lecture 11 Search, Corpora Characteristics, & Lucene Introduction.
Introduction to Information Retrieval (Manning, Raghavan, Schutze) Chapter 6 Scoring term weighting and the vector space model.
Web Search – Summer Term 2006 VI. Web Search - Ranking (c) Wolfgang Hürst, Albert-Ludwigs-University.
Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.
Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.
6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Re-ranking Documents Segments To Improve Access To Relevant Content in Information Retrieval Gary Madden Applied Computational Linguistics Dublin City.
Scaling Personalized Web Search Glen Jeh, Jennfier Widom Stanford University Presented by Li-Tal Mashiach Search Engine Technology course (236620) Technion.
1 Chapter 19: Information Retrieval. ©Silberschatz, Korth and Sudarshan19.2Database System Concepts - 5 th Edition, Sep 2, 2005 Chapter 19: Information.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Chapter 5 Searching for Truth: Locating Information on the WWW.
Chapter 19: Information Retrieval
Topic-Sensitive PageRank Taher H. Haveliwala. PageRank Importance is propagated A global ranking vector is pre-computed.
University of Kansas Department of Electrical Engineering and Computer Science Dr. Susan Gauch April 2005 I T T C Dr. Susan Gauch Personalized Search Based.
Recommender systems Ram Akella November 26 th 2008.
Information Retrieval
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Chapter 5: Information Retrieval and Web Search
1 SOCIAL BOOKMARKING 101. HIBA KHALID BILAL SAEED KHAN FARID ALIANI ASKARI HASAN SOCIAL BOOKMARKING.
Chapter 5 Searching for Truth: Locating Information on the WWW.
1 Wikification CSE 6339 (Section 002) Abhijit Tendulkar.
1 Chapter 19: Information Retrieval Chapter 19: Information Retrieval Relevance Ranking Using Terms Relevance Using Hyperlinks Synonyms., Homonyms,
Computing & Information Sciences Kansas State University Monday, 04 Dec 2006CIS 560: Database System Concepts Lecture 41 of 42 Monday, 04 December 2006.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
WEB SEARCH PERSONALIZATION WITH ONTOLOGICAL USER PROFILES Data Mining Lab XUAN MAN.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Privacy-Enhanced Personalized Search Yabo-Arber Xu, Benyu Zhang, Zheng Chen, Ke Wang Database & Data Mining Fraser University Microsoft Research.
Weighting and Matching against Indices. Zipf’s Law In any corpus, such as the AIT, we can count how often each word occurs in the corpus as a whole =
Personalized Search Xiao Liu
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
Data Mining for Web Intelligence Presentation by Julia Erdman.
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
Personalization with user’s local data Personalizing Search via Automated Analysis of Interests and Activities 1 Sungjick Lee Department of Electrical.
© 2004 Chris Staff CSAW’04 University of Malta of 15 Expanding Query Terms in Context Chris Staff and Robert Muscat Department of.
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
A Knowledge-Based Search Engine Powered by Wikipedia David Milne, Ian H. Witten, David M. Nichols (CIKM 2007)
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Improving the performance of personal name disambiguation.
Bloom Cookies: Web Search Personalization without User Tracking Authors: Nitesh Mor, Oriana Riva, Suman Nath, and John Kubiatowicz Presented by Ben Summers.
1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:
Cs Future Direction : Collaborative Filtering Motivating Observations:  Relevance Feedback is useful, but expensive a)Humans don’t often have time.
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
University Of Seoul Ubiquitous Sensor Network Lab Query Dependent Pseudo-Relevance Feedback based on Wikipedia 전자전기컴퓨터공학 부 USN 연구실 G
Database System Concepts, 5th Ed. ©Sang Ho Lee Chapter 19: Information Retrieval.
Recommender Systems & Collaborative Filtering
Future Direction #3: Collaborative Filtering
Guangbing Yang Presentation for Xerox Docushare Symposium in 2011
Text Based Information Retrieval
Information Retrieval
Multimedia Information Retrieval
Information Retrieval
Author: Kazunari Sugiyama, etc. (WWW2004)
Searching for Truth: Locating Information on the WWW
Searching for Truth: Locating Information on the WWW
Searching for Truth: Locating Information on the WWW
Chapter 31: Information Retrieval
Information Retrieval and Web Design
Chapter 19: Information Retrieval
Future Direction : Collaborative Filtering
Presentation transcript:

PROBLEM BEING ATTEMPTED Privacy -Enhancing Personalized Web Search Based on:  User's Existing Private Data Browsing History s Recent Documents  All Data Stored on User's Personal Computer Classify Interests into 2 Categories  General Interests (Less Sensitive to Privacy)  Specific Interests (More Sensitive to Privacy)

PROPOSED SOLUTION Automatically build user profile from available source data  Similar Terms: Two terms that cover the document set with heavy overlaps might indicate the same interest area Use Jaccard Similarity  Parent-Child Terms: Specific terms often appear together with general terms, but opposite not true. e.g. Badminton and Sports Use Conditional Probabilities Control the information sent to the search engine  minDetail: Determines which part of the user profile is protected  expRatio: Measures how much private information is exposed to the server  Inversely Related Wrapper to personalize results  Use previously constructed profile  Parameter α that determines amount of personalization  PPRank = α PersonalRank + (1-α) SearchEngineRank

CRITICISMS Number of Returned Results Considered  Only top 50 results from the original search engine results considered, but user's preferences could project a lower result into the top 10  Measure of Search Quality  Average Precision is used; but authors fail to explain how this ties in with the quality of the personalization in the search Testing the Effect of Manual Privacy Settings  The authors don't provide experiments that test how the system would behave if users, instead of using minDetail all the time, manually excluded some terms from their profile Classification of Terms  A specific term might be classified wrongly as general if it occurs often enough in the user's corpus; but it is extremely revealing as far as privacy is concerned

RELATIONS TO COURSE TOPICS Personalized Search  Query Disambiguation: The authors use a variant of the course example; searching for Rockets, a sports fan wants the basketball team, not links related to space exploration  Relevance Feedback: This system is implicit in seeking relevance feedback through the user's browsing history and other personal information, rather than explicit. Dealing with Unstructured Data  Browsing history, s and recent documents all unstructured  New system to classify terms into either similarity or parent-child relationships based on occurrences Term Similarity  Used to classify terms; use Jaccard Similarity and Conditional Probability Weighted Re-Ranking of Results  Similar idea, but not same, as PageRank  Get the results from search engine for given query  Then re-rank the results based on user's personal profile, using weighting factor α