ON THE SELECTION OF TAGS FOR TAG CLOUDS (WSDM11) Advisor: Dr. Koh. Jia-Ling Speaker: Chiang, Guang-ting Date:2011/06/20 1.

Slides:

Advertisements

Similar presentations

Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.

Advertisements

Learning to Suggest: A Machine Learning Framework for Ranking Query Suggestions Date: 2013/02/18 Author: Umut Ozertem, Olivier Chapelle, Pinar Donmez,

Psychological Advertising: Exploring User Psychology for Click Prediction in Sponsored Search Date: 2014/03/25 Author: Taifeng Wang, Jiang Bian, Shusen.

Diversity Maximization Under Matroid Constraints Date : 2013/11/06 Source : KDD’13 Authors : Zeinab Abbassi, Vahab S. Mirrokni, Mayur Thakur Advisor :

Date : 2013/05/27 Author : Anish Das Sarma, Lujun Fang, Nitin Gupta, Alon Halevy, Hongrae Lee, Fei Wu, Reynold Xin, Gong Yu Source : SIGMOD’12 Speaker.

Center for E-Business Technology Seoul National University Seoul, Korea Socially Filtered Web Search: An approach using social bookmarking tags to personalize.

Toward Whole-Session Relevance: Exploring Intrinsic Diversity in Web Search Date: 2014/5/20 Author: Karthik Raman, Paul N. Bennett, Kevyn Collins-Thompson.

Title Course opinion mining methodology for knowledge discovery, based on web social media Authors Sotirios Kontogiannis Ioannis Kazanidis Stavros Valsamidis.

Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.

Searchable Web sites Recommendation Date : 2012/2/20 Source : WSDM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh Jia-ling 1.

THE VISUALIZATION OF DATABASE SEARCH RESULTS Introduction: Edward Tufte describes the visual presentation of quantitative data as “envisioning information.”

Overview of Search Engines

Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.

Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.

Towards Boosting Video Popularity via Tag Selection Elizeu Santos-Neto, Tatiana Pontes, Jussara Almeida, Matei Ripeanu University of British Columbia -

Web 2.0: Concepts and Applications 4 Organizing Information.

Web Usage Mining with Semantic Analysis Date: 2013/12/18 Author: Laura Hollink, Peter Mika, Roi Blanco Source: WWW’13 Advisor: Jia-Ling Koh Speaker: Pei-Hao.

Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.

Leveraging Conceptual Lexicon ： Query Disambiguation using Proximity Information for Patent Retrieval Date : 2013/10/30 Author : Parvaz Mahdabi, Shima.

By : Garima Indurkhya Jay Parikh Shraddha Herlekar Vikrant Naik.

Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.

1 Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007)

No Title, yet Hyunwoo Kim SNU IDB Lab. September 11, 2008.

CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.

Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor ： Jia Ling, Koh Speaker ： SHENG HONG, CHUNG.

MINING FREQUENT ITEMSETS IN A STREAM TOON CALDERS, NELE DEXTERS, BART GOETHALS ICDM2007 Date: 5 June 2008 Speaker: Li, Huei-Jyun Advisor: Dr. Koh, Jia-Ling.

南台科技大學資訊工程系 A web page usage prediction scheme using sequence indexing and clustering techniques Adviser: Yu-Chiang Li Speaker: Gung-Shian Lin Date:2010/10/15.

Understanding and Predicting Personal Navigation Date : 2012/4/16 Source : WSDM 11 Speaker : Chiu, I- Chih Advisor : Dr. Koh Jia-ling 1.

Date: 2013/8/27 Author: Shinya Tanaka, Adam Jatowt, Makoto P. Kato, Katsumi Tanaka Source: WSDM’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Estimating.

From Social Bookmarking to Social Summarization: An Experiment in Community-Based Summary Generation Oisin Boydell, Barry Smyth Adaptive Information Cluster,

Date: 2012/4/23 Source: Michael J. Welch. al(WSDM’11) Advisor: Jia-ling, Koh Speaker: Jiun Jia, Chiou Topical semantics of twitter links 1.

You Are What You Tag Yi-Ching Huang and Chia-Chuan Hung and Jane Yung-jen Hsu Department of Computer Science and Information Engineering Graduate Institute.

Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.

Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.

1 Web-Page Summarization Using Clickthrough Data* JianTao Sun, Yuchang Lu Dept. of Computer Science TsingHua University Beijing , China Dou Shen,

A Statistical Comparison of Tag and Query Logs Mark J. Carman, Robert Gwadera, Fabio Crestani, and Mark Baillie SIGIR 2009 June 4, 2010 Hyunwoo Kim.

21/11/20151Gianluca Demartini Ranking Clusters for Web Search Gianluca Demartini Paul–Alexandru Chirita Ingo Brunkhorst Wolfgang Nejdl L3S Info Lunch Hannover,

Improving Search Engines Using Human Computation Games (CIKM09) Date: 2011/3/28 Advisor: Dr. Koh. Jia-Ling Speaker: Chiang, Guang-ting 1.

Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.

Algorithmic Detection of Semantic Similarity WWW 2005.

LOGO 1 Corroborate and Learn Facts from the Web Advisor ： Dr. Koh Jia-Ling Speaker ： Tu Yi-Lang Date ： Shubin Zhao, Jonathan Betz (KDD '07 )

LOGO Identifying Opinion Leaders in the Blogosphere Xiaodan Song, Yun Chi, Koji Hino, Belle L. Tseng CIKM 2007 Advisor ： Dr. Koh Jia-Ling Speaker ： Tu.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Fuzzy integration of structure adaptive SOMs for web content.

+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.

A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,

ITCS 6265 Details on Project & Paper Presentation.

Date: 2013/6/10 Author: Shiwen Cheng, Arash Termehchy, Vagelis Hristidis Source: CIKM’12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Predicting the Effectiveness.

Date: 2012/11/29 Author: Chen Wang, Keping Bi, Yunhua Hu, Hang Li, Guihong Cao Source: WSDM’12 Advisor: Jia-ling, Koh Speaker: Shun-Chen, Cheng.

1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:

26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.

A New Algorithm for Inferring User Search Goals with Feedback Sessions.

Date: 2013/4/1 Author: Jaime I. Lopez-Veyna, Victor J. Sosa-Sosa, Ivan Lopez-Arevalo Source: KEYS’12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang KESOSD.

Leveraging Knowledge Bases for Contextual Entity Exploration Categories Date:2015/09/17 Author:Joonseok Lee, Ariel Fuxman, Bo Zhao, Yuanhua Lv Source:KDD'15.

TO Each His Own: Personalized Content Selection Based on Text Comprehensibility Date: 2013/01/24 Author: Chenhao Tan, Evgeniy Gabrilovich, Bo Pang Source:

Date: 2013/9/25 Author: Mikhail Ageev, Dmitry Lagun, Eugene Agichtein Source: SIGIR’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Improving Search Result.

Hybrid Content and Tag-based Profiles for recommendation in Collaborative Tagging Systems Latin American Web Conference IEEE Computer Society, 2008 Presenter:

{ Adaptive Relevance Feedback in Information Retrieval Yuanhua Lv and ChengXiang Zhai (CIKM ‘09) Date: 2010/10/12 Advisor: Dr. Koh, Jia-Ling Speaker: Lin,

Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.

Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.

Finding similar items by leveraging social tag clouds Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: SAC 2012’ Date: October 4, 2012.

1 Link Privacy in Social Networks Aleksandra Korolova, Rajeev Motwani, Shubha U. Nabar CIKM’08 Advisor: Dr. Koh, JiaLing Speaker: Li, HueiJyun Date: 2009/3/30.

Presented by : Manoj Kumar & Harsha Vardhana Impact of Search Engines on Page Popularity by Junghoo Cho and Sourashis Roy (2004)

1 CS 430 / INFO 430: Information Retrieval Lecture 20 Web Search 2.

QUERY-PERFORMANCE PREDICTION: SETTING THE EXPECTATIONS STRAIGHT Date : 2014/08/18 Author : Fiana Raiber, Oren Kurland Source : SIGIR’14 Advisor : Jia-ling.

Jan 27, Digital Preservation Seminar1 Effective Page Refresh Policies for Web Crawlers Written By: Junghoo Cho & Hector Garcia-Molina Presenter:

Neighborhood - based Tag Prediction

Improving Search Relevance for Short Queries in Community Question Answering Date： 2014/09/25 Author ： Haocheng Wu, Wei Wu, Ming Zhou, Enhong Chen, Lei.

Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment Xinyang Jiang, Fei Wu, Xi Li, Zhou Zhao, Weiming Lu, Siliang Tang, Yueting.

Martin Rajman, Martin Vesely

Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,

Presentation transcript:

ON THE SELECTION OF TAGS FOR TAG CLOUDS (WSDM11) Advisor: Dr. Koh. Jia-Ling Speaker: Chiang, Guang-ting Date:2011/06/20 1

Preview Introduction What’s Tag Cloud System Model Metrics Algorithms User Model Experimental Evaluation Conclusions 2

Introduction Present a formal system model for reasoning about tag clouds. Present metrics that capture the structural properties of a tag cloud. Present a set of tag selection algorithms that are used in current sites (e.g., del.icio.us,Flickr…). Devise a new integrated user model that is specifically tailored for tag cloud evaluation. Evaluate the algorithms under this user model, using two datasets: CourseRank (a Stanford social tool containing information about courses) and del.icio.us (a social bookmarking site). 3

What’s Tag Cloud A tag cloud is a visual depiction of text content. Tag cloud have been used in social information sharing sites, including personal and commercial web pages, blogs, and search engines, such as Flickr and del.icio.us. Tags in tags cloud are extracted from the textual content of the results and are weighted based on their frequencies. Tags in the cloud are hyperlinks. 4

What’s Tag Cloud 5

Goal : To summarize the available data so the user can better understand what is at his use. To help users discover unexpected information of interest. 6

System Model 7

8 Object1Object2Object3 AAB CCD D

System Model 9

Metrics 10

11 Object1Object2Object3 AAA CCB DD O10.5 O O

Metrics 12 O10.5 O O Object1Object2Object3 AAA CCB DD

Metrics Overlap of S : The overlap metric captures the extent of such redundancy. 1, [0, 1] 13 O10.5 O O Object1Object2Object3 AAA CCB DD

Metrics 14 Object1Object2Object3 AAA CCB DD

Metrics 15 Object1Object2Object3 AAA CCB DD

Metrics 16 Object1Object2Object3 AAA CCB DD

Metrics 17 Object1Object2Object3 AAA CCB DD

Metrics Balance of S : To take into account the relative sizes of association sets., [0,1]. The closer it is to 1, the more balanced our association sets are. 18 Object1Object2Object3 AAA CCB DC

Algorithms Goal : Present a set of tag selection algorithms that are used in current sites (e.g., del.icio.us,Flickr…). A tag selection algorithm can be single or multi-objective. A single-objective algorithm has a single goal in mind. A multi-objective algorithm tries to strike a balance among two or more goals. In this paper we focus on four single objective algorithms that have been used in practice or that are very close to those used. 19

Algorithms Query :q Parameter :k Query :q Parameter :k Pop(S) Top-k tags 20

Algorithms 21

Algorithms 22

Algorithms 23

User Model Goal : Devise a new integrated user model that is specifically fit well for tag cloud evaluation and assumes an “ideal” user. This “ideal” user is satisfied in a very predictable and rational way. Base model: Coverage Incorporating Relevance Incorporating cohesiveness Incorporating Overlap Proposed User Model 24

User Model 25 Object1Object 2Object3Object4 AAAA CCBB DD

User Model DEFINITION 2. The failure probability (F) for a query q is the probability that the user will not be able to satisfy his search need using the tag cloud S. 26

User Model 27

User Model 28

User Model 29

User Model Taking into account scores Pr[c] : for each object c its probability of being of interest to the user. Pr[c]’ : the final probability by first multiplying every Pr[c] with s(q, c). 30

Experimental Evaluation Goal : Evaluate the algorithms under this user model, using two datasets: CourseRank and del.icio.us. CourseRank: A set of 18,774 courses that Stanford University offers annually. Each course represents an object in C. We consider all the words in the course title, course description and user comments to be the tags for the object, except for stop words and terms are very frequent and do not describe the content of the course. del.icio.us: Consists of urls and the tags that the users assigned to them on del.icio.us. Our dataset contains a month worth of data (May 25, 2007–June 26, 2007); we randomly sampled 100,000 urls and all the tags that were assigned to these 100,000 urls during that month. 31

Box plot 32

The algorithms impact metrics differently The algorithms impact metrics differently 33

Experimental Evaluation Algorithms impact failure probability differently The lower the failure probability is, the more likely it is that one of the displayed tags will lead to an object of interest. 34

Experimental Evaluation How the ordering of the algorithms change for different extent and r values. 35

Conclusions Our goal has been to understand the process of building a tag cloud for exploring and understanding a set of objects. Under our user model, our maximum coverage algorithm, COV seems to be a very good choice for most scenarios. Our user model is a useful tool for identifying tag clouds preferred by users. 36

3Q 4 UR ATTENTION !!!!!!!!! 37

Algorithms 38