Entity-Centric Topic-Oriented Opinion Summarization in Twitter Date : 2013/09/03 Author : Xinfan Meng, Furu Wei, Xiaohua, Liu, Ming Zhou, Sujian Li and.

Slides:



Advertisements
Similar presentations
A Human-Centered Computing Framework to Enable Personalized News Video Recommendation (Oh Jun-hyuk)
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Date: 2013/1/17 Author: Yang Liu, Ruihua Song, Yu Chen, Jian-Yun Nie and Ji-Rong Wen Source: SIGIR12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Adaptive.
Sumblr: Continuous Summarization of Evolving Tweet Streams
Date: 2012/8/13 Source: Luca Maria Aiello. al(CIKM’11) Advisor: Jia-ling, Koh Speaker: Jiun Jia, Chiou Behavior-driven Clustering of Queries into Topics.
Date : 2013/05/27 Author : Anish Das Sarma, Lujun Fang, Nitin Gupta, Alon Halevy, Hongrae Lee, Fei Wu, Reynold Xin, Gong Yu Source : SIGMOD’12 Speaker.
Sentiment Analysis on Twitter Data
TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
Linking Named Entity in Tweets with Knowledge Base via User Interest Modeling Date : 2014/01/22 Author : Wei Shen, Jianyong Wang, Ping Luo, Min Wang Source.
One Theme in All Views: Modeling Consensus Topics in Multiple Contexts Jian Tang 1, Ming Zhang 1, Qiaozhu Mei 2 1 School of EECS, Peking University 2 School.
Date : 2013/09/17 Source : SIGIR’13 Authors : Zhu, Xingwei
Sequence Clustering and Labeling for Unsupervised Query Intent Discovery Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: WSDM’12 Date: 1 November,
Bring Order to Your Photos: Event-Driven Classification of Flickr Images Based on Social Knowledge Date: 2011/11/21 Source: Claudiu S. Firan (CIKM’10)
S ENTIMENTAL A NALYSIS O F B LOGS B Y C OMBINING L EXICAL K NOWLEDGE W ITH T EXT C LASSIFICATION. 1 By Prem Melville, Wojciech Gryc, Richard D. Lawrence.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Information Retrieval in Practice
COM (Co-Occurrence Miner): Graph Classification Based on Pattern Co-occurrence Ning Jin, Calvin Young, Wei Wang University of North Carolina at Chapel.
Duyu Tang, Furu Wei, Nan Yang, Ming Zhou, Ting Liu, Bing Qin
Overview of Search Engines
1 Prototype Hierarchy Based Clustering for the Categorization and Navigation of Web Collections Zhao-Yan Ming, Kai Wang and Tat-Seng Chua School of Computing,
Opinion mining in social networks Student: Aleksandar Ponjavić 3244/2014 Mentor: Profesor dr Veljko Milutinović.
(ACM KDD 09’) Prem Melville, Wojciech Gryc, Richard D. Lawrence
1 Opinion Spam and Analysis (WSDM,08)Nitin Jindal and Bing Liu Date: 04/06/09 Speaker: Hsu, Yu-Wen Advisor: Dr. Koh, Jia-Ling.
On Sparsity and Drift for Effective Real- time Filtering in Microblogs Date : 2014/05/13 Source : CIKM’13 Advisor : Prof. Jia-Ling, Koh Speaker : Yi-Hsuan.
Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.
1 Entity Discovery and Assignment for Opinion Mining Applications (ACM KDD 09’) Xiaowen Ding, Bing Liu, Lei Zhang Date: 09/01/09 Speaker: Hsu, Yu-Wen Advisor:
Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.
Adding Semantics to Clustering Hua Li, Dou Shen, Benyu Zhang, Zheng Chen, Qiang Yang Microsoft Research Asia, Beijing, P.R.China Department of Computer.
Date : 2014/01/14 Author : Thanh-Son Nguyen, Hady W. Lauw, Panayiotis Tsaparas Source : CIKM’13 Advisor : Jia-ling Koh Speaker : Shao-Chun Peng.
BioSnowball: Automated Population of Wikis (KDD ‘10) Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/11/30 1.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
How Useful are Your Comments? Analyzing and Predicting YouTube Comments and Comment Ratings Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, Jose San.
A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
DOCUMENT UPDATE SUMMARIZATION USING INCREMENTAL HIERARCHICAL CLUSTERING CIKM’10 (DINGDING WANG, TAO LI) Advisor: Koh, Jia-Ling Presenter: Nonhlanhla Shongwe.
1 Masters Thesis Presentation By Debotosh Dey AUTOMATIC CONSTRUCTION OF HASHTAGS HIERARCHIES UNIVERSITAT ROVIRA I VIRGILI Tarragona, June 2015 Supervised.
CSC 594 Topics in AI – Text Mining and Analytics
Intelligent Database Systems Lab Presenter : WU, MIN-CONG Authors : YUNG-MING LI, TSUNG-YING LI 2013, DSS Deriving market intelligence from microblogs.
1 Generating Comparative Summaries of Contradictory Opinions in Text (CIKM09’)Hyun Duk Kim, ChengXiang Zhai 2010/05/24 Yu-wen,Hsu.
1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:
Multi-Aspect Query Summarization by Composite Query Date: 2013/03/11 Author: Wei Song, Qing Yu, Zhiheng Xu, Ting Liu, Sheng Li, Ji-Rong Wen Source: SIGIR.
CONTEXTUAL SEARCH AND NAME DISAMBIGUATION IN USING GRAPHS EINAT MINKOV, WILLIAM W. COHEN, ANDREW Y. NG SIGIR’06 Date: 2008/7/17 Advisor: Dr. Koh,
Topical Clustering of Search Results Date : 2012/11/8 Resource : WSDM’12 Advisor : Dr. Jia-Ling Koh Speaker : Wei Chang 1.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
Scalable Learning of Collective Behavior Based on Sparse Social Dimensions Lei Tang, Huan Liu CIKM ’ 09 Speaker: Hsin-Lan, Wang Date: 2010/02/01.
Summarizing Contrastive Viewpoints in Opinionated Text Michael J. Paul, ChengXiang Zhai, Roxana Girju EMNLP ’ 10 Speaker: Hsin-Lan, Wang Date: 2010/12/07.
Sentiment Analysis Using Common- Sense and Context Information Basant Agarwal 1,2, Namita Mittal 2, Pooja Bansal 2, and Sonal Garg 2 1 Department of Computer.
哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.
Semi-Supervised Recognition of Sarcastic Sentences in Twitter and Amazon -Smit Shilu.
ClusCite:Effective Citation Recommendation by Information Network-Based Clustering Date: 2014/10/16 Author: Xiang Ren, Jialu Liu,Xiao Yu, Urvashi Khandelwal,
Event Detection and Opinion Mining
ORec : An Opinion-Based Point-of-Interest Recommendation Framework
Improving Search Relevance for Short Queries in Community Question Answering Date: 2014/09/25 Author : Haocheng Wu, Wei Wu, Ming Zhou, Enhong Chen, Lei.
Speaker: Jim-an tsai advisor: professor jia-lin koh
Lei Sha, Jing Liu, Chin-Yew Lin, Sujian Li, Baobao Chang, Zhifang Sui
Measuring the Latency of Depression Detection in Social Media
Speaker: Jim-An Tsai Advisor: Professor Jia-ling Koh
Identifying Decision Makers from Professional Social Networks
Qingxia Liu Interactive Hierarchical Tag Clouds for Summarizing Spatiotemporal Social Contents [ICDE 2014] Kang, Wei, Anthony KH Tung,
Text Mining & Natural Language Processing
Intent-Aware Semantic Query Annotation
Text Mining & Natural Language Processing
Sourse: Www 2017 Advisor: Jia-Ling Koh Speaker: Hsiu-Yi,Chu
Date : 2013/1/10 Author : Lanbo Zhang, Yi Zhang, Yunfei Chen
TOPTRAC: Topical Trajectory Pattern Mining
Date: 2016/11/29 Author: Zhe Zhao, Paul Resnick, Qiaozhu Mei
Wiki3C: Exploiting Wikipedia for Context-aware Concept Categorization
Heterogeneous Graph Attention Network
Presentation transcript:

Entity-Centric Topic-Oriented Opinion Summarization in Twitter Date : 2013/09/03 Author : Xinfan Meng, Furu Wei, Xiaohua, Liu, Ming Zhou, Sujian Li and Houfeng Wang Source : KDD’12 Advisor : Jia-ling Koh Speaker : Yi-hsuan Yeh

Outline 2  Introduction  Topic Extraction  Opinion Summarization  Experiment  Conclusion

Introduction 3  Microblogging services, such as Twitter, have become popular channels for people.  People not only share their daily update information or personal conversation, but also exchange their opinions towards a broad range of topics.  However, people may express opinions towards different aspects, or topics, of an entity.

Introduction 4  Goal : Produce opinion summaries in accordance with topics and remarkably emphasizing the insight behind the opinions.

Outline 5  Introduction  Topic Extraction  Opinion Summarization  Experiment  Conclusion

Topic Extraction 6  #hashtags  They are created organically by Twitter users as a way to categorize messages and to highlight topics  We use #hashtags as candidate topics.

Topic Extraction 7 1. Collect a dictionary from ODP, Freebase  Rule-base classifier 2. Split #hashtags into multiple words and then check if some of words in person/location dictionary 3. Tagness (threshold=0.85) ex : occurrences of #fb = 95, total occurrences of its content = 100 tagness = 95/100 = 0.95 > 0.85 (remove)

Graph-based Topic Extraction 8  Affinity Propagation algorithm  Input : #hashtags pairwise relatedness matrix  output : #hashtags clusters and the centroids of clusters. 1. Co-occurrences Relation h1 h2 h4 h5 h3 h6 h1 h2 h4 h5 h3 h6

Relatedness 9 2. Context Similarity ex : hi hj t1 t2 t3 t Cosine(h i, h j ) = [(4*2)+(0*3)+(5*0)+(3*6)] / [( ) 1/2 ]*[( ) 1/2 ]

Relatedness Topic-Aware Distributional Similarity  Labeled LDA ex : h i h j w 1 w 2 w 3 w KL(h i, h j ) = ( ln (0.4/0.3) * 0.4)+ ( ln (0.3/0.1) * 0.3)+ ( ln (0.1/0.5) * 0.1)+ ( ln (0.2/0.1) * 0.2) Other words in the tweets

Topic Labeling and Assignment 11  For a tweet with #hashtag(s), we assign it the topic(s) corresponding to every #hashtag in the tweet  For a tweet without #hashtags, we predict its topic using a SVM classifier  Bag-of-words feature

Outline 12  Introduction  Topic Extraction  Opinion Summarization  Insightful Tweet Classification  Opinionated Tweet Classification  Summary Generation  Experiment  Conclusion

Insightful Tweet Classification 13  Standford Parser  match the pattern syntax trees against the tweet syntax trees  To create a high coverage pattern set, we use a paraphrase generation algorithm  ex : “that is why”  “which is why”

Opinionated Tweet Classification 14  A lexicon-based sentiment classifier relies on sentiment dictionary matching  counts the occurrences of the positive (cp) and negative (cn) words  Negation expressions  the distance in words between neg and w is smaller than a predefined threshold (5)  invert the sentiment orientation  ex : “eliminate”, “reduce”

Target-lexicon dependency classification 15  A binary SVM classifier to determine whether the sentiment word (w) is used to depict the target (e).  Feature: 1. The distance in word between w and e 2. Whether there are other entities between w and e 3. Whether there are punctuation(s) between w and e 4. Whether there are other sentiment word(s) between w and e 5. The relative position of w and e : w is before or after e 6. Whether these is a dependency relation between w and e (MST Parser)

Summary Generation 16  Selecting a subset of tweets P from tweet set Tk for topic k 1. Language style score ex : “I am Avril Lavigne’s biggest fan!! ❤ ” L(t i ) = 1+ (1/7) = 1.143

17 2. Topic relevance score  Term distribution of tweet ti and topic label lk ex : t i l k t 1 t 2 t 3 t KL(t i,l k ) = ( ln (0.1/0.2) * 0.1)+ ( ln (0.5/0.1) * 0.5)+ ( ln (0.2/0.6) * 0.2)+ ( ln (0.2/0.1) * 0.2)

18 3. Redundancy score  Word distribution of tweet t i and tweet t j ex : t i t j t 1 t 2 t 3 t 4 t KL(t i,l k ) = ( ln (0.4/0.1) * 0.4)+ ( ln (0.1/0.35) * 0.1)+ ( ln (0.15/0.2) * 0.15)+ ( ln (0.3/0.15) * 0.3)+ ( ln (0.05/0.2) * 0.05)+

Outline 19  Introduction  Topic Extraction  Opinion Summarization  Experiment  Conclusion

Data 20  ~

Evaluation of Topic Extraction 21

Evaluation of Opinion Summarization 22

23  Language style score = 1

Outline 24  Introduction  Topic Extraction  Opinion Summarization  Experiment  Conclusion

Conclusion 25  An entity-centric topic-oriented opinion summarization framework, which is capable of producing opinion summaries in accordance with topics and remarkably emphasizing the insight behind the opinions in Twitter.  In the future, we will further study the semantics underlying #hashtags, which we can make use of to extract more comprehensive and interesting topics.