Question Identification on Twitter 1 The Chinese University of Hong Kong, Shatin, N.T., Hong Kong 2 Google Research, Beijing, China 3 AT&T Labs Research,

Slides:



Advertisements
Similar presentations
By Constanza Lermanda G.. Topic: Electronic NewspaperDuration of the lesson: 50 minutesGrade Level: 12th.
Advertisements

Kiran Garimella.  News  Scientific papers   Search Queries  Twitter ◦ Gender ◦ Relationships ◦ Migration ◦ Politics.
Linking Entities in #Microposts ROMIL BANSAL, SANDEEP PANEM, PRIYA RADHAKRISHNAN, MANISH GUPTA, VASUDEVA VARMA INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY,
Twitter – what is it? The School District of Haverford Township |
DISPUTES & INVESTIGATIONS ECONOMICS FINANCIAL ADVISORY MANAGEMENT CONSULTING Joining Twitter How to Register, Follow Navigant & Join the Conversation June.
Twitter Glossary. #: People use the hashtag symbol # before a relevant keyword or phrase (no spaces) in their Tweet to categorize those Tweets and help.
SOCIAL MEDIA ADVOCACY &. WHAT YOU WILL GET OUT OF TODAY’S SESSION: HOW COALITION MEMBERS AND SUPPORTERS CAN ADVOCATE FOR MINNEMINDS CHANNELS TYPES OF.
NHnetWORKS December 14,  Facebook is a global Social Networking website that is operated and privately owned by Facebook, Inc.  Users can add.
Social Media for Health Advocates Twitter
Twitter The Basics. What is Twitter? Tweets are: 140 characters or less Quick to follow and view updates Used to share links, photos, videos, music,hot.
Semi Supervised Recognition of Sarcastic Sentences in Twitter and Amazon Dmitry DavidovOren TsurAri Rappoport.
Chen Cheng1, Haiqin Yang1, Irwin King1,2 and Michael R. Lyu1
Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.
1 PageSim: A Link-based Similarity Measure for the World Wide Web Zhenjiang Lin, Irwin King, and Michael, R., Lyu Computer Science & Engineering, The Chinese.
Extracting Interest Tags from Twitter User Biographies Ying Ding, Jing Jiang School of Information Systems Singapore Management University AIRS 2014, Kuching,
Skills: familiarity with the Twitter user interface and major features, using the hashtag (#) and at-sign searching and tweeting images and videos.
Search Engine Optimization
SIGIR’09 Boston 1 Entropy-biased Models for Query Representation on the Click Graph Hongbo Deng, Irwin King and Michael R. Lyu Department of Computer Science.
WEB FORUM MINING BASED ON USER SATISFACTION PAGE 1 WEB FORUM MINING BASED ON USER SATISFACTION By: Suresh Pokharel Information and Communications Technologies.
Introduction The large amount of traffic nowadays in Internet comes from social video streams. Internet Service Providers can significantly enhance local.
Extracting Key Terms From Noisy and Multi-theme Documents Maria Grineva, Maxim Grinev and Dmitry Lizorkin Institute for System Programming of RAS.
12/2014 Heidi Larson HeidiL_edc.  Setting up an account  Twitter vocabulary – With Strategy tips  How to Tweet  Why to Tweet  How to get started.
Detecting Semantic Cloaking on the Web Baoning Wu and Brian D. Davison Lehigh University, USA WWW 2006.
Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
Skills: familiarity with the Twitter user interface and major features, the #hashtag and search Concepts: evolution of Twitter applications.
Skills: familiarity with the Twitter user interface and major features, the #hashtag and search Concepts: evolution of Twitter applications.
A Language Independent Method for Question Classification COLING 2004.
Social Media By Amanda Carter & Nicholle Hinkle.
Zunal: Webquest Creation Website Created by Russell Smith Technology Facilitator North Edgecombe High School Username: edgecombe Password: warrior.
Jargon Busters Presented by Katie Munton and Natalie Dawson.
Chansamooth- The Confidence Coach © 2014 How To Schedule Posts in Facebook.
Question Routing in Community Question Answering: Putting Category in Its Place 1 The Chinese University of Hong Kong, Shatin, N.T., Hong Kong 2 AT&T Labs.
SEO Who knew 3 letters could mean so much?. What is SEO? Search Engine Optimization (SEO) is the practice of improving and promoting a web site in order.
SOCIAL MEDIA Beware the text-heavy presentation ahead Kelley Freeman Communications Associate Secular Student Alliance.
Reporter: Jing Chiu Advisor: Yuh-Jye Lee /3/17 1 Data Mining and Machine Learning Lab.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Club Overview - Day 2 (Get Excited!!!!!). Agenda I. Log into Canvas II. Choosing a Level III. Learning and Creating IV. Closing.
Created by Branden Maglio and Flynn Castellanos Team BFMMA.
Liangjie Hong and Brian D. Davison Department of Computer Science and Engineering Lehigh University SIGIR 2009.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
11 A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 1, Michael R. Lyu 1, Irwin King 1,2 1 The Chinese.
Running the OFA’s Social Media Madeline Zukowski Faculty Marketing Intern Summer 2013.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Analyzing and Predicting Question Quality in Community Question Answering Services Baichuan Li, Tan Jin, Michael R. Lyu, Irwin King, and Barley Mak CQA2012,
Poster Spotlights Conference on Uncertainty in Artificial Intelligence Catalina Island, United States August 15-17, 2012 Session: Wed. 15 August 2012,
MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.
Engaging the audience. Social Media is a Universe A way to talk with supporters and key stakeholders So, be a connector. Reciprocate. Empower your audience,
Fabricio Benevenuto, Gabriel Magno, Tiago Rodrigues, and Virgilio Almeida Universidade Federal de Minas Gerais Belo Horizonte, Brazil ACSAC 2010 Fabricio.
Internet Safety Blog
WePS2 Attribute Extraction Task Sekine and Artiles WWW 2009 Workshop.
Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha Hamed Haddadi Fabricio Benevenuto Krishna P. Gummadi.
Semi-Supervised Recognition of Sarcastic Sentences in Twitter and Amazon -Smit Shilu.
PLN Basics What is a Professional Learning Network? (PLN) How would that help me? How do I create one? Twitter Really Simple Syndicate (RSS)
Wujie Zheng 1, Hao Ma 2, Michael Lyu 1, Tao Xie 3, and Irwin King 1,4 1 CUHK, 2 Microsoft Research, 3 NCSU, 4 AT&T Labs Nov. 9, 2011 Mining Test Oracles.
Twitter Part One – The Fundamentals. First things first… What is Twitter? Social networking platform Short messages – 140 characters maximum Relaxed,
Topic Modeling for Short Texts with Auxiliary Word Embeddings
Your Company Competitor Report {Insert Company Logo Here}
WSRec: A Collaborative Filtering Based Web Service Recommender System
Video Summarization by Spatial-Temporal Graph Optimization
What Is MLA Style? MLA Stands for “Modern Language Association”
All a Twitter About Literature
Zhenjiang Lin, Michael R. Lyu and Irwin King
WorkShop on Community Question Answering on the Web
A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 22, Feb, 2010 Department of Computer.
Three steps are separately conducted
6) 2) 3) 7) 4) (Character’s full name) twitter name
Web Page Classification with Heterogeneous Data Fusion
Presentation transcript:

Question Identification on Twitter 1 The Chinese University of Hong Kong, Shatin, N.T., Hong Kong 2 Google Research, Beijing, China 3 AT&T Labs Research, San Francisco, CA, USA Baichuan Li 1, Xiance Si 2, Michael R. Lyu 1, Irwin King 13, and Edward Y. Chang 2 People Ask Questions on Twitter! 10% of Twitter Users once asked questions on Twitter 13% of Tweets contain questions 1,600 tweets were posted per second -> 200 questions per second on Twitter were asked How to Find Questions automatically? Tweet is short and noisy Tweets containing questions are not always asking questions Interrogative Tweet Detection Rule-based Approach  Question marks  5W1H words and Refined 5W1H words  H1: They must appear at the beginning of one sentence.  H2: Auxiliary words are added to the original words. E.g., we change “what” to “what is” and “what are”.  Heuristic Rules (Efron and Winget, 2010) Learning-based Approach  Frequent question patterns mining  One-class SVM Taxonomy of Interrogative Tweets Advertisement  Incorporating your business this year? Call us today for a free consultation with one of our attorneys Article or News Title on the Web  New post: Pregnancy Miracle - A Miracle or a Scam? or-a-scam/ II. Approach (cont.)I. Motivations II. Approach ~ Propose a novel problem of automatically identifying questions on Twitter Provide a two-phase classification model to discover interrogative tweets and qweets Investigate different feature sets’ influence on qweet extraction (especially, Tweet- specific features such retweet, and hashtag) III. Experiments IV. Conclusions Data Set 1 – Twitter stream from 11:00am to 12:00am on April 18, 2011 Objective: Discovering interrogative tweets and qweets Content: 2,045 English tweets (227 interrogative tweets and 127 qweets) Data Set 2 – QA pairs from Yahoo! Answers and WikiAnswers Objective: Extracting frequent question patterns Content: Over 850,000 question titles and the corresponding best answers – Experimental Results: Qweet Extraction Figure 1. Two-phase classification model Question with Answer  I even tried staying away from my using my Internet for a couple hours. The result? Insanity! Question as Quotation  I think Brian’s been drinking in there because I’m hearing him complain about girls, and then he goes “Wright,are you sure you’re not gay?” Rhetorical Question  You ruined my life and I’m supposed to like you? Qweet  What’s your favorite Harry Potter scene? Qweet Detection using a Random Forrest classifier Interrogative Tweets: Tweets which contain questions Qweets: Interrogative tweets which require information or help FeatureDescription Question features (Q) Quoted question Whether the question sentence is quoted from other sources Strong feeling Whether the question sentence contains strong feeling such as “???” and “?!” Context features (C) URLWhether the context contains any url Phone number or Whether the context contains any phone number or Strong feeling Whether there is any strong feeling such as “!” follows the question sentence Declarative sentence after question sentence Whether there is any declarative sentence follows the question sentence Word featuresUnigram words appear in the contexts of tweets Question-Context features (QC) Self ask self answer Whether the tweet contains obvious self ask self answer pattern. E.g., Q:...A:... Question-URL sameness Whether the question sentence is the same as the webpage's title linked through the URL Tweet-Specic features the tweet mentions other user's name RetweetWhether the tweet is a Retweet HashtagWhether the tweet contains any hashtag – Experimental Results: Interrogative Tweet Detection Table 1. Features extracted for qweet extraction MethodsPrecisionRecallF1 QM QM or 5W1H QM or refined 5W1H (H1) QM or refined 5W1H (H2) QM or refined 5W1H (H1 and H2) Rules in (Efron and Winget, 2010) Question Patterns (Confidence≥0.7) Question Patterns (Confidence≥0.8) Question Patterns (Confidence≥0.9) Table 2. Accuracies of interrogative tweet detection for various methods (QM: question mark; best results are in bold) Figure 2. Influence of feature sets on qweet extraction