A Novel Approach to Event Duration Prediction

Slides:



Advertisements
Similar presentations
Tim Smyth and Jamie Shutler Assessment of analysis and forecast skill Assessment using satellite data.
Advertisements

Using Query Patterns to Learn the Durations of Events Andrey Gusev joint work with Nate Chambers, Pranav Khaitan, Divye Khilnani, Steven Bethard, Dan Jurafsky.
Bilge Mutlu, Andreas Krause, Jodi Forlizzi, Carlos Guestrin, and Jessica Hodgins Human-Computer Interaction Institute, Carnegie Mellon University Robust,
Named Entity Classification Chioma Osondu & Wei Wei.
Word sense disambiguation and information retrieval Chapter 17 Jurafsky, D. & Martin J. H. SPEECH and LANGUAGE PROCESSING Jarmo Ritola -
Jean-Eudes Ranvier 17/05/2015Planet Data - Madrid Trustworthiness assessment (on web pages) Task 3.3.
1 A scheme for racquet sports video analysis with the combination of audio-visual information Visual Communication and Image Processing 2005 Liyuan Xing,
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Applications Chapter 9, Cimiano Ontology Learning Textbook Presented by Aaron Stewart.
Multivariate Methods Pattern Recognition and Hypothesis Testing.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Presented by Zeehasham Rasheed
Sentence Classifier for Helpdesk s Anthony 6 June 2006 Supervisors: Dr. Yuval Marom Dr. David Albrecht.
Automatic Acquisition of Lexical Classes and Extraction Patterns for Information Extraction Kiyoshi Sudo Ph.D. Research Proposal New York University Committee:
A Framework for Named Entity Recognition in the Open Domain Richard Evans Research Group in Computational Linguistics University of Wolverhampton UK
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
Scalable Text Mining with Sparse Generative Models
Activity Recognition from User- Annotated Acceleration Data Presented by James Reinebold CSCI 546.
Finding Advertising Keywords on Web Pages Scott Wen-tau YihJoshua Goodman Microsoft Research Vitor R. Carvalho Carnegie Mellon University.
Na-Rae Han (University of Pittsburgh), Joel Tetreault (ETS), Soo-Hwa Lee (Chungdahm Learning, Inc.), Jin-Young Ha (Kangwon University) May , LREC.
Towards Boosting Video Popularity via Tag Selection Elizeu Santos-Neto, Tatiana Pontes, Jussara Almeida, Matei Ripeanu University of British Columbia -
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Keyphrase Extraction in Scientific Documents Thuy Dung Nguyen and Min-Yen Kan School of Computing National University of Singapore Slides available at.
Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Mehdi Ghayoumi Kent State University Computer Science Department Summer 2015 Exposition on Cyber Infrastructure and Big Data.
Automatically Identifying Localizable Queries Center for E-Business Technology Seoul National University Seoul, Korea Nam, Kwang-hyun Intelligent Database.
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
2007. Software Engineering Laboratory, School of Computer Science S E Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying.
Iterative Readability Computation for Domain-Specific Resources By Jin Zhao and Min-Yen Kan 11/06/2010.
Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali and Vasileios Hatzivassiloglou Human Language Technology Research Institute The.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
Discovery of Manner Relations and their Applicability to Question Answering Roxana Girju 1,2, Manju Putcha 1, and Dan Moldovan 1 University of Texas at.
This work is supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center contract number.
Automatic Detection of Plagiarized Spoken Responses Copyright © 2014 by Educational Testing Service. All rights reserved. Keelan Evanini and Xinhao Wang.
A Weakly-Supervised Approach to Argumentative Zoning of Scientific Documents Yufan Guo Anna Korhonen Thierry Poibeau 1 Review By: Pranjal Singh Paper.
1 Automating Slot Filling Validation to Assist Human Assessment Suzanne Tamang and Heng Ji Computer Science Department and Linguistics Department, Queens.
1 Rated Aspect Summarization of Short Comments Yue Lu, ChengXiang Zhai, and Neel Sundaresan Presented by: Sapan Shah.
Entity Set Expansion in Opinion Documents Lei Zhang Bing Liu University of Illinois at Chicago.
Summarization Focusing on Polarity or Opinion Fragments in Blogs Yohei Seki Toyohashi University of Technology Visiting Scholar at Columbia University.
Some questions -What is metadata? -Data about data.
Semi-automatic Product Attribute Extraction from Store Website
Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework N 工科所 錢雅馨 2011/01/16 Li-Jia Li, Richard.
1 Fine-grained and Coarse-grained Word Sense Disambiguation Jinying Chen, Hoa Trang Dang, Martha Palmer August 22, 2003.
Unsupervised Relation Detection using Automatic Alignment of Query Patterns extracted from Knowledge Graphs and Query Click Logs Panupong PasupatDilek.
From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
Annotating and measuring Temporal relations in texts Philippe Muller and Xavier Tannier IRIT,Université Paul Sabatier COLING 2004.
Concept-Based Analysis of Scientific Literature Chen-Tse Tsai, Gourab Kundu, Dan Roth UIUC.
Language Identification and Part-of-Speech Tagging
Brief Intro to Machine Learning CS539
Automatically Labeled Data Generation for Large Scale Event Extraction
Coarse-grained Word Sense Disambiguation
Sentiment analysis algorithms and applications: A survey
Online Multiscale Dynamic Topic Models
CRF &SVM in Medication Extraction
David Mareček and Zdeněk Žabokrtský
Authorship Attribution Using Probabilistic Context-Free Grammars
Aspect-based sentiment analysis
Mining the Data Charu C. Aggarwal, ChengXiang Zhai
Social Knowledge Mining
Chao Xu, Parth H. Pathak, et al. HotMobile’15
Supervised vs. unsupervised Learning
CSCI 5832 Natural Language Processing
Text Mining & Natural Language Processing
Text Mining & Natural Language Processing
Logistic Regression [Many of the slides were originally created by Prof. Dan Jurafsky from Stanford.]
Automatic Prosodic Event Detection
Presentation transcript:

A Novel Approach to Event Duration Prediction Pranav Khaitan Divye Raj Khilnani Ye Jin

Introduction Liverpool will be playing inter-Milan this Friday. Predicting event duration has been a challenging problem and can solve some major challenges being faced in question answering systems. Examples: Liverpool will be playing inter-Milan this Friday. The United States has been fighting a cold war with the Soviet Union. duration of the match is in hours duration of the war was in decades

Duration is Non-trivial Same event can have different bounds in different contexts. James watched a movie. Hour James watched the birds fly. Minute More Features Subject Object Grammatical Part of Speech Tense Modality Context Class Hypernym Aspect

System Design Feature Extraction Feature Selection Learning and Classification Evaluation Parse Tree Web Count Hypernym Named Entity Recognition X2 score MI score Emperical observation Supervised learning: Naïve Bayes Logistic Regression Maximum Entropy Unsupervised Learning Agglomerative Clustering Multinomial clustering Precision Recall F1 Kappa Approximate Agreement

Feature Analysis Subject-object Base verb lemmatization Tense Jonathan is watching a movie vs Jonathan is watching an advertisement Base verb lemmatization eating, ate, has eaten, will be eating Tense Jonathan will play football in the evening vs Jonathan has been playing football for the past ten years Sentential Dependencies He read the report quickly vs He read the report slowly Part of speech tagging The government’s move was anticipated Named Entity Recognition The body will define the role of the United Nations

Feature Analysis Hypernyms Contextual Features Web Counts Generic Features: Modality, Aspect, Class Report Feature

Results

Results

Feature Selection Total extracted features: 10,000+. Need to scale down.   MI score for features drops quickly Effectiveness of feature selection

Unsupervised Clustering

Conclusion Significant gain in event duration prediction accuracy using supervised learning Unsupervised learning results look promising and gives opportunity to do duration prediction across domains with little annotated data Important to automatically select features and reduce human involvement Classification Task Our Results Feng Pan et al Human Agreement Coarse Grain 75.16% 70.3% 87.7% Fine Grain 63.69% 65.8% 79.8%