Predicting the Semantic Orientation of Adjectives

Slides:

Advertisements

Similar presentations

Chapter 5 Multiple Linear Regression

Advertisements

Experiments and Variables

Sentiment Analysis Learning Sentiment Lexicons. Dan Jurafsky Semi-supervised learning of lexicons Use a small amount of information A few labeled examples.

Data preprocessing before classification In Kennedy et al.: “Solving data mining problems”

Sentiment and Polarity Extraction Arzucan Ozgur SI/EECS 767 January 15, 2010.

Ensembles in Adversarial Classification for Spam Deepak Chinavle, Pranam Kolari, Tim Oates and Tim Finin University of Maryland, Baltimore County Full.

Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.

Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.

© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.

K nearest neighbor and Rocchio algorithm

CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?

Semantic text features from small world graphs Jure Leskovec, IJS + CMU John Shawe-Taylor, Southampton.

Predicting the Semantic Orientation of Adjective Vasileios Hatzivassiloglou and Kathleen R. McKeown Presented By Yash Satsangi.

19-1 Chapter Nineteen MULTIVARIATE ANALYSIS: An Overview.

Learning Subjective Nouns using Extraction Pattern Bootstrapping Ellen Riloff, Janyce Wiebe, Theresa Wilson Presenter: Gabriel Nicolae.

Boosting Applied to Tagging and PP Attachment By Aviad Barzilai.

Learning Subjective Adjectives from Corpora Janyce M. Wiebe Presenter: Gabriel Nicolae.

Article by: Feiyu Xu, Daniela Kurz, Jakub Piskorski, Sven Schmeier Article Summary by Mark Vickers.

Recommender systems Ram Akella November 26 th 2008.

Self-organizing Conceptual Map and Taxonomy of Adjectives Noriko Tomuro, DePaul University Kyoko Kanzaki, NICT Japan Hitoshi Isahara, NICT Japan April.

ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.

Towards the automatic identification of adjectival scales: clustering adjectives according to meaning Authors: Vasileios Hatzivassiloglou and Kathleen.

Scaling and Attitude Measurement in Travel and Hospitality Research Research Methodologies CHAPTER 11.

Text Classification With Labeled and Unlabeled Data Presenter: Aleksandar Milisic Supervisor: Dr. David Albrecht.

Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters Presented by Sole.

Mining and Summarizing Customer Reviews

Modeling (Chap. 2) Modern Information Retrieval Spring 2000.

Applying Science Towards Understanding Behavior in Organizations Chapters 2 & 3.

Presented By Wanchen Lu 2/25/2013

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.

COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics – Bag of concepts – Semantic distance between two words.

Automatic Extraction of Opinion Propositions and their Holders Steven Bethard, Hong Yu, Ashley Thornton, Vasileios Hatzivassiloglou and Dan Jurafsky Department.

Processing of large document collections Part 2 (Text categorization) Helena Ahonen-Myka Spring 2006.

Instrumentation.

Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.

Attribute Extraction and Scoring: A Probabilistic Approach Taesung Lee, Zhongyuan Wang, Haixun Wang, Seung-won Hwang Microsoft Research Asia Speaker: Bo.

Processing of large document collections Part 2 (Text categorization, term selection) Helena Ahonen-Myka Spring 2005.

Final Study Guide Research Design. Experimental Research.

2007. Software Engineering Laboratory, School of Computer Science S E Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying.

1 A Graph-Theoretic Approach to Webpage Segmentation Deepayan Chakrabarti Ravi Kumar

1 Query Operations Relevance Feedback & Query Expansion.

Feature selection LING 572 Fei Xia Week 4: 1/29/08 1.

Modelling Human Thematic Fit Judgments IGK Colloquium 3/2/2005 Ulrike Padó.

Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.

Mining Binary Constraints in Feature Models: A Classification-based Approach Yi Li.

Minimally Supervised Event Causality Identification Quang Do, Yee Seng, and Dan Roth University of Illinois at Urbana-Champaign 1 EMNLP-2011.

SemiBoost : Boosting for Semi-supervised Learning Pavan Kumar Mallapragada, Student Member, IEEE, Rong Jin, Member, IEEE, Anil K. Jain, Fellow, IEEE, and.

The Scientific Method. Objectives Explain how science is different from other forms of human endeavor. Identify the steps that make up scientific methods.

DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.

Collocations and Terminology Vasileios Hatzivassiloglou University of Texas at Dallas.

Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.

Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.

Molecular Classification of Cancer Class Discovery and Class Prediction by Gene Expression Monitoring.

4. Relationship Extraction Part 4 of Information Extraction Sunita Sarawagi 9/7/2012CS 652, Peter Lindes1.

Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -

Unit 2: Research & Statistics n Psychology deals with many experiments and studies n WHO? Every experimenter must decide on a SAMPLE, which is a group.

Relation Strength-Aware Clustering of Heterogeneous Information Networks with Incomplete Attributes ∗ Source: VLDB.

Physical Science and You Chapter One: Studying Physics and Chemistry Chapter Two: Experiments and Variables Chapter Three: Key Concepts in Physical Science.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Word classes and part of speech tagging. Slide 1 Outline Why part of speech tagging? Word classes Tag sets and problem definition Automatic approaches.

Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,

Adaptive Cluster Ensemble Selection Javad Azimi, Xiaoli Fern {azimi, Oregon State University Presenter: Javad Azimi. 1.

Sentiment Analysis Using Common- Sense and Context Information Basant Agarwal 1,2, Namita Mittal 2, Pooja Bansal 2, and Sonal Garg 2 1 Department of Computer.

Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.

Methods of multivariate analysis Ing. Jozef Palkovič, PhD.

Machine Learning: Ensemble Methods

CSE 4705 Artificial Intelligence

Introduction to Data Science Lecture 7 Machine Learning Overview

Text Categorization Berlin Chen 2003 Reference:

Chapter 4: More on Two-Variable Data

Presentation transcript:

Predicting the Semantic Orientation of Adjectives Vasileios Hatzivassiloglou and Kathleen R. McKeown Presenter: Gabriel Nicolae

Introduction Orientation/polarity = direction of deviation from the norm Nearly synonymous simple vs. simplistic Antonyms hot vs. cold

Introduction In linguistic constructs such as conjunctions the choice of arguments and connectives are mutually constrained. The tax proposal was simple and well-received simplistic but well-received simplistic and well-received by the public.

Exceptions

Goals Automatically identify antonyms Distinguish near synonyms How? by retrieving semantic orientation information using indirect information collected from a large corpus Why? dictionaries and similar sources (thesauri, WordNet) do not include explicitly semantic orientation information lack of links between antonyms and synonyms when they depend on the domain of the discourse

Overview of their approach Correlation between indicators and semantic orientation  direct indicators: affixes (in-, un-) mostly negatives exceptions: independent, unbiased  indirect indicators: conjunctions conjoined adjectives usually are of the same orientation for most connectives the situation is reversed for but fair and legitimate corrupt and brutal fair and brutal corrupt and legitimate vs. from corpus semantically anomalous

General algorithm Extract conjunctions of adjectives and morphological relations Label each two conjoined adjectives as being of the same or different orientation using a log-linear regression model Separate adjectives into two subsets of different orientation using a clustering algorithm The group with the higher average frequency is labeled as positive

Data collection Corpus: 21 million word 1987 Wall Street Journal Training data: a set of adjectives with predetermined (hand-annotated) orientation labels (+ or -) 1,336 adjectives (657 +, 679 -) The training set was validated by four other people 500 adjectives: 89.15% agreement Test data: 15,048 conjunction tokens 9,296 distinct pairs of conjoined adjectives (type)

Data collection (cont.) Each conjunction token is classified according to three variables: conjunction used and, or, but, either-or, neither-nor type of modification attributive, predicative, appositive, resultative number of the modified noun singular, plural

Validation of the conjunction hypothesis Results Their conjunction hypothesis is validated overall and for almost all individual cases There are small differences in the behavior of conjunctions between linguistic environments (as represented by the three attributes) Conjoint antonyms appear far more frequently than expected by chance in conjunctions other than but

Prediction of link type Baseline 1: always guessing that a link is of the same orientation type => 77.84% accuracy Baseline 2: Baseline 1 + but exhibits the opposite pattern => 80.82% accuracy Morphological relationships: Adjectives related in form almost always have different semantic orientations Highly accurate (97.06%), but applies only to 1,336 labeled adjectives (891,780 possible pairs) E.g. adequate-inadequate, thoughtful-thoughtless Baseline 1 + Morphology => 78.86% accuracy Baseline 2 + Morphology => 81.75% accuracy

Prediction of link type (cont.) Log-linear regression model x: the vector of the observed counts in the various conjunction categories w: the vector of weights to be learned y: the response of the system Using the method of iterative stepwise refinement they selected 9 predictor variables from all 90 possible predictor variables. Small improvement: 80.97% accuracy (82.05% accuracy using Morphology) but now each prediction is rated between 0 and 1

Clustering Input: a graph of adjectives connected by dissimilarity links Small dissimilarity value => same-orientation link High dissimilarity value => different-orientation link Method used: apply an iterative optimization procedure on each connected component, based on the exchange method, a non-hierarchical clustering algorithm Idea: find the partition P such that the objective function Φ is minimized

Labeling the clusters as + or - In oppositions of gradable adjectives where one member is semantically unmarked, the unmarked member is the most frequent one about 81% of the time Unmarked => positive orientation almost always So, label as positive the group that has the highest average frequency of words.

Graph connectivity and performance They tested how graph connectivity affects the overall performance