W ORD S ENSE D ISAMBIGUATION By Mahmood Soltani Tehran University 2009/12/24 1.

Slides:



Advertisements
Similar presentations
Semantic Access to Data from the Web Raquel Trillo *, Laura Po +, Sergio Ilarri *, Sonia Bergamaschi + and E. Mena * 1st International Workshop on Interoperability.
Advertisements

The Impact of Task and Corpus on Event Extraction Systems Ralph Grishman New York University Malta, May 2010 NYU.
Automatic Identification of Cognates, False Friends, and Partial Cognates University of Ottawa, Canada University of Ottawa, Canada.
Part of Speech Tagging Importance Resolving ambiguities by assigning lower probabilities to words that don’t fit Applying to language grammatical rules.
Word sense disambiguation and information retrieval Chapter 17 Jurafsky, D. & Martin J. H. SPEECH and LANGUAGE PROCESSING Jarmo Ritola -
CALTS, UNIV. OF HYDERABAD. SAP, LANGUAGE TECHNOLOGY CALTS has been in NLP for over a decade. It has participated in the following major projects: 1. NLP-TTP,
Building an Ontology-based Multilingual Lexicon for Word Sense Disambiguation in Machine Translation Lian-Tze Lim & Tang Enya Kong Unit Terjemahan Melalui.
Semantic similarity, vector space models and word- sense disambiguation Corpora and Statistical Methods Lecture 6.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Semi-supervised learning and self-training LING 572 Fei Xia 02/14/06.
CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
1 Empirical Learning Methods in Natural Language Processing Ido Dagan Bar Ilan University, Israel.
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
Creating a Bilingual Ontology: A Corpus-Based Approach for Aligning WordNet and HowNet Marine Carpuat Grace Ngai Pascale Fung Kenneth W.Church.
Designing clustering methods for ontology building: The Mo’K workbench Authors: Gilles Bisson, Claire Nédellec and Dolores Cañamero Presenter: Ovidiu Fortu.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
CS 4705 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised –Dictionary-based.
Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.
WSD using Optimized Combination of Knowledge Sources Authors: Yorick Wilks and Mark Stevenson Presenter: Marian Olteanu.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters Presented by Sole.
Lexical Semantics CSCI-GA.2590 – Lecture 7A
Machine translation Context-based approach Lucia Otoyo.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
Ontology Learning from Text: A Survey of Methods Source: LDV Forum,Volume 20, Number 2, 2005 Authors: Chris Biemann Reporter:Yong-Xiang Chen.
Lemmatization Tagging LELA /20 Lemmatization Basic form of annotation involving identification of underlying lemmas (lexemes) of the words in.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Processing of large document collections Part 2 (Text categorization) Helena Ahonen-Myka Spring 2006.
Word Sense Disambiguation (WSD)
2007. Software Engineering Laboratory, School of Computer Science S E Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying.
Natural Language Processing word sense disambiguation Updated 1/12/2005.
Word Sense Disambiguation Many words have multiple meanings –E.g, river bank, financial bank Problem: Assign proper sense to each ambiguous word in text.
Natural Language Processing Introduction. 2 Natural Language Processing We’re going to study what goes into getting computers to perform useful and interesting.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Word Sense Disambiguation UIUC - 06/10/2004 Word Sense Disambiguation Another NLP working problem for learning with constraints… Lluís Màrquez TALP, LSI,
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
Information Retrieval and Web Search Cross Language Information Retrieval Instructor: Rada Mihalcea Class web page:
Paper Review by Utsav Sinha August, 2015 Part of assignment in CS 671: Natural Language Processing, IIT Kanpur.
Improving Subcategorization Acquisition using Word Sense Disambiguation Anna Korhonen and Judith Preiss University of Cambridge, Computer Laboratory 15.
SYMPOSIUM ON SEMANTICS IN SYSTEMS FOR TEXT PROCESSING September 22-24, Venice, Italy Combining Knowledge-based Methods and Supervised Learning for.
An Effective Word Sense Disambiguation Model Using Automatic Sense Tagging Based on Dictionary Information Yong-Gu Lee
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
Modelling Human Thematic Fit Judgments IGK Colloquium 3/2/2005 Ulrike Padó.
Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
HyperLex: lexical cartography for information retrieval Jean Veronis Presented by: Siddhanth Jain( ) Samiulla Shaikh( )
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
Bootstrapping for Text Learning Tasks Ramya Nagarajan AIML Seminar March 6, 2001.
Lecture 21 Computational Lexical Semantics Topics Features in NLTK III Computational Lexical Semantics Semantic Web USCReadings: NLTK book Chapter 10 Text.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 24 (14/04/06) Prof. Pushpak Bhattacharyya IIT Bombay Word Sense Disambiguation.
1 Masters Thesis Presentation By Debotosh Dey AUTOMATIC CONSTRUCTION OF HASHTAGS HIERARCHIES UNIVERSITAT ROVIRA I VIRGILI Tarragona, June 2015 Supervised.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
1 Gloss-based Semantic Similarity Metrics for Predominant Sense Acquisition Ryu Iida Nara Institute of Science and Technology Diana McCarthy and Rob Koeling.
1 Fine-grained and Coarse-grained Word Sense Disambiguation Jinying Chen, Hoa Trang Dang, Martha Palmer August 22, 2003.
Word classes and part of speech tagging. Slide 1 Outline Why part of speech tagging? Word classes Tag sets and problem definition Automatic approaches.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Finding Predominant Word Senses in Untagged Text Diana McCarthy & Rob Koeling & Julie Weeds & Carroll Department of Indormatics, University of Sussex {dianam,
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
Sentiment Analysis Using Common- Sense and Context Information Basant Agarwal 1,2, Namita Mittal 2, Pooja Bansal 2, and Sonal Garg 2 1 Department of Computer.
Approaches to Machine Translation
Coarse-grained Word Sense Disambiguation
SENSEVAL: Evaluating WSD Systems
Sentiment analysis algorithms and applications: A survey
Using UMLS CUIs for WSD in the Biomedical Domain
Statistical NLP: Lecture 9
A method for WSD on Unrestricted Text
Approaches to Machine Translation
Statistical NLP : Lecture 9 Word Sense Disambiguation
Presentation transcript:

W ORD S ENSE D ISAMBIGUATION By Mahmood Soltani Tehran University 2009/12/24 1

Introduction Knowledge source Lexical Learned World WSD Approaches Knowledge-base Curpos-base Unsupervised Approaches Supervised Approaches Some approach in detail A GENDA 2 WSD 12/24/09 Tehran University

Defining word sense : The phenomenon of lexical ambiguity is traditionally subdivided into polysemy and homonymy Polysemy: one work having several related meaning. Homonymy: two words have the same lexical form but different etymoloies and unrelated meaning. In dictionaries, each word is listed with a number of discrete senses and subsenses. possibly dierent from dictionary to dictionary The first step involved in the task of WSD is the determination of dierent senses for all words in the text to be disambiguated, the sense inventory 3 I NTRODUCTION WSD 12/24/09 Tehran University

I NTRODUCTION ( CONT ) What is word sense disambiguation? WSD refers to a task that automatically assigns a sense, selected from a set of pre-defined word senses to an instance of a polysemous word in a particular context. identify the correct sense of an ambiguous word in a sentence. determine which of the senses of an ambiguous word is invoked in a particular use of the word. … 4 WSD 12/24/09 Tehran University

I NTRODUCTION ( CONT ) word sense disambiguation step: Step 1: pre-defines sense A list of sense such as those found in everyday dictionaries A group of feature, catagories, or associated words An entry in transfer dictionary which includes translations in another language Step 2 : assignment of words to senses. The context of the word to be disambiguation External knowledge 5 WSD 12/24/09 Tehran University

I NTRODUCTION ( CONT ) Where is the use of WSD? Sense disambiguation is an“intermediate task”,which is not an end in itself, but rather is necessary at one level or another to accomplish most natural language processing tasks. Machine translation Information retrieval Speech processing Text processing (spelling detection) 6 WSD 12/24/09 Tehran University

I NTRODUCTION ( CONT ) Conceptual Model of WSD WSD is the matching of sense knowledge and word context. Sense knowledge can either be lexical knowledge defined in dictionaries, or world knowledge learned from training corpora. 7 WSD 12/24/09 Tehran University

8 Lexical Knowledge Lexical knowledge is usually released with a dictionary. It can be either symbolic, or empirical. It is the foundation of unsupervised WSD approaches. Learned World Knowledge World knowledge is too complex or trivial to be verbalized completely. So it is a smart strategy to automatically acquire world knowledge from the context of training corpora on demand by machine learning techniques Trend Use the interaction of multiple knowledge sources to approach WSD. K NOWLEDGE SOURCES WSD 12/24/09 Tehran University

L EXICAL K NOWLEDGE Sense frequency Usage frequency of each sense of a word. Sense gloss Sense definition and examples By counting common words between the gloss and the context of the target word, we can nai Concept Tree Represent the related concepts of the target in the form of semantic network as is done by WordNet. 9 WSD 12/24/09 Tehran University

L EXICAL K NOWLEDGE ( CONT ) Selectional Restrictions Syntactic and semantic restric Subject Code Refers to the category to which one sense of the target word belongs. Part of Speech (POS) POS is associated with a subset of the word senses in both WordNet and LDOCE. That is, given the POS of the target, we may fully or partially disambiguate its sense (Stevenson & Wilks, 2001). 10 WSD 12/24/09 Tehran University

L EARND WORLD K NOWLEDGE Domain-specific Knowledge Like selectional restrictions, it is the semantic restriction placed on the use of each sense of the target word. The restriction is more specific. Parallel Corpora Also called bilingual corpora, one serving as primary language, and the other as a secondary language. Using some third-party software packages, we can align the major words (verb and noun) between two languages. Because the translation process implies that aligned pair words share the same sense or concept, we can use this information to sense the major words in the primary language (Bhattacharya et al. 2004). 11 WSD 12/24/09 Tehran University

E ARLY WSD WORK IN MT The first attempts at automated sense disambiguation were made in the context of machine translation Weaver (1949) : outlines the basis of an approach to WSD. Reifler’s (1955) “semantic coincidences” between a word and its context,the role of syntactic relations, was also recognized Weaver’s (1949) Memorandum discusses the role of the domain in sense disambiguation Oswald,1952, 1957; Oettinger, 1955; etc : micro-glossaries contain only the meaning of a given word relevant for texts in a particular domain Several researchers attempted to devise an “interlingua” based on logical and mathematical principles …. without large-scale resources most of these ideas remained untested and to large extent, forgotten until several decades later. 12 WSD 12/24/09 Tehran University

WSD A PPROACHES there are three ways to approach the problem of WSD: a knowledge-based approach, which uses an explicit lexicon (machine readable dictionary (MRD), thesaurus) or ontology (e.g. WordNet). Corpus-based disambiguation, where the relevant information about word senses is gathered from training on a large corpus. Hybrid approach combining aspects of both of the forementioned methodologies 13 WSD 12/24/09 Tehran University

K NOWLEDGE - BASED APPROACH WSD systems building on the information contained in MRDs use the available material in various ways: Lesk (1986) was the first to use dictionary denitions to WSD: counts overlapping content words in the sense denitions & in the denitions of context words occurring nearby Yarowsky (1992), the sense of a word is dened as its category in Roget's International Thesaurus Yarowsky (1992), thesense of a word is dened as its category in Roget's International Thesaurus WordNet includes various potential sources of information Leacock et al. (1998) employ WordNet to counter data sparseness. 14 WSD 12/24/09 Tehran University

C ORPUS - BASED APPROACH A corpus-based approach extracts information on word senses from a large annotated data collection. Distributional information about an ambiguous word refers to the frequency distribution of its senses collocational or co-occurrence information part-of-speech … 15 WSD 12/24/09 Tehran University

C ORPUS - BASED APPROACH ( CONT ) There are two possible approaches to corpus-based WSD systems: Supervised approaches use annotated training data and basically amount to a classication task Unsupervised algorithms are applied to raw text material and annotated data is only needed for evaluation They correspond to a clustering task rather than a classication. Bootstrapping, looks like supervised approaches, but it needs only a few seeds instead of a large number of training examples 16 WSD 12/24/09 Tehran University

C ORPUS - BASED APPROACH ( CONT ) Supervised exemplar-based : the k-nearest neighbor technique has been employed most Rule-base :use algorithms, e.g. decision lists, which search for discriminatory features in the training data Probabilistic-base : use of dierent probabilistic classiers. Despite its relative simplicity, naive Bayes has been frequently applied 17 WSD 12/24/09 Tehran University

C ORPUS - BASED APPROACH ( CONT ) Unsupervised cluster the contexts of an ambiguous word into a number of groups and discriminate between them without labeling them. A clear disadvantage is that, so far, the performance of unsupervised systems lies a lot lower than that of supervised systems 18 WSD 12/24/09 Tehran University

N AIVE BAYES CLASSIFIER 19  Test Training  WSD 12/24/09 Tehran University

K-NN 20  Test Training  WSD 12/24/09 Tehran University

Any Question? 21 WSD 12/24/09 Tehran University