CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 24 (14/04/06) Prof. Pushpak Bhattacharyya IIT Bombay Word Sense Disambiguation.

Slides:



Advertisements
Similar presentations
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING Word Sense Disambiguation semantic tagging of text, for Confusion Set Disambiguation.
Advertisements

CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 2 (06/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Part of Speech (PoS)
Word Sense Disambiguation MAS.S60 Catherine Havasi Rob Speer.
A method for unsupervised broad-coverage lexical error detection and correction 4th Workshop on Innovative Uses of NLP for Building Educational Applications.
Lexical Semantics and Word Senses Hongning Wang
11 Chapter 20 Computational Lexical Semantics. Supervised Word-Sense Disambiguation (WSD) Methods that learn a classifier from manually sense-tagged text.
Word sense disambiguation and information retrieval Chapter 17 Jurafsky, D. & Martin J. H. SPEECH and LANGUAGE PROCESSING Jarmo Ritola -
CS621: Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 29– AI and Probability (exemplified through NLP) 4 th Oct, 2010.
NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.
CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
Collective Word Sense Disambiguation David Vickrey Ben Taskar Daphne Koller.
I256 Applied Natural Language Processing Fall 2009
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
Part of speech (POS) tagging
CS 4705 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised –Dictionary-based.
تمرين شماره 1 درس NLP سيلابس درس NLP در دانشگاه هاي ديگر ___________________________ راحله مکي استاد درس: دکتر عبدالله زاده پاييز 85.
1/17 Acquiring Selectional Preferences from Untagged Text for Prepositional Phrase Attachment Disambiguation Hiram Calvo and Alexander Gelbukh Presented.
Lexical Semantics CSCI-GA.2590 – Lecture 7A
9/8/20151 Natural Language Processing Lecture Notes 1.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics – Bag of concepts – Semantic distance between two words.
WORD SENSE DISAMBIGUATION Presented By Roshan R. Karwa Guided By Dr. M.B.Chandak A Technical Seminar on.
CS : Speech, Natural Language Processing and the Web/Topics in Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12: Deeper.
Word Sense Disambiguation (WSD)
Word Sense Disambiguation Many words have multiple meanings –E.g, river bank, financial bank Problem: Assign proper sense to each ambiguous word in text.
Natural Language Processing Introduction. 2 Natural Language Processing We’re going to study what goes into getting computers to perform useful and interesting.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Prof. Pushpak Bhattacharyya, IIT Bombay.1 Application of Noisy Channel, Channel Entropy CS 621 Artificial Intelligence Lecture /09/05.
NL Question-Answering using Naïve Bayes and LSA By Kaushik Krishnasamy.
Word Sense Disambiguation UIUC - 06/10/2004 Word Sense Disambiguation Another NLP working problem for learning with constraints… Lluís Màrquez TALP, LSI,
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
Paper Review by Utsav Sinha August, 2015 Part of assignment in CS 671: Natural Language Processing, IIT Kanpur.
WORD SENSE DISAMBIGUATION STUDY ON WORD NET ONTOLOGY Akilan Velmurugan Computer Networks – CS 790G.
W ORD S ENSE D ISAMBIGUATION By Mahmood Soltani Tehran University 2009/12/24 1.
CS : Language Technology for the Web/Natural Language Processing Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12: WSD approaches (contd.)
Lexical Semantics Chapter 16
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 3 (10/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Statistical Formulation.
An Effective Word Sense Disambiguation Model Using Automatic Sense Tagging Based on Dictionary Information Yong-Gu Lee
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
CS626: NLP, Speech and the Web Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 15, 17: Parsing Ambiguity, Probabilistic Parsing, sample seminar 17.
Word Sense Disambiguation Kyung-Hee Sung Foundations of Statistical NLP Chapter 7.
WordNet: Connecting words and concepts Peng.Huang.
Unsupervised Word Sense Disambiguation REU, Summer, 2009.
11 Chapter 19 Lexical Semantics. 2 Lexical Ambiguity Most words in natural languages have multiple possible meanings. –“pen” (noun) The dog is in the.
CS460/IT632 Natural Language Processing/Language Technology for the Web Guest Lecture (31/03/06) Prof. Niladri Chatterjee IIT Delhi Guest Lecture on Machine.
Lecture 21 Computational Lexical Semantics Topics Features in NLTK III Computational Lexical Semantics Semantic Web USCReadings: NLTK book Chapter 10 Text.
Intelligent Database Systems Lab Presenter : Kung, Chien-Hao Authors : Yoong Keok Lee and Hwee Tou Ng 2002,EMNLP An Empirical Evaluation of Knowledge Sources.
Disambiguation Read J & M Chapter 17.1 – The Problem Washington Loses Appeal on Steel Duties Sue caught the bass with the new rod. Sue played the.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Word sense disambiguation of WordNet glosses Presenter: Chun-Ping Wu Author: Dan Moldovan, Adrian Novischi.
CS : Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-14: Probabilistic parsing; sequence labeling, PCFG.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 1 (03/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Introduction to Natural.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 6 (14/02/06) Prof. Pushpak Bhattacharyya IIT Bombay Top-Down and Bottom-Up.
Zdroje jazykových dat Word senses Sense tagged corpora.
Introduction to Machine Learning Dmitriy Dligach.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 13 (17/02/06) Prof. Pushpak Bhattacharyya IIT Bombay Top-Down Bottom-Up.
CS : Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-15: Probabilistic parsing; PCFG (contd.)
Overview of Statistical NLP IR Group Meeting March 7, 2006.
CS : Language Technology for the Web/Natural Language Processing Pushpak Bhattacharyya CSE Dept., IIT Bombay Parsing Algos.
Lexical Semantics and Word Senses Hongning Wang
Statistical Methods in NLP Course 10 Diana Trandab ă ț
Word Sense Disambiguation Algorithms in Hindi
CS : Speech, NLP and the Web/Topics in AI
Lecture 21 Computational Lexical Semantics
Using UMLS CUIs for WSD in the Biomedical Domain
CSC 594 Topics in AI – Applied Natural Language Processing
A method for WSD on Unrestricted Text
Linguistic Essentials
Classical Part of Speech (PoS) Tagging
Prof. Pushpak Bhattacharyya, IIT Bombay
Presentation transcript:

CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 24 (14/04/06) Prof. Pushpak Bhattacharyya IIT Bombay Word Sense Disambiguation

Word sense disambiguation (WSD) Cardinal problem in NLP also called as “Lexical Disambiguation” Other disambiguation are: –Structure disambiguation –Prepositional phrase attachment

Example I saw the mountain with a telescope. I saw the dog with two tails

WSD and POS tagging POS tagging is also disambiguation (local context needed). WSD is sense finding (long distance sense needed) Example –“The play could not be held due to rain, which flooded the open air theatre.”

WSD usage WSD is crucial for –Information extraction QA Summary generation –IR

Ambiguity Ambiguity arises from / refers to (written) –Homography The concept have no relation but are represented by same word/graph. –Polysemy The sense are related (many semantics).

Example Homography – “Bank” River side of Financial institute Depend (verb) Polysemy –“Fall” The tree chopped at he root, fall. The kingdom completely mismanaged, fall.

Approaches to WSD Knowledge based –Uses human-crafted knowledge base Data driven –Uses training data and machine learning.

Knowledge based WSD Resources needed –Argument-selection preference –Sense repository Wordnet Ontology

Example for Knowledge based –“The restaurant serves many delicious dishes.” –“Washing dishes after meals is a pain.” Hit the argument frame of the verb. Verb have arguments: Agent and Object. Example: –Serve:- Agent: dishObject: restaurant –Wash:- Agent: ?Object: utensil, car

Pitfalls Pitfall in Argument-selection preference: –Metaphor Example: –“I had to eat lots of cabbage in France” –“I had to eat my words” –“When she saw the demon eat the building, she ran in panic.” Here selection preference won’t work.

Lesk algorithm Another knowledge based approach. Uses Wordnet, heavily. Steps: –Create words from context –Watch words in Wordnet –Find intersection to obtain the sense Performance is not so good.

Data Driven approaches Have to use sense marked training data. Most common is “Semcor corpus”. Supervised category: –Naïve Bayes –Information Theoretic Un-supervised category: –Clustering

Naïve Bayes S ^ = argmax P(s|w) s Є S P(s|w) = P(w|s). P(s) P(w) Naïve Bayes uses following formula: