Structured lexicons and Lexical semantics Especially WordNet ® See D Jurafsky & JH Martin: Speech and Language Processing, Upper Saddle River NJ (2000):

Slides:



Advertisements
Similar presentations
Building Wordnets Piek Vossen, Irion Technologies.
Advertisements

Using Link Grammar and WordNet on Fact Extraction for the Travel Domain.
The WordNet Lexical Database Bernardo Magnini ITC-irst, Istituto per la Ricerca Scientifica e Tecnologica Trento - Italy.
The Meaning of Language
SEMANTICS.
Statistical NLP: Lecture 3
Lexical Semantics and Word Senses Hongning Wang
Introduction to Computational Linguisitics The Lexicon.
Ewa Rudnicka, Wojciech Witkowski, Maciej Piasecki G4.19 Research Group Institute of Informatics, Wrocław University of Technology nlp.pwr.wroc.pl plwordnet.pwr.wroc.pl.
Complete and Consistent Annotation of WordNet with the Top Concept Ontology Javier Álvez, Jordi Atserias, Jordi Carrera, Salvador Climent, Egoitza Laparra,
Building an Ontology-based Multilingual Lexicon for Word Sense Disambiguation in Machine Translation Lian-Tze Lim & Tang Enya Kong Unit Terjemahan Melalui.
1/27 Semantics Going beyond syntax. 2/27 Semantics Relationship between surface form and meaning What is meaning? Lexical semantics Syntax and semantics.
A STUDY ON THE KNOWLEDGE SOURCES OF TURKISH EFL LEARNERS IN LEXICAL INFERENCING İlknur İSTİFÇİ Anadolu University Eskişehir, TURKEY Eskişehir, TURKEY.
Using resources WordNet and the BNC. WordNet: History 1985: a group of psychologists and linguists start to develop a “lexical database” –Princeton University.
C SC 620 Advanced Topics in Natural Language Processing Lecture Notes 2 1/20/04.
Article by: Feiyu Xu, Daniela Kurz, Jakub Piskorski, Sven Schmeier Article Summary by Mark Vickers.
From Semantic Similarity to Semantic Relations Georgeta Bordea, November 25 Based on a talk by Alessandro Lenci titled “Will DS ever become Semantic?”,
1 Indo WordNet A WordNet for Hindi Centre for Technology Development for Indian Languages Computer Science and Engineering Department, IIT Bombay.
Course G Web Search Engines 3/9/2011 Wei Xu
INTRODUCTION TO ARTIFICIAL INTELLIGENCE
WORDNET Approach on word sense techniques - AKILAN VELMURUGAN.
Adam Pease and Christiane Fellbaum Presenter: 吳怡安
1 Natural Language Processing (2a) Zhao Hai 赵海 Department of Computer Science and Engineering Shanghai Jiao Tong University
COMP423.  Query expansion  Two approaches ◦ Relevance feedback ◦ Thesaurus-based  Most Slides copied from ◦
“How much context do you need?” An experiment about context size in Interactive Cross-language Question Answering B. Navarro, L. Moreno-Monteagudo, E.
WordNet ® and its Java API ♦ Introduction to WordNet ♦ WordNet API for Java Name: Hao Li Uni: hl2489.
Oana Adriana Şoica Building and Ordering a SenDiS Lexicon Network.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Meaning. Semantics (the study of meaning) Semantics: the study of meaning, or to be more specific, the study of the meaning of linguistic units, words.
1 Query Operations Relevance Feedback & Query Expansion.
WORD SENSE DISAMBIGUATION STUDY ON WORD NET ONTOLOGY Akilan Velmurugan Computer Networks – CS 790G.
WORDNET. THE WORDNET SYSTEM  Lexicographer files  Code: Lexico files  database  Search Routines and Interfaces.
Lexical Semantics Chapter 16
10/22/2015ACM WIDM'20051 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis Voutsakis.
WordNet: Connecting words and concepts Christiane Fellbaum Cognitive Science Laboratory Princeton University.
WordNet: Connecting words and concepts Peng.Huang.
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
LEXICAL RELATIONS Presented by ‘the big family’ group 3 Rauwan Harahap (Opung) Riza Nirmala Putri Salmah Silih Warni Siti Anifah Siti Juariyah.
23- November-091 WordNet and Extended WordNet Sriram Rajaraman.
Wordnet - A lexical database for the English Language.
Semantic distance & WordNet Serge B. Potemkin Moscow State University Philological faculty.
Ontology Engineering: from Cognitive Science to the Semantic Web Maria Teresa Pazienza University of Roma Tor Vergata, Italy 1.
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
Word Meaning and Similarity
Lecture 19 Word Meanings II Topics Description Logic III Overview of MeaningReadings: Text Chapter 189NLTK book Chapter 10 March 27, 2013 CSCE 771 Natural.
Using Wikipedia for Hierarchical Finer Categorization of Named Entities Aasish Pappu Language Technologies Institute Carnegie Mellon University PACLIC.
2/10/2016Semantic Similarity1 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis.
Information Retrieval Search Engine Technology (7) Prof. Dragomir R. Radev.
Lexical Semantics and Word Senses Hongning Wang
Detecting and Exploiting Figurative Language in WordNet Wim Peters Department of Computer Science University of Sheffield.
SEMANTICS Chapter 10 Ms. Abrar Mujaddidi. What is semantics?  Semantics is the study of the conventional meaning conveyed by the use of words, phrases.
Query expansion COMP423. Menu Query expansion Two approaches Relevance feedback Thesaurus-based Most Slides copied from
Introduction to Computational Linguisitics The Lexicon.
Lexicons, Concept Networks, and Ontologies
Statistical NLP: Lecture 3
Generating sets of synonyms between languages
LEXICAL RELATIONS IN DISCOURSE
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
Information Retrieval (7)
Comparing Two Thesaurus Representations for Russian
What is Linguistics? The scientific study of human language
CSC 594 Topics in AI – Applied Natural Language Processing
WordNet: A Lexical Database for English
Bulgarian WordNet Svetla Koeva Institute for Bulgarian Language
WordNet WordNet, WSD.
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
Linguistic Essentials
Lecture 19 Word Meanings II
Giannis Varelas Epimenidis Voutsakis Paraskevi Raftopoulou
Semantics Going beyond syntax.
Presentation transcript:

Structured lexicons and Lexical semantics Especially WordNet ® See D Jurafsky & JH Martin: Speech and Language Processing, Upper Saddle River NJ (2000): Prentice Hall, Chapter 16. and and explore WordNet:

2/27 Structured lexicons Alternative to alphabetical dictionary List of words grouped according to meaning Classic example Roget’s Thesaurus Hierarchical organization is important Hierarchies familiar as taxonomies, eg in natural sciences –Daughters are “types of” and share certain properties, inherited from the mother Similar idea for ordinary words: hyponymy and synonymy

3/27 animal bird fish... canary eagle trout shark bald e. golden e. hawk e. bateleur space in general dimensions form motion size expansion distance interval contiguity reduction, deflation, shrinkage, curtailment, condensation.... hyponymy synonymy

4/27 Thesaurus A way to show the structure of (lexical) knowledge Much used for technical terminology Can be enriched by having other lexical relations: –Antonyms (as well as synonyms) –Different hyponymy relations, not just is-a-type-of, but has-as-part/member Thesaurus can be explored in any direction –across, up, down –Some obvious distance metrics can be used to measure similarity between words

5/27 WordNet: History 1985: a group of psychologists and linguists start to develop a “lexical database” –Princeton University –theoretical basis: results from psycholinguistics and psycholexicology –What are properties of the “mental lexicon”?

6/27 Global organisation division of the lexicon into five categories: –Nouns –Verbs –Adjectives –Adverbs –function words (“probably stored separately as part of the syntactic component of language” [Miller et al.]

7/27 Global organization nouns: organized as topical hierarchies verbs: entailment relations adjectives: multi-dimensional hyperspaces adverbs: multi-dimensional hyperspaces

8/27 Lexical semantics How are word meanings represented in WordNet? –synsets (synonym sets) as basic units –a word ‘meaning’ is represented by simply listing the word forms that can be used to express it example: senses of board –a piece of lumber vs. a group of people assembled for some purpose –synsets as unambiguous designators: –{board, plank,...} vs. {board, committee,...} Members of synsets are rarely true synonyms –WordNet does not attempt to capture subtle distinctions among members of the synset –may be due to specific details, or simply connotation, collocation

9/27 Synsets synsets often sufficient for differential purposes –if an appropriate synonym is not available a short gloss may be used –e.g. {board, (a person’s meals, provided regularly for money)} –Preferable for cardinality of synset to be >1 –WordNet also gives a gloss for each word meaning, and (often) an example

10/27

11/27 WordNet is big

12/27 Lexical relations in WordNet WordNet is organized by semantic relations. –It is characteristic of semantic relations that they are reciprocated –if there is a semantic relation R between meaning {x1, x2,...} and meaning {y1, y2,...}, then there is a relation R between {y1,y2,...} and {x1, x2,...} –Individual relations may or may not be Symmetric R(A,B)  R(B,A) (eg synonymy, not hyponymy) Transitive R(A,B) & R(B,C)  R(A,C) (eg synonymy may be) Reflexive R(A,A) is true (synonymy is, antonymy isn’t)

13/27 Lexical relations Nouns –Synonym ~ antonym (opposite of) –Hypernyms (is a kind of) ~ hyponym (for example) –Coordinate (sister) terms: share the same hypernym –Holonym (is part of) ~ meronym (has as part) Verbs –Synonym ~ antonym –Hypernym ~ troponym (eg lisp – talk) –Entailment (eg snore – sleep) –Coordinate (sister) terms: share the same hypernym Adjectives/Adverbs in addition to above –Related nouns –Verb participles –Derivational information

14/27 Lexical relations: synonymy similarity of meaning –Leibniz: two expressions are synonymous if the substitution of one for the other never changes the truth value of a sentence in which the substitution is made such global synonymy is rare (it would be redundant) –synonymy relative to a context: two expressions are synonymous in a linguistic context C if the substitution of one for the other in C does not alter the truth value –consequence of this synonymy in terms of substitutability: words in different syntactic categories cannot be synonyms

15/27 Lexical relations: antonymy antonym of a word x is sometimes not-x, but not always –rich and poor are antonyms –but: not rich does not imply poor –(because many people consider themselves neither rich nor poor) antonymy is a lexical relation between word forms, not a semantic relation between word meanings –meanings {rise, ascend} and {fall, descend} are conceptual opposites, but they are not antonyms [rise/fall] and [ascend/descend] are pairs of antonyms

16/27 Lexical relations: hyponymy hyponymy is a semantic relation between word meanings –{maple} is a hyponym of {tree} inverse: hypernymy –{tree} is a hypernym of {maple} also called: subordination/superordination; subset/superset; ISA relation test for hyponomy: –native speaker must accept sentences built from the frame “An x is a (kind of) y” called troponomy when applied to verbs

17/27 Lexical relations: meronymy A concept represented by the synset {x1, x2,...} is a meronym of a concept represented by the synset {y1, y2,...} if native speakers of English accept sentences constructed from such frames as “A y has an x (as a part)”, “An x is a part of y”. inverse relation: holonymy HAS-AS-PART –part hierarchy –part-of is asymmetric and (with caution) transitive

18/27 Lexical relations: meronymy failures of transitivity caused by different part- whole relations, e.g. –A musician has an arm. –An orchestra has a musician. –but: ? An orchestra has an arm. Types of meronymy in WordNet: –component [most frequently found] –member –composition –phase process

19/27

20/27 WordNet’s noun hierarchy noun hierarchy partitioned into separate hierarchies with unique top hypernyms vague abstractions would be semantically empty, e.g. {entity} with immediate hyponyms {object, thing} and {idea}

21/27 {act,action,activity} {animal,fauna} {artifact} {attribute,property} {body,corpus} {cognition,knowledge} {communication} {event,happening} {feeling,emotion} {food} {group,collection} {location,place} {motive} {natural object} {natural phenomenon} {person,human being} {plant,flora} {possession} {process} {quantity,amount} {relation} {shape} {state,condition} {substance} {time}

22/27 Nouns in WordNet noun hierarchy as lexical inheritance system –seldom goes more than ten levels deep, –the deepest examples usually contain technical levels that are not part of everyday vocabulary –shallowest levels are too vague –“Inherited hypernym” option shows full hierarchy

23/27 deep shallow

24/27 Nouns in WordNet man-made artefacts: sometimes six or seven levels deep –roadster → car → motor vehicle → wheeled vehicle → vehicle → conveyance → artefact hierarchy of persons: about three or four levels –televangelist → evangelist → preacher → clergyman → spiritual leader → person Like all thesaurus structures, words can have multiple hypernyms

25/27 WordNets for other languages Idea has been widely copied Sometimes by “translating” Princeton WordNet –Lexical relations in general are universal... –But are they in practice? –Are synsets universal? EuroWordNet: combining multilingual WordNets to include cross-language equivalence –Inherent difficulties, as above

26/27 What can WordNet be used for? As a lexical resource, an online dictionary, for human use Word-sense disambiguation (including homophone correction) – neighbouring words will be more closely related to correct sense (desert/dessert ~ camel) Document classification –What is this text about? Look for recurring hypernyms

27/27 What can WordNet be used for? Document retrieval –eg looking for texts about sports cars, search for synonyms and hyponyms of sports car Open-domain Q/A –Searching texts (eg WWW) to answer questions expressed in natural language –eg [example]example Textual entailment –Answering questions implied by text

28/27