Guessing Hierarchies and Symbols for Word Meanings through Hyperonyms and Conceptual Vectors Mathieu Lafourcade LIRMM - France

Slides:



Advertisements
Similar presentations
S é mantique lexicale Vecteur conceptuels et TALN Mathieu Lafourcade LIRMM - France
Advertisements

Conceptual vectors for NLP Lexical functions
Conceptual vectors for NLP MMA 2001 Mathieu Lafourcade LIRMM - France
Automatically Populating Acception Lexical Database through Bilingual Dictionaries and Conceptual Vectors PAPILLON 2002 Mathieu Lafourcade LIRMM - France.
Synonymies and conceptual vectors NLPRS 2001 Mathieu Lafourcade, Violaine Prince LIRMM - France.
Computational Learning An intuitive approach. Human Learning Objects in world –Learning by exploration and who knows? Language –informal training, inputs.
Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme Presented by Smitashree Choudhury.
Applications Chapter 9, Cimiano Ontology Learning Textbook Presented by Aaron Stewart.
Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.
Video retrieval using inference network A.Graves, M. Lalmas In Sig IR 02.
Constructing Natural Knowledge Ontologies to Implement Semantic Organisational Memory Dr. Laura Campoy-Gómez Information Systems Institute /IRIS University.
/ faculty of mathematics and computer science TU/e eindhoven university of technology 1 MOT Adaptive Course Authoring: My Online Teacher Alexandra Cristea.
TFIDF-space  An obvious way to combine TF-IDF: the coordinate of document in axis is given by  General form of consists of three parts: Local weight.
Antonymy and Conceptual Vectors Didier Schwab, Mathieu Lafourcade, Violaine Prince Laboratoire d’informatique, de robotique Et de microélectronique de.
Recall: Query Reformulation Approaches 1. Relevance feedback based vector model (Rocchio …) probabilistic model (Robertson & Sparck Jones, Croft…) 2. Cluster.
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
A Self Learning Universal Concept Spotter By Tomek Strzalkowski and Jin Wang Presented by Iman Sen.
Designing clustering methods for ontology building: The Mo’K workbench Authors: Gilles Bisson, Claire Nédellec and Dolores Cañamero Presenter: Ovidiu Fortu.
Latent Semantic Analysis (LSA). Introduction to LSA Learning Model Uses Singular Value Decomposition (SVD) to simulate human learning of word and passage.
Word Sense Disambiguation for Automatic Taxonomy Construction from Text-Based Web Corpora 12th International Conference on Web Information System Engineering.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics – Bag of concepts – Semantic distance between two words.
1 Abstract Syntax Tree--motivation The parse tree –contains too much detail e.g. unnecessary terminals such as parentheses –depends heavily on the structure.
Geometric Conceptual Spaces Ben Adams GEOG 288MR Spring 2008.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Mining the Semantic Web: Requirements for Machine Learning Fabio Ciravegna, Sam Chapman Presented by Steve Hookway 10/20/05.
Name : Emad Zargoun Id number : EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF Computing and technology “ITEC547- text mining“ Prof.Dr. Nazife Dimiriler.
Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.
M. Lafourcade (LIRMM & Ch. Boitet (GETA, CLIPS)LREC-02, Las Palmas, 31/5/ LREC-2002, Las Palmas, May 2002 Mathieur Lafourcade & Christian Boitet.
Word Sense Disambiguation in Queries Shaung Liu, Clement Yu, Weiyi Meng.
Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining
VIKEF – Take the VIKEF train towards smart services …
10/22/2015ACM WIDM'20051 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis Voutsakis.
Katrin Erk Vector space models of word meaning. Geometric interpretation of lists of feature/value pairs In cognitive science: representation of a concept.
Math 205 – Calculus III Andy Rosen [If you are trying to crash the course, that will be the first thing we talk about.]
Efficiently Computed Lexical Chains As an Intermediate Representation for Automatic Text Summarization H.G. Silber and K.F. McCoy University of Delaware.
SDMX DATA STRUCTURE DEFINITION SDMX Training BANK INDONESIA SEPTEMBER 2015 YOGYAKARTA, INDONESIA.
Using Surface Syntactic Parser & Deviation from Randomness Jean-Pierre Chevallet IPAL I2R Gilles Sérasset CLIPS IMAG.
Team Members Dilip Narayanan Gaurav Jalan Nithya Janarthanan.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
Improving Translation Selection using Conceptual Vectors LIM Lian Tze Computer Aided Translation Unit School of Computer Sciences Universiti Sains Malaysia.
Conceptual structures in modern information retrieval Claudio Carpineto Fondazione Ugo Bordoni
1 Latent Concepts and the Number Orthogonal Factors in Latent Semantic Analysis Georges Dupret
Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR.
Flat clustering approaches
2/10/2016Semantic Similarity1 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis.
Multimedia Analytics Jianping Fan Department of Computer Science University of North Carolina at Charlotte.
Semantic Grounding of Tag Relatedness in Social Bookmarking Systems Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme ISWC 2008 Hyewon Lim January.
1.Learn appearance based models for concepts 2.Compute posterior probabilities or Semantic Multinomial (SMN) under appearance models. -But, suffers from.
Marko Grobelnik, Janez Brank, Blaž Fortuna, Igor Mozetič.
Houses of Mirrors: Deeply Adaptive Designs for Machine Cognition Deborah Duong, Michael Ross.
Learning Analogies and Semantic Relations Nov William Cohen.
Lens effects in autonomous terminology and conceptual vector learning Mathieu Lafourcade LIRMM - France
Contextual Text Cube Model and Aggregation Operator for Text OLAP
Chapter 4 Vector Spaces Linear Algebra. Ch04_2 Definition 1: ……………………………………………………………………. The elements in R n called …………. 4.1 The vector Space R n Addition.
Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,
Soft Computing Lecture 15 Constructive learning algorithms. Network of Hamming.
Constructing A Yami Language Lexicon Database from Yami Archiving Projects Meng-Chien Yang(Providence University, Taiwan) D. Victoria Rau(National Chung.
SERVICE ANNOTATION WITH LEXICON-BASED ALIGNMENT Service Ontology Construction Ontology of a given web service, service ontology, is constructed from service.
Vectors Def. A vector is a quantity that has both magnitude and direction. v is displacement vector from A to B A is the initial point, B is the terminal.
Conceptual vectors for NLP MMA 2001 Mathieu Lafourcade LIRMM - France
2-4 The Distributive Property
Antonymy and Conceptual Vectors
Advanced Database Models
Presented by: Prof. Ali Jaoua
Synonymies and conceptual vectors
Giannis Varelas Epimenidis Voutsakis Paraskevi Raftopoulou
Automatically Populating Acception Lexical Database through Bilingual Dictionaries and Conceptual Vectors PAPILLON 2002 Mathieu Lafourcade LIRMM -
Lens effects in autonomous terminology and conceptual vector learning
Presentation transcript:

Guessing Hierarchies and Symbols for Word Meanings through Hyperonyms and Conceptual Vectors Mathieu Lafourcade LIRMM - France

Overwiew & Objectives lexical semantic representations conceptual vector model (cvm) autonomous learning by the system from a given « semantic space » (ontology) Constructing texonomies Hierarchical - findind hyperonyms Multiple inheritance - views ambiguity as noise towards self contained WSD annotations « I made a deposit at the bank »  « I made a deposit at the bank »

Conceptual vectors vector space An idea Concept combination — a vector Idea space = vector space A concept = an idea = a vector V with augmentation: V + neighboorhood Meaning space = vector space + {v}* 

2D view of « meaning space » “ cat ” “ product ”

Conceptual vectors Thesaurus H : thesaurus hierarchy — K concepts Thesaurus Larousse = 873 concepts V(C i ) : a j = 1/ (2 ** D um (H, i, j)) 1/41 1/16 1/64 264

Conceptual vectors Concept c4:peace peace hierarchical relations conflict relations The world, manhood society

Conceptual vectors Term “peace” c4:peace

finance profit exchange

Angular distance D A (x, y) = angle (x, y) 0  D A (x, y)   if 0 then x & y colinear — same idea if  /2 then nothing in common if  then D A (x, -x) with -x — anti-idea of x  x’ y x

Angular distance D A (x, y) = acos(sim(x,y)) D A (x, y) = acos(x.y/|x||y|)) D A (x, x) = 0 D A (x, y) = D A (y, x) D A (x, y) + D A (y, z)  D A (x, z) D A (0, 0) = 0 and D A (x, 0) =  /2 by definition D A (  x,  y) = D A (x, y) with   0 D A (  x,  y) =  - D A (x, y) with  < 0 D A (x+x, x+y) = D A (x, x+y)  D A (x, y)

Thematic distance Examples D A (tit, tit) = 0 D A (tit, passerine) = 0.4 D A (tit, bird) = 0.7 D A (tit, train) = 1.14 D A (tit, insect) = 0.62 tit = insectivorous passerine bird …

Some vector operations Addition  : Z = X  Y z i = x i + y ivector Z is normalized Term to term mult  : Z = X  Y z i = (x i * y i ) 1/2 vector Z is not normalized Weak contextualization  : Z = X  (X  Y) =  (X,Y) “ Z is X augmented by its mutual information with Y ”

2D view of weak contextualization Y X XYXY XYXY Y  (X  Y) XYXY X  (X  Y)   

Autonomous learning 1/2 set of known words K, set of unknow words U revise a word w of K OR (try to) learn a word w of U From the web : for w ask for a def D specific sites : dicts, synonyms list, etc.  def analysis general sites : google, etc.  corpus analysis for each word wd of D if not in K then add wd to U AND add VO to V* otherwise get the vector of wd AND add V(wd) to V* compute the new vector of w from def(D) and V* words for senses (vectors) learned in 3 years French « ever » looping process

Autonomous learning 2/2 insectivorous passerine bird … ADJ, … N, GOV … … PH TXT VVV V V V V V  (X,Y) Weighted sum

Hyperonyms identifications Extraction Try all terms  too costly and unproductive Extract potential candidates  From definitions, cooccurence lists etc. Ex: Cand(emerald) = precious stone, stone, beryl, gem, … Evaluation of cand (m) to meaning (m) Contextualize :  (c,m) = c  (c  m) Retain c such as  (c,m) is the closest to m Loop: extracting hyper helps identifying meanings

Émeraude/pierre précieuseÉmeraude/béryl béryl Pierre précieuse Gemme/pierre précieuseGemme/bourgeonGemme/résine closest vector Émeraude/gemme … v v v vv v Émeraude/pierre précieuseÉmeraude/béryl béryl Pierre précieuse Gemme/pierre précieuseGemme/bourgeonGemme/résine Émeraude/gemme … v v v vv v

Émeraude/pierre précieuseÉmeraude/béryl béryl Pierre précieuse Gemme/pierre précieuseGemme/bourgeonGemme/résine Émeraude/béryl béryl Pierre précieuse Gemme/pierre précieuse Émeraude/vertÉmeraude/couleur Émeraude/vert Vert/couleur des signaux Couleur/matièreCouleur/sensation Vert/couleur … … …

Voiture/wagon wagon Moyen de transport véhicule/Moyen de transportvéhicule/vecteur automobile Voiture/automobile Cheval/moyen de transport Cheval/mammifère mammifère Cheval/viande Viande/nourriture aliment nourriture artefact Cheval/unité de puissance animal hypo

Last words Switching of representation From subsymbolic to symbolic … and vice-versa  readabily of symbols … of words global and local test functions for vector quality assessment decision taking about number of meanings … or views detectors when combined to lexical functions (antonymy, etc.) the basis for self adjustement toward a vector space of constant density wsd as a reduction of noise (in context or out of context) unification of ontologies self emergent structuration of terminology