Lens effects in autonomous terminology and conceptual vector learning Mathieu Lafourcade LIRMM - France

Slides:



Advertisements
Similar presentations
S é mantique lexicale Vecteur conceptuels et TALN Mathieu Lafourcade LIRMM - France
Advertisements

Conceptual vectors for NLP Lexical functions
Conceptual vectors for NLP MMA 2001 Mathieu Lafourcade LIRMM - France
Automatically Populating Acception Lexical Database through Bilingual Dictionaries and Conceptual Vectors PAPILLON 2002 Mathieu Lafourcade LIRMM - France.
Synonymies and conceptual vectors NLPRS 2001 Mathieu Lafourcade, Violaine Prince LIRMM - France.
Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme Presented by Smitashree Choudhury.
FP7 meeting - Gent - Carlos Rodríguez - April 18 WP4: Conceptual Mining from Text for Knowledge Engineering State of the Art WP Coordinators: Alfonso Valencia.
A Robust Approach to Aligning Heterogeneous Lexical Resources Mohammad Taher Pilehvar Roberto Navigli MultiJEDI ERC
The Google Similarity Distance  We’ve been talking about Natural Language parsing  Understanding the meaning in a sentence requires knowing relationships.
Jean-Eudes Ranvier 17/05/2015Planet Data - Madrid Trustworthiness assessment (on web pages) Task 3.3.
Applications Chapter 9, Cimiano Ontology Learning Textbook Presented by Aaron Stewart.
Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.
Introduction to Traversing using distances and directions of lines between Traversing is the method of using distances and directions of lines between.
Guessing Hierarchies and Symbols for Word Meanings through Hyperonyms and Conceptual Vectors Mathieu Lafourcade LIRMM - France
Chapter 5: Query Operations Baeza-Yates, 1999 Modern Information Retrieval.
Semantic text features from small world graphs Jure Leskovec, IJS + CMU John Shawe-Taylor, Southampton.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Two-Level Semantic Annotation Model BYU Spring Conference 2007 Yihong Ding Sponsored by NSF.
June 19-21, 2006WMS'06, Chania, Crete1 Design and Evaluation of Semantic Similarity Measures for Concepts Stemming from the Same or Different Ontologies.
Antonymy and Conceptual Vectors Didier Schwab, Mathieu Lafourcade, Violaine Prince Laboratoire d’informatique, de robotique Et de microélectronique de.
Recall: Query Reformulation Approaches 1. Relevance feedback based vector model (Rocchio …) probabilistic model (Robertson & Sparck Jones, Croft…) 2. Cluster.
A New Web Semantic Annotator Enabling A Machine Understandable Web BYU Spring Research Conference 2005 Yihong Ding Sponsored by NSF.
Dimension of Meaning Author: Hinrich Schutze Presenter: Marian Olteanu.
Deviation = The sum of the variables on each side of the mean will add up to 0 X
Automatically obtain a description for a larger cluster of relevant documents Identify terms related to query terms  Synonyms, stemming variations, terms.
Clustering Unsupervised learning Generating “classes”
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Claudia Marzi Institute for Computational Linguistics (ILC) National Research Council (CNR) - Italy.
Exploiting Wikipedia as External Knowledge for Document Clustering Sakyasingha Dasgupta, Pradeep Ghosh Data Mining and Exploration-Presentation School.
Geometric Conceptual Spaces Ben Adams GEOG 288MR Spring 2008.
Modeling Documents by Combining Semantic Concepts with Unsupervised Statistical Learning Author: Chaitanya Chemudugunta America Holloway Padhraic Smyth.
Name : Emad Zargoun Id number : EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF Computing and technology “ITEC547- text mining“ Prof.Dr. Nazife Dimiriler.
M. Lafourcade (LIRMM & Ch. Boitet (GETA, CLIPS)LREC-02, Las Palmas, 31/5/ LREC-2002, Las Palmas, May 2002 Mathieur Lafourcade & Christian Boitet.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
Latent Semantic Analysis Hongning Wang Recap: vector space model Represent both doc and query by concept vectors – Each concept defines one dimension.
10/22/2015ACM WIDM'20051 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis Voutsakis.
Katrin Erk Vector space models of word meaning. Geometric interpretation of lists of feature/value pairs In cognitive science: representation of a concept.
Course 9 Texture. Definition: Texture is repeating patterns of local variations in image intensity, which is too fine to be distinguished. Texture evokes.
Using Surface Syntactic Parser & Deviation from Randomness Jean-Pierre Chevallet IPAL I2R Gilles Sérasset CLIPS IMAG.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
Team Members Dilip Narayanan Gaurav Jalan Nithya Janarthanan.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Learning to Share Meaning in a Multi-Agent System (Part I) Ganesh Padmanabhan.
Improving Translation Selection using Conceptual Vectors LIM Lian Tze Computer Aided Translation Unit School of Computer Sciences Universiti Sains Malaysia.
Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR.
Whole Numbers Section 3.4 Properties of Whole-Number Operations
A Novel Visualization Model for Web Search Results Nguyen T, and Zhang J IEEE Transactions on Visualization and Computer Graphics PAWS Meeting Presented.
Flat clustering approaches
Discretization Methods Chapter 2. Training Manual May 15, 2001 Inventory # Discretization Methods Topics Equations and The Goal Brief overview.
From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:
2/10/2016Semantic Similarity1 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis.
Semantic Grounding of Tag Relatedness in Social Bookmarking Systems Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme ISWC 2008 Hyewon Lim January.
1.Learn appearance based models for concepts 2.Compute posterior probabilities or Semantic Multinomial (SMN) under appearance models. -But, suffers from.
Houses of Mirrors: Deeply Adaptive Designs for Machine Cognition Deborah Duong, Michael Ross.
Constructing A Yami Language Lexicon Database from Yami Archiving Projects Meng-Chien Yang(Providence University, Taiwan) D. Victoria Rau(National Chung.
Defect-Defect Interaction in Carbon Nanotubes under Mechanical Loading Topological defects can be formed in carbon nanotubes (CNTs) during processing or.
SERVICE ANNOTATION WITH LEXICON-BASED ALIGNMENT Service Ontology Construction Ontology of a given web service, service ontology, is constructed from service.
Conceptual vectors for NLP MMA 2001 Mathieu Lafourcade LIRMM - France
The Needs for Coding and Classification Systems
Antonymy and Conceptual Vectors
A method for WSD on Unrestricted Text
Semantic Similarity Methods in WordNet and their Application to Information Retrieval on the Web Yizhe Ge.
Synonymies and conceptual vectors
Giannis Varelas Epimenidis Voutsakis Paraskevi Raftopoulou
Automatically Populating Acception Lexical Database through Bilingual Dictionaries and Conceptual Vectors PAPILLON 2002 Mathieu Lafourcade LIRMM -
Lens effects in autonomous terminology and conceptual vector learning
Presentation transcript:

Lens effects in autonomous terminology and conceptual vector learning Mathieu Lafourcade LIRMM - France

Overwiew & Objectives lexical semantic representations conceptual vector model (cvm) autonomous learning by the system from a given « semantic space » (ontology) effects of swithing ontologies (general  spec) global effects on the lexicon local effects on particular word ambiguity as noise towards self contained WSD annotations « I made a deposit at the bank »  « I made a deposit at the bank »

Conceptual vectors vector space An idea Concept combination — a vector Idea space = vector space A concept = an idea = a vector V with augmentation: V + neighboorhood Meaning space = vector space + {v}* 

2D view of « meaning space » “ cat ” “ product ”

Conceptual vectors Thesaurus H : thesaurus hierarchy — K concepts Thesaurus Larousse = 873 concepts V(C i ) : a j = 1/ (2 ** D um (H, i, j)) 1/41 1/16 1/64 264

Conceptual vectors Concept c4:peace peace hierarchical relations conflict relations The world, manhood society

Conceptual vectors Term “peace” c4:peace

finance profit exchange

Angular distance D A (x, y) = angle (x, y) 0  D A (x, y)   if 0 then x & y colinear — same idea if  /2 then nothing in common if  then D A (x, -x) with -x — anti-idea of x  x’ y x

Angular distance D A (x, y) = acos(sim(x,y)) D A (x, y) = acos(x.y/|x||y|)) D A (x, x) = 0 D A (x, y) = D A (y, x) D A (x, y) + D A (y, z)  D A (x, z) D A (0, 0) = 0 and D A (x, 0) =  /2 by definition D A (  x,  y) = D A (x, y) with   0 D A (  x,  y) =  - D A (x, y) with  < 0 D A (x+x, x+y) = D A (x, x+y)  D A (x, y)

Thematic distance Examples D A (tit, tit) = 0 D A (tit, passerine) = 0.4 D A (tit, bird) = 0.7 D A (tit, train) = 1.14 D A (tit, insect) = 0.62 tit = insectivorous passerine bird …

Some vector operations Addition  : Z = X  Y z i = x i + y ivector Z is normalized Term to term mult  : Z = X  Y z i = (x i * y i ) 1/2 vector Z is not normalized Weak contextualization  : Z = X  (X  Y) =  (X,Y) “ Z is X augmented by its mutual information with Y ”

2D view of weak contextualization Y X XYXY XYXY Y  (X  Y) XYXY X  (X  Y)   

Autonomous learning 1/2 set of known words K, set of unknow words U revise a word w of K OR (try to) learn a word w of U From the web : for w ask for a def D specific sites : dicts, synonyms list, etc.  def analysis general sites : google, etc.  corpus analysis for each word wd of D if not in K then add wd to U AND add VO to V* otherwise get the vector of wd AND add V(wd) to V* compute the new vector of w from def(D) and V* words for senses (vectors) learned in 3 years French « forever » looping process

Autonomous learning 2/2 insectivorous passerine bird … ADJ, … N, GOV … … PH TXT VVV V V V V V  (X,Y) Weighted sum

Local expansion of vector space G GS c1c1 cncn “ cat ” “ product ” finer mesh locally over the space

S G  G  GS (v G ) = v GS vGvG vSvS Specialized ontology General ontology Point of G without link in S  GS  G (v GS ) = v G a bc + a + b + c a… … a+bb+cb………… Folding and unfolding

Local lexical density given a point P count the number of points at distance d1, d2, dn P 0 ≤ d1 < d2 < … dn ≤ π/2

Lexical Distribution from Local density high density curve shifted on the left medium density curve top centered low density case left as an exercise

Macro level Local density variation G GS

Micro level Distance variation G GS small angle = high similarity larger angle = less similarity

Last words Switching of representation coarse grained to fine grained  better semantic discrimation … and vice-versa  conservation of resource global and local test functions for vector quality assessment decision taking about level of representation detectors when combined to lexical functions (antonymy, etc.) the basis for self adjustement toward a vector space of constant density wsd as a reduction of noise (in context or out of context) unification of ontologies self emergent structuration of terminology