Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Word senses: a computational response Adam Kilgarriff.

Similar presentations


Presentation on theme: "1 Word senses: a computational response Adam Kilgarriff."— Presentation transcript:

1 1 Word senses: a computational response Adam Kilgarriff

2 Kivik 2013 Kilgarriff: Word senses: a computational response2

3 Kivik 2013 Kilgarriff: Word senses: a computational response3 My PhD (in 5 slides)  What is a word sense

4 Kivik 2013 Kilgarriff: Word senses: a computational response4 The lexicographers  They create them  Methods Introspection Other dictionaries Corpus  Atkins, Hanks, Krishnamurthy

5 Kivik 2013 Kilgarriff: Word senses: a computational response5 What is a word sense (1)  SFIP Sufficiently frequent insufficiently predictable  (a glass of) whisky  x (a glass of) tequila

6 Kivik 2013 Kilgarriff: Word senses: a computational response6 What is a word sense (2) homonymy analogy polysemy rules collocation

7 Kivik 2013 Kilgarriff: Word senses: a computational response7 What is a word sense (3)  A cluster Of instances of use  Operationalised as: corpus lines Clustered by lexicographers

8 Kivik 2013 Kilgarriff: Word senses: a computational response8 What is a word sense (3)

9 Kivik 2013 Kilgarriff: Word senses: a computational response9 What is a word sense (3)

10 Kivik 2013 Kilgarriff: Word senses: a computational response10 What is a word sense (3)

11 Kivik 2013 Kilgarriff: Word senses: a computational response11 What is a word sense (3)

12 Kivik 2013 Kilgarriff: Word senses: a computational response12 What is a word sense (3)  A cluster Of instances of use  Operationalised as: corpus lines Clustered by lexicographers  Makes sense of Overlapping senses Different dictionaries, different senses Lumping and splitting

13 Kivik 2013 Kilgarriff: Word senses: a computational response13 I don’t believe in word senses  Believe in: resurrection ghost witch vampire god miracle fairy  Philosophy: Ontological commitment (same meaning different register)  “good entities to build belief systems on”

14 Kivik 2013 Kilgarriff: Word senses: a computational response14 A word sense is a cluster of corpus lines  But I’m an NLP person  Automatic clustering?  Inspiration: Hindle 1991, Schütze 1993, Grefenstette 1993, Lin 1999 You can get semantic sense from corpora+stats

15 Kivik 2013 Kilgarriff: Word senses: a computational response15 First attempt  Longman 1994  Abject failure No grammar Corpus too small and noisy Naïve clustering Useless programmer

16 Kivik 2013 Kilgarriff: Word senses: a computational response16 Collocations  Easy Most words don’t go with most other words  Then build on what we can do well  metaphor, analogy, homonymy, rules all much harder

17 Kivik 2013 Kilgarriff: Word senses: a computational response17 Clustering  Word sketch Collocates organised by grammar  Dictionary Collocates (and other things) organised by meaning  How to re-organise

18 Kivik 2013 Kilgarriff: Word senses: a computational response18 Observation:  corpus: arbitrary sample  dictionary ( =lexicon) : systematic account Children  encounter arbitrary samples  develop systematic account

19 Kivik 2013 Kilgarriff: Word senses: a computational response19 Corpus  provisional, dispensable  used to develop lexicon

20 Kivik 2013 Kilgarriff: Word senses: a computational response20 Levels of abstraction  Direct linkage:  Fragile Updates (to C or D) break links  Dictionary: abstract  Corpus: raw  Intermediate level needed CorpusDictionary ===   ===

21 Kivik 2013 Kilgarriff: Word senses: a computational response21  How most automatic word sense disambiguation (WSD) works Analyse dictionary to give set of collocates Match to collocates in a corpus  Dispensable corpus CorpusDictionary ===   === ===   === Collocates

22 Kivik 2013 Kilgarriff: Word senses: a computational response22 Not just collocates  triples  parse the corpus  some “unary relations” I hear him singing  domain-based clues Collocates, Constructions, Domains = CoCoDo

23 Kivik 2013 Kilgarriff: Word senses: a computational response23  Automatically extract CoCoDos from corpus  How linked to senses? Automatic (WSD techniques) ‏  Manual “dictionary-free”: ideal for new dictionaries Labour costs  Mixed WSD with manual confirmation/correction CorpusDictionary ===   === ===   === CoCoDo CoCoDo Linking CoCoDo’s to senses

24 Kivik 2013 Kilgarriff: Word senses: a computational response24 Semi-automatic dictionary drafting (SADD) ‏  CoCoDo database  Automatic clustering  Lexicographer input  More clustering  Dictionary with corpus inside

25 Kivik 2013 Kilgarriff: Word senses: a computational response25 Semi-automatic dictionary drafting (SADD) ‏  CoCoDo database  Automatic clustering  Lexicographer input  More clustering  Dictionary with corpus inside  hard

26 Related projects  Dante (completed 2010)  Tickbox lexicography Demo  Automatic collocations dictionaries SkE Language Resources And Tools Kivik 2013 Kilgarriff: Word senses: a computational response26


Download ppt "1 Word senses: a computational response Adam Kilgarriff."

Similar presentations


Ads by Google