Mapping meaning onto use: a Pattern Dictionary of English Verbs Patrick Hanks Faculty of Informatics, Masaryk University, Brno, Czech Republic

Slides:



Advertisements
Similar presentations
Building Wordnets Piek Vossen, Irion Technologies.
Advertisements

1 Lexical Semantics for the Semantic Web Patrick Hanks Masaryk University, Brno Czech Republic UFAL, Mathematics Faculty, Charles University.
Metaphorical Uses of Language in Native and Non-native Student Writing: A corpus-based study By: Claudia Marcela Chapetón Castro M.A. in Applied Linguistics.
1 The Generative Lexicon (GL) meets Corpus Pattern Analysis (CPA) Patrick Hanks Institute of Formal and Applied Linguistics, Charles University in Prague,
1 Why do CPA? Patrick Hanks Research Institute for Information and Language Processing, University of Wolverhampton; Bristol Centre for Linguistics, University.
CL Research ACL Pattern Dictionary of English Prepositions (PDEP) Ken Litkowski CL Research 9208 Gue Road Damascus,
Syntax Lecture 9: Verb Types 2.
1 Elliptical Arguments Patrick Hanks Institute of Formal and Applied Linguistics, Charles University in Prague, Czech Republic ***
Ontology From Wikipedia, the free encyclopedia In philosophy, ontology (from the Greek oν, genitive oντος: of being (part. of εiναι: to be) and –λογία:
Statistical NLP: Lecture 3
Introduction to phrases & clauses
1 Computing Real Language Meaning for the Semantic Web Patrick Hanks Masaryk University, Brno Czech Republic UFAL, Mathematics Faculty,
LING NLP 1 Introduction to Computational Linguistics Martha Palmer April 19, 2006.
Linguistic Theory Lecture 8 Meaning and Grammar. A brief history In classical and traditional grammar not much distinction was made between grammar and.
What is a corpus?* A corpus is defined in terms of  form  purpose The word corpus is used to describe a collection of examples of language collected.
January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.
Term 2 Week 3 Semantics.
Amirkabir University of Technology Computer Engineering Faculty AILAB Efficient Parsing Ahmad Abdollahzadeh Barfouroush Aban 1381 Natural Language Processing.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 2.
Basic Scientific Writing in English Lecture 3 Professor Ralph Kirby Faculty of Life Sciences Extension 7323 Room B322.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
Corpora and Language Teaching
Second Language Acquisition and Real World Applications Alessandro Benati (Director of CAROLE, University of Greenwich, UK) Making.
Type shifting and coercion Henriëtte de Swart November 2010.
Presented by Jennifer Robison TexTESOL II March 12, 2010 San Antonio, TX.
Science and Engineering Practices
Lexical Patterns: from Hornby to Hunston and beyond
Deny A. Kwary Internal Structures of Dictionary Entries.
Building the Valency Lexicon of Arabic Verbs Viktor Bielický Otakar Smrž LREC 2008, Marrakech, Morocco.
Albert Gatt LIN 3098 Corpus Linguistics. In this lecture Some more on corpora and grammar Construction Grammar as a theoretical framework Collostructional.
1 Syntagmatic Preferences Patrick Hanks Masaryk University In honour of Yorick Wilks BCS, London, June 22, 2007.
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
Introduction to English Syntax Level 1 Course Ron Kuzar Department of English Language and Literature University of Haifa Chapter 2 Sentences: From Lexicon.
Linguistics, Pragmatics & Natural Grammar
The DVC project: Disambiguation of Verbs by Collocation ____ an introduction to the linguistic theory of norms and exploitations Patrick Hanks Research.
Syntax Lecture 8: Verb Types 1. Introduction We have seen: – The subject starts off close to the verb, but moves to specifier of IP – The verb starts.
1 Statistical NLP: Lecture 10 Lexical Acquisition.
1 How to Compute the Meaning of Natural Language Utterances Patrick Hanks, Research Institute of Information and Language Processing, University of Wolverhampton.
Probabilistic Parsing Reading: Chap 14, Jurafsky & Martin This slide set was adapted from J. Martin, U. Colorado Instructor: Paul Tarau, based on Rada.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Linguistics and Language
Dr. Monira Al-Mohizea MORPHOLOGY & SYNTAX WEEK 12.
The Current State of FrameNet CLFNG June 26, 2006 Fillmore.
Improving Subcategorization Acquisition using Word Sense Disambiguation Anna Korhonen and Judith Preiss University of Cambridge, Computer Laboratory 15.
1 Using Corpora in Language Research -also Introduction to the Sketch Engine (WS15) part 1 Adam Kilgarriff Lexical Computing Ltd Universities of Leeds.
An Intelligent Analyzer and Understander of English Yorick Wilks 1975, ACM.
Modern Lexicography – Developments, Prospects, and Problems Patrick Hanks Research Institute of Information and Language Processing University of Wolverhampton.
Using Corpora in Language Research Adam Kilgarriff Lexical Computing Ltd Universities of Leeds January 2013Adam Kilgarriff.
The Next Generation Science Standards: 4. Science and Engineering Practices Professor Michael Wysession Department of Earth and Planetary Sciences Washington.
1 Statistical NLP: Lecture 7 Collocations. 2 Introduction 4 Collocations are characterized by limited compositionality. 4 Large overlap between the concepts.
Albert Gatt LIN3021 Formal Semantics Lecture 4. In this lecture Compositionality in Natural Langauge revisited: The role of types The typed lambda calculus.
Rules, Movement, Ambiguity
Introduction Chapter 1 Foundations of statistical natural language processing.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Supertagging CMSC Natural Language Processing January 31, 2006.
CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.
SYNTAX.
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
SIMS 296a-4 Text Data Mining Marti Hearst UC Berkeley SIMS.
Word classes and part of speech tagging. Slide 1 Outline Why part of speech tagging? Word classes Tag sets and problem definition Automatic approaches.
1 CPA: Where do we go from here? Research Institute for Information and Language Processing, University of Wolverhampton; UPF Barcelona; University of.
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
In this lecture, we will learn about: Translation.
2. The standards of textuality: cohesion Traditional approach to the study of lannguage: sentence as conventional object of study Structuralism (Bloofield,
Lec. 10.  In this section we explain which constituents of a sentence are minimally required, and why. We first provide an informal discussion and then.
Figure and Ground Part 2 APLNG 597C LEJIAO WANG 03/16/2015.
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
Use of Concordancers A corpus (plural corpora) – a large collection of texts, written or spoken, stored on a computer. A concordancer – a computer programme.
Statistical NLP: Lecture 7
Statistical NLP: Lecture 3
Presentation transcript:

Mapping meaning onto use: a Pattern Dictionary of English Verbs Patrick Hanks Faculty of Informatics, Masaryk University, Brno, Czech Republic American Association for Corpus Linguistics, BYU, March 15,

Outline of the talk What is a pattern dictionary? Distinguishing norms from exploitations Pattern dictionary and pattern grammar Measuring collocations Pattern dictionary and FrameNet Conclusions 2

What is a pattern dictionary? A semantically motivated inventory of word uses and their meanings. – Shows comparative frequency of each pattern of a polysemous word. Meanings are associated with patterns, not with words. – The colligational preferences of a word are part of its patterns. Driven by Corpus Pattern Analysis (CPA). 3

Patterns, not senses Meanings taken from WordNet or a dictionary do not yield reliable data for disambiguating senses (Ide and Wilks 2005). – WordNet lists synonym sets and other semantic relations – but not senses. – WordNet did not do contrastive analysis of word senses. – In standard dictionaries, word senses are not mutually exclusive. – There is much fuzzy overlap between senses – which may be OK for sophisticated human users, but not for learners or computers. The patterns of all and only the normal uses of a lexical item are (normally) mutually exclusive. – However, teasing them out from corpus data is hard. 4

Norms and exploitations A pattern dictionary aims to record all and only the normal uses of each word. – Exploitation of norms is a subject for separate analysis. – Types of ‘exploitation’ include creative metaphor, ellipsis, and anomalous arguments. Consider: The goat ate the newspaper. The verb eat has a preference for nouns of semantic type [[Food]] in the direct object clause role. ‘[[Animate]] eat [[Document]]’ is not a normal pattern of English. Compare John devoured the newspaper. ‘[[Human]] devour [[Document]]’ is a normal pattern of English. It is a conventional metaphor. 5

Specifically,... The Brno-Brandeis Pattern Dictionary of English Verbs will list all normal patterns of each verb lemma in BNC. Providing a benchmark for identification of norms in other corpora – by time period: patterns in historical corpora, future corpora. – by region: e.g. patterns in American English. – by domain, e.g.: ‘[[Human]] abate [[Problem = Nuisance]]’ is a domain-specific norm in the domain of legal jargon abate is not normally a transitive verb. 6

A typical pattern dictionary entry irritate PATTERN 1 (90%): [[Anything]] irritate [[Human]] IMPLICATURE : [[Anything]] causes [[Human]] to feel mildly annoyed. PATTERN 2 (8%): [[Stuff]] irritate [[Body Part]] IMPLICATURE : [[Stuff]] causes [[Body Part]] to become inflamed and somewhat painful. Notes: Both patterns are transitive (V n), but they have different meanings. They are distinguished by the semantic types of the nouns. Getting the right level of semantic generalization for each n is hard. 7

Semantic type vs. semantic role Mr Woods sentenced Bailey to life imprisonment. – [[Human 1]] sentence [[Human 2]] to [[Punishment]] Semantic type: [[Human]] Semantic roles: [[Human = Judge]], [[Human = Convicted Criminal]] – Semantic type is an intrinsic property of a lexical item. – Semantic role is assigned by the context. 8

Nouns and verbs The apparatus required for analysing nouns is different from that required for predicators (verbs, adjectives, prepositions). – Nouns are grouped into lexical sets in relation to the predicators that they normally colligate with. – Typically, the lexical sets are united by a semantic type. – A shallow ontology of nouns (grouped by their semantic type) is therefore part of the apparatus of a pattern dictionary. – Semantic typing in real texts is more complex than might be expected from invented examples. – Lexical sets include alternations, parts, and attributes of types – Example: calm (next slide) 9

What would an empirically well- founded ontology be like? It would have to take account of verb-specific alternations, parts, and attributes of semantic types. For example, Pattern 2 (of 8) for calm, verb, is: [[Human 1 | Event]] calm [[Human 2]] – Alternation of Human (2 – direct object): [[Animal]] – Parts of Human (2 – direct object): nerves – Attributes: fear, anxiety, agitation,.... [[Emotion]] 10

Pattern Grammar Hunston and Francis (2000): Pattern Grammar: a corpus- driven approach to the lexical grammar of English “One of the most important observations in a corpus-driven description of English is that patterns and meanings are connected.” PG is founded on real texts and is a real attempt at empirically valid generalizations. 11

How is a Pattern Dictionary different from Pattern Grammar? Pattern Grammar seeks similarities – words with similar meanings grouped together according to syntactic similarity. By contrast, a pattern dictionary seeks systematic differences: – In particular, the differences in pattern that pick out different meanings of a polysemous word. 12

Apparatus The Pattern Grammar has an admirably simple apparatus. – target word; part of speech categories; word order; and certain function words (mainly prepositions). Simplicity can be overdone. To represent the distinctive features of meaning in use, we also need at least: – Systematic analysis and categorization of colligations – Lexical items grouped by semantic type – Valencies – a.k.a. clause roles (S P O C A will do) 13

execute VerbPattern Grammar Pattern Dictionary executeV n (passive be V-ed) [[Human 1]] execute [[Human 2]] 14 Example sentence: Private Joseph Byers was the first Kitchener volunteer to be executed. In the Pattern Dictionary (but not in PG) semantic types distinguish this sense from other “V n” patterns of the same verb, e.g. ‘execute an order’.

enlist VerbPattern Grammar Pattern Dictionary enlistV in n [[Human]] enlist [NO OBJ] {in [[Human Group = Military]]} 15 Example sentence: He was 17 And under age when he enlisted in the 1 st Royal Scots Fusiliers. Pattern Grammar and Pattern Dictionary agree in contrasting this sense with patterns such as “[[Human]] enlist [[Assistance]]” (V n).

go VerbPattern Grammar Pattern Dictionary go V adj[[Human]] go [NO OBJ] {absent | AWOL} 16 Example sentence: His inexperience and the horrors he witnessed caused him to go absent without leave. This is a light verb (“delexical verb” in Sinclair’s terminology), with many patterns. The “adj” in PG is a Subject Complement. The small lexical set, {absent | AWOL}, in the Pattern Dictionary activates a particular meaning of go, contrasting with other patterns of go having a Subject Complement, e.g. go {mad | bananas}.

plead VerbPattern Grammar Pattern Dictionary plead V adj[[Human]] plead [NO OBJ] {guilty | {not guilty} 17 Example sentence: Byers pleaded guilty. The adj in this pattern is an Object Complement. The Object Complement is populated by a lexical set of just two possible (normal) items. (“plead innocent” is plausible but not idiomatic.)

fire VerbPattern Grammar Pattern Dictionary fire V adj[[Human]] fire [NO OBJ] ([Advl[Direction]]) 18 Example sentence: … the firing squad had fired wide to avoid killing the youth. The “adj” in this sentence has the clause role of Adjunct or (in my terminology) Adverbial of Direction, as in: The police fired into the crowd. They fired over their heads.

Collocations (1) Hoey (2005), quoting Partington (1998), distinguishes textual, statistical, and psychological definitions of ‘collocation’. – The statistical definition: “The relationship a lexical item has with items that appear with greater than random probability in its textual context” (Hoey 1991). – That’s exactly right. – But it does not say how “greater than random probability” is to be measured. – Later: “The [statistical] definition says nothing interesting about the phenomenon.” (Hoey 2005). – This is exactly wrong. – To understand how word meaning works, measuring the statistics of collocation is both essential and interesting. 19

How to Measure Collocations? Various statistical tools are available, e.g.: Mutual information (“MI”; Church and Hanks 1990) – tends to favor content words as collocates t-score tends to favor function words as collocates. Log likelihood ratios (e.g. Dunning 1993) – Results in collocational analysis are intuitively unsatisfactory Word Sketch Engine (Kilgarriff, Rychlý, et al., 2004) – measures salience scores for pre-determined colligational patterns Take your pick – but it must be done, one way or the other. 20

Pattern Dictionary and FrameNet CPA investigates syntagmatic criteria for distinguishing different meanings of polysemous words, in a “semantically shallow” way. FrameNet: expresses the deep semantics of situations (frames); proceeds frame by frame, not word by word; analyses situations in terms of frame elements; studies meaning differences and similarities between different words in a frame; does not explicitly study meaning differences of polysemous words; does not analyse corpus data systematically, but goes fishing in corpora for examples in support of hypotheses; has problems grouping words into frames, and misses some; has no established inventory of frames; has no criteria for completeness of a lexical entry. 21

Construction Grammar (1) Focus on meaning, not just well-formedness. Challenges reductionist theories of language Meaning is associated with constructions. Anything from a word to a clause can be a construction. – Example: ‘she slept her way to the top.’ – Sleep is not normally a goal-achievement verb. – But in this sentence, it is coerced into being one by the construction “[V] one’s way to [[Status]]”. – This meaning is not arrived at by a concatenation of the meanings of the lexical items of which the sentence is composed. 22

Construction Grammar (2) So far so good – but Construction Grammar is in the speculative tradition. It is not based on analysis of evidence. It is based largely on made-up examples, many of which are bizarre, e.g. The gardener watered the flowers flat. Corpus evidence shows that the verb water does not normally participate in the resultative construction. A distinction between normal usage and exploitation of norms is needed. – Abnormal examples are conducive to distortions in the theory. – CG needs corpus analysis. – Some sort of synthesis between PG and CG is desirable. 23

Theoretical consequences and practical applications (1) Pedagogical: Anyone acquiring a language must learn competence in two kinds of rule-governed linguistic behaviour: – How to use words normally – How to exploit the norms (creative metaphors, ellipsis, etc.) A pattern dictionary gives comparative frequency of patterns. – A lexical syllabus could select only primary norms. – “Primary norms” are a) high-frequency norms and b) concrete norms. In error analysis: what norm was aimed at? – If learners are exploiting norms creatively, do you (the teacher) really want them to? 24

Theoretical consequences and practical applications (2) For theoretical linguistics: Are some grammars better than others for representing how words are used to make meanings? ‘S  NP VP’: a confusion of language with logic? The third argument (‘adjunct’, ‘adverbial’): – CPA shows that a new grammar of adverbials is needed. Metaphor analysis: – CPA distinguishes conventional metaphors from exploitations. Ontologies: – The relationship between a possible ontology of words in use and scientific conceptual ontologies such as WordNet. 25

Theoretical consequences and practical applications (3) For computational linguistics and AI: Improving machine translation – Getting the pattern right is more likely to select the right translation. Parsing and word-class tagging: – CLAWS achieves ~90% accuracy in word-class tagging in BNC – CPA reveals some systematic errors in CLAWS tagging. Anaphora resolution: – He found a glass of water on the table and drank it. – ‘[[Animate]] drink [[Liquid]]’ selects water as a direct object 26

Conclusions Goal: to work out how people use words to make meanings. Building a framework for this purpose requires: 1.Clause role analysis of corpus data 2.Statistical analysis of colligations in a corpus. The result: An inventory of mutually exclusive patterns of usage of polysemous verbs, with a meaning (or “implicature”) attached to each pattern. A “shimmering ontology” of lexical sets of nouns: Each lexical set consists not only of a semantic type, but also of: a) prototypical members, b) canonical members, c) ad-hoc members, d) parts, and e) attributes. 27

Thanks To you, for listening, To the late John Sinclair and the (still extant) James Pustejovsky, who have inspired this approach, To Karel Pala, Pavel Rychlý, Adam Rambousek, and Adam Kilgarriff, who have created tools that make this kind of analysis possible, and to the Academy of Sciences of the Czech Republic (project T ) and the Czech Ministry of Education (National Research Program II project 2C06009), who, in part, are funding the research. 28

How to browse the Pattern Dictionary and associated tools Fire up Firefox. GO TO: Click on “install firefox”. This will install the software and create an icon “CPA” in your toolbar. Restart Firefox. Click on CPA in the toolbar. You will be asked to log in twice (2 databases). – Login: guest – Password: guest 29