When Compositionality Fails to Predict Systematicity Reinhard Blutner, Petra Hendriks, Helen de Hoop, Oren Schwartz.

Slides:



Advertisements
Similar presentations
Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 4 Shieber 1993; van Deemter 2002.
Advertisements

P.M van Hiele Mathematics Learning Theorist Rebecca Bonk Math 610 Fall 2009.
Chapter Two The Scope of Semantics.
A semantic argument against the existence of universally held real properties Emanuel Rutten Faculty of Philosophy VU University.
Optimality Theory Lexical Semantics Tandem workshop on Optimality Theory in language and geometric approaches to language.
The Nature of Systematicity in Natural Language. 27 October, 2004Blutner & Spenader KNAW-Colloquium 2 Systematicity Introduced in Fodor & Pylyshyn (1988):
Knowledge Representation
Cognitive Linguistics Croft & Cruse 9
Albert Gatt LIN3021 Formal Semantics Lecture 5. In this lecture Modification: How adjectives modify nouns The problem of vagueness Different types of.
LEARNING FROM OBSERVATIONS Yılmaz KILIÇASLAN. Definition Learning takes place as the agent observes its interactions with the world and its own decision-making.
1 Measurement PROCESS AND PRODUCT. 2 MEASUREMENT The assignment of numerals to phenomena according to rules.
LEARNING FROM OBSERVATIONS Yılmaz KILIÇASLAN. Definition Learning takes place as the agent observes its interactions with the world and its own decision-making.
Let remember from the previous lesson what is Knowledge representation
The Nature of Systematicity in Natural Language. 27 October, 2004Blutner & Spenader KNAW-Colloquium 2 Systematicity Introduced in Fodor & Pylyshyn (1988):
Physical Symbol System Hypothesis
CS 330 Programming Languages 09 / 16 / 2008 Instructor: Michael Eckmann.
The Scientific Method n See the problem n Look for the relevant variables n Construct a hypothesis, if possible n Create a research design n Collect data.
Conceptual modelling. Overview - what is the aim of the article? ”We build conceptual models in our heads to solve problems in our everyday life”… ”By.
Meaning and Language Part 1.
Its Grammatical Categories
The Scientific Method Lecture – Natural History of Cleveland.
Linguistic Theory Lecture 2 Phrase Structure. What was there before structure? Classical studies: Classical studies: –Languages such as Latin Rich morphology.
Copyright © Cengage Learning. All rights reserved.
What makes communication by language possible? Striking fact (a) If someone utters a sentence and you know which proposition her utterance expresses, then.
FIRST ORDER LOGIC Levent Tolga EREN.
February 2009Introduction to Semantics1 Logic, Representation and Inference Introduction to Semantics What is semantics for? Role of FOL Montague Approach.
Tuesday, December 2 GUM 5.7 Composition 3.7 Literary Analysis and Composition
FUZZY LOGIC Babu Appat. OVERVIEW What is Fuzzy Logic? Where did it begin? Fuzzy Logic vs. Neural Networks Fuzzy Logic in Control Systems Fuzzy Logic in.
Modeling (Chap. 2) Modern Information Retrieval Spring 2000.
SOFTWARE ENGINEERING BIT-8 APRIL, 16,2008 Introduction to UML.
Chapter 6: Objections to the Physical Symbol System Hypothesis.
LDK R Logics for Data and Knowledge Representation Context Logic Originally by Alessandro Agostini and Fausto Giunchiglia Modified by Fausto Giunchiglia,
Geometric Conceptual Spaces Ben Adams GEOG 288MR Spring 2008.
Protege OWL Plugin Short Tutorial. OWL Usage The world wide web is a natural application area of ontologies, because ontologies could be used to describe.
Quantum Mind Symposium · Salzburg 1 Reinhard Blutner · Universiteit van Amsterdam · 19th July 2007 Combining Prototypes: Quantal Macrostates & Entanglement.
November 2003CSA4050: Semantics I1 CSA4050: Advanced Topics in NLP Semantics I What is semantics for? Role of FOL Montague Approach.
SYNTAX Lecture -1 SMRITI SINGH.
Boolean Algebra and Computer Logic Mathematical Structures for Computer Science Chapter 7.1 – 7.2 Copyright © 2006 W.H. Freeman & Co.MSCS Slides Boolean.
Formal Models in AGI Research Pei Wang Temple University Philadelphia, USA.
Pattern-directed inference systems
1 Logical Agents CS 171/271 (Chapter 7) Some text and images in these slides were drawn from Russel & Norvig’s published material.
ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
7. Parsing in functional unification grammar Han gi-deuc.
The interface between model-theoretic and corpus-based semantics
Machine Learning Chapter 5. Artificial IntelligenceChapter 52 Learning 1. Rote learning rote( โรท ) n. วิถีทาง, ทางเดิน, วิธีการตามปกติ, (by rote จากความทรงจำ.
1 The Theoretical Framework. A theoretical framework is similar to the frame of the house. Just as the foundation supports a house, a theoretical framework.
Albert Gatt LIN3021 Formal Semantics Lecture 4. In this lecture Compositionality in Natural Langauge revisited: The role of types The typed lambda calculus.
LECTURE 2: SEMANTICS IN LINGUISTICS
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
SCIENTIFIC THINKING Presented by: Dr. Salem M. Al-Ghamdi Associate Professor of Management King Fahd University of Petroleum and Minerals.
1 Logical Agents CS 171/271 (Chapter 7) Some text and images in these slides were drawn from Russel & Norvig’s published material.
Logical Agents Chapter 7. Outline Knowledge-based agents Logic in general Propositional (Boolean) logic Equivalence, validity, satisfiability.
Sense and Denotation (Part 2) Caroline Bardini – Université Paris 7 April 23rd 2004.
Developing the theoretical and conceptual framework From R.E.Khan ( J199 lecture)
WORDS The term word is much more difficult to define in a technical sense, and like many other linguistic terms, there are often arguments about what exactly.
Lecture 2 (Chapter 2) Introduction to Semantics and Pragmatics.
Transient Unterdetermination and the Miracle Argument Paul Hoyningen-Huene Leibniz Universität Hannover Center for Philosophy and Ethics of Science (ZEWW)
1 First Order Logic CS 171/271 (Chapter 8) Some text and images in these slides were drawn from Russel & Norvig’s published material.
The Language of Thought : Part II Joe Lau Philosophy HKU.
From NARS to a Thinking Machine Pei Wang Temple University.
PHILOSOPHY OF LANGUAGE Some topics and historical issues of the 20 th century.
Pattern Recognition. What is Pattern Recognition? Pattern recognition is a sub-topic of machine learning. PR is the science that concerns the description.
WHAT IS THE NATURE OF SCIENCE?
CS 4700: Foundations of Artificial Intelligence
Language, Logic, and Meaning
Artificial Intelligence Lecture 11: First Order Logic
Introduction to Semantics
Thinking about Thinking
Normativism A phenomenon is normative if it cannot be adequately described in merely descriptive terms (but must instead be described using such terms.
Presentation transcript:

When Compositionality Fails to Predict Systematicity Reinhard Blutner, Petra Hendriks, Helen de Hoop, Oren Schwartz

The immediate appeal of compositionality: Frege (1923) It is astonishing what language can do. With a few syllables it can express an incalculable number of thoughts, so that even a thought grasped by a terrestrial being for the first time can be put into a form of words which will be understood by someone to whom the thought is entirely new. This would be impossible, were we not able to distinguish parts in the thoughts corresponding to the parts of a sentence, so that the structure of the sentence serves as the image of the structure of the thoughts.

Compositionality The meaning of a complex expression is determined by its structure and the meanings of its constitutents. equivalently There is a homomorphism between the syntactic term algebra to the algebra of meanings.

Compositionality Compositionality is necessary to explain the productivity of language. Compositionality entails productivity.

Systematicity The term and empirical hypothesis was introduced by Fodor & Pylyshyn (1988), who didn’t attempt a precise definition of the concept or a complete description of the empirical phenomenon. Van Gelder & Niklasson (1994) summarize F&P’s notion of systematicity as: [the] ability to do some things of a given cognitive type (including at least “thinking a thought” and making an inference) is intrinsically connected with their ability to do other, structurally related things of that type.

Systematicity When an agent understands the expressions brown triangle and black square, she understands the expressions brown square and black triangle as well. Does compositionality entail systematicity?

A Classical Example Agent understands brown triangle and black square. She constructs the conceptual representations BROWN  TRIANGLE and BLACK  SQUARE (via compositionality). She knows the truth conditional impact of the corresponding constitutents, and she extracts the lexicon entries: brown -> BROWN, square- >SQUARE, etc. Using these, she constructs BROWN  SQUARE and BLACK  TRIANGLE (via compositionality) Thus compositionality may derive systematicity

Systematicity If an agent understands within an hour and without a watch, does she also understand within a watch and without an hour? (Szabo 2004)

Our View Language is (mostly) systematic and compositional, but not always. Traditional semantic analyses have difficulty with cases where the systematicity of meaning is weak or missing. Connectionist models are well suited to handling these types of cases, and can account for the semantic contribution of a constitutent that is both systematic (in general) and idiosyncratic (in certain cases).

Why compositionality fails to explain systsematicity In his “Grundlagen der Mathematik” Frege (1884) noticed the context- dependence of words (and took this as an argument against compositionality in NL) “One should ask for the meaning of a word only in the context of a sentence, and not in isolation.“

Empirical Phenomena Quine (1960) was the first who noted the contrast between red apple (red on the outside) and pink grapefruit (pink on the inside). Lahav (1993) argues that an adjective such as brown doesn’t make a simple and fixed contribution to any composite expression in which it appears: brown cow, brown crystal, brown book, brown newspaper

Empirical Phenomena Different colors typically denoted by red in red apple and red hair. the actual color value deviates in a systematic way from the prototypical color value that can be assigned to the color adjective in isolation – in dependency on the conceptual properties of the modified noun Similarly, the same color is often described with different adjectives In Japanese, brown sugar, aka-zatoo (lit. ‘red sugar’) comes in the same range of colors as shira-miso, (lit. ‘white bean paste’)

Three consequences Intersectivity, ||A(B)|| = ||A(x)||  ||B||, doesn’t hold, even for most ‘absolute’ adjectives Systematicity statements cannot be derived from compositionality in these cases Encyclopedic knowledge is required to determine the truth conditional content of an utterance (explicature in Relevance Theory).

A Related Phenomenon Typicality A specific instance of a red apple is more typical of a “red apple” than of an “apple” Incompatible Conjunctions The typicality effects are greater for “incompatible conjunctions” (striped apple) than for “compatible conjunctions” (green apple)

A Concrete Question How should we account for the semantic contribution of an (absolute) adjective to an (Adjective Noun) N complex expression in the face of these phenomena?

Radical underspecification with contextual enrichment small  x small(x,N) * small terrier  x [small(x,N) & terrier(x)] Analogously for red apple with place-holders for the relevant parts red  x [part(Y,x) & red(Y)] red apple  x [part(Y,x) & red(Y) & apple(x)] How to determine the proper values for N and Y, respectively? Contextual enrichment, as in Probabilistic Theory of Relevance (van Rooy 2000) * with small(x,N)  size(x) < N

Problems with underspecification Does not really clarify how to determine the border line between the underspecified representation and the contextual enrichment It is difficult to see how the available mechanisms account for the prototype effects found in adjectival modification

A connectionist approach Inspired by the (symbolic) selective modification model (Smith, Osherson, Rips & Keane 1988) Prototype representations consisting of Symbolic, structured concepts Attribute-value pairs, weighted by salience Modifiers act on prototypes by increasing salience of attribute and changing its value Tversky’s (1977) contrast rule for similarity

A connectionist approach AdjectiveNoun Conceptual Layer

A connectionist approach AdjectiveNoun Conceptual Layer Color

A connectionist approach AdjectiveNoun Conceptual Layer Taste

A connectionist approach AdjectiveNoun Conceptual Layer Peel

A connectionist approach AdjectiveNoun Conceptual Layer Pulp

A connectionist approach AdjectiveNoun Conceptual Layer Color Peel (Color x Peel)

A connectionist approach AdjectiveNoun Conceptual Layer red

A connectionist approach AdjectiveNoun Conceptual Layer white

A connectionist approach AdjectiveNoun Conceptual Layer red peel white pulp red

A connectionist approach AdjectiveNoun Conceptual Layer

A connectionist approach AdjectiveNoun Conceptual Layer

A connectinist approach Default uniform connections within conceptual layer In training, instances are presented at the conceptual layer and at the (localist) adjective and noun layers. The prototypes learned by the model are found by turning on units in the adjective and noun layers and letting the network settle into a representation at the conceptual layer.

Prototype effects Using Tversky's (1977) contrast rule (formulated for activation vectors) sim(s,t) =  i min(s i,t i )   i |s i  t i | sim(s red apple, t 1 ) > sim(s apple, t 1 ) sim(s purple apple,t 2 )  sim(s apple,t 2 ) > sim(s red apple,t 1 )  sim(s apple,t 1 ) t 1 = a red apple, t 2 = a purple apple

Conclusions 1. Compositionality alone is not formally restrictive enough to explain the type of productivity in NL 2. Semantic contributions of constituents to their complex expressions are more complex and less systematic than we might expect 3. Encyclopedic knowledge is necessary to account for the variability in the semantic contribution of constituents to complex phrases 4. Connectionist approaches easily handle the accumulation and application of this knowledge.