Schema Theorem in Language Acquisition A Rags to Riches Story BOOT-LA, Indiana University, April 23, 2003.

Slides:



Advertisements
Similar presentations
1 Machine Learning: Lecture 1 Overview of Machine Learning (Based on Chapter 1 of Mitchell T.., Machine Learning, 1997)
Advertisements

COGNITIVE DEVELOPMENT, PART 1
Music and Evolution View music from evolutionary perspective From an evolutionary perspective, ask question why is there music at all? Parallel with Chomsky’s.
What is research? Lecture 2 INFO61003 Harold Somers.
Module 14 Thought & Language.
Linguistic Theory Lecture 11 Explanation.
I NNATIST HYPOTHESIS, (UG) Second language acquisition.
Intro to Linguistics Class # 2 Chapter 1: What is Language?
FIRST LANGUAGE ACQUISITION
AVRAM NOAM CHOMSKY Biography Criticisms and problems
© Cambridge International Examinations 2013 Component/Paper 1.
Introduction: The Chomskian Perspective on Language Study.
Nativism: Noam Chomsky
Week 3a. UG and L2A: Background, principles, parameters CAS LX 400 Second Language Acquisition.
Critical Thinking: Chapter 10
What differences are there between the brains of these two primates?
Module 14 Thought & Language. INTRODUCTION Definitions –Cognitive approach method of studying how we process, store, and use information and how this.
Language, Mind, and Brain by Ewa Dabrowska Chapter 10: The cognitive enterprise.
COGNITION AND LANGUAGE Pertemuan 6 Matakuliah: O0072 / Pengantar Psikologi Tahun: 2008.
Psych 56L/ Ling 51: Acquisition of Language Lecture 8 Phonological Development III.
Language Development Major Questions: 1) What is language/what is involved in language? 2) What are the stages of language development? 3) Is language.
Topic: Theoretical Bases for Cognitive Method Objectives Trainees will be able to give reasons for the design and procedures of the Cognitive Method.
Language, Mind, and Brain by Ewa Dabrowska Chapter 3: Language acquisition: robustness.
COMP305. Part II. Genetic Algorithms. Genetic Algorithms.
1 Validation and Verification of Simulation Models.
Sampling Distributions
CS Bayesian Learning1 Bayesian Learning. CS Bayesian Learning2 States, causes, hypotheses. Observations, effect, data. We need to reconcile.
Lecture 1 Introduction: Linguistic Theory and Theories
Generative Grammar(Part ii)
Linguistic Theory Lecture 3 Movement. A brief history of movement Movements as ‘special rules’ proposed to capture facts that phrase structure rules cannot.
Theories of Development. Cognitive Development Early psychologists believed that children were not capable of meaningful thought and that there actions.
Genetic Programming.
X Language Acquisition
Chapter 6: Objections to the Physical Symbol System Hypothesis.
Jean Piaget ( ) Started out as a biologist but specialized in psychology. He was interested in the nature of knowledge and how the child acquires.
Language PERTEMUAN Communication Psycholinguistics –study of mental processes and structures that underlie our ability to produce and comprehend.
Evolution of Universal Grammar Pia Göser Universität Tübingen Seminar: Sprachevolution Dozent: Prof. Jäger
Cristian Urs and Ben Riveira. Introduction The article we chose focuses on improving the performance of Genetic Algorithms by: Use of predictive models.
A Brief Introduction to GA Theory. Principles of adaptation in complex systems John Holland proposed a general principle for adaptation in complex systems:
Schemata Theory Chapter 11. A.E. Eiben and J.E. Smith, Introduction to Evolutionary Computing Theory Why Bother with Theory? Might provide performance.
Chapter 8 Language & Thinking
Multiplying Whole Numbers © Math As A Second Language All Rights Reserved next #5 Taking the Fear out of Math 9 × 9 81 Single Digit Multiplication.
Cognitive Psychology: Thinking, Intelligence, and Language
인공지능 연구실 황명진 FSNLP Introduction. 2 The beginning Linguistic science 의 4 부분 –Cognitive side of how human acquire, produce, and understand.
Classical Rationalism The fundamental source of knowledge is reason. Knowledge should be of the essential properties of things. Such knowledge is knowledge.
I. INTRODUCTION.
Universal Grammar Noam Chomsky.
Theories of first language acquisition.  We are not born speaking!  Language must be acquired. ◦ Learning vs. acquisition  If we think of all that.
Cognitive Processes Chapter 8. Studying CognitionLanguage UseVisual CognitionProblem Solving and ReasoningJudgment and Decision MakingRecapping Main Points.
Question paper 1997.
Linguistic Anthropology Bringing Back the Brain. What Bloomfield Got “Right” Emphasized spoken language rather than written language The role of the linguist.
MAE 552 Heuristic Optimization Instructor: John Eddy Lecture #12 2/20/02 Evolutionary Algorithms.
Introduction Chapter 1 Foundations of statistical natural language processing.
Theories of Second Language Acquisition. Behaviorism A change in external behavior achieved through a large amount of repetition of desired actions. The.
First Language Acquisition
Stages involve Discontinuous (qualitative) change Invariant sequence –Stages are never skipped.
Chapter 3 Language Acquisition: A Linguistic Treatment Jang, HaYoung Biointelligence Laborotary Seoul National University.
Does the brain compute confidence estimates about decisions?
Chapter 10 Language acquisition Language acquisition----refers to the child’s acquisition of his mother tongue, i.e. how the child comes to understand.
Chapter 8 Thinking and Language.
Syntax 1 Introduction.
What is Language Acquisition?
2nd Language Learning Chapter 2 Lecture 4.
Explaining Second Language Learning
Theories of Language Development
Over the past fifty years, three main theoretical positions have been advanced to explain language development from infancy through the early school years:
Over the past fifty years, three main theoretical positions have been advanced to explain language development from infancy through the early school years:
Traditional Grammar VS. Generative Grammar
Over the past fifty years, three main theoretical positions have been advanced to explain language development from infancy through the early school years:
Presentation transcript:

Schema Theorem in Language Acquisition A Rags to Riches Story BOOT-LA, Indiana University, April 23, 2003

Schema Theorem in Language Acquisition

Poverty of the Stimulus The poverty-of-the-stimulus argument, otherwise known as Platos Problem, claims that the nature of language knowledge is such that it could not have been acquired from the actual samples of language available to the human child.Cook & Newson(1996:86)

Schema Theorem in Language Acquisition Poverty of the Stimulus What counts as evidence? positive evidence requirement: no correction, explanation etc. occurrence requirement: must occur in normal language situations uniformity requirement: must be available to all children regardless of culture, class, language take-up requirement: must be used by children

Schema Theorem in Language Acquisition Poverty of the Stimulus Rational Steps for Inclusion in UG/LAD A. A native speaker of a particular language knows a particular aspect of syntax. Ex. structure- dependency, Binding Principles, etc. B. This aspect of syntax could not have been acquired from the language input available to children. C. This aspect of syntax is not learnt from outside. D. This aspect of syntax is built-in to the mind. Cook & Newson(1996:86)

Schema Theorem in Language Acquisition Poverty of the Stimulus A Problem: A. A native speaker of a particular language knows a particular aspect of syntax. Ex. structure- dependency, Binding Principles, etc. B. This aspect of syntax could not have been acquired from the language input available to children. C. This aspect of syntax is not learnt from outside. D. This aspect of syntax is built-in to the mind.

Schema Theorem in Language Acquisition Poverty of the Stimulus Step B is in practice assumed, and rarely rigorously demonstrated increasingly we find existence proofs of acquisition tasks previously believed impossible via statistical, data-driven methods (ex. Chalmers, 1990; Elman, 1995)

Schema Theorem in Language Acquisition Poverty of the Stimulus Faulty Step B Reasoning: a) Helen said that Jane i voted for herself i. b)*Helen i said that Jane voted for herself i. Cook & Newson (1996:84) no context could let them unerringly distinguish the binding of anaphors and of pronominals. implicitly assumes that at this point, the only utterances / experience the child has access to are these two possible interpretations in fact, by the time children produce / understand sentences of this level of complexity, theyve had extensive experience producing and interpreting anaphors and pronominals (OGrady, 1997) moreover, from the outset children show a bias towards binding to the nearest antecedent – they have the most trouble with sentences like: *Helen said that Jane i voted for her i.

Schema Theorem in Language Acquisition Poverty of the Stimulus Faulty Step B Reasoning: a) It is likely that John will be delayed. b) It is probable that John will be delayed. c) John is likely to be delayed. d)*John is probable to be delayed. OGrady (1997:246) common argument against analogy as a learning method denies analogy based on anything but these specific cases – by the time a child produces / understands sentences such as these, they already have extensive linguistic knowledge that would preclude such naive analogies Other studies have shown analogy can be a useful technique for the acquisition of categories and grammatical structure (McLennan, ms.; Tomasello, 2000 for example)

Schema Theorem in Language Acquisition What to do? Simply denying UG doesnt solve our problem since traditional linguists intuitions about the input remain unchanged and lead us back to the same conclusions Genetic Algorithms seem to have a similar problem – they look more efficient than they possibly could be – similar sense of getting something for nothing

Schema Theorem in Language Acquisition Genetic Algorithms problem solving technique which is capable of assessing an extremely large and complicated problem space on the basis of a restricted impoverished input set Three primary elements: 1. a population of chromosomes (bit string) 2. a fitness function (judges goodness) 3. mating and procreation (Holland, 1975; Mitchell, 1996)

Schema Theorem in Language Acquisition Genetic Algorithms from purely random beginnings a solution emerges very quickly – even for optimizations that cant be performed by traditional serial computational methods

Schema Theorem in Language Acquisition Genetic Algorithms Schema Theorem: explanation of how GAs work 101 is an instantiation of the categories (schemata): {***, 1**, *0*, **1, 10*, 1*1, *01, 101} (of a possible 27) 1** is a category representation of {100, 101, 110, 111, (1*1, 1*0, 11*, 10*)}

Schema Theorem in Language Acquisition Genetic Algorithms If 101 is judged as being 75% fit, it simultaneously guestimates {***, 1**, *0*, **1, 10*, 1*1, *01, 101} as being 75% fit Given a population with multiple instantiations, implicit calculation of category fitness becomes more accurate Fuzzy judgments are still useful Selection, biased by fitness, selects not for highly fit individuals but (implicitly) highly fit categories by targeting highly fit individuals

Schema Theorem in Language Acquisition Genetic Algorithms the profound insight: GAs make use of category information without explicit category definitions, explicit biases, or explicit reference to category information. It implicitly acts on categories through category instantiations

Schema Theorem in Language Acquisition Genetic Algorithms taken in this light it is easier to see how GAs skip a great deal of the computational load through implicit parallelism Critical characteristics use a population of tokens (parallelism) a selection process that targets / discovers salient / relevant dimensions of substructure within those tokens

Schema Theorem in Language Acquisition Wealth of the Stimulus Schema Theorem in Language Acquisition Schema Theorem tokens evaluation outcome GAs chromosomes fitness function optimal solution Acquisition experience learning grammar

Schema Theorem in Language Acquisition Wealth of the Stimulus Experiences entire sensory experiences that include linguistic stimuli importantly, all sensory information impacts memory and is available to be correlated infants are exquisitely sensitive to detailed and correlated sensory information – at least until they learn what to ignore (Rovee-Collier, 1991) population because stored distributed within the same neural structures – continuous, not digital

Schema Theorem in Language Acquisition Wealth of the Stimulus Learning in most basic neural sense – continuous, correlative, passive reduces sensory noise – reinforces correlated multimodal sensory experience a type of selection process because salient dimensions emerge through the process

Schema Theorem in Language Acquisition Wealth of the Stimulus Grammar Schematic / analogical (following Tomasello, 2000; Hofstadter; and usage based models) More subtle correlations, or higher level correlations will take more time to be distinguished from noise – results in a course of development Acquisitional prerequisites may exist, but its a mistake to believe that relevant information isnt being collected long before certain phenomena appear – all input has a physiological impact

Schema Theorem in Language Acquisition Wealth of the Stimulus Traditional Progression 1. infants attend to phonetic features 2. phonetic features allow access to phonological system 3. access to phonology allows access to words and short phrases 4. access to words gives access to syntax matches the observed developmental increase in grammatical complexity input is only informative to the linguistic module acquired at each stage linguistic evidence sets innate parameters serial, computationally expensive (thus UG)

Schema Theorem in Language Acquisition Wealth of the Stimulus Schema Theorem Based Progression 1. Every utterance an infant hears provides a tiny bit of information about the phonetics, phonotactics, phonology, morphology, word categories, syntax, tense and aspect system, pragmatics, semantic categories, diexis, references – every aspect of their language will also match the observed developmental increase in grammatical complexity input is informative to every aspect of language even though its contribution may not clearly surface or be attended to immediately parallel, computationally efficient, flexible, adaptable in line with whats going on in other fields

Schema Theorem in Language Acquisition Conclusion A population of tokens implicitly carries exponentially more information about the set than the tokens themselves represent. Parallel systems (of which GAs and the brain are examples) that act on that population can make use of category information that is not explicitly stated. Formal systems cannot. Without changing our observations of the input, development, or the outcome, by taking a more biologically plausible perspective on the information processing going on, we can see that the linguistic environment is far richer than impoverished