Emergence of Semantic Knowledge from Experience Jay McClelland Stanford University.

Slides:

Advertisements

Similar presentations

Artificial Intelligence 12. Two Layer ANNs

Advertisements

Summer 2011 Tuesday, 8/ No supposition seems to me more natural than that there is no process in the brain correlated with associating or with.

Cognitive - knowledge.ppt © 2001 Laura Snodgrass, Ph.D.1 Knowledge Structure of semantic memory –relationships among concepts –organization of memory –memory.

Probabilistic inference in human semantic memory Mark Steyvers, Tomas L. Griffiths, and Simon Dennis 소프트컴퓨팅연구실오근현 TRENDS in Cognitive Sciences vol. 10,

Does the Brain Use Symbols or Distributed Representations? James L. McClelland Department of Psychology and Center for Mind, Brain, and Computation Stanford.

Emergence in Cognitive Science: Semantic Cognition Jay McClelland Stanford University.

Emergence of Semantic Structure from Experience Jay McClelland Stanford University.

Concepts and Categories. Functions of Concepts By dividing the world into classes of things to decrease the amount of information we need to learn, perceive,

Concepts and Categories. Functions of Concepts By dividing the world into classes of things to decrease the amount of information we need to learn, perceive,

Physical Symbol System Hypothesis

Categorization  How do we organize our knowledge?  How do we retrieve knowledge when we need it?

CS Bayesian Learning1 Bayesian Learning. CS Bayesian Learning2 States, causes, hypotheses. Observations, effect, data. We need to reconcile.

Cognitive Psychology, 2 nd Ed. Chapter 8 Semantic Memory.

Lecture 10 – Semantic Networks 1 Two questions about knowledge of the world 1.How are concepts stored? We have already considered prototype and other models.

CHAPTER 12 ADVANCED INTELLIGENT SYSTEMS © 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang.

General Knowledge Dr. Claudia J. Stanny EXP 4507 Memory & Cognition Spring 2009.

Development and Disintegration of Conceptual Knowledge: A Parallel-Distributed Processing Approach Jay McClelland Department of Psychology and Center for.

Dynamics of learning: A case study Jay McClelland Stanford University.

Using Backprop to Understand Apects of Cognitive Development PDP Class Feb 8, 2010.

Representation, Development and Disintegration of Conceptual Knowledge: A Parallel-Distributed Processing Approach James L. McClelland Department of Psychology.

Emergence of Semantic Structure from Experience Jay McClelland Stanford University.

Semantic Memory Psychology Introduction This is our memory for facts about the world This is our memory for facts about the world How do we know.

The changing face of face research Vicki Bruce School of Psychology Newcastle University.

Department of Computer Science, University of Waikato, New Zealand Geoffrey Holmes, Bernhard Pfahringer and Richard Kirkby Traditional machine learning.

Dynamic Probabilistic Relational Models Paper by: Sumit Sanghai, Pedro Domingos, Daniel Weld Anna Yershova Presentation slides are adapted.

Bayesian and Connectionist Approaches to Learning Tom Griffiths, Jay McClelland Alison Gopnik, Mark Seidenberg.

Integrating New Findings into the Complementary Learning Systems Theory of Memory Jay McClelland, Stanford University.

The PDP Approach to Understanding the Mind and Brain J. McClelland Cognitive Core Class Lecture March 7, 2011.

The PDP Approach to Understanding the Mind and Brain Jay McClelland Stanford University January 21, 2014.

Disintegration of Conceptual Knowledge In Semantic Dementia James L. McClelland Department of Psychology and Center for Mind, Brain, and Computation Stanford.

Awakening from the Cartesian Dream: The PDP Approach to Understanding the Mind and Brain Jay McClelland Stanford University February 7, 2013.

Contrasting Approaches To Semantic Knowledge Representation and Inference Psychology 209 February 15, 2013.

The Influence of Feature Type, Feature Structure and Psycholinguistic Parameters on the Naming Performance of Semantic Dementia and Alzheimer’s Patients.

The Past Tense Model Psych /719 Feb 13, 2001.

Development, Disintegration, and Neural Basis of Semantic Cognition: A Parallel-Distributed Processing Approach James L. McClelland Department of Psychology.

Emergence of Semantic Structure from Experience Jay McClelland Stanford University.

Similarity and Attribution Contrasting Approaches To Semantic Knowledge Representation and Inference Jay McClelland Stanford University.

Rapid integration of new schema- consistent information in the Complementary Learning Systems Theory Jay McClelland, Stanford University.

Semantic Cognition: A Parallel Distributed Processing Approach James L. McClelland Center for the Neural Basis of Cognition and Departments of Psychology.

Cognitive Processes PSY 334 Chapter 5 – Meaning-Based Knowledge Representation.

Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.

1 How is knowledge stored? Human knowledge comes in 2 varieties: Concepts Concepts Relations among concepts Relations among concepts So any theory of how.

The Origins of Knowledge Debate How do people gain knowledge about the world around them? Are we born with some fundamental knowledge about concepts like.

Origins of Cognitive Abilities Jay McClelland Stanford University.

The Emergent Structure of Semantic Knowledge

Semantic Memory Psychology Introduction This is our memory for facts about the world How do we know that the capital of Viet Nam is Hanoi How is.

Verbal Representation of Knowledge

Emergent Semantics: Meaning and Metaphor Jay McClelland Department of Psychology and Center for Mind, Brain, and Computation Stanford University.

From NARS to a Thinking Machine Pei Wang Temple University.

Semantic Knowledge: Its Nature, its Development, and its Neural Basis James L. McClelland Department of Psychology and Center for Mind, Brain, and Computation.

Organization and Emergence of Semantic Knowledge: A Parallel-Distributed Processing Approach James L. McClelland Department of Psychology and Center for.

Development and Disintegration of Conceptual Knowledge: A Parallel-Distributed Processing Approach James L. McClelland Department of Psychology and Center.

Chapter 9 Knowledge. Some Questions to Consider Why is it difficult to decide if a particular object belongs to a particular category, such as “chair,”

Brief Intro to Machine Learning CS539

Bayesian inference in neural networks

Psychology 209 – Winter 2017 January 31, 2017

What is cognitive psychology?

Psychology 209 – Winter 2017 Feb 28, 2017

Development and Disintegration of Conceptual Knowledge: A Parallel-Distributed Processing Approach James L. McClelland Department of Psychology and Center.

Does the Brain Use Symbols or Distributed Representations?

Bayesian inference in neural networks

Emergence of Semantic Structure from Experience

Emergence of Semantics from Experience

Knowledge Pt 2 Chapter 10 Knowledge Pt 2.

Prepared by: Mahmoud Rafeek Al-Farra

Artificial Intelligence 12. Two Layer ANNs

How is knowledge stored?

CLS, Rapid Schema Consistent Learning, and Similarity-weighted Interleaved learning Psychology 209 Feb 26, 2019.

Presentation transcript:

Emergence of Semantic Knowledge from Experience Jay McClelland Stanford University

Approaches to Understanding Intelligence Symbolic approaches – explicit symbolic structures – structure-sensitive rules – discrete computations even if probabilistic Emergence-based approaches – Symbolic structures and processes as approximate characterizations of emergent consequences of  Neural mechanisms  Development  Evolution ……

Emergent vs. Stipulated Structure Old Boston Midtown Manhattan

Explorations of a Neural Network Model Neurobiological basis Initial implementation Emergence of semantic knowledge Disintegration of semantic knowledge in neurodegenerative illness Characterizing the behavior of the model Further explorations

Kiani et al (2007) Pattern Similarity From Monkey Neurons

Goals: 1.Show how a neural network could capture semantic knowledge implicitly 2.Demonstrate that learned internal representations can capture hierarchical structure 3.Show how the model could make inferences as in a symbolic model Rumelhart’s Distributed Representation Model

The Quillian Model

The Rumelhart Model

The Training Data: All propositions true of items at the bottom level of the tree, e.g.: Robin can {grow, move, fly} 7

Start with neutral pattern. Adjust to find a pattern that accounts for new information.

The result is a pattern similar to that of the average bird…

Use this pattern to infer what the new thing can do.

Phenomena in Development (Rogers & McClelland, 2004) Progressive differentiation U-shaped over-generalization of – Typical properties – Frequent names Emergent domain-specificity Basic level, expertise & frequency effects Conceptual reorganization Tim Rogers

Phenomena in Development (Rogers & McClelland, 2004) Progressive differentiation U-shaped over-generalization of – Typical properties – Frequent names Emergent domain-specificity Basic level, expertise & frequency effects Conceptual reorganization Tim Rogers

5

Differentiation over time

ExperienceExperience Early Later Later Still

Overgeneralization of Typical Properties Activation Epochs of Training Pine has leaves

Overgeneralization of Frequent Names Children typically see and talk about far more dogs than any other animal They often call other, less familiar animals ‘dog’ or ‘doggie’ But when they are a little older they stop This occurs in the model, too

Overgeneralization of Frequent Names Epochs of Training Activation

Reorganization of Conceptual Knowledge (Carey, 1985) Young children don’t really understand what it means to be a living thing By 10-12, they have a very different understanding Carey argues this requires integration of many different kinds of information The model can exhibit reorganization, too

The Rumelhart Model small

The Rumelhart Model small

The Rumelhart Model small

Reorganization Simulation Results EARLY LATER

Disintegration in Semantic Dementia Loss of differentiation Overgeneralization

language Grounding the Model in The Brain Specialized brain areas subserve each kind of semantic information Semantic dementia results from degeneration near the temporal pole Initial learning and use of knowledge depends on the medial temporal lobe

Architecture for the Organization of Semantic Memory color form motion action valance Temporal pole name Medial Temporal Lobe

Explorations of a Neural Network Model Neurobiological basis Initial implementation Emergence of semantic knowledge Disintegration of semantic knowledge in neurodegenerative illness Characterizing the behavior of the model Further explorations

Neural Networks and Probabilistic Models The model learns the conditional probability structure of the training data: P(A i = 1|I j & C k ) for all i,j,k … subject to constraints imposed by initial weights and architecture. Input representations are important too The structure in the training data and lead the network to behave as though it is learning a – Hierarchy – Linear Ordering – Two-dimensional similarity space…

The Hierarchical Naïve Bayes Classifier as a Model of the Rumelhart Network Items are organized into categories Categories may contain sub- categories Features are probabilistic and depend on the category We start with a one-category model, and learn p(F|C) for each feature We differentiate as evidence accumulates supporting a further differentiation Brain damage erases the finer sub-branches, causing ‘reversion’ to the feature probabilities of the parent Living Things … Animals Plants Birds Fish Flowers Trees

Overgeneralization of Typical Properties Activation Epochs of Training Pine has leaves

Accounting for the network’s feature attributions with mixtures of classes at different levels of granularity Regression Beta Weight Epochs of Training Property attribution model: P(f i |item) =  k p(f i |c k ) + (1-  k )[(  j p(f i |c j ) + (1-  j )[…])

Should we replace the PDP model with the Naïve Bayes Classifier? It explains a lot of the data, and offers a succinct abstract characterization But –It only characterizes what’s learned when the data actually has hierarchical structure –In natural data, all items don’t neatly fit in just one place, and some important dimensions of similarity cut across the tree. So it may be a useful approximate characterization in some cases, but can’t really replace the real thing.

Further Explorations Modeling cross-domain knowledge transfer and ‘grounding’ of one kind of knowledge in another Mathematical characterization of natural structure, encompassing hierarchical organization as well as other structural forms Exploration of the protective effects of ongoing experience on preservation of knowledge during early phases of semantic dementia

Thanks!