Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.

Slides:



Advertisements
Similar presentations
Perception Chapter 4.
Advertisements

Perception and Pattern Recognition  What types of information do we use to perceive the world correctly?  What are the major theories about how we recognize.
Last week... why object recognition is difficult, the template model the feature recognition model, word recognition as a case study Today... Recognition.
Chapter 1: Introduction to Pattern Recognition
Perception Putting it together. Sensation vs. Perception A somewhat artificial distinction Sensation: Analysis –Extraction of basic perceptual features.
Cognitive Processes PSY 334 Chapter 2 – Perception April 9, 2003.
I. Face Perception II. Visual Imagery. Is Face Recognition Special? Arguments have been made for both functional and neuroanatomical specialization for.
PSY 402 Theories of Learning Chapter 8 – Stimulus Control How Stimuli Guide Instrumental Action.
Cognitive Processes PSY 334 Chapter 2 – Perception June 30, 2003.
1 3 Processes of Pattern Recognition Sensation – you have to detect or see the pattern Perception – you have to organize the features into a whole Memory.
Pattern Recognition Pattern - complex composition of sensory stimuli that the human observer may recognize as being a member of a class of objects Issue.
Object Perception. Perceptual Grouping and Gestalt Laws Law of Good continuation. This is perceived as a square and triangle, not as a combination of.
Types of Perceptual Processes Bottom-up - work up from sensory info. Top-down - apply knowledge and experience.
Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.
Visual Cognition I basic processes. What is perception good for? We often receive incomplete information through our senses. Information can be highly.
Reading. Reading Research Processes involved in reading –Orthography (the spelling of words) –Phonology (the sound of words) –Word meaning –Syntax –Higher-level.
Michael Arbib & Laurent Itti: CS664 – USC, spring Lecture 6: Object Recognition 1 CS664, USC, Spring 2002 Lecture 6. Object Recognition Reading Assignments:
Processing and Constraint Satisfaction: Psychological Implications The Interactive-Activation (IA) Model of Word Recognition Psychology /719 January.
PSY 369: Psycholinguistics Language Comprehension: Visual perception.
Cognitive Processes PSY 334 Chapter 2 – Perception.
Categorization: Scenes & Objects (P) Lavanya Sharan March 16th, 2011.
An aside: peripheral drift illusion illusion of motion is strongest when reading text (such as this) while viewing the image in your periphery. Blinking.
Cognitive Processes PSY 334 Chapter 2 – Perception.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Pattern Recognition Vidya Manian Dept. of Electrical and Computer Engineering University of Puerto Rico INEL 5046, Spring 2007
THE PROBLEM OF VISUAL RECOGNITION (Ch. 3, Farah) Why is it difficult to identify real world objects from the retinal image? Why is it difficult to identify.
Course Overview Knowledge ch. 3: Vision. How are objects recognized? -It looks easy but it’s not The Brain Acquisition (perception) ch.4: Attention. Use.
Biases: An Example Non-accidental properties: Properties that appear in an image that are very unlikely to have been produced by chance, and therefore.
Pattern recognition = perception Template theory  has problems Prototype theory  better Distinctive features theory  better.
Perceptual Processes: Visual & Auditory Recognition Dr. Claudia J. Stanny EXP 4507 Memory & Cognition Spring 2009.
2 2  Background  Vision in Human Brain  Efficient Coding Theory  Motivation  Natural Pictures  Methodology  Statistical Characteristics  Models.
The Word Superiority Effect OR How humans use context to see without really seeing and how can it help the field of computational vision.
1 Computational Vision CSCI 363, Fall 2012 Lecture 31 Heading Models.
Cognitive Processes PSY 334
Outline »Visual Pattern recognition ◊Template theory ◊Feature Theory ◊Top down influences »Object recognition »Auditory Pattern recognition ◊Physiology.
1 Perceptual Processes  Introduction Pattern Recognition Pattern Recognition Top-down Processing & Pattern Recognition Top-down Processing & Pattern Recognition.
Lecture 3 - Race against Time 1 Three points for today Sensory memory (SM) contains highly transient information about the dynamic sensory array. Stabilizing.
Reicher (1969): Word Superiority Effect Dr. Timothy Bender Psychology Department Missouri State University Springfield, MO
Cognitive Psychology PSYC231 Perception 2 Dr. Jan Lauwereyns, EA619, ext
Korea University Dept.of Industrial System & Information Engineering User Interface Lab Chapter 3 _ Object Recognition + 이병용.
1 Perceptual Processes Introduction –Pattern Recognition –Top-down Processing & Pattern Recognition –Face Perception Attention –Divided attention –Selective.
Perception. Question of the Day Why is recognizing an object so easy for humans, but so difficult for computers?
Perception.
Thinking part I Mental Representations and Visual Imagery Mind Reading
EMPATH: A Neural Network that Categorizes Facial Expressions Matthew N. Dailey and Garrison W. Cottrell University of California, San Diego Curtis Padgett.
Chapter 3: Sensation and Perception Sensation: activity of receptor organs Perception: interpretation of sensory system activity Visual system organization:
3:01 PM Three points for today Sensory memory (SM) contains highly transient information about the dynamic sensory array. Stabilizing the contents of SM.
Introduction to Psychology Sensation and Perception Prof. Jan Lauwereyns
High level vision.
PSY 402 Theories of Learning Chapter 8 – Stimulus Control How Stimuli Guide Instrumental Action.
Cognitive - perception.ppt © 2001 Laura Snodgrass, Ph.D.1 Perception The final image we are consciously aware of is a “constructed” representation of the.
High-Level Vision Object Recognition.
Thinking part I Visual Imagery Mind Reading. Solving problems through imagery What shape are mickey mouse’s ears? How many windows are there in your apartment?
1 Computational Vision CSCI 363, Fall 2012 Lecture 32 Biological Heading, Color.
Thinking part I Mental Representations and Visual Imagery Mind Reading.
Face Detection 蔡宇軒.
Perception & Pattern Recognition 1 Perception Pattern Recognition Theories of Pattern Recognition Bottom-up vs. Top-Down Processing & Pattern Recognition.
Recognizing Visual and Auditory Stimuli
CS201 Lecture 02 Computer Vision: Image Formation and Basic Techniques
Cognitive Processes PSY 334
© 2016 by W. W. Norton & Company Recognizing Objects Chapter 4 Lecture Outline.
Perceiving and Recognizing Objects
Neuropsychology of Vision Anthony Cate April 19, 2001
Brief Review of Recognition + Context
Pattern recognition (…and object perception).
Introduction to Neural Networks And Their Applications - Basics
Creating Data Representations
Cognitive Processes PSY 334
III. Introduction to Neural Networks And Their Applications - Basics
Perception & Pattern Recognition
Presentation transcript:

Visual Cognition II Object Perception

Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural models

Template matching Detect patterns by matching visual input with a set of templates stored in memory – see if any template matches. TEST INSTANCE “J” TEMPLATE “T” TEMPLATE match

Problem: what if the object differs slightly from the template? E.g., it is rotated or scaled differently? Solution: use a set of transformations to best align the object with a template (using translation, rotation, scaling) TEST INSTANCE “J” TEMPLATE “T” TEMPLATE rotation match

Template-matching works well in constrained environments

Figure 2-15 (p. 58) Examples of the letter M. Problem: template matching is not powerful enough for general object recognition

Feature Theories Detect objects by the presence of features Each object is broken down into features E.g. A = + +

Problem Many objects consist of the same collection of features Need to also know how the features relate to each other  structural theories One theory is recognition by components Different objects, similar sets of features

Recognition by Components (RBC) Biederman (1987): Complex objects are made up of arrangements of basic, component parts: geons. “Alphabet” of 24 geons Recognition involves recognizing object elements (geons) and their configuration

Why these geons? Choice of shape vocabulary seems a bit arbitrary However, choice of geons was based on non-accidental properties. The same geon can be recognized across a variety of different perspectives: except for a few “accidental” views:

Viewpoint Invariance Viewpoint invariance is possible except for a few accidental viewpoints, where geons cannot be uniquely identified

Prediction Recognition is easier when geons can be recovered Disrupting vertices disrupts geon processing more than just deleting parts of lines Object Deleting line segments Deleting vertices

Evidence from priming experiments

Problem for RBC Theory does not say how color, texture and small details are processed. These are often important to tell apart specific exemplars or similar objects. E.g.:

Configural models of recognition Individual instances are not stored; what is stored is an “exemplar” or representative element of a category Recognition based on “distance” between perceived item and prototype prototype match “Face space” no match

Prediction: Caricatures might be better recognized than original face from: Face Recognition by Humans: 20 Results all Computer Vision Researchers Should Know About. Sinha et al. (2005). average female face “veridical” face caricature of B

a)no, nothing b)same mouth c)same nose d)same eyes Do these faces have anything in common?

How about these ones? By disrupting holistic (configural) processing, it becomes easier to process the individual parts

Configural effects often disappear when stimulus is inverted Face Inversion

Top-down and Context Effects in Object Recognition

Slide from Rob Goldstone

Context can often help in identification of an object Later identification of objects is more accurate when object is embedded in coherent context

Context can alter the interpretation of an object

Context Effects in Letter Perception The word superiority effect: discriminating between letters is easier in the context of a word than as letters alone or in the context of a nonword string. DEMO: (Reicher, 1969)

Interactive Activation Model Word superiority effect suggests that information at the word level might affect interpretation at the letter level Interactive activation model: neural network model for how different information processing levels interact Levels interact –bottom up: how letters combine to form words –top-down: how words affect detectability of letters

The Interactive Activation Model Three levels: feature, letter, and word level Nodes represent features, letters and words; each has an activation level Connections between nodes are excitatory or inhibitory Activation flows from feature to letter to word level and back to letter level (McClelland & Rumelhart, 1981)

The Interactive Activation Model Bottom-up: –feature to word level Top-down: –word to letter level Model predicts word superiority effect because of top-down processing (McClelland & Rumelhart, 1981)

Predictions of the IA model – stimulus is “WORK” At word level, evidence for “WORK” accumulates over time Small initial increase for “WORD” WORK WORD WEAR

Predictions of the IA model – stimulus is “WORK” Why does the letter “K” get activated? a)because of (partial) activation from feature level b)because of activation from word level back to feature level c)both a) and b) K R D

Predictions of the IA model – stimulus is “WORK” Why does the letter “R” get partially activated? a)because of (partial) activation from feature level b)because of activation from word level back to feature level c)both a) and b) K R D

For a demo of the IA model, see:

“Mind reading”

Predicting What Somebody is Seeing (“mind reading”) Viewing a BottleViewing a Shoe If the brain response is different for different kinds of stimuli, can we predict what somebody is thinking of solely based on the brain’s response? bold response

Pattern Classification Method 1.Acquire brain data for different stimuli (e.g. bottles and shoes) 2.Train a classifier (such as the neural network on right) to discriminate between bottle voxel patterns and shoe voxel patterns 3.Test classifier on novel images (slide from Ken Norman) bottle shoe Input Layer (voxels) Output layer (categories)

Faces Cats Scissors Chairs Houses Bottles Shoes Scrambled Pictures slides courtesy of Jim Haxby Haxby et al. (2001) can predict with 96% accuracy stimuli from 8 categories

Reconstructing the Mental Image If we can predict what somebody is looking at, can we also reconstruct what somebody might be looking at from just the brain’s response? Image Brain’s response Mathematical Model Reconstructed image

Reconstructing simple patterns from fMRI Miyawaki et al. (2008) from: Miyawaki et al. (2008). Neuron, 60(5), pp movie at:

Brain Computer Interfaces ATR Laboratories in Japan developed a robotic hand that can be controlled using fMRI Rainer Goebel’s team had two patients play mental ping-pong in fMRI machines