Chapter 2: Marr’s theory of vision. Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Overview Introduce Marr’s distinction between.

Slides:



Advertisements
Similar presentations
Top-Down versus Bottom-Up Perception
Advertisements

Applications of one-class classification
Chapter 4: Cognitive science and the integration challenge
Chapter 5: Space and Form Form & Pattern Perception: Humans are second to none in processing visual form and pattern information. Our ability to see patterns.
Chapter 1: Information and information-processing
ARCHITECTURES FOR ARTIFICIAL INTELLIGENCE SYSTEMS
Chapter 3: Neural Processing and Perception. Lateral Inhibition and Perception Experiments with eye of Limulus –Ommatidia allow recordings from a single.
Dynamic Occlusion Analysis in Optical Flow Fields
Chapter 4: Local integration 2: Neural correlates of the BOLD signal
DREAM PLAN IDEA IMPLEMENTATION Introduction to Image Processing Dr. Kourosh Kiani
PDP: Motivation, basic approach. Cognitive psychology or “How the Mind Works”
PSY 5018H: Math Models Hum Behavior, Prof. Paul Schrater, Spring 2004 Vision as Optimal Inference The problem of visual processing can be thought of as.
Visualization CSC 485A, CSC 586A, SENG 480A Instructor: Melanie Tory.
Cognitive Processes PSY 334 Chapter 2 – Perception June 30, 2003.
Overview of Computer Vision CS491E/791E. What is Computer Vision? Deals with the development of the theoretical and algorithmic basis by which useful.
Sebastian Bitzer Seminar Neurophysiological Foundations of Consciousness University of Osnabrueck A sensorimotor.
Green’s Tri-Level Hypothesis Behavioral: a person’s performance on specific experimental tasks Cognitive: the postulated cognitive or affective systems.
What is Cognitive Science? … is the interdisciplinary study of mind and intelligence, embracing philosophy, psychology, artificial intelligence, neuroscience,
Chapter Four The Cognitive Approach I: History, Vision, and Attention.
Summer 2011 Wednesday, 8/3. Biological Approaches to Understanding the Mind Connectionism is not the only approach to understanding the mind that draws.
Ch 31 Sensation & Perception Ch. 3: Vision © Takashi Yamauchi (Dept. of Psychology, Texas A&M University) Main topics –convergence –Inhibition, lateral.
CS292 Computational Vision and Language Visual Features - Colour and Texture.
PY202 Overview. Meta issue How do we internalise the world to enable recognition judgements to be made, visual thinking, and actions to be executed.
What is Cognitive Science? … is the interdisciplinary study of mind and intelligence, embracing philosophy, psychology, artificial intelligence, neuroscience,
The Cognitive Approach I: History, Vision, and Attention
Cognitive Processes PSY 334 Chapter 2 – Perception.
Cognitive Processes PSY 334 Chapter 2 – Perception.
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.
1B50 – Percepts and Concepts Daniel J Hulme. Outline Cognitive Vision –Why do we want computers to see? –Why can’t computers see? –Introducing percepts.
Chapter 2: Modeling mental imagery. Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 The ingredients Encountered some of the basic.
THE PROBLEM OF VISUAL RECOGNITION (Ch. 3, Farah) Why is it difficult to identify real world objects from the retinal image? Why is it difficult to identify.
MIND: The Cognitive Side of Mind and Brain  “… the mind is not the brain, but what the brain does…” (Pinker, 1997)
1 Computational Vision CSCI 363, Fall 2012 Lecture 26 Review for Exam 2.
Chapter 4: Global responses to the integration challenge.
Active Vision Key points: Acting to obtain information Eye movements Depth from motion parallax Extracting motion information from a spatio-temporal pattern.
Visual Perception Is a Creative Process Instructor : Dr. S. Gharibzadeh Presented By : J. Razjouyan.
Lecture 2b Readings: Kandell Schwartz et al Ch 27 Wolfe et al Chs 3 and 4.
1 Computational Vision CSCI 363, Fall 2012 Lecture 20 Stereo, Motion.
Course 9 Texture. Definition: Texture is repeating patterns of local variations in image intensity, which is too fine to be distinguished. Texture evokes.
Information: Perception and Representation Lecture #7 Part A.
Chapter 3: Neural Processing and Perception. Neural Processing and Perception Neural processing is the interaction of signals in many neurons.
Korea University Dept.of Industrial System & Information Engineering User Interface Lab Chapter 3 _ Object Recognition + 이병용.
1 Perception and VR MONT 104S, Fall 2008 Lecture 4 Lightness, Brightness and Edges.
1 Self-Calibration and Neural Network Implementation of Photometric Stereo Yuji IWAHORI, Yumi WATANABE, Robert J. WOODHAM and Akira IWATA.
Fundamentals of Sensation and Perception RECOGNIZING VISUAL OBJECTS ERIK CHEVRIER NOVEMBER 23, 2015.
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
Perception and VR MONT 104S, Fall 2008 Lecture 8 Seeing Depth
Image features and properties. Image content representation The simplest representation of an image pattern is to list image pixels, one after the other.
Ko Youngkil.  Biological Evidence  5 prediction  compare with experimental results  Primary visual cortex (V1) is involved in image segmentation,
1 Computational Vision CSCI 363, Fall 2012 Lecture 2 Introduction to Vision Science.
Pattern Recognition. What is Pattern Recognition? Pattern recognition is a sub-topic of machine learning. PR is the science that concerns the description.
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
From local motion estimates to global ones - physiology:
Sight Our Visual Perception
What is cognitive psychology?
Vision.
Gestalt Perception Cognitive Robotics David S. Touretzky &
Computational Vision --- a window to our brain
Cognitive Processes PSY 334
Recognizing Deformable Shapes
Dr.safeyya Adeeb Alchalabi
What is Information Visualization?
Pattern recognition (…and object perception).
Cognitive Processes PSY 334
On Symmetry, Illusory Contours and Visual Perception
?. ? White Fuzzy Color Oblong Texture Shape Most problems in vision are underconstrained White Color Most problems in vision are underconstrained.
Cognitive Processes PSY 334
Presented by Hanan Shteingart Physiology A – 2009, ICNC May 2009
Grouping/Segmentation
Experiencing the World
Presentation transcript:

Chapter 2: Marr’s theory of vision

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Overview Introduce Marr’s distinction between levels of explanation Explain Marr’s top-down model of how the levels relate Sketch out the basic elements of Marr’s theory of early vision

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Background The different disciplines involved in cognitive science operate at very different levels and use different techniques Experimental psychology Computer science/AI Theoretical perspectives from information theory and theory of computation

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Different levels Miller’s psychophysics experiments  explore the cognitive capacities and limitations of the subject Broadbent’s information-processing model  explores how information flows within and between various subpersonal sub- systems and mechanisms Kosslyn focusing on different ways of coding information And we haven’t even started thinking about the brain...

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Marr’s approach Distinguished different explanatory tasks at different levels Gave a general theoretical framework for combining them Apply the framework in considerable detail to a single example  the early visual system

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Marr’s three levels 3 different types of analysis of an information- processing system Computational Algorithmic Implementational

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Computational analysis Form of task analysis of a cognitive system (a) Identifies the specific information-processing problem that the system is configured to solve (b) Identify general constraints upon any solution to that problem

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Algorithmic analysis Explains how the cognitive system actually performs the information-processing task identifies input information and output information identifies algorithm for transforming input into required output specifies how information is encoded

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Implementational analysis Finds a physical realization for the algorithm Identify neural structures realizing the basic representational states to which the algorithm applies [e.g. populations of neurons] Identify neural mechanisms that transform those representational states according to the algorithm

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Turing machine example Computational Characterization of multiplication function AlgorithmicTuring machine table ImplementationalConstruction of a physical Turing machine

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Marr’s computational analysis of visual system Two basic conclusions from his task analysis The visual system’s job is to provide a 3D representation of the visual environment that can serve as input to recognition and classification processes – primarily information about shape of objects and their spatial distribution This 3D representation is on an object-centered rather than viewer-centered frame of reference

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Experimental evidence Possibility of double dissociations between perceptual abilities and recognitional abilities Right parietal lesions – recognition abilities preserved, but problems in perceiving shapes from unusual perspectives Left parietal lesions - shape perception intact, but recognition and identification impaired Suggested to Marr that visual system provides input to recognition systems

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Theoretical considerations Recognitional abilities are constant across changes in how things look to the perceiver due to orientation of object its distance from perceiver partial occlusion by other objects So - visual system provides information to recognition systems that abstracts away from these perspectival features  – observer-independent representation

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Algorithmic analysis Input = light arriving at retina Output = 3D representation of environment Questions: what sort of information is extracted from the light at the retina? how does the system get from this information to a 3D representation of the environment?

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 The challenge “From an information-processing point of view, our primary purpose is to define a representation of the image of reflectance changes on a surface that is suitable for detecting changes in the image’s geometrical organization that are due to changes in the reflectance of the surface itself or to changes in the surface’s orientation or distance from the viewer” (Marr, Vision p. 44) Need to find representational primitives that allow inference backwards from structure of image to structure of environment

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Representational primitives Basic information at retina = intensity value of light at each point in the retinal image changes in intensity value provide clues as to surface boundaries Primitives allow structure to be imposed on patterns of intensity changes E.g. zero crossings (sudden intensity changes)

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Zero crossings If we plot changes in intensity on a graph, then radical discontinuities will be signalled by the curve crossing zero Marr proposed a Laplacian/Gaussian filter to detect zero crossings

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Primal sketch identifies intensity changes in the 2D image basic information about the geometric organization of those intensity changes Primitives include: zero-crossings virtual lines groups

Cognitive Science  José Luis Bermúdez / Cambridge University Press D sketch Displays orientation of visible surfaces in viewer-centered coordinates Represents distance of each point in visual field from viewer Also orientation of each point and contours of discontinuities Very basic information about depth

Cognitive Science  José Luis Bermúdez / Cambridge University Press D sketch characterizes shapes and their spatial organization object-centered basic volumetric and surface primitives are schematic (facilitates recognition)

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Representation in the 3D sketch depends upon many shapes being recognizable as ensembles of generalized cones Generalized cones are easy to represent vector describing path of the figures axis of symmetry vector specifying perpendicular distance from every point on axis to shape’s surface

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Basic top-down assumption Explanation is top-down because of underdetermination Many different algorithms can in principle compute the same task There are many different ways of implementing a given algorithm Multiple realizability  more informative to work at higher levels Relatively little implementational detail in Marr

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Implementing zero crossings

Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Marr key points 1)Trilevel hypothesis very influential 2)Classic example of top-down analysis 3)Most detail comes at algorithmic level 4)Neurobiology only comes in at the implementational level