Cognitive Neuroscience and Embodied Intelligence

Slides:



Advertisements
Similar presentations
Perception Chapter 4 Visual Process beyond the Retina
Advertisements

Chapter 4: The Visual Cortex and Beyond
The Primary Visual Cortex
What do we know about Primary Visual Cortex (V1)
Higher Visual Areas Anatomy of higher visual areas
Chapter 2.
Visual Sensation & Perception How do we see?. Structure of the eye.
Central Visual Processes. Anthony J Greene2 Central Visual Pathways I.Primary Visual Cortex Receptive Field Columns Hypercolumns II.Spatial Frequency.
Chapter 3: Neural Processing and Perception. Lateral Inhibition and Perception Experiments with eye of Limulus –Ommatidia allow recordings from a single.
The Central Visual System
Anatomy/Physiology of Binocular Vision Goals –Follow the M and P pathway out of primary visual cortex –Answer where binocularly and disparity driven cells.
Perception of Stimuli Stephen Taylor.
Chapter 6 The Visual System
Higher Processing of Visual Information: Lecture III
Blue= rods Green = Cones Pathways from the Retina In the brain, retinal ganglion axons travel to… –the hypothalamus: control bodily rhythms.
Exam 1 week from today in class assortment of question types including written answers.
The Visual System Into. to Neurobiology 2010.
COGNITIVE SCIENCE 17 The Visual System: Color Vision Part 2 Jaime A. Pineda, Ph.D.
Visual Sensation & Perception How do we see?. Structure of the eye.
Visual Cognition I basic processes. What is perception good for? We often receive incomplete information through our senses. Information can be highly.
Color vision Different cone photo- receptors have opsin molecules which are differentially sensitive to certain wavelengths of light – these are the physical.
VISUAL PATHWAYS Organization of LGN of thalamus Organization of Visual Cortex What Stream How Stream The Binding Problem.
Human Sensing: The eye and visual processing Physiology and Function Martin Jagersand.
The Human Visual System Vonikakis Vasilios, Antonios Gasteratos Democritus University of Thrace
Ch 31 Sensation & Perception Ch. 3: Vision © Takashi Yamauchi (Dept. of Psychology, Texas A&M University) Main topics –convergence –Inhibition, lateral.
Chapter 10 The Central Visual System. Introduction Neurons in the visual system –Neural processing results in perception Parallel pathway serving conscious.
Basic Processes in Visual Perception
The visual system Lecture 1: Structure of the eye
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 5: Introduction to Vision 2 1 Computational Architectures in.
EE141 1 Perception and attention Janusz A. Starzyk Computational Intelligence Based on a course taught by Prof. Randall O'ReillyRandall O'Reilly University.
Copyright © 2007 Wolters Kluwer Health | Lippincott Williams & Wilkins Neuroscience: Exploring the Brain, 3e Chapter 10: The Central Visual System.
University Studies 15A: Consciousness I The Neurobiology of Vision.
Vision. Light is electromagnetic energy. One nm = one billionth of a meter The Visible Spectrum.
Sensation and Perception Part 1: Intro and Vision.
Vision Biology/Psychology Some introductory thoughts Sensory world in general is basically a representation of the real world So, we have a rich.
1 Computational Vision CSCI 363, Fall 2012 Lecture 3 Neurons Central Visual Pathways See Reading Assignment on "Assignments page"
Visual System: Photons to memory. -- Each sensory system responds with some specificity to a stimulus and each employs specialized cells - the peripheral.
THE VISUAL SYSTEM: EYE TO CORTEX Outline 1. The Eyes a. Structure b. Accommodation c. Binocular Disparity 2. The Retina a. Structure b. Completion c. Cone.
Slide 1 Neuroscience: Exploring the Brain, 3rd Ed, Bear, Connors, and Paradiso Copyright © 2007 Lippincott Williams & Wilkins Bear: Neuroscience: Exploring.
Occipital Lobe Videos: –Brain modules 8,9,10, 11 –Consciousness- Blindsight.
Visual System: Photons to memory. -- Each sensory system responds with some specificity to a stimulus and each employs specialized cells - the peripheral.
DT2350, Lecture 4: Introduction to vision, including perception of objects and scenes Dr. Ginevra Castellano Department of Information Technology Uppsala.
Ch 31 Sensation & Perception Ch. 3: Vision © Takashi Yamauchi (Dept. of Psychology, Texas A&M University) Main topics –convergence –Inhibition, lateral.
The primate visual systemHelmuth Radrich, The primate visual system 1.Structure of the eye 2.Neural responses to light 3.Brightness perception.
Vision Psychology Some introductory thoughts Sensory world in general is basically a representation of the real world Sensory world in general is.
Chapter 3: Neural Processing and Perception. Neural Processing and Perception Neural processing is the interaction of signals in many neurons.
1 Computational Vision CSCI 363, Fall 2012 Lecture 5 The Retina.
EE141 1 Perception - Vision Janusz A. Starzyk
The Eye. Energy v. Chemical senses Energy SensesChemical Senses.
Vision Photoreceptor cells Rod & Cone cells Bipolar Cells Connect in between Ganglion Cells Go to the brain.
Cognition, Brain and Consciousness: An Introduction to Cognitive Neuroscience Edited by Bernard J. Baars and Nicole M. Gage 2007 Academic Press Chapter.
Higher Visual Areas 1.Anatomy of higher visual areas 2.Two processing pathways - “ Where ” pathway for motion and depth - “ What ” pathway for form and.
Cognitive Architectures
1 Copyright © 2014 Elsevier Inc. All rights reserved. Chapter 19 Visual Network Moran Furman.
Keith Clements Introduction to Neuroscience
Understanding Psychophysics: Spatial Frequency & Contrast
Outline Of Today’s Discussion 1.LGN Projections & Color Opponency 2.Primary Visual Cortex: Structure 3.Primary Visual Cortex: Individual Cells.
Sensation and Perception. Transformation of stimulus energy into a meaningful understanding –Each sense converts energy into awareness.
CHAPTER 10 Vision and visual perception Form Vision.
1 Perception and VR MONT 104S, Spring 2008 Lecture 3 Central Visual Pathways.
Vision.
Cognitive Architectures
Brodmann’s Areas. Brodmann’s Areas The Primary Visual Cortex Hubel and Weisel discovered simple, complex and hypercomplex cells in the striate.
Early Processing in Biological Vision
Optic Nerve Projections
The Visual System: Higher Cortical Mechanisms
Mind, Brain & Behavior Wednesday February 12, 2003.
Fundamentals of Sensation and Perception
(Do Now) Journal What is psychophysics? How does it connect sensation with perception? What is an absolute threshold? What are some implications of Signal.
Outline Announcements Human Visual Information Processing
Presentation transcript:

Cognitive Neuroscience and Embodied Intelligence Perception - Vision Based on book Cognition, Brain and Consciousness ed. Bernard J. Baars courses taught by Prof. Randall O'Reilly, University of Colorado, and Prof. Włodzisław Duch, Uniwersytet Mikołaja Kopernika and http://wikipedia.org/ http://grey.colorado.edu/CompCogNeuro/index.php/CECN_CU_Boulder_OReilly http://grey.colorado.edu/CompCogNeuro/index.php/Main_Page Janusz A. Starzyk

Motivation Perception is comparatively the easiest to understand although for many specific questions there are no clear answers. General questions: Why does the primary visual cortex react to oriented edges? Why does the visual system separate information into the dorsal stream, connected to motion and representation of object locations, and the ventral stream, connected to object recognition? Why does damage to the parietal cortex lead to spatial orientation and attention disorders? In what way do we recognize objects in different places, orientations, distances, with different projections of the image onto the retina?

Introduction The purpose of vision: ‘to know what is where’ (David Marr) The visual perception is far more complicated than simply taking a picture with digital camera The camera doesn’t really do anything with this image and doesn’t have any knowledge about what is stored in the image

Introduction Knowing what: perceiving features, groups and objects Studies of human visual perception and neuroscience suggest that there are many levels of perception. The human brain appears to process basic visual features, such as color, orientation, motion, texture and stereoscopic depth. Neurons are highly tuned to specific features like a line at particular angle, a particular color, or particular motion detection. The activity of each neuron represents only a small part of the visual field. How is the brain able to combine this information across many neurons? The brain is able to organize basic feature elements into organized perceptual groups. Psychologists proposed the Gestalt laws of perceptual grouping, such as the laws of similarity, proximity, good continuation, common fate and so forth

Perceptual grouping Grouping by similarity: Grouping by proximity: White dots grouped with white dots, squares with squares Grouping by proximity: Here we perceive two separate groups of dots that are near each other Grouping by good continuation: On the left we perceive a single object. When the same lines are separated we do not

Functional organization of the visual system Visual pathway

Visual pathways Visual pathways: retina => lateral geniculate nucleus (LGN) of the thalamus => visual radiation => area of the primary cortex V1 => higher levels of the visual system => associative and multimodal areas. V1 cells are organized in ocular dominance columns and orientation columns, retinoscopic. Simple layer 4 cells react to bands with a specific slant, contrasting edges, stimulus from one eye. A substantial part of the central V1 area reacts to signals from fovea, where the density of receptors is the greatest.

Functional organization of the visual system The hierarchical organization begins in the retina, passes through the lateral geniculate nucleus (LGN - part of the thalamus), reaching the primary visual cortex V1, from where it's distributed further. (c) 1999. Tralvex Yeap. All Rights Reserved 8

Functional organization of the visual system Objects in environment are projected to the back of the eye – the retina. Retina contains millions of photoreceptors

The retina Two types of light-sensitive photoreceptors Cones cone-shaped less sensitive operate in high light color vision Rods rod-shaped highly sensitive operate at night gray-scale vision

The retina http://www.iit.edu/~npr/DrJennifer/visual/retina.html The signals from photoreceptors are processed by a collection of intermediary neurons, bipolar cells, horizontal cells and amacrine cells, before they reach the ganglion cells

Retina The retina is not a passive camera registering images. Crucial rule: enhancing contrasts underlining changes in space and time, strengthening edges, uniformly lit areas are less important. Photoreceptors in rods and cones, 3-layer network, ganglion cells =>LGN. Receptive fields: areas, which stimulate a given cell. The combination of signals in the retina gives center-surround receptive fields (on-center) and vice versa, detects edges. Each individual field of cells can be modeled as a Gaussian model, so these fields are obtained as a difference of Gaussians (DOG).

On-center off surround ganglion cells No stimuli: both fire at base rate Stimuli in center: ON-center-OFF-surround fires rapidly OFF-center-ON-surround doesn’t fire Stimuli in both regions: both fire slowly Stimuli in surround: OFF-center-ON-surround fires rapidly ON-center-OFF-surround doesn’t fire

On-center off surround ganglion cells on-center cell David Hubel & Torsten Wiesel Received Nobel price for their discovery of on-center off-surround cells in retina http://www.physiology.wisc.edu/yin/public/

On-center off surround ganglion cells Retina ganglion cells receive both excitatory and inhibitory inputs from bipolar neurons In the figure shown, the ganglion cell receives excitatory inputs from cells corresponding to the on-center region, and inhibitory inputs from the off-center region

On-center off surround ganglion cells Original image Image based on edges Lateral inhibition is important in enhancing neural representation of edges, where the light intensity changes sharply and indicate a presence of contours, shapes, or objects. Uniform parts of a picture are less interesting.

On-center off surround ganglion cells Sometimes this later inhibition leads to a surprising visual illusions as shown on this figure. Notice black dots appear on intersection of white lines.

Retinal ganglion cells There are two types of ganglion cells in the retina: Large magnocellular ganglion cells, or M cells, carry information about: Movement Location depth perception. Smaller parvocellular ganglion cells, or P cells, transmit signals that pertain to: Colour Form texture of objects in the visual field.

Lateral geniculate nucleus (LGN) From the eye, retinal ganglion cells send their axons to a structure in the thalamus called lateral geniculate nucleus (LGN) The inputs from the nasal portion of each retina must cross at the optic chiasm to project to the opposite LGN

LGN pathway The M cells send their information to layers 1 & 2 of LGN. The P cells send their information to layers 3-6. So, layers 3-6 are involved in processing information concerning fine detail and color. Layers 1 & 2 process information concerning movement.

Pathways in visual system Propagation of the visual input from the left and right visual fields. Signals propagate through eye, retina, optic nerve, chiasm, optic tract, LGN to visual cortex V1

Lateral geniculate nucleus Signal compression – partly already done in the retina. Different types of information find their way to different LGN layers. Intermediate station – all sensory signals (except olfactory) go through different nuclei of the thalamus. Dynamic information processing: steering attention and fast large-celled pathway reacting to motion. Retroactive projections V1=>LGN are an order of magnitude more numerous than LGN=>V1 (role - prediction). The competitive dynamic selects signals from the visual field, especially involving motion.

LGN of the Thalamus Parvocellular layers 3-6 Magnocellular layers 1& 2

Primary visual cortex V1 Neurons in V1 are sensitive to a whole host of visual features, not seen in the LGN, like orientation, direction of motion, color differences, or binocular disparities. Orientation helps to detect edges and contours. Direction of motion is important to determine dangerous moves of an attacker. Color helps to differentiate and identify objects particularly in a camouflage environment. Binocular disparities between images in two eyes allow us to perceive stereo-depth when we look at object with both eyes.

Primary visual cortex V1 Neurons in V1 respond with different strength to orientation edges, depending on location of their receptive fields. Neuron’s response is strongest if the excitation aligns with its receptive field.

Edge detectors Contrasting signals points from the LGN are organized by the V1 cortex into edge detectors oriented at a specific angle. Simple V1 cells combine into edge detectors, enabling the determination of shapes, other cells react to color and texture. Properties of edge detectors: different orientation; high frequency = fast changes, narrow bands; low frequency = gentle changes, wide bands; polarization = dark-light or vice-versa, dark-light-dark or vice-versa.

Representation in the V1 cortex Oriented edge detectors can be created by correlational Hebbian learning based on natural scenes. What happens with information about color, texture, motion?

Retinoscopic maps in V1 The spatial position of the ganglion cells within the retina is preserved by the spatial organization of the neurons within the LGN layers. The posterior LGN contains neurons whose receptive field are near the fovea.

Area V1: The Primary Visual Cortex V1 is made up of 6 layers (no relation to 6 layers in LGN). LGN sends axons to layer IV of V1. M and P cells are separate. Right and Left eye are separate.

Hierarchy of visual processing From retina, LGN, V1, through V4 and to ventral temporal cortex (VTC) neurons gradually respond to more complex stimuli: Retina and LGN extract small dots In V1 small dots are combined into edges, In V4 edges are combined into simple shapes and color features In VTC simple shapes are combined into objects

Extrastriate visual areas – outside of V1 Flattened map of higher visual areas V1 sends signals to many higher visual areas, including areas such as V2, V3, V4 and motion-sensitive area MT. Area V4 is important for the perception of color and some neurons in V4 respond to more complex features or their combination (like corners or curves). The middle-temporal area (MT), is important for motion perception. Almost all of the neurons in MT are direction-selective, and respond selectively to a certain range of motion directions or patterns of motion.

The ventral and dorsal pathways: knowing what and where The projections from V2 to higher areas in the cortex can be roughly divided according to two major parallel pathways: a ventral pathway to temporal lobe (what) and a dorsal pathway to parietal lobe (where) The ‘what’ pathway includes ventral areas like V4, LOC, and IT

Two streams where?/what? depth direction “where” pathway “what” pathway orientation shape color Recognized Object ready for perception “Where" = large-celled pathway, heading for the parietal lobe. "What"= small-celled pathway heading for the temporal lobe (IT).

The ventral and dorsal pathways: knowing what and where In the dorsal pathway, signals from V1 travel to dorsal areas like MT and V3A, which then send major projections to many regions of the parietal lobe. In the ventral pathway, many signals from V1 travel to ventral areas V2, V3 and V4 and onward to many areas of the temporal lobe. Dorsal and ventral pathways in a monkey brain

Two streams where?/what? - temporal lobe Where? - parietal lobe Milner and Goodale (1995): visual pathways don't so much determine where and what, as much as they enable action and perception. There is also the old limbic pathway, enabling rapid action in dangerous situations (after which follows a wave of fear).

Two streams Ungerleider and Mishkin (1982): there exist two notably divided pathways for processing visual information, running from the eye. Large-grained PA retina cells, 3 types of photoreceptive cones, large receptive fields, rapidly-conducting axons, activation for light in a wide band. Small-grained PB cells, 1 or 2 types of photoreceptive cones, small receptive fields, slowly conducting axons, recognize color oppositions. Large-celled pathway: runs to two large-celled LGN layers, it's characterized by a low spatial resolution, high sensitivity to contrast, rapid signal transfer, without information about color. The small-celled pathway has 4 small-grained layers in the LGN, high spatial resolution, color, slower information transfer, low sensitivity to contrast.

Dorsal pathway Large-celled pathway: from the occipital lobe through the dorsal pathway to the parietal cortex. Arrives at the 4B layer in V1, from here to the thick dark stripes of the V2 region, analyzes information about object motion. In V1, layer 4B => V5, localization in the field of vision, motion. V5 stimulates the parietal lobe, PPC (posterior parietal cortex), regions 7 and 5; this enables spacial orientation, depth and motion perception(eye orientation).

Ventral pathway Small-celled pathway: the ventral pathway, to the inferior temporal cortex. V1 => V2 interblob region, reacts to line orientation, gives a large visual acuity, without color. V1 => V3 blob region, reacts to shapes, reaction to color in the neurons in the dark stripes of V3. V2 => V4, main area of color analysis, information arrives at the inferior temporal cortex (IT). The IT area in the inferior temporal lobe has neurons which react to complex objects.

Model v1rf.proj.gz, Chapter 8 How receptive fields are formed in V1? Inputs: 12x12, signals from LGN cells on (pos) and off (neg) center. Input samples: randomly selected parts of 24x24 from 4 600x800 natural pictures. Hidden layer 14x14; links: random excitatory connections. Project description in chapter 8.3.2. Natural shapes and textures lead to specific receptive fields: from here reactions to edges. (c) 1999. Tralvex Yeap. All Rights Reserved 39

Areas involved in object recognition Neurons in V1 respond to scrambled pictures of a cat equally well or even stronger While neurons in lateral occipital (LOC) respond much less to scrambled pictures than to a picture of a cat Human neuroimaging studies have revealed many brain areas involved in processing objects. These object-sensitive areas, which lie anterior to visual areas V1-V4, respond more strongly to coherent shapes and objects, as compared to scrambled, meaningless stimulus.

Areas involved in object recognition Lateral occipital complex (LOC) The lateral occipital complex seems to have a general role in object recognition and responds strongly to a variety of shapes and objects. Presumably neurons in this region respond best to different kinds of objects. Fusiform face area (FFA) Human neuroimaging studies have shown that there is a region in the fusiform gyrus, called the fusiform face area (FFA) that responds more strongly to faces than to just about any other category of objects. This region responds more to human, animal and cartoon faces than to a variety of non-face stimuli. Neurons in this area specialize in facial expression, particular identity or viewpoint (e.g. profile) Parahippocampal place area (PPA) The parahippocampal place area is another strongly category-selective region that responds best to houses, landmarks, indoor and outdoor scenes. This area responds weakly to faces, body parts, and animals.

Hierarchical and interactive theories of vision Is it possible to invoke visual experience without seeing? Yes we can bypass retina and LGN and stimulate area V1. However, it seems impossible to recover full visual experience from higher visual areas bypassing primary visual cortex. Hierarchical and interactive theories of vision According to hierarchical theory, visual consciousness is organized in a hierarchical fashion with increasingly higher visual areas being more closely related to our internal conscious experience. But if this is the case how to explain awareness of all details in the observed image? The interactive theory of visual consciousness emphasizes interactions between lower and higher visual areas where higher areas send feedback signals down to early visual area.

Some answers Why does the primary visual cortex react to oriented edges? Because correlational learning in a natural environment leads to this type of detector. Why does the visual system separate information into the dorsal pathway and the ventral pathway? Because signal transformations extract qualitatively different information, strengthening some contrasts and weakening others. Why does damage to the parietal cortex lead to disorders of spatial orientation and attention (neglect)? Because attention is an emergent property of systems with competition. How do we recognize objects in different locations, orientations, distances, with different images projected on the retina? Thanks to transformations, which create distributed representations based on increasingly complex and spatially invariant features.

Model v1rf.proj.gz, Chapt. 8 How do receptive fields form? Where do these V1 properties come from? Inputs: 12x12, signals from LGN cells: on- and off-center. Input images: randomly chosen from a natural 512x512 scene. Hidden layer 14x14; connections: coincidental with the input, excitatory with the surroundings. Description of the project in Chapt. 8.3.2. Natural shapes and textures lead to specific receptive fields: from this come reactions to edges.

Model properties The V1 cortex receives from the LGN an on/off signal with heightened contrast, input to V1 through layer 4, processing in this model responds to overlapping processes mainly in layers 2 and 3. The model includes one hypercolumn, analyzing a small sector of the image from images of landscapes and plants => all elements see the same thing. Properties: spherical geometry, i.e. top = bottom, left = right; independent inputs for on/off cells, in accordance with biology; strong and widespread excitatory horizontal connections – like in SOM; kWTA leaves ~10% active neurons. Contrast for weights is small ~1, because these aren't decision-making neurons, thresholds are large (~2) to force sparse representations, strong correlations. Noise helps in avoiding weak solutions.

Exercises from v1rf Check the structure, connection weights (r.wt): strong activations within the hidden layer, random connections with on/off inputs. LoadEnv to load the 512x512 image - for the training 10 images were used, here is one random one, processed into on/off points. StepTrain – observe the oscillation of learning for phases – and + Complementarity of on/off: stronger "on" activation for images which are brighter in the middle than on the edges, dark = extra "off" activation. Question: what can we expect if horizontal connections will dominate? Check your predictions, temporarily changing lat_wt_scale 0.04 => 0.2. LoadNet to load the trained network, after 100,000 presentations of images and several days of calculations...

Receptive fields How do receptive fields form? Where do these V1 properties come from? Check the activation r.wt, change the color scale so we can better see the field orientation, check several hidden elements, bi- and tri-polar fields of both types. Load all: View, RFIELDS activation on=red, off=blue. Orientation, position, size, polarity are 4 different traits of receptive fields. Radial orientation changes (pinwheels), singular points. View, PROBE_ENV shows 4 different probe stimuli, StepProbe will show activation of hidden units.