Image Understanding Roxanne Canosa, Ph.D.. Introduction Computer vision  Give machines the ability to see  The goal is to duplicate the effect of human.

Slides:



Advertisements
Similar presentations
Human-Computer Interaction
Advertisements

Parts of the Eye - Pupil.
Processing Digital Images. Filtering Analysis –Recognition Transmission.
Vision Computing An Introduction. Visual Perception Sight is our most impressive sense. It gives us, without conscious effort, detailed information about.
The Human Visual System Vonikakis Vasilios, Antonios Gasteratos Democritus University of Thrace
Ch 31 Sensation & Perception Ch. 3: Vision © Takashi Yamauchi (Dept. of Psychology, Texas A&M University) Main topics –convergence –Inhibition, lateral.
Visual Perception Chapter 3 Pages
1. Vision Stimulus: Light (Elecro-magnetic radiation) Receptor: Cones and Rods.
Vision Our most dominant sense
Sensation Chapter 5 Myers AP Psychology. Transduction  Conversion of one form of energy into another.  In sensation, the transforming of stimulus energies,
DO NOW: What do you know about our sense of sight and vision? What parts of the eye do you know? What do you know about light?
Vision By: Bethany, Iqra, Clint, Cameron, Nick. The Process Light enters eye through the cornea Then, it goes through the pupil which is surrounded by.
The Visual System: The Structure of the Visual System Module 9: Sensation.
Vision Biology/Psychology Some introductory thoughts Sensory world in general is basically a representation of the real world So, we have a rich.
Vision – our most dominant sense. Vision Purpose of the visual system –transform light energy into an electro-chemical neural response –represent characteristics.
Sensation & Perception
As you read this, do you wonder why you’re able to? Or what's involved?
Digital Image Processing & Analysis Spring Definitions Image Processing Image Analysis (Image Understanding) Computer Vision Low Level Processes:
Biology Journal 3/25/2014 Hair cells are the receptors inside of the cochlea that are stimulated by vibrations in the liquid in the cochlea. A person may.
Mr. Koch AP Psychology Forest Lake High School
Computer Vision – Fundamentals of Human Vision Hanyang University Jong-Il Park.
3.2 VISION 70% of your receptor cells are in your eyes taste and touch need direct contact where as sight and smell don’t Sight can be experienced from.
The Eye. Energy v. Chemical senses Energy SensesChemical Senses.
© by Yu Hen Hu 1 Human Visual System. © by Yu Hen Hu 2 Understanding HVS, Why? l Image is to be SEEN! l Perceptual Based Image Processing.
The Visual System. The Nature of Light Electromagnetic Spectrum – An energy spectrum that includes X-rays, radar, and radio waves – A small portion of.
Myers EXPLORING PSYCHOLOGY Module 14 Introduction to Sensation and Perception: Vision James A. McCubbin, PhD Clemson University Worth Publishers.
.  Sensation: process by which our sensory receptors and nervous system receive and represent stimulus energy  Perception: process of organizing and.
Psychology, Ninth Edition in Modules David Myers PowerPoint Slides Aneeq Ahmad Henderson State University Worth Publishers, © 2010.
Ch 31 Sensation & Perception Ch. 3: Vision © Takashi Yamauchi (Dept. of Psychology, Texas A&M University) Main topics –convergence –Inhibition, lateral.
Digital Image Processing & Analysis Dr. Samir H. Abdul-Jauwad Electrical Engineering Department King Fahd University of Petroleum & Minerals.
Intelligent Vision Systems Image Geometry and Acquisition ENT 496 Ms. HEMA C.R. Lecture 2.
The primate visual systemHelmuth Radrich, The primate visual system 1.Structure of the eye 2.Neural responses to light 3.Brightness perception.
Vision Psychology Some introductory thoughts Sensory world in general is basically a representation of the real world Sensory world in general is.
Digital Image Processing (DIP) Lecture # 5 Dr. Abdul Basit Siddiqui Assistant Professor-FURC 1FURC-BCSE7.
1 Computational Vision CSCI 363, Fall 2012 Lecture 5 The Retina.
The Eye. Energy v. Chemical senses Energy SensesChemical Senses.
11 PSYCHOLOGY (8th Edition, in Modules) David Myers PowerPoint Slides Worth Publishers, © 2007 PowerPoint Slides Worth Publishers, © 2007.
1 Perception and VR MONT 104S, Fall 2008 Lecture 4 Lightness, Brightness and Edges.
University of Kurdistan Digital Image Processing (DIP) Lecturer: Kaveh Mollazade, Ph.D. Department of Biosystems Engineering, Faculty of Agriculture,
The Visual System: The Structure of the Visual System Module 9: Sensation.
Sensation and Perception Module 18 Vision. Energy=Light We only see a small spectrum of light rays 2 characteristics determine our sensory experiences.
Image Perception ‘Let there be light! ‘. “Let there be light”
The eye is connected to the brain via the optic nerve. Images are interpreted by the occipital lobe.
Intelligent Vision Systems Image Geometry and Acquisition ENT 496 Ms. HEMA C.R. Lecture 2.
Name them!! There are 5! Hearing Sight Touch Taste Smell.
Human Visual System.
JHS AP Psychology Unit 4: Sensation, Perception Essential Task 4-2 :Describe the sensory process of vision including the specific nature of energy transduction.
Sensation and Perception. Transformation of stimulus energy into a meaningful understanding –Each sense converts energy into awareness.
The Visual System: The Structure of the Visual System.
DO NOW. VisionVision Our most dominating sense. Visual Capture.
Vision Our most dominant sense. Our Essential Questions What are the major parts of the eye? How does the eye translate light into neural impulses?
Unit 4: Sensation & Perception
MODULE #13: VISION. Vision Transduction: transformation of stimulus energy (light, sound, smells, etc.) to neural impulses our brains can interpret. Our.
The Visual System: The Structure of the Visual System Module 9: Sensation.
Sensation. The process by which our sensory systems (eyes, ears, and other sensory organs) and nervous system receive stimuli from the environment A person’s.
Vision. The Eye and Vision It’s the most complex and most important sense for humans. The vision “system” transfers light waves into neural messages that.
Image Perception ‘Let there be light! ‘. “Let there be light”
1. 2 What is Digital Image Processing? The term image refers to a two-dimensional light intensity function f(x,y), where x and y denote spatial(plane)
The Visual Sense: Sight EQ: What is the process though which we see and how do we differentiate between different objects and types of motion?
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
The Visual System: The Structure of the Visual System
Human Visual System.
EE663-Digital Image Processing & Analysis Dr. Samir H
Mr. Koch AP Psychology Forest Lake High School
THE VISUAL SYSTEM: ESSENTIALS OF SIGHT
VISION Module 18.
Changing Light Waves to Neural Impulses
IT523 Digital Image Processing
Experiencing the World
(Do Now) Journal What is psychophysics? How does it connect sensation with perception? What is an absolute threshold? What are some implications of Signal.
Presentation transcript:

Image Understanding Roxanne Canosa, Ph.D.

Introduction Computer vision  Give machines the ability to see  The goal is to duplicate the effect of human visual processing  We live in a 3-D world, but camera sensors can only capture 2-D information.  Flip side of computer graphics?

Introduction Computer vision is composed of:  Image processing  Image analysis  Image understanding

Introduction Image processing  The goal is to present the image to the system in a useful form image capture and early processing remove noise detect luminance differences detect edges enhance image

Introduction Image analysis  The goal is to extract useful information from the processed image identify boundaries find connected components label regions segment parts of objects group parts together into whole objects

Introduction Image understanding  The goal is to make sense of the information. Draw qualitative, or semantic, conclusions from the quantitative information. make a decision about the quantitative information classify the parts recognize objects understand the objects’ usage and the meaning of the scene

Introduction Computer vision uses techniques and methods from:  electronics - sensor technology  mathematics - statistics and differential calculus  spatial pattern recognition  artificial intelligence  psychophysics

Low-level Representations Low-level: little knowledge about the content of the image  The data that is manipulated usually resembles the input image. For example, if the image is captured using a CCD camera (2-D), the representation can be described by an image function whose value is brightness depending on 2 parameters: the x-y coordinates of the location of the brightness value.

Low-Level Mechanisms Low-level vision only takes us to the sophistication of a very expensive digital camera

High-level Representations High-level: extract meaningful information from the low-level representation.  Image may be mapped to a formalized model of the world (model may change dynamically as new information becomes available)  Data to be processed is dramatically reduced: instead of dealing with pixel values, deal with features such as shape, size, relationships, etc  Usually expressed in symbolic form

High-Level Mechanisms High-level vision and perception requires brain functions that we do not fully understand yet

Bottom-up vs. Top-down Bottom-up: processing is content-driven Top-down: processing is context-driven Goal: combine knowledge about content as well as context.  Goals, plans, history, expectations  Imitate human cognition and the ability to make decisions based on extracted information

Bottom-up v. Top-down Top-Down?Bottom-up? Information flow

Visual Completion: Top-down Control

Visual Completion: Top-down Control

Visual Completion: Top-down Control

Visual Completion: Top-down Control

Visual Completion: Top-down Control

Old Women or Young Girl?

Expectation and Learning From Palmer (1999)

Zolner Illusion Are the black and yellow lines parallel?

Visual Illusions Demos

The Human Visual System Optical information from the eyes is transmitted to the primary visual cortex in the occipital lobe at the back of the head.

The Human Visual System - 20 mm focal length lens - iris controls amount of light entering eye by changing the size of the pupil

The Human Visual System Light enters the eye through the cornea, aqueous humor, lens, and vitreous humor before striking the light-sensitive receptors of the retina. After striking the retina, light is converted into electrochemical signals that are carried to the brain via the optic nerve.

The Human Visual System image from

Multi-Resolution Vision +

+ If you can read this you must be cheating

Multi-Resolution Vision From Palmer (1999) The distribution of rods and cones across the retina is highly uneven The fovea contains the highest concentration of cones for high visual acuity

Contrast Sensitivity Spatial frequency (cpd) Contrast sensitivity low high

Lateral Inhibition

A biological neural network in which neurons inhibit spatially neighboring neurons. Architecture of first few layers of retina. Input light level Layer n Layer n Output perception = = = =

Simultaneous Contrast Two regions that have identical spectra result in different color (lightness) perceptions due to the spectra of the surrounding regions Background color can visibly affect the perceived color of the target

Simultaneous Contrast

Original Painting Task-Oriented Vision Free ViewingEstimate the economic level of the people Judge their ages Guess what they had been doing before the visitor’s arrival Remember the clothes worn by the people

Change Blindness Lack of attention to an object causes failure to perceive it People find it difficult to detect major changes in a scene if those changes occur in objects that are not the focus of attention Our impression that our visual capabilities give us a rich, complete, and detailed representation of the world around us is a grand illusion!

Change Blindness Demos

Modeling Attention How do we decide where to look next while performing a task? What factors influence our decision to look at something? Can we model visual behavior?

Modeling Attention - Saliency Maps Koch & Ullman (1985), Itti & Koch (2000), Parkhurst, Law, & Neibur (2002), Turano, Geruschat, & Baker (2003) Input Image Saliency Map

colorintensityorient center surrounds Input Image Saliency Map Computational Model of Saliency

Proto-object Map G0 G45G90 G135 XYZ transform rods LMSLMS A C1 C2 Intensity Map Input Image (RGB) Conspicuity Map Orientation Map Color Map Object Module Pre-processing Module Oriented Edge Module

Proto-object Map G0 G45G90 G135 XYZ transform rods Orientation Map LMSLMS A C1 C2 Intensity Map Input Image (RGB) Conspicuity Map Color Map Oriented Edge Module Object Module

Proto-object Map G0 G45G90 G135 XYZ transform rods LMSLMS A C1 C2 Intensity Map Input Image (RGB) Orientation Map Conspicuity Map Color Map Pre-processing Module Object Module

Proto-object Map G0 G45G90 G135 XYZ transform rods LMSLMS A C1 C2 Intensity Map Input Image (RGB) Orientation Map Conspicuity Map Color Map Oriented Edge Module Pre-processing Module

Spatial frequency (cpd) Contrast sensitivity low high weight Weight Output with Contrast Sensitivity Function CSF = 2.6 ( f) e -(o.114f) ^ Manno and Sakrison (1974)

Input Image CIE Map Conspicuity Map (C_Map)

Verification of Model

Task Differences Free-view“Get supplies from the closet” “Work at the computer”“Make a photocopy”

External mirror - IR reflecting, visible passing Scene camera LASER Optics module, includes IR source and eye camera Head-tracking receiver Head-Mounted Eye-Tracker

Portable Eye-Tracker

Newell’s temporal hierarchy of brain organization: The Benefits of Eye-Tracking Cognition Working Memory Visual Routines Neural Operations 10 seconds 1 second 300 msec 80 msec

Verification of Model

Comparison of Models

Task Differences

Possible M.S. Projects Comparison of saliency map generation techniques  Feature-based  Graph-based  Information theory-based Object detection from salient keypoints  SIFT features  Multi-resolution images

Possible M.S. Projects Multi-modal tumor classification  PET, MRI, CT Solving visual CAPTCHAs  Text-based  Image-based 

Possible M.S. Projects Using reasoning or logic for learning about the world from visual observations  Bayes nets  Reinforcement learning  Inductive logic programming (ILP)