1 CSE 455: Computer Vision Winter 2007 Instructor: Professor Linda Shapiro Additional Instructor: Dr. Matthew Brown

Slides:



Advertisements
Similar presentations
Digital Image Fundamentals Selim Aksoy Department of Computer Engineering Bilkent University
Advertisements

Digital Image Fundamentals Selim Aksoy Department of Computer Engineering Bilkent University
Digital Image Fundamentals Selim Aksoy Department of Computer Engineering Bilkent University
5/13/2015CAM Talk G.Kamberova Computer Vision Introduction Gerda Kamberova Department of Computer Science Hofstra University.
Computer Vision Chapter 6 Color.
COLORCOLOR A SET OF CODES GENERATED BY THE BRAİN How do you quantify? How do you use?
Computer Vision: CSE 803 A brief intro MSU/CSE Fall 2014.
MSU/CSE Fall Computer Vision: Imaging Devices Summary of several common imaging devices covered ( Chapter 2 of Shapiro and Stockman)
Data Structures For Image Analysis
OpenCV Stacy O’Malley CS-590 Summer, What is OpenCV? Open source library of functions relating to computer vision. Cross-platform (Linux, OS X,
1 Imaging and Image Representation  Sensing Process  Typical Sensing Devices  Problems with Digital Images  Image Formats  Relationship of 3D Scenes.
Stockman MSU/CSE Fall Computer Vision: Imaging Devices Summary of several common imaging devices covered in Chapter 2 of Shapiro and Stockman.
Computer Vision Basics Image Terminology Binary Operations Filtering Edge Operators.
Digital Audio, Image and Video Hao Jiang Computer Science Department Sept. 6, 2007.
Stockman MSU Fall Computing Motion from Images Chapter 9 of S&S plus otherwork.
Stockman MSU/CSE Fall 2009 Computer Vision: CSE 803 A brief intro.
1 Introduction What IS computer vision? Where do images come from? the analysis of digital images by a computer.
Computer Vision (CSE P576) Staff Prof: Steve Seitz TA: Jiun-Hung Chen Web Page
Computer Vision (CSE/EE 576) Staff Prof: Steve Seitz TA: Aseem Agarwala Web Page
Content-based Image Retrieval (CBIR)
Introduction to Machine Vision Systems
November 29, 2004AI: Chapter 24: Perception1 Artificial Intelligence Chapter 24: Perception Michael Scherger Department of Computer Science Kent State.
CIS 601 Fall 2004 Introduction to Computer Vision and Intelligent Systems Longin Jan Latecki Parts are based on lectures of Rolf Lakaemper and David Young.
A Brief Overview of Computer Vision Jinxiang Chai.
SCCS 4761 Introduction What is Image Processing? Fundamental of Image Processing.
Department of Physics and Astronomy DIGITAL IMAGE PROCESSING
Image Formation. Input - Digital Images Intensity Images – encoding of light intensity Range Images – encoding of shape and distance They are both a 2-D.
Map Scale, Resolution and Data Models. Components of a GIS Map Maps can be displayed at various scales –Scale - the relationship between the size of features.
1 Image Basics Hao Jiang Computer Science Department Sept. 4, 2014.
Sensing for Robotics & Control – Remote Sensors R. R. Lindeke, Ph.D.
CS 376b Introduction to Computer Vision 02 / 08 / 2008 Instructor: Michael Eckmann.
Factors affecting CT image RAD
Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.
Introduction to Computer Vision Sebastian van Delden USC Upstate
Chapter 2 : Imaging and Image Representation Computer Vision Lab. Chonbuk National University.
Intelligent Vision Systems Image Geometry and Acquisition ENT 496 Ms. HEMA C.R. Lecture 2.
Addressing Image Compression Techniques on current Internet Technologies By: Eduardo J. Moreira & Onyeka Ezenwoye CIS-6931 Term Paper.
1-1 Chapter 1: Introduction 1.1. Images An image is worth thousands of words.
CS 376b Introduction to Computer Vision 02 / 11 / 2008 Instructor: Michael Eckmann.
1 Introduction What IS computer vision? Where do images come from? the analysis of digital images by a computer.
1 Machine Vision. 2 VISION the most powerful sense.
PENGENALAN POLA DAN VISI KOMPUTER PENDAHULUAN. Vision Vision is the process of discovering what is present in the world and where it is by looking.
Image Perception ‘Let there be light! ‘. “Let there be light”
Introduction to Image Processing. What is Image Processing? Manipulation of digital images by computer. Image processing focuses on two major tasks: –Improvement.
Intelligent Vision Systems Image Geometry and Acquisition ENT 496 Ms. HEMA C.R. Lecture 2.
Introduction to Image Processing Course Notes Anup Basu, Ph.D. Professor, Dept of Computing Sc. University of Alberta.
1 Review and Summary We have covered a LOT of material, spending more time and more detail on 2D image segmentation and analysis, but hopefully giving.
Image Perception ‘Let there be light! ‘. “Let there be light”
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
COMP 9517 Computer Vision Digital Images 1/28/2018 COMP 9517 S2, 2009.
CSE 484: Computer Vision Fatoş T. Yarman Vural Atıl İşçen
Introduction Computer vision is the analysis of digital images
CSE/EE 576 Computer Vision Spring 2007
CSE/EE 576 Computer Vision Spring 2011
CSE 455 Computer Vision Autumn 2014
EE 596 Machine Vision Autumn 2015
CSEP 576 Computer Vision Winter 2008
Outline Image formats and basic operations Image representation
Digital Image Fundamentals
Introduction Computer vision is the analysis of digital images
Introduction What IS computer vision?
CSE/EE 576 Computer Vision Spring 2012
CSE 455 Computer Vision Autumn 2010
Computer Vision (CSE 490CV, EE400B)
Introduction What IS computer vision?
CSE/EE 576 Computer Vision Spring 2010
Computer Vision (CSE 455) Staff Web Page Handouts
Computer Vision (CSE 455) Staff Web Page Handouts
Introduction Computer vision is the analysis of digital images
Presentation transcript:

1 CSE 455: Computer Vision Winter 2007 Instructor: Professor Linda Shapiro Additional Instructor: Dr. Matthew Brown TAs: Masa Kobashi Peter Davis Text: Shapiro and Stockman, Computer Vision (chapters available from class web page) Evaluation: 70% programming projects, 30% exams

2 Topics Basics: images, binary operations, filtering, edge operators Basics: images, binary operations, filtering, edge operators Color, texture, segmentation Color, texture, segmentation Interest operators: detectors and descriptors Interest operators: detectors and descriptors Use of interest operators: object recognition, stitching, tracking Use of interest operators: object recognition, stitching, tracking Content-based image retrieval Content-based image retrieval 2D object recognition 2D object recognition Motion Motion 3D: sensing, camera calibration, reconstruction, recognition 3D: sensing, camera calibration, reconstruction, recognition

3 What IS computer vision? Where do images come from? the analysis of digital images by a computer

4 Applications Medical Imaging CT image of a patient’s abdomen liver kidney

5 Visible Man Slice Through Lung

6 3D Reconstruction of the Blood Vessel Tree

7 Symbolic Shape Descriptors for Classifying Craniosynostosis sagittal synostosis metopic synostosis

8 Robotics 2D Gray-tone or Color Images 3D Range Images “Mars” rover What am I?

9 Recognition of 3D Object Classes from Range Data

10 Image Databases: Content-Based Retrieval What categories of image databases exist today? Images from my Ground-Truth collection.

11 Similarity Retrieval of Brain Data

12 CBIR of Mouse Eye Images for Genetic Studies

13 Abstract Regions for Object Recognition Original ImagesColor RegionsTexture RegionsLine Clusters

14 Insect Identification for Ecology Studies Yoraperla (Yor) Calineuria Calineuria (Cal) Doroneuria Doroneuria (Dor)

15 Document Analysis

16

17 Original Video Frame Structure RegionsColor Regions Surveillance: Object and Event Recognition in Aerial Videos

18 Video Analysis What are the objects? What are the events?

19 3D Scanning Scanning Michelangelo’s “The David” The Digital Michelangelo Project - UW Prof. Brian Curless, collaboratorBrian Curless 2 BILLION polygons, accuracy to.29mm

20 The Digital Michelangelo Project, Levoy et al.

21

22

23

24 M otio n Capture, Games UW Professor: Zoran Popovich works in this area.Zoran Popovich

25 Andy Serkis, Gollum, Lord of the Rings Effects

26 Imaging

27 Digital Image Terminology: pixel (with value 94) its 3x3 neighborhood binary image gray-scale (or gray-tone) image color image multi-spectral image range image labeled image region of medium intensity resolution (7x7)

28 Goals of Image and Video Analysis Segment an image into useful regions Perform measurements on certain areas Determine what object(s) are in the scene Calculate the precise location(s) of objects Visually inspect a manufactured object Construct a 3D model of the imaged object Find “interesting” events in a video liver kidney spleen

29 The Three Stages of Computer Vision low-level mid-level high-level image image features features analysis

30 Low-Level blurring sharpening

31 Low-Level Canny ORT Mid-Level original image edge image circular arcs and line segments data structure

32 Mid-level K-means clustering original color image regions of homogeneous color (followed by connected component analysis) data structure

33 edge image consistent line clusters low-level mid-level high-level Low- to High-Level Building Recognition

34 Imaging and Image Representation Sensing Process Sensing Process Typical Sensing Devices Typical Sensing Devices Problems with Digital Images Problems with Digital Images Image Formats Image Formats Relationship of 3D Scenes to 2D Images Relationship of 3D Scenes to 2D Images Other Types of Sensors Other Types of Sensors

35 Images: 2D projections of 3D The 3D world has color, texture, surfaces, volumes, light sources, objects, motion, … The 3D world has color, texture, surfaces, volumes, light sources, objects, motion, … A 2D image is a projection of a scene from a specific viewpoint. A 2D image is a projection of a scene from a specific viewpoint.

36 Images as Functions  A gray-tone image is a function: g(x,y) = val or f(row, col) = val  A color image is just three functions or a vector-valued function: f(row,col) =(r(row,col), g(row,col), b(row,col))

37 Image vs Matrix Digital images (or just “images”) are typically stored in a matrix. There are many different file formats.

38 Gray-tone Image as 3D Function

39 Imaging Process Light reaches surfaces in 3D Light reaches surfaces in 3D Surfaces reflect Surfaces reflect Sensor element receives light energy Sensor element receives light energy Intensity counts Intensity counts Angles count Angles count Material counts Material counts What are radiance and irradiance?

40 Radiometry and Computer Vision* From Sonka, Hlavac, and Boyle, Image Processing, Analysis, and Machine Vision, ITP, Radiometry is a branch of physics that deals with the measurement of the flow and transfer of radiant energy. Radiance is the power of light that is emitted from a unit surface area into some spatial angle; the corresponding photometric term is brightness. Irradiance is the amount of energy that an image- capturing device gets per unit of an efficient sensitive area of the camera. Quantizing it gives image gray tones.

41 CCD type camera: Commonly used in industrial applications Array of small fixed elements Array of small fixed elements Can read faster than TV rates Can read faster than TV rates Can add refracting elements to get Can add refracting elements to get color in 2x2 neighborhoods color in 2x2 neighborhoods 8-bit intensity common 8-bit intensity common

42 Blooming Problem with Arrays Difficult to insulate adjacent sensing elements. Difficult to insulate adjacent sensing elements. Charge often leaks from hot cells to neighbors, making bright regions larger. Charge often leaks from hot cells to neighbors, making bright regions larger.

43 8-bit intensity can be clipped Dark grid intersections at left were actually brightest of scene. In A/D conversion the bright values were clipped to lower values.

44 Lens distortion distorts image “Barrel distortion” of rectangular grid is common for cheap lenses ($50) “Barrel distortion” of rectangular grid is common for cheap lenses ($50) Precision lenses can cost $1000 or more. Precision lenses can cost $1000 or more. Zoom lenses often show severe distortion. Zoom lenses often show severe distortion.

45 Resolution resolution: precision of the sensor nominal resolution: size of a single pixel in scene coordinates (ie. meters, mm) common use of resolution: num_rows X num_cols (ie. 515 x 480) subpixel resolution: measurement that goes into fractions of nominal resolution field of view (FOV): size of the scene a sensor can sense

46 Resolution Examples Resolution decreases by one half in cases at left Human faces can be recognized at 64 x 64 pixels per face

47 Image Formats Portable gray map (PGM) older form Portable gray map (PGM) older form GIF was early commercial version GIF was early commercial version JPEG (JPG) is modern version JPEG (JPG) is modern version Many others exist: header plus data Many others exist: header plus data Do they handle color? Do they handle color? Do they provide for compression? Do they provide for compression? Are there good packages that use them Are there good packages that use them or at least convert between them? or at least convert between them?

48 PGM image with ASCII info. P2 means ASCII gray P2 means ASCII gray Comments Comments W=16; H=8 W=16; H=8 192 is max intensity 192 is max intensity Can be made with editor Can be made with editor Large images are usually not stored as ASCII Large images are usually not stored as ASCII

49 PBM/PGM/PPM Codes P1: ascii binary (PBM) P2: ascii grayscale (PGM) P3: ascii color (PPM) P4: byte binary (PBM) P5: byte grayscale (PGM) P6: byte color (PPM)

50 JPG current popular form Public standard Public standard Allows for image compression; often 10:1 or 30:1 are easily possible Allows for image compression; often 10:1 or 30:1 are easily possible 8x8 intensity regions are fit with basis of cosines 8x8 intensity regions are fit with basis of cosines Error in cosine fit coded as well Error in cosine fit coded as well Parameters then compressed with Huffman coding Parameters then compressed with Huffman coding Common for most digital cameras Common for most digital cameras

51 From 3D Scenes to 2D Images Object World Camera Real Image Pixel Image

52 3D Sensors Laser range finders Laser range finders CT, MRI, and ultrasound machines CT, MRI, and ultrasound machines Sonar sensors Sonar sensors Tactile sensors (pressure arrays) Tactile sensors (pressure arrays) Structured light sensors Structured light sensors Stereo Stereo MRA (angiograph) MRA (angiograph) showing blood flow.

53 Where do we go next? So we’ve got an image, say a single gray-tone image. What can we do with it? The simplest types of analysis is binary image analysis. Convert the gray-tone image to a binary image (0s and 1s) and perform analysis on the binary image, with possible reference back to the original gray tones in a region.