Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Scaled representations Big bars (resp. spots, hands, etc.) and little.

Slides:



Advertisements
Similar presentations
Linear Filtering – Part II Selim Aksoy Department of Computer Engineering Bilkent University
Advertisements

Linear Filtering – Part II Selim Aksoy Department of Computer Engineering Bilkent University
Linear Filtering – Part II Selim Aksoy Department of Computer Engineering Bilkent University
Linear Filtering – Part II Selim Aksoy Department of Computer Engineering Bilkent University
3-D Computational Vision CSc Image Processing II - Fourier Transform.
November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
Multiscale Analysis of Images Gilad Lerman Math 5467 (stealing slides from Gonzalez & Woods, and Efros)
Ted Adelson’s checkerboard illusion. Motion illusion, rotating snakes.
Slides from Alexei Efros
CS 691 Computational Photography
Pyramids and Texture. Scaled representations Big bars and little bars are both interesting Spots and hands vs. stripes and hairs Inefficient to detect.
Linear Filtering – Part II Selim Aksoy Department of Computer Engineering Bilkent University
Texture. Edge detectors find differences in overall intensity. Average intensity is only simplest difference. many slides from David Jacobs.
Lecture 6 Color and Texture
Computer Vision - A Modern Approach Set: Linear Filters Slides by D.A. Forsyth Differentiation and convolution Recall Now this is linear and shift invariant,
Edge detection. Edge Detection in Images Finding the contour of objects in a scene.
Image processing. Image operations Operations on an image –Linear filtering –Non-linear filtering –Transformations –Noise removal –Segmentation.
Sampling and Pyramids : Rendering and Image Processing Alexei Efros …with lots of slides from Steve Seitz.
Computer Vision - A Modern Approach
Immagini e filtri lineari. Image Filtering Modifying the pixels in an image based on some function of a local neighborhood of the pixels
CSCE 641 Computer Graphics: Fourier Transform Jinxiang Chai.
Announcements For future problems sets: matlab code by 11am, due date (same as deadline to hand in hardcopy). Today’s reading: Chapter 9, except.
Introduction to Computer Vision CS / ECE 181B  Handout #4 : Available this afternoon  Midterm: May 6, 2004  HW #2 due tomorrow  Ack: Prof. Matthew.
Texture Reading: Chapter 9 (skip 9.4) Key issue: How do we represent texture? Topics: –Texture segmentation –Texture-based matching –Texture synthesis.
2D Fourier Theory for Image Analysis Mani Thomas CISC 489/689.
CPSC 641 Computer Graphics: Fourier Transform Jinxiang Chai.
Lecture 2: Image filtering
Computational Photography: Fourier Transform Jinxiang Chai.
Computer Vision P. Schrater Spring 2003
Fourier Analysis : Rendering and Image Processing Alexei Efros.
CSC589 Introduction to Computer Vision Lecture 9 Sampling, Gaussian Pyramid Bei Xiao.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5409 T-R 10:30am – 11:50am.
2D Image Fourier Spectrum.
Applications of Image Filters Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/04/10.
CSC589 Introduction to Computer Vision Lecture 8
CAP5415: Computer Vision Lecture 4: Image Pyramids, Image Statistics, Denoising Fall 2006.
Computer Vision - A Modern Approach Set: Sources, shadows and shading Slides by D.A. Forsyth Sources and shading How bright (or what colour) are objects?
EECS 274 Computer Vision Pyramid and Texture. Filter, pyramid and texture Frequency domain Fourier transform Gaussian pyramid Wavelets Texture Reading:
Lecture 6: Edge Detection CAP 5415: Computer Vision Fall 2008.
09/19/2002 (C) University of Wisconsin 2002, CS 559 Last Time Color Quantization Dithering.
Texture Key issue: representing texture –Texture based matching little is known –Texture segmentation key issue: representing texture –Texture synthesis.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm.
Many slides from Steve Seitz and Larry Zitnick
Edge Detection and Geometric Primitive Extraction Jinxiang Chai.
Edges.
Image hole-filling. Agenda Project 2: Will be up tomorrow Due in 2 weeks Fourier – finish up Hole-filling (texture synthesis) Image blending.
Computer Graphics & Image Processing Chapter # 4 Image Enhancement in Frequency Domain 2/26/20161.
Last Lecture photomatix.com. Today Image Processing: from basic concepts to latest techniques Filtering Edge detection Re-sampling and aliasing Image.
Linear filters. CS8690 Computer Vision University of Missouri at Columbia What is Image Filtering? Modify the pixels in an image based on some function.
Recall: Gaussian smoothing/sampling G 1/4 G 1/8 Gaussian 1/2 Solution: filter the image, then subsample Filter size should double for each ½ size reduction.
Miguel Tavares Coimbra
Prof. Adriana Kovashka University of Pittsburgh September 14, 2016
Sampling, Template Matching and Pyramids
Linear Filters and Edges Chapters 7 and 8
- photometric aspects of image formation gray level images
… Sampling … … Filtering … … Reconstruction …
Linear Filtering – Part II
Linear Filters and Edges Chapters 7 and 8
Department of Computer Engineering
General Functions A non-periodic function can be represented as a sum of sin’s and cos’s of (possibly) all frequencies: F() is the spectrum of the function.
Fourier Transform.
Outline Linear Shift-invariant system Linear filters
CSCE 643 Computer Vision: Thinking in Frequency
Gaussian Pyramid and Texture
Texture.
Phase and Amplitude in Fourier Transforms, Meaning of frequencies
Oh, no, wait, sorry, I meant, welcome to the implementation of 20th-century science fiction literature, or a small part of it, where we will learn about.
Linear Filtering CS 678 Spring 2018.
Department of Computer Engineering
Templates and Image Pyramids
Presentation transcript:

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Scaled representations Big bars (resp. spots, hands, etc.) and little bars are both interesting –Stripes and hairs, say Inefficient to detect big bars with big filters –And there is superfluous detail in the filter kernel Alternative: –Apply filters of fixed size to images of different sizes –Typically, a collection of images whose edge length changes by a factor of 2 (or root 2) –This is a pyramid (or Gaussian pyramid) by visual analogy

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth A bar in the big images is a hair on the zebra’s nose; in smaller images, a stripe; in the smallest, the animal’s nose

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Aliasing Can’t shrink an image by taking every second pixel If we do, characteristic errors appear –In the next few slides –Typically, small phenomena look bigger; fast phenomena can look slower –Common phenomenon Wagon wheels rolling the wrong way in movies Checkerboards misrepresented in ray tracing Striped shirts look funny on colour television

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Resample the checkerboard by taking one sample at each circle. In the case of the top left board, new representation is reasonable. Top right also yields a reasonable representation. Bottom left is all black (dubious) and bottom right has checks that are too big.

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Constructing a pyramid by taking every second pixel leads to layers that badly misrepresent the top layer

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Open questions What causes the tendency of differentiation to emphasize noise? In what precise respects are discrete images different from continuous images? How do we avoid aliasing? General thread: a language for fast changes The Fourier Transform

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth The Fourier Transform Represent function on a new basis –Think of functions as vectors, with many components –We now apply a linear transformation to transform the basis dot product with each basis element In the expression, u and v select the basis element, so a function of x and y becomes a function of u and v basis elements have the form

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth To get some sense of what basis elements look like, we plot a basis element --- or rather, its real part --- as a function of x,y for some fixed u, v. We get a function that is constant when (ux+vy) is constant. The magnitude of the vector (u, v) gives a frequency, and its direction gives an orientation. The function is a sinusoid with this frequency along the direction, and constant perpendicular to the direction.

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Here u and v are larger than in the previous slide.

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth And larger still...

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Phase and Magnitude Fourier transform of a real function is complex –difficult to plot, visualize –instead, we can think of the phase and magnitude of the transform Phase is the phase of the complex transform Magnitude is the magnitude of the complex transform Curious fact –all natural images have about the same magnitude transform –hence, phase seems to matter, but magnitude largely doesn’t Demonstration –Take two pictures, swap the phase transforms, compute the inverse - what does the result look like?

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth This is the magnitude transform of the cheetah pic

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth This is the phase transform of the cheetah pic

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth This is the magnitude transform of the zebra pic

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth This is the phase transform of the zebra pic

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Reconstruction with zebra phase, cheetah magnitude

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Reconstruction with cheetah phase, zebra magnitude

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Various Fourier Transform Pairs Important facts –The Fourier transform is linear –There is an inverse FT –if you scale the function’s argument, then the transform’s argument scales the other way. This makes sense --- if you multiply a function’s argument by a number that is larger than one, you are stretching the function, so that high frequencies go to low frequencies –The FT of a Gaussian is a Gaussian. The convolution theorem –The Fourier transform of the convolution of two functions is the product of their Fourier transforms –The inverse Fourier transform of the product of two Fourier transforms is the convolution of the two inverse Fourier transforms There’s a table in the book.

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Sampling in 1D takes a continuous function and replaces it with a vector of values, consisting of the function’s values at a set of sample points. We’ll assume that these sample points are on a regular grid, and can place one at each integer for convenience.

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Sampling in 2D does the same thing, only in 2D. We’ll assume that these sample points are on a regular grid, and can place one at each integer point for convenience.

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth A continuous model for a sampled function We want to be able to approximate integrals sensibly Leads to –the delta function –model on right

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth The Fourier transform of a sampled signal

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Smoothing as low-pass filtering The message of the FT is that high frequencies lead to trouble with sampling. Solution: suppress high frequencies before sampling –multiply the FT of the signal with something that suppresses high frequencies –or convolve with a low-pass filter A filter whose FT is a box is bad, because the filter kernel has infinite support Common solution: use a Gaussian –multiplying FT by Gaussian is equivalent to convolving image with Gaussian.

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Sampling without smoothing. Top row shows the images, sampled at every second pixel to get the next; bottom row shows the magnitude spectrum of these images.

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Sampling with smoothing. Top row shows the images. We get the next image by smoothing the image with a Gaussian with sigma 1 pixel, then sampling at every second pixel to get the next; bottom row shows the magnitude spectrum of these images.

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Sampling with smoothing. Top row shows the images. We get the next image by smoothing the image with a Gaussian with sigma 1.4 pixels, then sampling at every second pixel to get the next; bottom row shows the magnitude spectrum of these images.

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Applications of scaled representations Search for correspondence –look at coarse scales, then refine with finer scales Edge tracking –a “good” edge at a fine scale has parents at a coarser scale Control of detail and computational cost in matching –e.g. finding stripes –terribly important in texture representation

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth The Gaussian pyramid Smooth with gaussians, because –a gaussian*gaussian=another gaussian Synthesis –smooth and sample Analysis –take the top image Gaussians are low pass filters, so repn is redundant

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Texture Key issue: representing texture –Texture based matching little is known –Texture segmentation key issue: representing texture –Texture synthesis useful; also gives some insight into quality of representation –Shape from texture cover superficially

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Representing textures Textures are made up of quite stylised subelements, repeated in meaningful ways Representation: –find the subelements, and represent their statistics But what are the subelements, and how do we find them? –recall normalized correlation –find subelements by applying filters, looking at the magnitude of the response What filters? –experience suggests spots and oriented bars at a variety of different scales –details probably don’t matter What statistics? –within reason, the more the merrier. –At least, mean and standard deviation –better, various conditional histograms.

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Gabor filters at different scales and spatial frequencies top row shows anti-symmetric (or odd) filters, bottom row the symmetric (or even) filters.

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth The Laplacian Pyramid Synthesis –preserve difference between upsampled Gaussian pyramid level and Gaussian pyramid level –band pass filter - each level represents spatial frequencies (largely) unrepresented at other levels Analysis –reconstruct Gaussian pyramid, take top layer

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Oriented pyramids Laplacian pyramid is orientation independent Apply an oriented filter to determine orientations at each layer –by clever filter design, we can simplify synthesis –this represents image information at a particular scale and orientation

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Reprinted from “Shiftable MultiScale Transforms,” by Simoncelli et al., IEEE Transactions on Information Theory, 1992, copyright 1992, IEEE

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Analysis

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth synthesis

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Final texture representation Form an oriented pyramid (or equivalent set of responses to filters at different scales and orientations). Square the output Take statistics of responses –e.g. mean of each filter output (are there lots of spots) –std of each filter output –mean of one scale conditioned on other scale having a particular range of values (e.g. are the spots in straight rows?)

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Texture synthesis Use image as a source of probability model Choose pixel values by matching neighbourhood, then filling in Matching process – look at pixel differences –count only synthesized pixels

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Figure from Texture Synthesis by Non-parametric Sampling, A. Efros and T.K. Leung, Proc. Int. Conf. Computer Vision, 1999 copyright 1999, IEEE

Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Variations Texture synthesis at multiple scales Texture synthesis on surfaces Texture synthesis by tiles “Analogous” texture synthesis