Pyramids and Texture. Scaled representations Big bars and little bars are both interesting Spots and hands vs. stripes and hairs Inefficient to detect.

Slides:



Advertisements
Similar presentations
Linear Filtering – Part II Selim Aksoy Department of Computer Engineering Bilkent University
Advertisements

Linear Filtering – Part II Selim Aksoy Department of Computer Engineering Bilkent University
Linear Filtering – Part II Selim Aksoy Department of Computer Engineering Bilkent University
Linear Filtering – Part II Selim Aksoy Department of Computer Engineering Bilkent University
November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
University of Ioannina - Department of Computer Science Wavelets and Multiresolution Processing (Background) Christophoros Nikou Digital.
Spatial Filtering (Chapter 3)
Multiscale Analysis of Images Gilad Lerman Math 5467 (stealing slides from Gonzalez & Woods, and Efros)
Slides from Alexei Efros
CS 691 Computational Photography
Linear Filtering – Part II Selim Aksoy Department of Computer Engineering Bilkent University
Computational Photography CSE 590 Tamara Berg Filtering & Pyramids.
Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.
Texture. Edge detectors find differences in overall intensity. Average intensity is only simplest difference. many slides from David Jacobs.
Templates, Image Pyramids, and Filter Banks Slides largely from Derek Hoeim, Univ. of Illinois.
Lecture 6 Color and Texture
Sampling and Pyramids : Rendering and Image Processing Alexei Efros …with lots of slides from Steve Seitz.
Edges and Scale Today’s reading Cipolla & Gee on edge detection (available online)Cipolla & Gee on edge detection Szeliski – From Sandlot ScienceSandlot.
Computer Vision - A Modern Approach
Immagini e filtri lineari. Image Filtering Modifying the pixels in an image based on some function of a local neighborhood of the pixels
CSCE 641 Computer Graphics: Fourier Transform Jinxiang Chai.
Announcements For future problems sets: matlab code by 11am, due date (same as deadline to hand in hardcopy). Today’s reading: Chapter 9, except.
Introduction to Computer Vision CS / ECE 181B  Handout #4 : Available this afternoon  Midterm: May 6, 2004  HW #2 due tomorrow  Ack: Prof. Matthew.
Texture Reading: Chapter 9 (skip 9.4) Key issue: How do we represent texture? Topics: –Texture segmentation –Texture-based matching –Texture synthesis.
2D Fourier Theory for Image Analysis Mani Thomas CISC 489/689.
CPSC 641 Computer Graphics: Fourier Transform Jinxiang Chai.
Computer Vision - A Modern Approach Set: Pyramids and Texture Slides by D.A. Forsyth Scaled representations Big bars (resp. spots, hands, etc.) and little.
Computational Photography: Fourier Transform Jinxiang Chai.
4 – Image Pyramids. Admin stuff Change of office hours on Wed 4 th April – Mon 31 st March pm (right after class) Change of time/date of last.
Fourier Analysis : Rendering and Image Processing Alexei Efros.
CS448f: Image Processing For Photography and Vision Wavelets Continued.
CSC589 Introduction to Computer Vision Lecture 9 Sampling, Gaussian Pyramid Bei Xiao.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5409 T-R 10:30am – 11:50am.
2D Image Fourier Spectrum.
Chapter 2. Image Analysis. Image Analysis Domains Frequency Domain Spatial Domain.
Applications of Image Filters Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/04/10.
Templates, Image Pyramids, and Filter Banks Computer Vision Derek Hoiem, University of Illinois 01/31/12.
CSC589 Introduction to Computer Vision Lecture 8
CAP5415: Computer Vision Lecture 4: Image Pyramids, Image Statistics, Denoising Fall 2006.
EECS 274 Computer Vision Pyramid and Texture. Filter, pyramid and texture Frequency domain Fourier transform Gaussian pyramid Wavelets Texture Reading:
Templates and Image Pyramids Computational Photography Derek Hoiem, University of Illinois 09/03/15.
Digital Image Processing CSC331 Image Enhancement 1.
Texture Key issue: representing texture –Texture based matching little is known –Texture segmentation key issue: representing texture –Texture synthesis.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm.
Templates, Image Pyramids, and Filter Banks
Edges.
Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos VC 15/16 – TP7 Spatial Filters Miguel Tavares Coimbra.
Image hole-filling. Agenda Project 2: Will be up tomorrow Due in 2 weeks Fourier – finish up Hole-filling (texture synthesis) Image blending.
Projects Project 1a due this Friday Project 1b will go out on Friday to be done in pairs start looking for a partner now.
Last Lecture photomatix.com. Today Image Processing: from basic concepts to latest techniques Filtering Edge detection Re-sampling and aliasing Image.
Linear filters. CS8690 Computer Vision University of Missouri at Columbia What is Image Filtering? Modify the pixels in an image based on some function.
Recall: Gaussian smoothing/sampling G 1/4 G 1/8 Gaussian 1/2 Solution: filter the image, then subsample Filter size should double for each ½ size reduction.
Spatial Filtering (Chapter 3) CS474/674 - Prof. Bebis.
Miguel Tavares Coimbra
Prof. Adriana Kovashka University of Pittsburgh September 14, 2016
Sampling, Template Matching and Pyramids
Linear Filters and Edges Chapters 7 and 8
Linear Filtering – Part II
Linear Filters and Edges Chapters 7 and 8
Department of Computer Engineering
Fourier Transform.
Outline Linear Shift-invariant system Linear filters
CSCE 643 Computer Vision: Thinking in Frequency
Gaussian Pyramid and Texture
Texture.
Phase and Amplitude in Fourier Transforms, Meaning of frequencies
Oh, no, wait, sorry, I meant, welcome to the implementation of 20th-century science fiction literature, or a small part of it, where we will learn about.
Linear Filtering CS 678 Spring 2018.
Department of Computer Engineering
Templates and Image Pyramids
Presentation transcript:

Pyramids and Texture

Scaled representations Big bars and little bars are both interesting Spots and hands vs. stripes and hairs Inefficient to detect big bars with big filters And there is superfluous detail in the filter kernel Alternative: Apply filters of fixed size to images of different sizes Typically, a collection of images whose edge length changes by a factor of 2 (or root 2) This is a pyramid (or Gaussian pyramid) by visual analogy

A bar in the big images is a hair on the zebra’s nose; in smaller images, a stripe; in the smallest, the animal’s nose

Aliasing Can ’ t shrink an image by taking every second pixel If we do, characteristic errors appear In the next few slides Typically, small phenomena look bigger; fast phenomena can look slower Common phenomenon Wagon wheels rolling the wrong way in movies Checkerboards misrepresented in ray tracing Striped shirts look funny on color television

Constructing a pyramid by taking every second pixel leads to layers that badly misrepresent the top layer

Open questions What causes the tendency of differentiation to emphasize noise? In what precise respects are discrete images different from continuous images? How do we avoid aliasing? General thread: a language for fast changes The Fourier Transform

Represent function on a new basis Think of functions as vectors, with many components We now apply a linear transformation to transform the basis dot product with each basis element In the expression, u and v select the basis element, so a function of x and y becomes a function of u and v basis elements have the form vectorized image Fourier transform base, also possible Wavelets, steerable pyramids, etc. transformed image

Fourier basis element example, real part F u,v (x,y) F u,v (x,y)=const. for (ux+vy)=const. Vector (u,v) Magnitude gives frequency Direction gives orientation.

Here u and v are larger than in the previous slide.

And larger still...

Phase and Magnitude Fourier transform of a real function is complex difficult to plot, visualize instead, we can think of the phase and magnitude of the transform Phase is the phase of the complex transform Magnitude is the magnitude of the complex transform Curious fact all natural images have about the same magnitude transform hence, phase seems to matter, but magnitude largely doesn ’ t Demonstration Take two pictures, swap the phase transforms, compute the inverse - what does the result look like?

This is the magnitude transform of the cheetah pic

This is the phase transform of the cheetah pic

This is the magnitude transform of the zebra pic

This is the phase transform of the zebra pic

Reconstruction with zebra phase, cheetah magnitude

Reconstruction with cheetah phase, zebra magnitude

Smoothing as low-pass filtering The message of the FT is that high frequencies lead to trouble with sampling. Solution: suppress high frequencies before sampling multiply the FT of the signal with something that suppresses high frequencies or convolve with a low-pass filter A filter whose FT is a box is bad, because the filter kernel has infinite support Common solution: use a Gaussian multiplying FT by Gaussian is equivalent to convolving image with Gaussian.

Sampling without smoothing. Top row shows the images, sampled at every second pixel to get the next; bottom row shows the magnitude spectrum of these images.

Sampling with smoothing. Top row shows the images. We get the next image by smoothing the image with a Gaussian with sigma 1 pixel, then sampling at every second pixel to get the next; bottom row shows the magnitude spectrum of these images.

Sampling with smoothing. Top row shows the images. We get the next image by smoothing the image with a Gaussian with sigma 1.4 pixels, then sampling at every second pixel to get the next; bottom row shows the magnitude spectrum of these images.

Applications of scaled representations Search for correspondence look at coarse scales, then refine with finer scales Edge tracking a “ good ” edge at a fine scale has parents at a coarser scale Control of detail and computational cost in matching e.g. finding stripes terribly important in texture representation

Example: CMU face detection

The Gaussian pyramid Smooth with gaussians, because a gaussian*gaussian=another gaussian Synthesis smooth and sample Analysis take the top image Gaussians are low pass filters, so representation is redundant

Texture Key issue: representing texture Texture based matching little is known Texture segmentation key issue: representing texture Texture synthesis useful; also gives some insight into quality of representation Shape from texture will skip discussion

Texture synthesis Given example, generate texture sample (that is large enough, satisfies constraints, … )

Texture analysis Compare; is this the same “ stuff ” ?

pre-attentive texture discrimination

same or not?

pre-attentive texture discrimination

same or not?

Representing textures Textures are made up of quite stylized subelements, repeated in meaningful ways Representation: find the subelements, and represent their statistics But what are the subelements, and how do we find them? recall normalized correlation find subelements by applying filters, looking at the magnitude of the response What filters? experience suggests spots and oriented bars at a variety of different scales details probably don ’ t matter What statistics? within reason, the more the merrier. At least, mean and standard deviation better, various conditional histograms.

Spots and bars at a fine scale

Spots and bars at a coarser scale

Fine scale Coarse scale How many filters and what orientations?

Texture Similarity based on Response Statistics Collect statistics of responses over an image or subimage Mean of squared response Mean and variance of squared response Euclidean distance between vectors of response statistics for two images is measure of texture similarity

Example 1: Squared response

Example 2: Mean and variance of squared response Compute the mean and standard deviation of the filter outputs over the window, and use these for the feature vector. (Ma and Manjunath, 1996)Ma and Manjunath, 1996 Decreasing response vector similarity

The Choice of Scale One approach: start with a small window and increase the size of the window until an increase does not cause a significant change.

Laplacian Pyramids as Band-Pass Filters Each level is the difference of a more smoothed and less smoothed image ! It contains the band of frequencies in between courtesy of Wolfram from Forsyth & Ponce

Oriented Pyramids Laplacian pyramid + direction sensitivity v from Forsyth & Ponce

Oriented Pyramids Reprinted from “Shiftable MultiScale Transforms,” by Simoncelli et al., IEEE Transactions on Information Theory, 1992.

Gabor Filters “Localized Fourier transforms”: Make each kernel from product of Fourier basis image and Gaussian Larger scale Smaller scale Odd Even Frequency from Forsyth & Ponce

Gabor Filters (cont’d) Symmetric kernel (even): Anti-symmetric kernel (odd):

Application: Texture synthesis Use image as a source of probability model Choose pixel values by matching neighborhood, then filling in Matching process look at pixel differences count only synthesized pixels