Computational Photography lecture 5 – filtering and frequencies CS 590 Spring 2014 Prof. Alex Berg (Credits to many other folks on individual slides)

Slides:



Advertisements
Similar presentations
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
Advertisements

CS 691 Computational Photography Instructor: Gianfranco Doretto Frequency Domain.
15-463: Computational Photography Alexei Efros, CMU, Spring 2010 Most slides from Steve Marschner Sampling and Reconstruction.
Computational Photography: Sampling + Reconstruction Connelly Barnes Slides from Alexei Efros and Steve Marschner.
Thinking in Frequency Computer Vision Brown James Hays Slides: Hoiem, Efros, and others.
Computer Vision University of Illinois Derek Hoiem
Templates, Image Pyramids, and Filter Banks Slides largely from Derek Hoeim, Univ. of Illinois.
Templates and Image Pyramids Computational Photography Derek Hoiem, University of Illinois 09/06/11.
1 Image filtering Hybrid Images, Oliva et al.,
Convolution, Edge Detection, Sampling : Computational Photography Alexei Efros, CMU, Fall 2006 Some slides from Steve Seitz.
CSCE 641 Computer Graphics: Image Sampling and Reconstruction Jinxiang Chai.
Recap from Monday Spectra and Color Light capture in cameras and humans.
1 Image Filtering Readings: Ch 5: 5.4, 5.5, 5.6,5.7.3, 5.8 (This lecture does not follow the book.) Images by Pawan SinhaPawan Sinha formal terminology.
CSCE 641 Computer Graphics: Fourier Transform Jinxiang Chai.
15-463: Computational Photography Alexei Efros, CMU, Fall 2011 Many slides from Steve Marschner Sampling and Reconstruction.
Fourier Analysis Without Tears : Computational Photography Alexei Efros, CMU, Fall 2005 Somewhere in Cinque Terre, May 2005.
The Frequency Domain : Computational Photography Alexei Efros, CMU, Fall 2011 Somewhere in Cinque Terre, May 2005 Many slides borrowed from Steve.
Convolution and Edge Detection : Computational Photography Alexei Efros, CMU, Fall 2005 Some slides from Steve Seitz.
CPSC 641 Computer Graphics: Fourier Transform Jinxiang Chai.
Templates, Image Pyramids, and Filter Banks Computer Vision James Hays, Brown Slides: Hoiem and others.
The Frequency Domain : Computational Photography Alexei Efros, CMU, Fall 2008 Somewhere in Cinque Terre, May 2005 Many slides borrowed from Steve.
Computational Photography: Fourier Transform Jinxiang Chai.
Slides from Alexei Efros, Steve Marschner Filters & fourier theory.
1 Image Filtering Slides by Steve Seitz. 2 Salvador Dali, “Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham.
Fourier Analysis : Rendering and Image Processing Alexei Efros.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm.
CSE 185 Introduction to Computer Vision Image Filtering: Frequency Domain.
Slide credit Fei Fei Li. Image filtering Image filtering: compute function of local neighborhood at each position Really important! – Enhance images.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30 – 11:50am.
Applications of Image Filters Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/04/10.
Templates, Image Pyramids, and Filter Banks Computer Vision Derek Hoiem, University of Illinois 01/31/12.
The Frequency Domain Somewhere in Cinque Terre, May 2005 Many slides borrowed from Steve Seitz CS194: Image Manipulation & Computational Photography Alexei.
Recap of Friday linear Filtering convolution differential filters
Computer Vision James Hays Thinking in Frequency Computer Vision James Hays Slides: Hoiem, Efros, and others.
CSC589 Introduction to Computer Vision Lecture 7 Thinking in Frequency Bei Xiao.
Templates, Image Pyramids, and Filter Banks
CSE 185 Introduction to Computer Vision Image Filtering: Frequency Domain.
Templates and Image Pyramids Computational Photography Derek Hoiem, University of Illinois 09/03/15.
Lecture 3: Edge detection CS4670/5670: Computer Vision Kavita Bala From Sandlot ScienceSandlot Science.
Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos VC 14/15 – TP6 Frequency Space Miguel Tavares Coimbra.
Thinking in Frequency Computational Photography University of Illinois Derek Hoiem 09/01/15.
Instructor: S. Narasimhan
The Frequency Domain : Computational Photography Alexei Efros, CMU, Fall 2010 Somewhere in Cinque Terre, May 2005 Many slides borrowed from Steve.
Recap of Monday linear Filtering convolution differential filters filter types boundary conditions.
Image as a linear combination of basis images
Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos VC 15/16 – TP7 Spatial Filters Miguel Tavares Coimbra.
CS 691B Computational Photography
CS559: Computer Graphics Lecture 3: Image Sampling and Filtering Li Zhang Spring 2010.
Lecture 3: Filtering and Edge detection
Lecture 5: Fourier and Pyramids
Cornell CS465 Fall 2004 Lecture 5© 2004 Steve Marschner 1 Sampling and reconstruction CS 465 Lecture 5.
Last Lecture photomatix.com. Today Image Processing: from basic concepts to latest techniques Filtering Edge detection Re-sampling and aliasing Image.
Slide credit Fei Fei Li.
The Frequency Domain, without tears
Computer Vision Brown James Hays 09/16/11 Thinking in Frequency Computer Vision Brown James Hays Slides: Hoiem, Efros, and others.
Slow mo guys – Saccades
Sampling and reconstruction
Lecture 1.26 Spectral analysis of periodic and non-periodic signals.
Miguel Tavares Coimbra
Frequency domain analysis and Fourier Transform
The Frequency Domain : Computational Photography
Recap from Wednesday Spectra and Color Light capture in cameras and humans.
Sampling and Reconstruction
CSCE 643 Computer Vision: Image Sampling and Filtering
CSCE 643 Computer Vision: Thinking in Frequency
Motion illusion, rotating snakes
Instructor: S. Narasimhan
Oh, no, wait, sorry, I meant, welcome to the implementation of 20th-century science fiction literature, or a small part of it, where we will learn about.
Fourier Analysis Without Tears
Templates and Image Pyramids
Presentation transcript:

Computational Photography lecture 5 – filtering and frequencies CS 590 Spring 2014 Prof. Alex Berg (Credits to many other folks on individual slides)

Thanks for the assignments! I will post links later today. Please look over each other’s work and send me a vote for your favorite {assignment, specific image} with at least a sentence about why. Today filtering and frequency analysis of images. Next week, guest lecturer! Prof. Marc Niethammer, an expert in medical image analysis. Another chance to see how to get to know the pixels. I will be at a conference – the European Conference on Computer Vision (ECCV) presenting some research, but will be thinking of you all, will be on , and will hold office hours over google hangouts (alex.c.berg) during the same after-class time, or will announce alternatives. Today

Slides from Derek Hoiem

Sampling and Reconstruction

© 2006 Steve Marschner 5 Sampled representations How to store and compute with continuous functions? Common scheme for representation: samples –write down the function’s values at many points [FvDFH fig.14.14b / Wolberg]

© 2006 Steve Marschner 6 Reconstruction Making samples back into a continuous function –for output (need realizable method) –for analysis or processing (need mathematical method) –amounts to “guessing” what the function did in between [FvDFH fig.14.14b / Wolberg]

1D Example: Audio lowhigh frequencies

© 2006 Steve Marschner 8 Sampling in digital audio Recording: sound to analog to samples to disc Playback: disc to samples to analog to sound again –how can we be sure we are filling in the gaps correctly?

© 2006 Steve Marschner 9 Sampling and Reconstruction Simple example: a sign wave

© 2006 Steve Marschner 10 Undersampling What if we “missed” things between the samples? Simple example: undersampling a sine wave –unsurprising result: information is lost

© 2006 Steve Marschner 11 Undersampling What if we “missed” things between the samples? Simple example: undersampling a sine wave –unsurprising result: information is lost –surprising result: indistinguishable from lower frequency

© 2006 Steve Marschner 12 Undersampling What if we “missed” things between the samples? Simple example: undersampling a sine wave –unsurprising result: information is lost –surprising result: indistinguishable from lower frequency –also, was always indistinguishable from higher frequencies –aliasing: signals “traveling in disguise” as other frequencies

Aliasing in video Slide by Steve Seitz

Aliasing in images

What’s happening? Input signal: x = 0:.05:5; imagesc(sin((2.^x).*x)) Plot as image: Alias! Not enough samples

Antialiasing What can we do about aliasing? Sample more often –Join the Mega-Pixel craze of the photo industry –But this can’t go on forever Make the signal less “wiggly” –Get rid of some high frequencies –Will loose information –But it’s better than aliasing

© 2006 Steve Marschner 17 Preventing aliasing Introduce lowpass filters: –remove high frequencies leaving only safe, low frequencies –choose lowest frequency in reconstruction (disambiguate)

© 2006 Steve Marschner 18 Linear filtering: a key idea Transformations on signals; e.g.: –bass/treble controls on stereo –blurring/sharpening operations in image editing –smoothing/noise reduction in tracking Key properties –linearity: filter(f + g) = filter(f) + filter(g) –shift invariance: behavior invariant to shifting the input delaying an audio signal sliding an image around Can be modeled mathematically by convolution

© 2006 Steve Marschner 19 Moving Average basic idea: define a new function by averaging over a sliding window a simple example to start off: smoothing

© 2006 Steve Marschner 20 Weighted Moving Average Can add weights to our moving average Weights […, 0, 1, 1, 1, 1, 1, 0, …] / 5

© 2006 Steve Marschner 21 Weighted Moving Average bell curve (gaussian-like) weights […, 1, 4, 6, 4, 1, …]

© 2006 Steve Marschner 22 Moving Average In 2D What are the weights H? Slide by Steve Seitz

© 2006 Steve Marschner 23 Cross-correlation filtering Let’s write this down as an equation. Assume the averaging window is (2k+1)x(2k+1): We can generalize this idea by allowing different weights for different neighboring pixels: This is called a cross-correlation operation and written: H is called the “filter,” “kernel,” or “mask.” Slide by Steve Seitz

Gaussian filtering A Gaussian kernel gives less weight to pixels further from the center of the window This kernel is an approximation of a Gaussian function: Slide by Steve Seitz

Mean vs. Gaussian filtering Slide by Steve Seitz

Convolution cross-correlation: A convolution operation is a cross-correlation where the filter is flipped both horizontally and vertically before being applied to the image: It is written: Suppose H is a Gaussian or mean kernel. How does convolution differ from cross-correlation? Slide by Steve Seitz

© 2006 Steve Marschner 27 Convolution is nice! Notation: Convolution is a multiplication-like operation –commutative –associative –distributes over addition –scalars factor out –identity: unit impulse e = […, 0, 0, 1, 0, 0, …] Conceptually no distinction between filter and signal Usefulness of associativity –often apply several filters one after another: (((a * b 1 ) * b 2 ) * b 3 ) –this is equivalent to applying one filter: a * (b 1 * b 2 * b 3 )

Tricks with convolutions =

Slides from Alexei Eros and Steve Seitz

Salvador Dali, “Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham Lincoln”, 1976 Salvador Dali “Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham Lincoln”, 1976

A nice set of basis This change of basis has a special name… Teases away fast vs. slow changes in the image.

Jean Baptiste Joseph Fourier ( ) had crazy idea (1807): Any univariate function can be rewritten as a weighted sum of sines and cosines of different frequencies. Don’t believe it? –Neither did Lagrange, Laplace, Poisson and other big wigs –Not translated into English until 1878! But it’s (mostly) true! –called Fourier Series –there are some subtle restrictions...the manner in which the author arrives at these equations is not exempt of difficulties and...his analysis to integrate them still leaves something to be desired on the score of generality and even rigour. Laplace Lagrange Legendre

A sum of sines Our building block: Add enough of them to get any signal f(x) you want! How many degrees of freedom? What does each control? Which one encodes the coarse vs. fine structure of the signal?

Fourier Transform We want to understand the frequency  of our signal. So, let’s reparametrize the signal by  instead of x: f(x) F(  ) Fourier Transform F(  ) f(x) Inverse Fourier Transform For every  from 0 to inf, F(  ) holds the amplitude A and phase  of the corresponding sine How can F hold both? Complex number trick! We can always go back:

Time and Frequency example : g(t) = sin(2pf t) + (1/3)sin(2p(3f) t)

Time and Frequency example : g(t) = sin(2pf t) + (1/3)sin(2p(3f) t) = +

Frequency Spectra example : g(t) = sin(2pf t) + (1/3)sin(2p(3f) t) = +

Frequency Spectra Usually, frequency is more interesting than the phase

= + = Frequency Spectra

= + =

= + =

= + =

= + =

=

FT: Just a change of basis * = M * f(x) = F(  )

IFT: Just a change of basis * = M -1 * F(  ) = f(x)

Finally: Scary Math

…not really scary: is hiding our old friend: So it’s just our signal f(x) times sine at frequency  phase can be encoded by sin/cos pair

Extension to 2D in Matlab, check out: imagesc(log(abs(fftshift(fft2(im)))));

Fourier analysis in images Intensity Image Fourier Image

Signals can be composed += More:

Man-made Scene

Can change spectrum, then reconstruct

Low and High Pass filtering

The Convolution Theorem The greatest thing since sliced (banana) bread! –The Fourier transform of the convolution of two functions is the product of their Fourier transforms –The inverse Fourier transform of the product of two Fourier transforms is the convolution of the two inverse Fourier transforms –Convolution in spatial domain is equivalent to multiplication in frequency domain!

2D convolution theorem example * f(x,y)f(x,y) h(x,y)h(x,y) g(x,y)g(x,y) |F(s x,s y )| |H(s x,s y )| |G(s x,s y )|

Why does the Gaussian give a nice smooth image, but the square filter give edgy artifacts? GaussianBox filter Filtering

Gaussian

Box Filter

Fourier Transform pairs

Low-pass, Band-pass, High-pass filters low-pass: High-pass / band-pass:

Edges in images

What does blurring take away? original

What does blurring take away? smoothed (5x5 Gaussian)

High-Pass filter smoothed – original

Band-pass filtering Laplacian Pyramid (subband images) Created from Gaussian pyramid by subtraction Gaussian Pyramid (low-pass images)

Laplacian Pyramid How can we reconstruct (collapse) this pyramid into the original image? Need this! Original image

Why Laplacian? Laplacian of Gaussian Gaussian delta function

Project 2: Hybrid Images rid.html Gaussian Filter! Laplacian Filter! Project Instructions: A. Oliva, A. Torralba, P.G. Schyns, “Hybrid Images,” SIGGRAPH 2006 “Hybrid Images,” Gaussian unit impulse Laplacian of Gaussian

Early processing in humans filters for various orientations and scales of frequency Perceptual cues in the mid frequencies dominate perception When we see an image from far away, we are effectively subsampling it Early Visual Processing: Multi-scale edge and blob filters Clues from Human Perception

Frequency Domain and Perception Campbell-Robson contrast sensitivity curve

Da Vinci and Peripheral Vision

Leonardo playing with peripheral vision

Unsharp Masking - = = + 

Freq. Perception Depends on Color R GB

Lossy Image Compression (JPEG) Block-based Discrete Cosine Transform (DCT)

Using DCT in JPEG The first coefficient B(0,0) is the DC component, the average intensity The top-left coeffs represent low frequencies, the bottom right – high frequencies

Image compression using DCT Quantize –More coarsely for high frequencies (which also tend to have smaller values) –Many quantized high frequency values will be zero Encode –Can decode with inverse dct Quantization table Filter responses Quantized values

JPEG Compression Summary Subsample color by factor of 2 –People have bad resolution for color Split into blocks (8x8, typically), subtract 128 For each block a.Compute DCT coefficients for b.Coarsely quantize Many high frequency components will become zero c.Encode (e.g., with Huffman coding)

Block size in JPEG Block size –small block faster correlation exists between neighboring pixels –large block better compression in smooth regions –It’s 8x8 in standard JPEG

JPEG compression comparison 89k12k

Image gradient The gradient of an image: The gradient points in the direction of most rapid change in intensity The gradient direction is given by: how does this relate to the direction of the edge? The edge strength is given by the gradient magnitude

Effects of noise Consider a single row or column of the image –Plotting intensity as a function of position gives a signal Where is the edge? How to compute a derivative?

Where is the edge? Solution: smooth first Look for peaks in

Derivative theorem of convolution This saves us one operation:

Laplacian of Gaussian Consider Laplacian of Gaussian operator Where is the edge? Zero-crossings of bottom graph

2D edge detection filters is the Laplacian operator: Laplacian of Gaussian Gaussianderivative of Gaussian

Try this in MATLAB g = fspecial('gaussian',15,2); imagesc(g); colormap(gray); surfl(g) gclown = conv2(clown,g,'same'); imagesc(conv2(clown,[-1 1],'same')); imagesc(conv2(gclown,[-1 1],'same')); dx = conv2(g,[-1 1],'same'); imagesc(conv2(clown,dx,'same')); lg = fspecial('log',15,2); lclown = conv2(clown,lg,'same'); imagesc(lclown) imagesc(clown +.2*lclown) Find your own image, clown doesn’t seem to be a default image anymore 

For next class I will send out a pointer to the links to look at other folks’ assignment 1s. Please look over these and send me your votes for best {assignment, image} with 1-2 sentence comments. Prof. Marc Niethammer will talk more about finding edges and regions of images next week. I will put the next assignment online soon – image compositing with variations. It will be due September 18. In the meantime, please experiment with the matlab from the lecture slides. One of the class goals is to get familiar with manipulating images!