The Frequency Domain : Computational Photography

Slides:



Advertisements
Similar presentations
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
Advertisements

CS 691 Computational Photography Instructor: Gianfranco Doretto Frequency Domain.
Thinking in Frequency Computer Vision Brown James Hays Slides: Hoiem, Efros, and others.
Computer Vision University of Illinois Derek Hoiem
Linear Filtering – Part II Selim Aksoy Department of Computer Engineering Bilkent University
Templates, Image Pyramids, and Filter Banks Slides largely from Derek Hoeim, Univ. of Illinois.
Templates and Image Pyramids Computational Photography Derek Hoiem, University of Illinois 09/06/11.
Computer Graphics Recitation 6. 2 Motivation – Image compression What linear combination of 8x8 basis signals produces an 8x8 block in the image?
Convolution, Edge Detection, Sampling : Computational Photography Alexei Efros, CMU, Fall 2006 Some slides from Steve Seitz.
Digital Image Processing Chapter 4: Image Enhancement in the Frequency Domain.
CSCE 641 Computer Graphics: Fourier Transform Jinxiang Chai.
Fourier Analysis Without Tears : Computational Photography Alexei Efros, CMU, Fall 2005 Somewhere in Cinque Terre, May 2005.
The Frequency Domain : Computational Photography Alexei Efros, CMU, Fall 2011 Somewhere in Cinque Terre, May 2005 Many slides borrowed from Steve.
Convolution and Edge Detection : Computational Photography Alexei Efros, CMU, Fall 2005 Some slides from Steve Seitz.
CPSC 641 Computer Graphics: Fourier Transform Jinxiang Chai.
Templates, Image Pyramids, and Filter Banks Computer Vision James Hays, Brown Slides: Hoiem and others.
The Frequency Domain : Computational Photography Alexei Efros, CMU, Fall 2008 Somewhere in Cinque Terre, May 2005 Many slides borrowed from Steve.
Computational Photography: Fourier Transform Jinxiang Chai.
Fourier Analysis : Rendering and Image Processing Alexei Efros.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm.
CSE 185 Introduction to Computer Vision Image Filtering: Frequency Domain.
Slide credit Fei Fei Li. Image filtering Image filtering: compute function of local neighborhood at each position Really important! – Enhance images.
Computational Photography lecture 5 – filtering and frequencies CS 590 Spring 2014 Prof. Alex Berg (Credits to many other folks on individual slides)
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30 – 11:50am.
Templates, Image Pyramids, and Filter Banks Computer Vision Derek Hoiem, University of Illinois 01/31/12.
The Frequency Domain Somewhere in Cinque Terre, May 2005 Many slides borrowed from Steve Seitz CS194: Image Manipulation & Computational Photography Alexei.
CS559: Computer Graphics Lecture 3: Digital Image Representation Li Zhang Spring 2008.
Recap of Friday linear Filtering convolution differential filters
Computer Vision James Hays Thinking in Frequency Computer Vision James Hays Slides: Hoiem, Efros, and others.
CSC589 Introduction to Computer Vision Lecture 7 Thinking in Frequency Bei Xiao.
Templates, Image Pyramids, and Filter Banks
CSE 185 Introduction to Computer Vision Image Filtering: Frequency Domain.
DCT cs195g: Computational Photography James Hays, Brown, Spring 2010 Somewhere in Cinque Terre, May 2005 Slides from Alexei Efros.
Templates and Image Pyramids Computational Photography Derek Hoiem, University of Illinois 09/03/15.
Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos VC 14/15 – TP6 Frequency Space Miguel Tavares Coimbra.
Thinking in Frequency Computational Photography University of Illinois Derek Hoiem 09/01/15.
Instructor: S. Narasimhan
The Frequency Domain : Computational Photography Alexei Efros, CMU, Fall 2010 Somewhere in Cinque Terre, May 2005 Many slides borrowed from Steve.
Recap of Monday linear Filtering convolution differential filters filter types boundary conditions.
Image as a linear combination of basis images
Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos VC 15/16 – TP7 Spatial Filters Miguel Tavares Coimbra.
Lecture 5: Fourier and Pyramids
Last Lecture photomatix.com. Today Image Processing: from basic concepts to latest techniques Filtering Edge detection Re-sampling and aliasing Image.
Frequency domain analysis and Fourier Transform
Fourier Transform and Frequency Domain
Miguel Tavares Coimbra
Slide credit Fei Fei Li.
The Frequency Domain, without tears
Edge Detection slides taken and adapted from public websites:
Computer Vision Brown James Hays 09/16/11 Thinking in Frequency Computer Vision Brown James Hays Slides: Hoiem, Efros, and others.
Sampling, Template Matching and Pyramids
Slow mo guys – Saccades
Linear Filtering – Part II
Lecture 1.26 Spectral analysis of periodic and non-periodic signals.
Image Pyramids and Applications
Miguel Tavares Coimbra
Frequency domain analysis and Fourier Transform
Computer Vision Jia-Bin Huang, Virginia Tech
Jeremy Bolton, PhD Assistant Teaching Professor
CSCE 643 Computer Vision: Image Sampling and Filtering
CSCE 643 Computer Vision: Thinking in Frequency
Edge Detection Today’s reading
Lecture 5: Resampling, Compositing, and Filtering Li Zhang Spring 2008
Instructor: S. Narasimhan
Oh, no, wait, sorry, I meant, welcome to the implementation of 20th-century science fiction literature, or a small part of it, where we will learn about.
Fourier Analysis Without Tears
Edge Detection Today’s reading
Review and Importance CS 111.
Thinking in Frequency Computational Photography University of Illinois
Templates and Image Pyramids
Presentation transcript:

The Frequency Domain 15-463: Computational Photography Somewhere in Cinque Terre, May 2005 15-463: Computational Photography Alexei Efros, CMU, Fall 2012 Many slides borrowed from Steve Seitz

Salvador Dali “Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham Lincoln”, 1976 Salvador Dali, “Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham Lincoln”, 1976 Salvador Dali, “Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham Lincoln”, 1976

A nice set of basis Teases away fast vs. slow changes in the image. This change of basis has a special name…

Jean Baptiste Joseph Fourier (1768-1830) ...the manner in which the author arrives at these equations is not exempt of difficulties and...his analysis to integrate them still leaves something to be desired on the score of generality and even rigour. had crazy idea (1807): Any univariate function can be rewritten as a weighted sum of sines and cosines of different frequencies. Don’t believe it? Neither did Lagrange, Laplace, Poisson and other big wigs Not translated into English until 1878! But it’s (mostly) true! called Fourier Series there are some subtle restrictions Laplace Legendre Lagrange

A sum of sines Our building block: Add enough of them to get any signal f(x) you want! How many degrees of freedom? What does each control? Which one encodes the coarse vs. fine structure of the signal?

Fourier Transform f(x) F(w) F(w) f(x) We want to understand the frequency w of our signal. So, let’s reparametrize the signal by w instead of x: f(x) F(w) Fourier Transform For every w from 0 to inf, F(w) holds the amplitude A and phase f of the corresponding sine How can F hold both? Complex number trick! We can always go back: F(w) f(x) Inverse Fourier Transform

Time and Frequency example : g(t) = sin(2pf t) + (1/3)sin(2p(3f) t)

Time and Frequency example : g(t) = sin(2pf t) + (1/3)sin(2p(3f) t) =

Frequency Spectra example : g(t) = sin(2pf t) + (1/3)sin(2p(3f) t) = +

Frequency Spectra Usually, frequency is more interesting than the phase

Frequency Spectra = + =

Frequency Spectra = + =

Frequency Spectra = + =

Frequency Spectra = + =

Frequency Spectra = + =

Frequency Spectra =

Frequency Spectra

FT: Just a change of basis M * f(x) = F(w) * = .

IFT: Just a change of basis M-1 * F(w) = f(x) = * .

Finally: Scary Math

Finally: Scary Math …not really scary: is hiding our old friend: So it’s just our signal f(x) times sine at frequency w phase can be encoded by sin/cos pair

Extension to 2D in Matlab, check out: imagesc(log(abs(fftshift(fft2(im)))));

Fourier analysis in images Intensity Image Fourier Image http://sharp.bu.edu/~slehar/fourier/fourier.html#filtering

Signals can be composed + = http://sharp.bu.edu/~slehar/fourier/fourier.html#filtering More: http://www.cs.unm.edu/~brayer/vision/fourier.html

Man-made Scene

Can change spectrum, then reconstruct

Low and High Pass filtering

The Convolution Theorem The greatest thing since sliced (banana) bread! The Fourier transform of the convolution of two functions is the product of their Fourier transforms The inverse Fourier transform of the product of two Fourier transforms is the convolution of the two inverse Fourier transforms Convolution in spatial domain is equivalent to multiplication in frequency domain!

2D convolution theorem example |F(sx,sy)| f(x,y) * h(x,y) |H(sx,sy)| g(x,y) |G(sx,sy)|

Filtering Why does the Gaussian give a nice smooth image, but the square filter give edgy artifacts? Gaussian Box filter Things we can’t understand without thinking in frequency 32

Gaussian

Box Filter

Fourier Transform pairs

Low-pass, Band-pass, High-pass filters High-pass / band-pass:

Edges in images

What does blurring take away? original

What does blurring take away? smoothed (5x5 Gaussian)

High-Pass filter smoothed – original

Band-pass filtering Gaussian Pyramid (low-pass images) Laplacian Pyramid (subband images) Created from Gaussian pyramid by subtraction

Laplacian Pyramid Need this! Original image How can we reconstruct (collapse) this pyramid into the original image?

Why Laplacian? Gaussian delta function Laplacian of Gaussian

Project 2: Hybrid Images A. Oliva, A. Torralba, P.G. Schyns, “Hybrid Images,” SIGGRAPH 2006 Gaussian Filter! Gaussian unit impulse Laplacian of Gaussian Laplacian Filter! Project Instructions: http://www.cs.illinois.edu/class/fa10/cs498dwh/projects/hybrid/ComputationalPhotography_ProjectHybrid.html 44

Clues from Human Perception Early processing in humans filters for various orientations and scales of frequency Perceptual cues in the mid frequencies dominate perception When we see an image from far away, we are effectively subsampling it Early Visual Processing: Multi-scale edge and blob filters

Frequency Domain and Perception Campbell-Robson contrast sensitivity curve 46

Da Vinci and Peripheral Vision

Leonardo playing with peripheral vision

Unsharp Masking - = = + a

Freq. Perception Depends on Color G B

Lossy Image Compression (JPEG) Block-based Discrete Cosine Transform (DCT)

Using DCT in JPEG The first coefficient B(0,0) is the DC component, the average intensity The top-left coeffs represent low frequencies, the bottom right – high frequencies

Image compression using DCT Quantize More coarsely for high frequencies (which also tend to have smaller values) Many quantized high frequency values will be zero Encode Can decode with inverse dct Filter responses Quantization table Quantized values

JPEG Compression Summary Subsample color by factor of 2 People have bad resolution for color Split into blocks (8x8, typically), subtract 128 For each block Compute DCT coefficients for Coarsely quantize Many high frequency components will become zero Encode (e.g., with Huffman coding) http://en.wikipedia.org/wiki/YCbCr http://en.wikipedia.org/wiki/JPEG

Block size in JPEG Block size small block large block faster correlation exists between neighboring pixels large block better compression in smooth regions It’s 8x8 in standard JPEG

JPEG compression comparison 89k 12k

Image gradient The gradient of an image: The gradient points in the direction of most rapid change in intensity The gradient direction is given by: how does this relate to the direction of the edge? The edge strength is given by the gradient magnitude

Effects of noise Consider a single row or column of the image Plotting intensity as a function of position gives a signal How to compute a derivative? Where is the edge?

Solution: smooth first Where is the edge? Look for peaks in

Derivative theorem of convolution This saves us one operation:

Laplacian of Gaussian Consider Where is the edge? operator Where is the edge? Zero-crossings of bottom graph

2D edge detection filters Laplacian of Gaussian Gaussian derivative of Gaussian How many 2nd derivative filters are there? There are four 2nd partial derivative filters. In practice, it’s handy to define a single 2nd derivative filter—the Laplacian is the Laplacian operator:

Try this in MATLAB g = fspecial('gaussian',15,2); imagesc(g); colormap(gray); surfl(g) gclown = conv2(clown,g,'same'); imagesc(conv2(clown,[-1 1],'same')); imagesc(conv2(gclown,[-1 1],'same')); dx = conv2(g,[-1 1],'same'); imagesc(conv2(clown,dx,'same')); lg = fspecial('log',15,2); lclown = conv2(clown,lg,'same'); imagesc(lclown) imagesc(clown + .2*lclown)