Presentation is loading. Please wait.

Presentation is loading. Please wait.

The Frequency Domain : Computational Photography

Similar presentations


Presentation on theme: "The Frequency Domain : Computational Photography"— Presentation transcript:

1 The Frequency Domain 15-463: Computational Photography
Somewhere in Cinque Terre, May 2005 15-463: Computational Photography Alexei Efros, CMU, Fall 2012 Many slides borrowed from Steve Seitz

2 Salvador Dali “Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham Lincoln”, 1976 Salvador Dali, “Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham Lincoln”, 1976 Salvador Dali, “Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham Lincoln”, 1976

3

4

5 A nice set of basis Teases away fast vs. slow changes in the image.
This change of basis has a special name…

6 Jean Baptiste Joseph Fourier (1768-1830)
...the manner in which the author arrives at these equations is not exempt of difficulties and...his analysis to integrate them still leaves something to be desired on the score of generality and even rigour. had crazy idea (1807): Any univariate function can be rewritten as a weighted sum of sines and cosines of different frequencies. Don’t believe it? Neither did Lagrange, Laplace, Poisson and other big wigs Not translated into English until 1878! But it’s (mostly) true! called Fourier Series there are some subtle restrictions Laplace Legendre Lagrange

7 A sum of sines Our building block:
Add enough of them to get any signal f(x) you want! How many degrees of freedom? What does each control? Which one encodes the coarse vs. fine structure of the signal?

8 Fourier Transform f(x) F(w) F(w) f(x)
We want to understand the frequency w of our signal. So, let’s reparametrize the signal by w instead of x: f(x) F(w) Fourier Transform For every w from 0 to inf, F(w) holds the amplitude A and phase f of the corresponding sine How can F hold both? Complex number trick! We can always go back: F(w) f(x) Inverse Fourier Transform

9 Time and Frequency example : g(t) = sin(2pf t) + (1/3)sin(2p(3f) t)

10 Time and Frequency example : g(t) = sin(2pf t) + (1/3)sin(2p(3f) t) =

11 Frequency Spectra example : g(t) = sin(2pf t) + (1/3)sin(2p(3f) t) = +

12 Frequency Spectra Usually, frequency is more interesting than the phase

13 Frequency Spectra = + =

14 Frequency Spectra = + =

15 Frequency Spectra = + =

16 Frequency Spectra = + =

17 Frequency Spectra = + =

18 Frequency Spectra =

19 Frequency Spectra

20 FT: Just a change of basis
M * f(x) = F(w) * = .

21 IFT: Just a change of basis
M-1 * F(w) = f(x) = * .

22 Finally: Scary Math

23 Finally: Scary Math …not really scary: is hiding our old friend:
So it’s just our signal f(x) times sine at frequency w phase can be encoded by sin/cos pair

24 Extension to 2D in Matlab, check out: imagesc(log(abs(fftshift(fft2(im)))));

25 Fourier analysis in images
Intensity Image Fourier Image

26 Signals can be composed
+ = More:

27 Man-made Scene

28 Can change spectrum, then reconstruct

29 Low and High Pass filtering

30 The Convolution Theorem
The greatest thing since sliced (banana) bread! The Fourier transform of the convolution of two functions is the product of their Fourier transforms The inverse Fourier transform of the product of two Fourier transforms is the convolution of the two inverse Fourier transforms Convolution in spatial domain is equivalent to multiplication in frequency domain!

31 2D convolution theorem example
|F(sx,sy)| f(x,y) * h(x,y) |H(sx,sy)| g(x,y) |G(sx,sy)|

32 Filtering Why does the Gaussian give a nice smooth image, but the square filter give edgy artifacts? Gaussian Box filter Things we can’t understand without thinking in frequency 32

33 Gaussian

34 Box Filter

35 Fourier Transform pairs

36 Low-pass, Band-pass, High-pass filters
High-pass / band-pass:

37 Edges in images

38 What does blurring take away?
original

39 What does blurring take away?
smoothed (5x5 Gaussian)

40 High-Pass filter smoothed – original

41 Band-pass filtering Gaussian Pyramid (low-pass images)
Laplacian Pyramid (subband images) Created from Gaussian pyramid by subtraction

42 Laplacian Pyramid Need this! Original image How can we reconstruct (collapse) this pyramid into the original image?

43 Why Laplacian? Gaussian delta function Laplacian of Gaussian

44 Project 2: Hybrid Images
A. Oliva, A. Torralba, P.G. Schyns, “Hybrid Images,” SIGGRAPH 2006 Gaussian Filter! Gaussian unit impulse Laplacian of Gaussian Laplacian Filter! Project Instructions: 44

45 Clues from Human Perception
Early processing in humans filters for various orientations and scales of frequency Perceptual cues in the mid frequencies dominate perception When we see an image from far away, we are effectively subsampling it Early Visual Processing: Multi-scale edge and blob filters

46 Frequency Domain and Perception
Campbell-Robson contrast sensitivity curve 46

47 Da Vinci and Peripheral Vision

48 Leonardo playing with peripheral vision

49 Unsharp Masking - = = + a

50 Freq. Perception Depends on Color
G B

51 Lossy Image Compression (JPEG)
Block-based Discrete Cosine Transform (DCT)

52 Using DCT in JPEG The first coefficient B(0,0) is the DC component, the average intensity The top-left coeffs represent low frequencies, the bottom right – high frequencies

53 Image compression using DCT
Quantize More coarsely for high frequencies (which also tend to have smaller values) Many quantized high frequency values will be zero Encode Can decode with inverse dct Filter responses Quantization table Quantized values

54 JPEG Compression Summary
Subsample color by factor of 2 People have bad resolution for color Split into blocks (8x8, typically), subtract 128 For each block Compute DCT coefficients for Coarsely quantize Many high frequency components will become zero Encode (e.g., with Huffman coding)

55 Block size in JPEG Block size small block large block
faster correlation exists between neighboring pixels large block better compression in smooth regions It’s 8x8 in standard JPEG

56 JPEG compression comparison
89k 12k

57 Image gradient The gradient of an image:
The gradient points in the direction of most rapid change in intensity The gradient direction is given by: how does this relate to the direction of the edge? The edge strength is given by the gradient magnitude

58 Effects of noise Consider a single row or column of the image
Plotting intensity as a function of position gives a signal How to compute a derivative? Where is the edge?

59 Solution: smooth first
Where is the edge? Look for peaks in

60 Derivative theorem of convolution
This saves us one operation:

61 Laplacian of Gaussian Consider Where is the edge?
operator Where is the edge? Zero-crossings of bottom graph

62 2D edge detection filters
Laplacian of Gaussian Gaussian derivative of Gaussian How many 2nd derivative filters are there? There are four 2nd partial derivative filters. In practice, it’s handy to define a single 2nd derivative filter—the Laplacian is the Laplacian operator:

63 Try this in MATLAB g = fspecial('gaussian',15,2);
imagesc(g); colormap(gray); surfl(g) gclown = conv2(clown,g,'same'); imagesc(conv2(clown,[-1 1],'same')); imagesc(conv2(gclown,[-1 1],'same')); dx = conv2(g,[-1 1],'same'); imagesc(conv2(clown,dx,'same')); lg = fspecial('log',15,2); lclown = conv2(clown,lg,'same'); imagesc(lclown) imagesc(clown + .2*lclown)


Download ppt "The Frequency Domain : Computational Photography"

Similar presentations


Ads by Google