What have we leaned so far? Camera structure Eye structure Project 1: High Dynamic Range Imaging.

Slides:



Advertisements
Similar presentations
Stereo Vision Reading: Chapter 11
Advertisements

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923.
Gratuitous Picture US Naval Artillery Rangefinder from World War I (1918)!!
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
Recap from Previous Lecture Tone Mapping – Preserve local contrast or detail at the expense of large scale contrast. – Changing the brightness within.
Lecture 8: Stereo.
Stereo.
776 Computer Vision Jan-Michael Frahm, Enrique Dunn Spring 2012.
Last Time Pinhole camera model, projection
CS6670: Computer Vision Noah Snavely Lecture 17: Stereo
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
More on Stereo. Outline Fast window-based correlationFast window-based correlation DiffusionDiffusion Energy minimizationEnergy minimization Graph cutsGraph.
Computer Vision : CISC 4/689 Adaptation from: Prof. James M. Rehg, G.Tech.
Stereo & Iterative Graph-Cuts Alex Rav-Acha Vision Course Hebrew University.
Stereo Binocular Stereo Calibration (finish up) Next Time Motivation
Stereopsis Mark Twain at Pool Table", no date, UCR Museum of Photography.
The plan for today Camera matrix
3D from multiple views : Rendering and Image Processing Alexei Efros …with a lot of slides stolen from Steve Seitz and Jianbo Shi.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Lecture 10: Stereo and Graph Cuts
Stereo and Structure from Motion
Stereo Computation using Iterative Graph-Cuts
Lecture 11: Stereo and optical flow CS6670: Computer Vision Noah Snavely.
CSE473/573 – Stereo Correspondence
Announcements PS3 Due Thursday PS4 Available today, due 4/17. Quiz 2 4/24.
Stereo Guest Lecture by Li Zhang
Project 1 artifact winners Project 2 questions Project 2 extra signup slots –Can take a second slot if you’d like Announcements.
Midterm went out on Tuesday (due next Tuesday) Project 3 out today Announcements.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
VZvZ r t b image cross ratio Measuring height B (bottom of object) T (top of object) R (reference point) ground plane H C scene cross ratio  scene points.
Stereo matching Class 10 Read Chapter 7 Tsukuba dataset.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
What have we leaned so far? Camera structure Eye structure Project 1: High Dynamic Range Imaging.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
Stereo Readings Trucco & Verri, Chapter 7 –Read through 7.1, 7.2.1, 7.2.2, 7.3.1, 7.3.2, and 7.4, –The rest is optional. Single image stereogram,
Camera Calibration & Stereo Reconstruction Jinxiang Chai.
Announcements Project 1 artifact winners Project 2 questions
Structure from images. Calibration Review: Pinhole Camera.
Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.
Recap from Monday Image Warping – Coordinate transforms – Linear transforms expressed in matrix form – Inverse transforms useful when synthesizing images.
Epipolar geometry Epipolar Plane Baseline Epipoles Epipolar Lines
Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images Subproblems: –Calibrating camera positions. –Finding all corresponding.
Stereo Readings Szeliski, Chapter 11 (through 11.5) Single image stereogram, by Niklas EenNiklas Een.
Stereo Many slides adapted from Steve Seitz.
Project 2 code & artifact due Friday Midterm out tomorrow (check your ), due next Fri Announcements TexPoint fonts used in EMF. Read the TexPoint.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.
Computer Vision, Robert Pless
Computer Vision Stereo Vision. Bahadir K. Gunturk2 Pinhole Camera.
Bahadir K. Gunturk1 Phase Correlation Bahadir K. Gunturk2 Phase Correlation Take cross correlation Take inverse Fourier transform  Location of the impulse.
Lecture 16: Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
Digital Image Processing
776 Computer Vision Jan-Michael Frahm Spring 2012.
Solving for Stereo Correspondence Many slides drawn from Lana Lazebnik, UIUC.
Paper presentation topics 2. More on feature detection and descriptors 3. Shape and Matching 4. Indexing and Retrieval 5. More on 3D reconstruction 1.
A global approach Finding correspondence between a pair of epipolar lines for all pixels simultaneously Local method: no guarantee we will have one to.
Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.
Correspondence and Stereopsis Original notes by W. Correa. Figures from [Forsyth & Ponce] and [Trucco & Verri]
Project 2 artifacts—vote now!! Project 3 questions? Start thinking about final project ideas, partners Announcements.
Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) –One-page writeup (from project web page), specifying: »Your team.
Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:
CSE 185 Introduction to Computer Vision Stereo 2.
Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
Noah Snavely, Zhengqi Li
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
Stereo and Structure from Motion
Announcements Midterms graded (handed back at end of lecture)
Thanks to Richard Szeliski and George Bebis for the use of some slides
What have we learned so far?
Computer Vision Stereo Vision.
Stereo vision Many slides adapted from Steve Seitz.
Presentation transcript:

What have we leaned so far? Camera structure Eye structure Project 1: High Dynamic Range Imaging

What have we learned so far? Image Filtering Image Warping Camera Projection Model Project 2: Panoramic Image Stitching

What have we learned so far? Projective Geometry Single View Modeling Shading Model Project 3: Photometric Stereo

Today 3D modeling from two images – Stereo

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Stereograms online UCR stereographs The Art of Stereo Photography History of Stereo Photography Double Exposure Stereo Photography 3D Photography links National Stereoscopic Association Books on Stereo Photography A free pair of red-blue stereo glasses can be ordered from Rainbow Symphony IncRainbow Symphony Inc

Stereo scene point optical center image plane

Stereo Basic Principle: Triangulation Gives reconstruction as intersection of two rays Requires –calibration –point correspondence

Stereo correspondence Determine Pixel Correspondence Pairs of points that correspond to same scene point Epipolar Constraint Reduces correspondence problem to 1D search along conjugate epipolar lines Java demo: epipolar plane epipolar line

Epipolar Line Example courtesy of Marc Pollefeys

Stereo image rectification

reproject image planes onto a common plane parallel to the line between optical centers pixel motion is horizontal after this transformation two homographies (3x3 transform), one for each input image reprojection  C. Loop and Z. Zhang. Computing Rectifying Homographies for Stereo Vision. IEEE Conf. Computer Vision and Pattern Recognition, 1999.Computing Rectifying Homographies for Stereo Vision

Epipolar Line Example courtesy of Marc Pollefeys

Epipolar Line Example courtesy of Marc Pollefeys

Stereo matching algorithms Match Pixels in Conjugate Epipolar Lines Assume brightness constancy This is a tough problem Numerous approaches –A good survey and evaluation:

Basic stereo algorithm For each epipolar line For each pixel in the left image compare with every pixel on same epipolar line in right image pick pixel with minimum match cost Improvement: match windows

Basic stereo algorithm For each pixel For each disparity –For each pixel in window »Compute difference Find disparity with minimum SSD

Reverse order of loops For each disparity For each pixel –For each pixel in window »Compute difference Find disparity with minimum SSD at each pixel

Incremental computation Given SSD of a window, at some disparity Image 1 Image 2

Incremental computation Want: SSD at next location Image 1 Image 2

Incremental computation Subtract contributions from leftmost column, add contributions from rightmost column Image 1 Image

Selecting window size Small window: more detail, but more noise Large window: more robustness, less detail Example:

Selecting window size 3 pixel window 20 pixel window

Non-square windows Compromise: have a large window, but higher weight near the center Example: Gaussian Example: Shifted windows

Ordering constraint Order of matching features usually the same in both images But not always: occlusion

Dynamic programming Treat feature correspondence as graph problem Left image features Right image features Cost of edges = similarity of regions between image features

Dynamic programming Find min-cost path through graph Left image features Right image features

Dynamic Programming Results

Energy minimization Another approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function: E data + E smoothness E data : how well does disparity match data E smoothness : how well does disparity match that of neighbors – regularization

Stereo as energy minimization Matching Cost Formulated as Energy “data” term penalizing bad matches “neighborhood term” encouraging spatial smoothness

Energy minimization If data and energy terms are nice (continuous, smooth, etc.) can try to minimize via gradient descent, etc. In practice, disparities only piecewise smooth Design smoothness function that doesn’t penalize large jumps too much Example: V(  )=min(|  |, K)

Energy minimization Hard to find global minima of non-smooth functions Many local minima Provably NP-hard Practical algorithms look for approximate minima (e.g., simulated annealing)

Energy minimization via graph cuts Labels (disparities) d1d1 d2d2 d3d3 edge weight

Graph Cost Matching cost between images Neighborhood matching term Goal: figure out which labels are connected to which pixels d1d1 d2d2 d3d3 Energy minimization via graph cuts

d1d1 d2d2 d3d3 Graph Cut Delete enough edges so that –each pixel is (transitively) connected to exactly one label node Cost of a cut: sum of deleted edge weights Finding min cost cut equivalent to finding global minimum of energy function

Computing a multiway cut With 2 labels: classical min-cut problem Solvable by standard flow algorithms –polynomial time in theory, nearly linear in practice More than 2 terminals: NP-hard [Dahlhaus et al., STOC ‘92] Efficient approximation algorithms exist Within a factor of 2 of optimal Computes local minimum in a strong sense –even very large moves will not improve the energy Yuri Boykov, Olga Veksler and Ramin Zabih, Fast Approximate Energy Minimization via Graph Cuts, International Conference on Computer Vision, September 1999.Fast Approximate Energy Minimization via Graph Cuts

Move examples Starting point Red-blue swap move Green expansion move

The swap move algorithm 1. Start with an arbitrary labeling 2. Cycle through every label pair (A,B) in some order 2.1 Find the lowest E labeling within a single AB-swap 2.2 Go there if it’s lower E than the current labeling 3. If E did not decrease in the cycle, we’re done Otherwise, go to step 2 Original graph A B AB subgraph (run min-cut on this graph) B A

The expansion move algorithm 1. Start with an arbitrary labeling 2. Cycle through every label A in some order 2.1 Find the lowest E labeling within a single A- expansion 2.2 Go there if it’s lower E than the current labeling 3. If E did not decrease in the cycle, we’re done Otherwise, go to step 2

Stereo results ground truth scene Data from University of Tsukuba

Results with window correlation normalized correlation (best window size) ground truth

Results with graph cuts ground truthgraph cuts (Potts model E, expansion move algorithm)

Depth from disparity f xx’ baseline z CC’ X f input image (1 of 2) [Szeliski & Kang ‘95] depth map 3D rendering

Real-time stereo Used for robot navigation (and other tasks) Several software-based real-time stereo techniques have been developed (most based on simple discrete search) Nomad robot Nomad robot searches for meteorites in Antartica

Camera calibration errors Poor image resolution Occlusions Violations of brightness constancy (specular reflections) Large motions Low-contrast image regions Stereo reconstruction pipeline Steps Calibrate cameras Rectify images Compute disparity Estimate depth What will cause errors?

Spacetime Stereo Li Zhang, Noah Snavely, Brian Curless, Steven Seitz CVPR 2003, SIGGRAPH 2004

Stereo

? ? ?

Marker-based Face Capture The Polar Express, 2004 “The largest intractable problem with ‘The Polar Express’ is that the motion-capture technology used to create the human figures has resulted in a film filled with creepily unlifelike beings.” New York Times Review, Nov 2004

Stereo

Frame-by-Frame Stereo W  H = 15  15 Window A Pair of Videos 640  Each Inaccurate & Jittering

3D Surface Spacetime Stereo

Time 3D Surface

Spacetime Stereo Time 3D Surface

Spacetime Stereo 3D Surface Time

Spacetime Stereo Surface Motion Time

Spacetime Stereo Surface Motion Time=0

Spacetime Stereo Surface Motion Time=1

Spacetime Stereo Surface Motion Time=2

Spacetime Stereo Surface Motion Time=3

Spacetime Stereo Surface Motion Time=4

Surface Motion Matching Volumetric Window Affine Window Deformation Key ideas: Spacetime Stereo Time

Spacetime Stereo Time

Spacetime Stereo Time

Spacetime Stereo

A Pair of Videos 640  Each Spacetime Stereo W  H  T = 9  5  5 Window

Frame-by-Frame vs. Spacetime Stereo Spacetime Stereo W  H  T = 9  5  5 Window Frame-by-Frame W  H = 15  15 Window Spatially More Accurate Temporally More Stable

Video Projectors Color Cameras Black & White Cameras Spacetime Face Capture System

System in Action

Input Videos (640  480, 60fps)

Spacetime Stereo Reconstruction

Creating a Face Database

[Zhang et al. SIGGRAPH’04] …

Application 1: Expression Synthesis [Zhang et al. SIGGRAPH’04] … A New Expression:

Application 2: Facial Animation [Zhang et al. SIGGRAPH’04] …

Keyframe Animation