Depth and Motion Discontinuities Stan Birchfield Ph.D. oral defense Stanford University January 1999.

Slides:



Advertisements
Similar presentations
Gratuitous Picture US Naval Artillery Rangefinder from World War I (1918)!!
Advertisements

Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
MASKS © 2004 Invitation to 3D vision Lecture 7 Step-by-Step Model Buidling.
Qualifying Exam: Contour Grouping Vida Movahedi Supervisor: James Elder Supervisory Committee: Minas Spetsakis, Jeff Edmonds York University Summer 2009.
Graph-Based Image Segmentation
Forward-Backward Correlation for Template-Based Tracking Xiao Wang ECE Dept. Clemson University.
Lecture 8: Stereo.
Silhouettes in Multiview Stereo Ian Simon. Multiview Stereo Problem Input: – a collection of images of a rigid object (or scene) – camera parameters for.
Last Time Pinhole camera model, projection
Contents Description of the big picture Theoretical background on this work The Algorithm Examples.
Computer Vision : CISC 4/689 Adaptation from: Prof. James M. Rehg, G.Tech.
Stereo & Iterative Graph-Cuts Alex Rav-Acha Vision Course Hebrew University.
Robust Lane Detection and Tracking
Introduction to Computer Vision 3D Vision Topic 9 Stereo Vision (I) CMPSCI 591A/691A CMPSCI 570/670.
The plan for today Camera matrix
CS 223b 1 More on stereo and correspondence. CS 223b 2 =?f g Mostpopular For each window, match to closest window on epipolar line in other image. (slides.
Optical flow and Tracking CISC 649/849 Spring 2009 University of Delaware.
Lecture 10: Stereo and Graph Cuts
Stereo Computation using Iterative Graph-Cuts
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Stereo matching Class 10 Read Chapter 7 Tsukuba dataset.
Computer Vision - A Modern Approach Set: Segmentation Slides by D.A. Forsyth Segmentation and Grouping Motivation: not information is evidence Obtain a.
Structure from images. Calibration Review: Pinhole Camera.
Michael Bleyer LVA Stereo Vision
Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.
Graph Cut 韋弘 2010/2/22. Outline Background Graph cut Ford–Fulkerson algorithm Application Extended reading.
Interactive Graph Cuts for Optimal Boundary & Region Segmentation of Objects in N-D Images (Fri) Young Ki Baik, Computer Vision Lab.
Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images Subproblems: –Calibrating camera positions. –Finding all corresponding.
3D Sensing and Reconstruction Readings: Ch 12: , Ch 13: , Perspective Geometry Camera Model Stereo Triangulation 3D Reconstruction by.
Auditory and Visual Spatial Sensing Stan Birchfield Department of Electrical and Computer Engineering Clemson University.
Stereo Many slides adapted from Steve Seitz.
CS 4487/6587 Algorithms for Image Analysis
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.
Computer Vision, Robert Pless
December 9, 2014Computer Vision Lecture 23: Motion Analysis 1 Now we will talk about… Motion Analysis.
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
1 Markov Random Fields with Efficient Approximations Yuri Boykov, Olga Veksler, Ramin Zabih Computer Science Department CORNELL UNIVERSITY.
Computer Vision Stereo Vision. Bahadir K. Gunturk2 Pinhole Camera.
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
Elliptical Head Tracking Using Intensity Gradients and Color Histograms Stan Birchfield Stanford University Autodesk Advanced Products Group
Bahadir K. Gunturk1 Phase Correlation Bahadir K. Gunturk2 Phase Correlation Take cross correlation Take inverse Fourier transform  Location of the impulse.
Image-Based Segmentation of Indoor Corridor Floors for a Mobile Robot
Image-Based Segmentation of Indoor Corridor Floors for a Mobile Robot Yinxiao Li and Stanley T. Birchfield The Holcombe Department of Electrical and Computer.
Lecture 16: Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
Non-Ideal Iris Segmentation Using Graph Cuts
CSE 185 Introduction to Computer Vision Feature Matching.
Solving for Stereo Correspondence Many slides drawn from Lana Lazebnik, UIUC.
Course14 Dynamic Vision. Biological vision can cope with changing world Moving and changing objects Change illumination Change View-point.
Jeong Kanghun CRV (Computer & Robot Vision) Lab..
Vision Sensors for Stereo and Motion Joshua Gluckman Polytechnic University.
1Ellen L. Walker 3D Vision Why? The world is 3D Not all useful information is readily available in 2D Why so hard? “Inverse problem”: one image = many.
A global approach Finding correspondence between a pair of epipolar lines for all pixels simultaneously Local method: no guarantee we will have one to.
Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.
John Morris Stereo Vision (continued) Iolanthe returns to the Waitemata Harbour.
Image-Based Rendering Geometry and light interaction may be difficult and expensive to model –Think of how hard radiosity is –Imagine the complexity of.
John Morris These slides were adapted from a set of lectures written by Mircea Nicolescu, University of Nevada at Reno Stereo Vision Iolanthe in the Bay.
Person Following with a Mobile Robot Using Binocular Feature-Based Tracking Zhichao Chen and Stanley T. Birchfield Dept. of Electrical and Computer Engineering.
Instructor: Mircea Nicolescu Lecture 5 CS 485 / 685 Computer Vision.
Photoconsistency constraint C2 q C1 p l = 2 l = 3 Depth labels If this 3D point is visible in both cameras, pixels p and q should have similar intensities.
Motion Segmentation at Any Speed Shrinivas J. Pundlik Department of Electrical and Computer Engineering, Clemson University, Clemson, SC.
Computational Vision CSCI 363, Fall 2012 Lecture 17 Stereopsis II
Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
A Plane-Based Approach to Mondrian Stereo Matching
Approximate Models for Fast and Accurate Epipolar Geometry Estimation
Range Imaging Through Triangulation
Multiway Cut for Stereo and Motion with Slanted Surfaces
Computer Vision Stereo Vision.
Optical flow and keypoint tracking
Presentation transcript:

Depth and Motion Discontinuities Stan Birchfield Ph.D. oral defense Stanford University January 1999

PHOTOMETRIC GEOMETRIC Discontinuities intensity, color, texture,... DepthMotion camera surface camera surface

Fundamental to image understanding Object boundaries (well-defined) Assumption of continuity Occlusion Importance of Discontinuities camera 1camera 2 object

Motivation segmentation classification tracking camera control video retrieval detection IMAGESLOW-LEVEL FILTERSHIGH-LEVEL TASKS

Evidence for Discontinuities LOCAL –T-junctions [Parida et al. 1997, Ruzon & Tomasi 1999] –Occlusion [Toh & Forrest 1990, Wixson 1993] –Multi-modal depth or motion [Spoerri & Ullman 1987, Little & Gillett 1990] –Changes in intensity, color, texture, … [Canny 1986, Wang & Binford 1994] GLOBAL –“Continuity of discontinuities” [Marr & Poggio 1979] –Changes in depth or motion field [ Thompson et al. 1982, Birchfield & Tomasi 1998]

Complications from Sampling Amount of discontinuity Curvature ?

Finding and Using Discontinuities Depth discontinuities from stereo Maximum flow for stereo and motion Motion discontinuities from a sequence Using discontinuities to track heads

Depth Discontinuities from a Stereo Pair Use dynamic programming to minimize: intensity L R disparity lamp wall pixel dissimilarity discontinuity penalty epipolar constraint

Discontinuity Penalty u(l ) i penalty occlusion length u(3) = 3u(1) u(3) < 3u(1) BAD GOOD l i L R L R BAD disparity pixel intensity

Stereo Algorithm Matches pixels directly with convex penalty function (no preprocessing or windows) Handles large untextured regions Solves sampling problem Handles slanted surfaces Is fast

Handling Untextured Regions Our constraint: Depth discontinuities lie to a particular side of intensity edge Discontinuities in left (right) scanline lie to the left (right) of intensity edge Assumption: Depth discontinuities are accompanied by intensity edge Naïve constraint: Depth discontinuities lie near intensity edge Intensity edges (x-derivative with low threshold) With naïve constraint With our constraint

Discontinuity Lies to the Left of an Edge near object far object left camera right camera Obvious matches Physical setup Matches violate assumption Matches are consistent with assumption edge depth discontinuity

The Problem of Image Sampling Absolute difference Our measure Left scanlineRight scanline

A Dissimilarity Measure That is Insensitive to Image Sampling d(x L,x R ) xLxL xRxR d(x L,x R ) = min{d(x L,x R ),d(x R,x L )}Our dissimilarity measure:

Analysis of Dissimilarity Measure Concave/convex regions: measure is guaranteed to work Inflection points: measure works in practice when lens is defocused to remove aliasing (inflection points  linear {convex, concave}) intensity function

Analysis of Dissimilarity Measure our measure absolute difference maximum average T c = 1 fcfc cutoff frequency As a function of lens defocus At a hypothesized disparity of 10 pixels aliasing { defocus

Speeding Computation D isparity map occlusion depth discontinuity Pruning bad nodes Standard Ours maximum disparity time Fast postprocessing TWO STEPS: 1. Reliable overruns unreliable 2. Background overruns foreground RIGHT LEFT

Handling Slanted Surfaces Independent scanlines: no coherence Postprocessing: forbid propagation if disparity changes by just one level disparity column

Results

More Results [Images from JISCT data set]

Finding and Using Discontinuities Depth discontinuities from stereo Maximum flow for stereo and motion Motion discontinuities from a sequence Using discontinuities to track heads

Motion Discontinuities from a Sequence Similar to stereo (matching pixels to fit piecewise-smooth function) But different (2D search, non-rigid transformation, many images) How far can we get using sparse features?

Motion Video

Difficulties of Image Sequences Instantaneous velocity from sampled positions Accumulation of evidence Frame of reference, motion model time position ? feature position time feature background threshold position time feature 2 feature 1 background threshold

Image Strain

Motion Algorithm Select and track features Group Trace boundaries between groups

Finding and Using Discontinuities Depth discontinuities from stereo Maximum flow for stereo and motion Motion discontinuities from a sequence Using discontinuities to track heads

Why Maximum Flow? Our stereo algorithm (Dynamic programming) –Good results on difficult images –Fast –But, suboptimal (processes rows, then columns independently) Newer algorithms (Maximum flow) –Set up graph, find maximum flow --- minimum cut yields disparity or motion map –Able to look at whole image

Minimizing a 2D Cost Function Minimize: disparity 2D: GLOBAL disparity pixel ? 1D: Global u(l ) p,q Discontinuity penalty: l p,q minimum cut = disparity surface solves LOCAL Local (GOOD) (BAD) 

Maximum Flow for Stereo and Motion Global algorithm [Roy & Cox 1998, Ishikawa & Geiger 1998] Local algorithm [Boykov et al. 1998] penalty BAD discontinuity amount GOOD

Challenges for Maximum Flow … and slant of table With our intensity variation constraint and large penalty With small penaltyWith large penalty … and slant of surfaces

Finding and Using Discontinuities Depth discontinuities from stereo Maximum flow for stereo and motion Motion discontinuities from a sequence Using discontinuities to track heads

Problem TILT PAN ZOOM CHALLENGES: * rotation * multiple people * zoom APPLICATIONS: * video conferencing * distance learning

Previous Methods 3D RotationMoving people in background Skin color: N? Background subtraction: YN Template: NY Contour: YN

Head Tracking Algorithm MODEL (x,y)  Ellipse: vertical aspect ratio = 1.2  state s = (x,y,  ) ellipse in frame t OUTPUT color module Model Current Intersection SkinHair LOCAL SEARCH gradient module gradient ellipse normal ellipse in frame t INPUT frame t-1

Evaluation of Head Tracker 1. Tracks head in real time on standard hardware 2. Insensitive to - full 360-degree out-of-plane rotation - arbitrary camera movement (including zoom) - multiple moving people - severe but brief occlusion - hair/skin color, hair length, facial hair, glasses

Headtracker Video

Finding and Using Discontinuities Depth discontinuities from stereo Maximum flow for stereo and motion Motion discontinuities from a sequence Using discontinuities to track heads

Contributions Precise, unified definitions and relationship Pixel-to-pixel stereo algorithm [ICCV 1998] –good results on difficult images intensity variation constraint dissimilarity measure [PAMI 1998] preserving slant –fast (pruning search nodes, postprocessor) Motion discontinuities (sparse features, image strain, grouping algorithm) Maximum flow techniques (edge weights for global algorithm, smoothness, left border ; applied to motion) Elliptical head tracker [Asilomar 1997, CVPR 1998] –advanced state of the art by tracking people undergoing 3D rotation in front of a dynamic background

Appendix: Extra slides

Comparison of Our Algorithm with Maxflow Our algorithm Local maxflow algorithm

Depth Discontinuity Proposition 1: A depth discontinuity is a point in the image plane whose projection ray grazes a surface in the world. object image plane focal point Depth discontinuities

Definition of Jump The jump of a discontinuity is the smallest  >0 such that there is a  >0 for which, if ||x-x 0 ||  x f(x) x0x0   jump

Comparison with Intensity Edges Original imageIntensity edgesDepth discontinuities More closely tied to actual object boundaries Not distracted by change in albedo (reflectance) change in lighting (e.g., shadows)

Comparison with Layers Original imageMotion discontinuities

Comparison with Segmentation A plausible segmentation Depth discontinuities Original image Well-defined (not task-dependent) Curves are not necessarily closed

Epipolar Constraint Left cameraRight camera world point center of projection epipolar plane epipolar line

Comparison of Dissimilarity Measure with Other Methods absolute difference our measure spurious matches subpixel resolution Computing time

Propagating Information Between Scanlines Calculate reliabilities & threshold Disparities: [ ] T Reliabilities: [ ] T unreliable < reliable < very reliable Propagate very reliable when –neighbor unreliable, OR –disparity(neighbor) > disparity(pixel) + 2 Repeat in x-direction Fast (30% increase instead of 800%)

Features on Freethrow

Maximum Flow Finds greatest shipping rate from source to sink w/o violating capacity constraints Flow network G=(V,E) is directed graph with edge capacities c(u,v) > 0, source s, and sink t Flow f:E R Value of flow |f| =  f(s,v) Find f* = arg max |f| Cut (S,T) is partition of V s.t. s in S and t in T capacity of cut c(S,T) Max-flow min-cut theorem: There exists a cut (S,T) s.t. |f*| = c(S,T)

Blowup of Meter Global algorithm Local algorithm

Maximum Flow and Occlusion [from University of Tsukuba, Japan]

Maximum Flow and Left Image Border