Special Applications If we assume a specific application, many image- based rendering tools can be improved –The Lumigraph assumed the special domain of.

Slides:

Advertisements

Similar presentations

Active Appearance Models

Advertisements

Character Animation from 2D Pictures and 3D Motion Data ACM Transactions on Graphics 2007.

Designing Facial Animation For Speaking Persian Language Hadi Rahimzadeh June 2005.

Acquiring the Reflectance Field of a Human Face Paul Debevec, Tim Hawkins, Chris Tchou, Haarm-Pieter Duiker, Westley Sarokin, Mark Sagar Haarm-Pieter Duiker,

1 Online Construction of Surface Light Fields By Greg Coombe, Chad Hantak, Anselmo Lastra, and Radek Grzeszczuk.

3D Face Modeling Michaël De Smet.

Structure and Motion from Line Segments in Multiple Images Camillo J. Taylor, David J. Kriegman Presented by David Lariviere.

December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.

A new approach for modeling and rendering existing architectural scenes from a sparse set of still photographs Combines both geometry-based and image.

Exchanging Faces in Images SIGGRAPH ’04 Blanz V., Scherbaum K., Vetter T., Seidel HP. Speaker: Alvin Date: 21 July 2004.

Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.

Copyright  Philipp Slusallek Cs fall IBR: Model-based Methods Philipp Slusallek.

Video Rewrite Driving Visual Speech with Audio Christoph Bregler Michele Covell Malcolm Slaney Presenter : Jack jeryes 3/3/2008.

LYU0603 A Generic Real-Time Facial Expression Modelling System Supervisor: Prof. Michael R. Lyu Group Member: Cheung Ka Shun ( ) Wong Chi Kin ( )

Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.

Human Face Modeling and Animation Example of problems in application of multimedia signal processing.

Face Poser: Interactive Modeling of 3D Facial Expressions Using Model Priors Manfred Lau 1,3 Jinxiang Chai 2 Ying-Qing Xu 3 Heung-Yeung Shum 3 1 Carnegie.

Create Photo-Realistic Talking Face Changbo Hu * This work was done during visiting Microsoft Research China with Baining Guo and Bo Zhang.

Advanced lighting and rendering Multipass rendering.

Global Illumination May 7, Global Effects translucent surface shadow multiple reflection.

Augmented Reality: Object Tracking and Active Appearance Model

Computational Photography Light Field Rendering Jinxiang Chai.

CSCE 641: Computer Graphics Image-based Rendering Jinxiang Chai.

Facial Type, Expression, and Viseme Generation Josh McCoy, James Skorupski, and Jerry Yee.

Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.

Animation Theory.

David Luebke Modeling and Rendering Architecture from Photographs A hybrid geometry- and image-based approach Debevec, Taylor, and Malik SIGGRAPH.

The Story So Far The algorithms presented so far exploit: –Sparse sets of images (some data may not be available) –User help with correspondences (time.

University of Texas at Austin CS 378 – Game Technology Don Fussell CS 378: Computer Game Technology Beyond Meshes Spring 2012.

03/11/05© 2005 University of Wisconsin Last Time Image Based Rendering –Plenoptic function –Light Fields and Lumigraph NPR Papers: By today Projects: By.

Integration Of CG & Live-Action For Cinematic Visual Effects by Amarnath Director, Octopus Media School.

Facial Animation By: Shahzad Malik CSC2529 Presentation March 5, 2003.

1 Computer Graphics Week13 –Shading Models. Shading Models Flat Shading Model: In this technique, each surface is assumed to have one normal vector (usually.

01/24/05© 2005 University of Wisconsin Last Time Raytracing and PBRT Structure Radiometric quantities.

Multimodal Interaction Dr. Mike Spann

Graphite 2004 Statistical Synthesis of Facial Expressions for the Portrayal of Emotion Lisa Gralewski Bristol University United Kingdom

Real-Time Animation of Realistic Virtual Humans. 1. The 3D virtual player is controlled by the real people who has a HMD and many sensors people who has.

Digital Face Replacement in Photographs CSC2530F Project Presentation By: Shahzad Malik January 28, 2003.

Exploitation of 3D Video Technologies Takashi Matsuyama Graduate School of Informatics, Kyoto University 12 th International Conference on Informatics.

09/09/03CS679 - Fall Copyright Univ. of Wisconsin Last Time Event management Lag Group assignment has happened, like it or not.

Reconstructing 3D mesh from video image sequences supervisor : Mgr. Martin Samuelčik by Martin Bujňák specifications Master thesis

Computer Graphics 2 In the name of God. Outline Introduction Animation The most important senior groups Animation techniques Summary Walking, running,…examples.

110/20/ :06 Graphics II Paper Reviews Facial Animation Session 8.

Image-based Rendering. © 2002 James K. Hahn2 Image-based Rendering Usually based on 2-D imagesUsually based on 2-D images Pre-calculationPre-calculation.

Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.

CS 8690: Computer Vision Ye Duan. CS8690 Computer Vision University of Missouri at Columbia Instructor Ye Duan (209 Engr West)

Presented by Matthew Cook INFO410 & INFO350 S INFORMATION SCIENCE Paper Discussion: Dynamic 3D Avatar Creation from Hand-held Video Input Paper Discussion:

Computer Vision Michael Isard and Dimitris Metaxas.

1 Reconstructing head models from photograph for individualized 3D-audio processing Matteo Dellepiane, Nico Pietroni, Nicolas Tsingos, Manuel Asselot,

1 Perception and VR MONT 104S, Fall 2008 Lecture 21 More Graphics for VR.

112/5/ :54 Graphics II Image Based Rendering Session 11.

Review on Graphics Basics. Outline Polygon rendering pipeline Affine transformations Projective transformations Lighting and shading From vertices to.

Pure Path Tracing: the Good and the Bad Path tracing concentrates on important paths only –Those that hit the eye –Those from bright emitters/reflectors.

University of Washington v The Hebrew University * Microsoft Research Synthesizing Realistic Facial Expressions from Photographs Frederic Pighin Jamie.

High Resolution Surface Reconstruction from Overlapping Multiple-Views

Performance Driven Facial Animation

CSCE 641 Computer Graphics: Image-based Rendering (cont.) Jinxiang Chai.

Painterly Rendering for Animation Introduction speaks of focus and detail –Small brush strokes focus and provide detail –Large strokes are abstract and.

Facial Animation Wilson Chang Paul Salmon April 9, 1999 Computer Animation University of Wisconsin-Madison.

Image-Based Rendering Geometry and light interaction may be difficult and expensive to model –Think of how hard radiosity is –Imagine the complexity of.

Animation Animation is about bringing things to life Technically: –Generate a sequence of images that, when played one after the other, make things move.

11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.

Presented by 翁丞世  View Interpolation  Layered Depth Images  Light Fields and Lumigraphs  Environment Mattes  Video-Based.

Computer vision: geometric models Md. Atiqur Rahman Ahad Based on: Computer vision: models, learning and inference. ©2011 Simon J.D. Prince.

03/26/03© 2003 University of Wisconsin Last Time Light Fields.

Rendering Pipeline Fall, 2015.

© 2005 University of Wisconsin

Image Based Modeling and Rendering (PI: Malik)

CS5500 Computer Graphics May 29, 2006

Computer Graphics Lecture 15.

Presentation transcript:

Special Applications If we assume a specific application, many image- based rendering tools can be improved –The Lumigraph assumed the special domain of orbiting small objects Two applications stand out: –Architecture, because people wish to capture models of cities –Faces, because there is no other good way to do it, and pictures of faces are essential to various fields (movies, advertising, and so on)

Hybrid Image/Geometry for Architecture Most buildings: –Are made up of common, simple, architectural elements (boxes, domes…) –Have lots of implicit constraints like parallel lines and right angles We can exploit the simple geometry and constraints to simplify the image-based rendering problem Hybrid approaches build simple geometric models from images, then texture them with the same images

Façade Debevec, Taylor, Malik 1996 Start with a sparse set of images of a building (from one to tens of images) With an interactive photogrammetric modeling program, a user builds an approximate 3D model of the building Generate view dependent texture maps with the images Use model-based stereo to reconstruct additional detail (doorways, window ledges…) Render from any view (assuming images see all the surfaces)

Photogrammetric Modeling User specifies which parametric blocks make up a model, and the constraints between them User marks edges on the model and corresponding edges in images The system determines camera locations and model parameters using minimization algorithm Result: A 3D model of the approximate geometry –The blocks used determine the accuracy of the model –Details can be left out – later stages will catch them

View Dependent Textures The images can be projected onto the 3D model to determine which parts of the images correspond to which parts of the model –Hardware projective texture mapping can make this very fast More than one image may see a point on the model –Blend the image values –Use weights that favor images from cameras closer to the viewing direction (alpha blending in hardware) Some points may be seen in no image – use hole filling (or creative photography)

Model-Based Stereo Blocks do not capture all the details, such as sunken doorways –This introduces errors in the new renderings –View-dependent texture mapping helps if there is a view close to the right direction The approximate model gives many major hints to an automatic shape-from-stereo algorithm –Find correspondences between points in different images –Add depth information to the texture

Other Systems Other systems do a fully automated job –Find correspondences from “stable” features –Compute 3D location of some points/edges –Fit planes to the data –Feedback to improve results Works well in architectural environment (where flat surfaces abound) Must deal with occlusion – in most environments buildings hide each other in different views

Rendering Faces Producing realistic, expressive animated faces is (probably) the holy grail of rendering It is hard for many reasons: –Skin has extremely complex interactions with light –The face deforms due to a vast number of muscles and complex tissues –Must manage: hair (including facial), teeth, tongue, eyes, ears –The mapping from emotions to facial expression is complex –Most of all, viewers are exceptional at spotting problems

Overview A broad range of approaches: –Physiologically- and physically-based models –Pure image based approachs Current state-of-the-art depends on which application: –Capturing and then reproducing (in 3D) a fixed sequence –Producing static 3D models of faces, real and imagined –Dubbing speech with correct mouth and facial motions –Producing 3D facial animations Reference: Parke & Waters “Computer Facial Animation”

Why Imaged-Based? Rendering realistic looking skin from first principles is currently intractable Taking pictures of a person and applying it to a model is much simpler Some pure image based models may entirely avoid an accurate geometric model The only plausible current technology for capturing moving faces is video (or film) Key technology: Cyberware head scanners capture geometry and texture of a static face

Geometry, Physics and Images Lee, Terzopoulos and Waters 1995 Use a Cyberware scanner to get cylindrical depth information and textures Fit a pre-defined mesh to the data –Use landmarks like corners of eyes, mouth, etc Attach muscles and estimate skeletal structures Animate using a spring-mass model Render 3D model with captured texture Geometry for teeth, eyes, hair and shoulders

Video Rewrite Bregler, Covell, Slaney 1997 Aim: Take video of a person and make them say something else, with correct mouth and facial movements –For example, do much better film dubbing Works at the level of phonemes and visemes: the audio and visual subunits of speech –A training phase identifies visemes with phonemes –A reconstruction phase synthesizes new video by inserting a reconstructed viseme sequence into an existing video background

Training Overview Take video and audio of someone speaking, and break it into pieces –Uses Hidden Markov Models (HMM) for audio segmentation Phonemes are standard from speech community Different models for men and women –Use vision techniques to estimate and correct for changes in head pose –Output is a set of triphones (three visemes in a row) with associated visemes sequences

Synthesis Overview Take new speech and break it into phonemes (use training HMM) Search for appropriate sets of visemes Adjust timing of visemes to match audio timing Blend neighbors together to get seamless mouth motion Remove mouth and jaw from background video Estimate pose, morph visemes, and blend them into the background

Making Faces Guenter, Grimm, Wood, Malvar, Pighin 1997 Aim: Take video of someone, and generate a representation that allows reconstruction from any view Take video with multiple cameras of a person speaking with colored dots glued all over their face Track correspondences between dots to determine dot motion over time Generate a 3D model by scanning the head (with the dots)

Making Faces (cont) For each frame, compute model vertex motion according to the (known) locations of the dots –Two stage fit: first determine motion for a grid of points, then determine vertex motion To render, simply texture map the 3D fitted model –Textures come from the video streams –Dots must be removed from the textures, and the resulting holes filled in –Blend textures into one view independent texture Brute force and not real time

Facial Expressions From Photographs Pighin, Hecker, Lischinski, Szeliski, Salesin 1998 Aim: Generate static 3D models of various facial expressions, and then morph between them to animate Five pictures of each expression for each face Fit a standard 3D model to the pictures –Uses hand marked features (lots of them) –For good morphing, marked features should correspond to facial structures (eyes, mouth, eyebrows, etc) –Allows textures to be mapped onto vertices –Use view dependent textures when rendering

Facial Expressions From Photographs Outcome of fitting is a mapping from facial expression to deformations of the model Blend different deformation to get different expressions –Simple transitions to go from one expression to another –Different blends for different regions to get combinations of expressions (eg: forced smile) Deformations can be mapped from one person to another Animations generated by specifying sequence of expressions

A Morphable Model Blanz, Vetter 1999 Create a space of possible face geometries and textures –Points in the space are vectors of vertex locations and colors at vertices Apply statistical techniques to describe probable faces, and to generate completely new faces –Start with 200 faces, with corresponding S i and T i vectors –Compute the mean vector and covariance matrix –Assume joint normal distribution Possible to evaluate the probability of a given face Possible to sample from distribution to get artificial faces –Covariance can be diagonalized to find principle components –Distribution guides fitting process for new real faces

A Morphable Model Aim: Characterize particular facial types –Male vs. Female, Fat vs. thin, Many others Approach: Find the “direction” in which special faces vary from neutral face –Mark some faces with the given features (eg all males) –Look at where those faces lie in the space compared to the neutral face –Implies which way to move from the neutral face to get that feature –Moving along that axis goes from, for instance, androgynous to male and on to extremely male

A Morphable Model (cont) Match a face to the model using iterative search –Find which point in the space corresponds to a given face –Start with a human specified guess –Repeatedly move the point according to the difference between a rendered image and the photograph Add new faces to the model using 3D scans –First find closest face in existing model –Improve the model point using optical flow, applied iteratively and hierarchically in difficult cases

Acquiring the Reflectance Field Debevec, Hawkins, Tchou, Duiker, Sarokin, Sagar 2000 Aim: Determine how a face’s appearance changes under different lighting All previous work assumes fixed lighting First consider re-rendering from fixed views: –Use a light stage to capture images from fixed views of a face lit from various directions –To render with novel lighting: Weigh each captured image by the strength of the incoming illumination Composite weighed images

Acquiring the Reflectance Field For rendering with novel lighting and novel views –Use a model for how skin reflects Specular scattering from surface oil layer Diffuse reflection from subsurface scattering Fit some parameters using data from the forehead and polarizing filters –Separate diffuse and specular components using color matching –Generate 3D model using structured light –Project images onto model and re-render using skin illumination model Blends images from both original camera locations

Faces Summary We have reasonable models for: –Capturing a head and playing it back –Dubbing video –Generating new, static heads, and new views of those heads –Rendering a known static head in different light –Showing different facial expressions from a known head There’s still a way to go: Aim for an artificial bearded man giving a speech under dynamic theatrical lighting in real time (recreating Shakespeare)