1 Reconstructing head models from photograph for individualized 3D-audio processing Matteo Dellepiane, Nico Pietroni, Nicolas Tsingos, Manuel Asselot,

Slides:



Advertisements
Similar presentations
Progressive Perceptual Audio Rendering of Complex Scenes Thomas Moeck - Nicolas Bonneel - Nicolas Tsingos - George Drettakis - Isabelle Viaud-Delmon -
Advertisements

The fundamental matrix F
For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.
1 Image Authentication by Detecting Traces of Demosaicing June 23, 2008 Andrew C. Gallagher 1,2 Tsuhan Chen 1 Carnegie Mellon University 1 Eastman Kodak.
Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo.
Face Alignment with Part-Based Modeling
Temporally Coherent Completion of Dynamic Shapes Hao Li, Linjie Luo, Daniel Vlasic, Pieter Peers, Jovan Popović, Mark Pauly, Szymon Rusinkiewicz ACM Transactions.
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
Inter-Surface Mapping John Schreiner, Arul Asirvatham, Emil Praun (University of Utah) Hugues Hoppe (Microsoft Research)
1 Face Synthesis M. L. Gavrilova. 2 Outline Face Synthesis From Modeling to Synthesis Facial Expression Synthesis Conclusions.
6/3/20151 Voice Transformation : Speech Morphing Gidon Porat and Yizhar Lavner SIPL – Technion IIT December
A new approach for modeling and rendering existing architectural scenes from a sparse set of still photographs Combines both geometry-based and image.
Exchanging Faces in Images SIGGRAPH ’04 Blanz V., Scherbaum K., Vetter T., Seidel HP. Speaker: Alvin Date: 21 July 2004.
Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.
SIGGRAPH Course 30: Performance-Driven Facial Animation Section: Markerless Face Capture and Automatic Model Construction Part 2: Li Zhang, Columbia University.
RECOGNIZING FACIAL EXPRESSIONS THROUGH TRACKING Salih Burak Gokturk.
A Study of Approaches for Object Recognition
Advanced Computer Graphics (Spring 2005) COMS 4162, Lecture 21: Image-Based Rendering Ravi Ramamoorthi
Face Recognition Based on 3D Shape Estimation
HK UST * Hong Kong University of Science and Technology HK UST Modeling Hair from Multiple Views Y. Wei, E. Ofek, L. Quan and H. Shum.
Active Appearance Models Computer examples A. Torralba T. F. Cootes, C.J. Taylor, G. J. Edwards M. B. Stegmann.
CSCE 641: Computer Graphics Image-based Rendering Jinxiang Chai.
David Luebke Modeling and Rendering Architecture from Photographs A hybrid geometry- and image-based approach Debevec, Taylor, and Malik SIGGRAPH.
Computer vision: models, learning and inference
Projective Texture Atlas for 3D Photography Jonas Sossai Júnior Luiz Velho IMPA.
Modeling And Visualization Of Aboriginal Rock Art in The Baiame Cave
1 Intelligent Robotics Research Centre (IRRC) Department of Electrical and Computer Systems Engineering Monash University, Australia Visual Perception.
Automatic Registration of Color Images to 3D Geometry Computer Graphics International 2009 Yunzhen Li and Kok-Lim Low School of Computing National University.
Digital Face Replacement in Photographs CSC2530F Project Presentation By: Shahzad Malik January 28, 2003.
Light Using Texture Synthesis for Non-Photorealistic Shading from Paint Samples. Christopher D. Kulla, James D. Tucek, Reynold J. Bailey, Cindy M. Grimm.
Image-based Plant Modeling Zeng Lanling Mar 19, 2008.
Interactively Modeling with Photogrammetry Pierre Poulin Mathieu Ouimet Marie-Claude Frasson Dép. Informatique et recherche opérationnelle Université de.
Computer Graphics 2 In the name of God. Outline Introduction Animation The most important senior groups Animation techniques Summary Walking, running,…examples.
A Camera-Projector System for Real-Time 3D Video Marcelo Bernardes, Luiz Velho, Asla Sá, Paulo Carvalho IMPA - VISGRAF Laboratory Procams 2005.
Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.
Object Recognition in Images Slides originally created by Bernd Heisele.
Access Control Via Face Recognition. Group Members  Thilanka Priyankara  Vimalaharan Paskarasundaram  Manosha Silva  Dinusha Perera.
Presented by Matthew Cook INFO410 & INFO350 S INFORMATION SCIENCE Paper Discussion: Dynamic 3D Avatar Creation from Hand-held Video Input Paper Discussion:
Machine Learning for Computer graphics Aaron Hertzmann University of Toronto Bayesian.
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
12/7/10 Looking Back, Moving Forward Computational Photography Derek Hoiem, University of Illinois Photo Credit Lee Cullivan.
Segmental Brush Stroke Synthesis by Example Ryoichi Ando and Reiji Tsuruno Kyushu Univ, Japan. Came from Here !
An Automated Method for Geometric Reconstruction of Vertebrae from Clinical CT Scans Yifei Dai 10/19/2006.
Temporally Coherent Completion of Dynamic Shapes AUTHORS:HAO LI,LINJIE LUO,DANIEL VLASIC PIETER PEERS,JOVAN POPOVIC,MARK PAULY,SZYMON RUSINKIEWICZ Presenter:Zoomin(Zhuming)
Autonomous Robots Vision © Manfred Huber 2014.
Mobile Image Processing
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Bundle Adjustment A Modern Synthesis Bill Triggs, Philip McLauchlan, Richard Hartley and Andrew Fitzgibbon Presentation by Marios Xanthidis 5 th of No.
Computer Vision, CS766 Staff Instructor: Li Zhang TA: Yu-Chi Lai
COS429 Computer Vision =++ Assignment 4 Cloning Yourself.
University of Washington v The Hebrew University * Microsoft Research Synthesizing Realistic Facial Expressions from Photographs Frederic Pighin Jamie.
Enhancing Image-Based Aging Approaches Olivier Clément Eric Paquette.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Image-Based Rendering Geometry and light interaction may be difficult and expensive to model –Think of how hard radiosity is –Imagine the complexity of.
Animation Animation is about bringing things to life Technically: –Generate a sequence of images that, when played one after the other, make things move.
What does it mean to us?.  History  Biometrics Defined  Modern Day Applications  Spoofing  Future of Biometrics.
Face Recognition Technology By Catherine jenni christy.M.sc.
A Framework for Perceptual Studies in Photorealistic Augmented Reality Martin Knecht 1, Andreas Dünser 2, Christoph Traxler 1, Michael Wimmer 1 and Raphael.
A facial recognition system is a computer application for automatically identifying or verifying a person from a digital image or a video frame from.
Deformation Modeling for Robust 3D Face Matching Xioguang Lu and Anil K. Jain Dept. of Computer Science & Engineering Michigan State University.
Acquiring, Stitching and Blending Diffuse Appearance Attributes on 3D Models C. Rocchini, P. Cignoni, C. Montani, R. Scopigno Istituto Scienza e Tecnologia.
Automating the Polarizing Filter
Recent developments on micro-triangulation
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
Automatic Digitizing.
Image Based Modeling and Rendering (PI: Malik)
“man, how you define computer
Noah Snavely.
CMSC 426: Image Processing (Computer Vision)
Science of Crime Scenes
Presentation transcript:

1 Reconstructing head models from photograph for individualized 3D-audio processing Matteo Dellepiane, Nico Pietroni, Nicolas Tsingos, Manuel Asselot, Roberto Scopigno

2 Overview What is HRTF HRTF from head 3D model Our solution Results and future work

3 HRTF: what, why, how... WHAT: HRTF (Head Related Transfer Function) describes how a given sound wave input (parameterized as frequency and source location) is filtered by the diffraction and reflection properties of the head before the sound reaches the ear. HRTF depends mainly on the shape of the ears and (secondarily) of the face. So it’s individual and it can vary a lot from one subject to the other

4 HRTF: what, why, how... WHY: In order to perform realistic 3D sound rendering it’s necessary to know in advance the HRTF of the user, and apply it as a “filter”. Main applications: everything related to realistic rendering, for example videogames

5 HRTF: what, why, how... HOW: There are two main methods to calculate HRTF: Anechoic chamber measurements Scattering calculation from 3D models

6 HRTF: what, why, how... Anechoic chamber measurements: reference method precise you need anechoic chamber not easy to perform.

7 HRTF: what, why, how... Scattering calculation: Finite element methods very precise veeeery slow needs watertight and very “clean” models

8 HRTF: what, why, how... Scattering calculation: “Instant GPU sound scattering” [Tsingos et al.] accurate enough very fast indipendent from model topology only “first bounce” calculated

9 Our Target How to provide 3D information? BEST SCENARIO: the user provides some material, and receives his individual HRTF “by mail”. This HRTF is applied for every videogame or application he uses, in order to provide best realism. Needs: Robust method Precise reconstruction Least possible intervention by the user

10 HRTF from head 3D model How to provide 3D information? 1)Measures 2)3D Scanning very precise (nearly 0.3 mm error) 3D scanner technology is still expensive  humans are not the best subjects for scanning

11 How to get 3D model of Head&Ears? 3D SCANNING “Improved 3D Head Reconstruction System based on Combining Shape-From-Silhouette with Two- Stage Stereo Algorithm” [Fujimura et al.] or using 3D laser scanning Accurate and automatic Very resembling results. Precise 29 digital cameras and 3 projectors!!!

12 How to get 3D model of Head&Ears? IMAGE-BASED TECHNIQUES: “A morphable model for synthesis of 3D faces” [Blanz et al.] Accurate and automatic Very resembling results. No estimation of ears shape. Scaling issue.

13 How to get 3D model of Head&Ears? IMAGE BASED: “Photo-realistic 3D Head Modeling Using Multi-view Images” They perform two-pass bundle adjustments to reconstruct a photo-realistic 3D head model: compute several feature points of a target 3D head and use them to modify a generic head. Reconstruct the whole head from few photos Automatic Low accuracy No estimation of ears shape. Scaling issue.

14 Our Solution: overview

Gathering the 3d model Photos + keypoints Features extraction Dummy selection Rigid alignement Single-view morph Merging to a multi-view morph Ears Morph Texturing

photouploader NEEDS: 3 photos 6/7 key-points per image Global scaling factor (nose length)

Visual Computing Lab, Vienna Meeting, 9-11 May Features extraction Regarding face, some work has been done, and can be used to extract features for the face and head. [Au et al, Synthesizing 3D Facial Models from Photograph] Regarding ear, most of the work on ear biometry is in the field of detection and security, so the extracted features are not directly related to ear shape. Moreover, proposed biometric measures are not easy to be used due to great differences between ears, and difficulty in finding reference points. [Jeges,Mate, Model based human ear localization and feature extraction]

18 Features extraction Contours extraction algorithm using snakes.

19 Dummy selection Choose the dummy that best fits (using morphing) extracted ears.

20 Dummy morphing Overview: INPUT: 3 extracted masks of head photos (dx,sx,front) aligned with the dummy head. one cropped image for each of ears and his extracted masks. OUTPUT: Recontructed colored Dummy model with particular attention to ear reconstruction. Placement of virtual microphones for HRTF calculation.

21 Morphing to target shape We use a variant of “Texture Design Using a Simplicial Complex of Morphable Textures” [Wojciech Matusik] Overlap the edge-detection of input image with the edge detection of rendered geometry. Calculate image Warping by minimizing an energy function in order to align features. Project warpings back to 3D model

22 Head Morphing Same approach with some difference: We choose to use the image mask for robustness. Adding keypoints matching to energy function.

23 Dummy reconstruction Symmetrization:

24 Geometric validation Scanned vs reconstructed

25 Preliminary assessment of HRTF Scanned vs reconstructed

26 Video

27 Conclusions AUTOMATIC HRTF calculation. robust. Low input from the user. Quick (5 min for reconstruction + 4 hours scattering calculation)

28 Future work Improve avatar generation. GPU morphing implementation. Improvement of Dummy library. Perception tests

29 Reconstructing head models from photograph for individualized 3D-audio processing Matteo Dellepiane, Nico Pietroni, Nicolas Tsingos, Manuel Asselot, Roberto Scopigno

30 QUESTIONS?