Computer Vision Lab Seoul National University Keyframe-Based Real-Time Camera Tracking Young Ki BAIK Vision seminar : Mar. 5. 2010 Computer Vision Lab.

Slides:



Advertisements
Similar presentations
3D Model Matching with Viewpoint-Invariant Patches(VIP) Reporter :鄒嘉恆 Date : 10/06/2009.
Advertisements

DONG XU, MEMBER, IEEE, AND SHIH-FU CHANG, FELLOW, IEEE Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment.
RGB-D object recognition and localization with clutter and occlusions Federico Tombari, Samuele Salti, Luigi Di Stefano Computer Vision Lab – University.
For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.
산업 및 시스템 공학과 통신시스템 및 인터넷보안연구실 김효원 Optimizing Tree Reconfiguration for Mobile Target Tracking in Sensor Networks Wensheng Zhang and Guohong Cao.
Parallel Tracking and Mapping for Small AR Workspaces Vision Seminar
Kiyoshi Irie, Tomoaki Yoshida, and Masahiro Tomono 2011 IEEE International Conference on Robotics and Automation Shanghai International Conference Center.
Pose Estimation TUE. Kim Kyungkoo Active Grasp.
Robust Object Tracking via Sparsity-based Collaborative Model
Segmentation-Free, Area-Based Articulated Object Tracking Daniel Mohr, Gabriel Zachmann Clausthal University, Germany ISVC.
A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
Effective Image Database Search via Dimensionality Reduction Anders Bjorholm Dahl and Henrik Aanæs IEEE Computer Society Conference on Computer Vision.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Object retrieval with large vocabularies and fast spatial matching
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
1 Abstract This paper presents a novel modification to the classical Competitive Learning (CL) by adding a dynamic branching mechanism to neural networks.
A Study of Approaches for Object Recognition
Distinctive image features from scale-invariant keypoints. David G. Lowe, Int. Journal of Computer Vision, 60, 2 (2004), pp Presented by: Shalomi.
Effective Gaussian mixture learning for video background subtraction Dar-Shyang Lee, Member, IEEE.
MULTIPLE MOVING OBJECTS TRACKING FOR VIDEO SURVEILLANCE SYSTEMS.
A Performance and Energy Comparison of FPGAs, GPUs, and Multicores for Sliding-Window Applications From J. Fowers, G. Brown, P. Cooke, and G. Stitt, University.
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
Scale-Invariant Feature Transform (SIFT) Jinxiang Chai.
FLANN Fast Library for Approximate Nearest Neighbors
REALTIME OBJECT-OF-INTEREST TRACKING BY LEARNING COMPOSITE PATCH-BASED TEMPLATES Yuanlu Xu, Hongfei Zhou, Qing Wang*, Liang Lin Sun Yat-sen University,
Face Recognition and Retrieval in Video Basic concept of Face Recog. & retrieval And their basic methods. C.S.E. Kwon Min Hyuk.
(Fri) Young Ki Baik Computer Vision Lab.
Webcam-synopsis: Peeking Around the World Young Ki Baik (CV Lab.) (Fri)
Yuping Lin and Gérard Medioni.  Introduction  Method  Register UAV streams to a global reference image ▪ Consecutive UAV image registration ▪ UAV to.
Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.
Factor Graphs Young Ki Baik Computer Vision Lab. Seoul National University.
Final Exam Review CS485/685 Computer Vision Prof. Bebis.
Reading Notes: Special Issue on Distributed Smart Cameras, Proceedings of the IEEE Mahmut Karakaya Graduate Student Electrical Engineering and Computer.
Mutual Information-based Stereo Matching Combined with SIFT Descriptor in Log-chromaticity Color Space Yong Seok Heo, Kyoung Mu Lee, and Sang Uk Lee.
“Low-Power, Real-Time Object- Recognition Processors for Mobile Vision Systems”, IEEE Micro Jinwook Oh ; Gyeonghoon Kim ; Injoon Hong ; Junyoung.
KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.
Orderless Tracking through Model-Averaged Posterior Estimation Seunghoon Hong* Suha Kwak Bohyung Han Computer Vision Lab. Dept. of Computer Science and.
A General Framework for Tracking Multiple People from a Moving Camera
Dynamic 3D Scene Analysis from a Moving Vehicle Young Ki Baik (CV Lab.) (Wed)
Particle Filters.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
Zhenbao Liu 1, Shaoguang Cheng 1, Shuhui Bu 1, Ke Li 2 1 Northwest Polytechnical University, Xi’an, China. 2 Information Engineering University, Zhengzhou,
Handover and Tracking in a Camera Network Presented by Dima Gershovich.
Tao Lin Chris Chu TPL-Aware Displacement- driven Detailed Placement Refinement with Coloring Constraints ISPD ‘15.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
Visual SLAM Visual SLAM SPL Seminar (Fri) Young Ki Baik Computer Vision Lab.
CSCE 643 Computer Vision: Extractions of Image Features Jinxiang Chai.
Computer Science and Engineering Parallelizing Defect Detection and Categorization Using FREERIDE Leonid Glimcher P. 1 ipdps’05 Scaling and Parallelizing.
Fields of Experts: A Framework for Learning Image Priors (Mon) Young Ki Baik, Computer Vision Lab.
Real-Time Simultaneous Localization and Mapping with a Single Camera (Mono SLAM) Young Ki Baik Computer Vision Lab. Seoul National University.
Peter Henry1, Michael Krainin1, Evan Herbst1,
3D reconstruction from uncalibrated images
Spatiotemporal Saliency Map of a Video Sequence in FPGA hardware David Boland Acknowledgements: Professor Peter Cheung Mr Yang Liu.
Visual Odometry David Nister, CVPR 2004
Using Cross-Media Correlation for Scene Detection in Travel Videos.
Vision-based SLAM Enhanced by Particle Swarm Optimization on the Euclidean Group Vision seminar : Dec Young Ki BAIK Computer Vision Lab.
Updating Designed for Fast IP Lookup Author : Natasa Maksic, Zoran Chicha and Aleksandra Smiljani´c Conference: IEEE High Performance Switching and Routing.
1 An Efficient Optimal Leaf Ordering for Hierarchical Clustering in Microarray Gene Expression Data Analysis Jianting Zhang Le Gruenwald School of Computer.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Jian-Lin Kuo Author : Aristidis Likas Nikos Vlassis Jakob J.Verbeek 國立雲林科技大學 National Yunlin.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.
Presenter: Jae Sung Park
1 Munther Abualkibash University of Bridgeport, CT.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching Link: singhashwini.mesinghashwini.me.
Capturing, Processing and Experiencing Indian Monuments
Real-Time Human Pose Recognition in Parts from Single Depth Image
Geometric Hashing: An Overview
Online Graph-Based Tracking
for Vision-Based Navigation
Resource Allocation for Distributed Streaming Applications
Presentation transcript:

Computer Vision Lab Seoul National University Keyframe-Based Real-Time Camera Tracking Young Ki BAIK Vision seminar : Mar Computer Vision Lab.

Computer Vision Lab Seoul National University Reference Keyframe-Based Real-Time Camera Tracking - ICCV Zilong Dong, Guofeng Zhang, Jiaya Jia Hujun Bao

Computer Vision Lab Seoul National University Outline Introduction System overview Offline map building Online camera tracking Demonstration Conclusion Opitimal keyframe selection Keyframe recognition using vocabulary tree How to use multiple thread

Computer Vision Lab Seoul National University What is the purpose? Real-time camera tracking Condition Known internal camera parameters The map of environment is given… 3D Map measurement Input frame 6-DOF camera pose estimation!! 3D camera pose

Computer Vision Lab Seoul National University System overview Real-time camera tracking Novel keyframe selection algorithm How to build a map How to estimate the camera pose Offline mapping using the conventional MVG method Online camera tracking Efficient key frame recognition method Parallel computing framework

Computer Vision Lab Seoul National University Offline mapping stage reference images Image Acquisition Video data for the target environment

Computer Vision Lab Seoul National University Offline mapping stage Image Acquisition Feature Extraction and Matching SIFT descriptors

Computer Vision Lab Seoul National University Offline mapping stage Image Acquisition Conventional MVG Feature Extraction and Matching 3D Map Building

Computer Vision Lab Seoul National University Offline mapping stage Keyframe set Image AcquisitionKeyframe selection Feature Extraction and Matching 3D Map Building Optimal Keyframe Selection For efficiency of map management

Computer Vision Lab Seoul National University Offline mapping stage Image Acquisition Vocabulary tree Feature Extraction and Matching 3D Map Building Optimal Keyframe Selection Vocabulary Tree Construction For fast keyframe selection in tracking stage… Contribution!!

Computer Vision Lab Seoul National University Why keyframe? Which one is good for a map? To solve the scalability problem of the map All referencesKeyframe A small number of …

Computer Vision Lab Seoul National University Optimal Keyframe Selection Previous methods select manually… using heuristic method… Proposed method select optimal keyframe Containing many salient features as possible Reducing non-distinctiveness in matching Features should be distributed evenly in keyframes

Computer Vision Lab Seoul National University Optimal Keyframe Selection Cost minimization Problem formulation… to find optimal keyframes Completeness TermRedundancy Term weight Saliency of features Distribution of features Distinctiveness

Computer Vision Lab Seoul National University Optimal Keyframe Selection Completeness Term Definition Reference images Keyframes

Computer Vision Lab Seoul National University Optimal Keyframe Selection Completeness Term Definition Feature clusters Reference image set 2D image position

Computer Vision Lab Seoul National University Optimal Keyframe Selection Completeness Term Definition Set of superior features ~ Long-term surviving (or representative) features in reference images

Computer Vision Lab Seoul National University Optimal Keyframe Selection Completeness Term Saliency ~ The larger s(χ) means the higher saliency! DoG map

Computer Vision Lab Seoul National University Optimal Keyframe Selection Completeness Term Distribution The smaller d (χ) → Features are well distributed !! Feature density 31x31 window

Computer Vision Lab Seoul National University Optimal Keyframe Selection Completeness Term =3 : controls the sensitivity of feature density… : the superior feature set in the keyframe set F

Computer Vision Lab Seoul National University Optimal Keyframe Selection Redundancy Term Normalization The number of keyframes which contain feature χ Small E r (F) indicates low redundancy!

Computer Vision Lab Seoul National University Keyframe selection Algorithm Let = 0 Initialization ~

Computer Vision Lab Seoul National University Keyframe selection Algorithm ~ Add Keyframe

Computer Vision Lab Seoul National University Keyframe selection Algorithm ~ Add Keyframe

Computer Vision Lab Seoul National University Keyframe selection Algorithm ~ Check

Computer Vision Lab Seoul National University Keyframe selection Algorithm ~ Add Keyframe

Computer Vision Lab Seoul National University Keyframe selection Algorithm ~ Add Keyframe

Computer Vision Lab Seoul National University Keyframe selection Algorithm ~ Check

Computer Vision Lab Seoul National University Vocabulary Tree Construction For fast keyframe recognition… Root node List Branch = 8, Tree depth = 5 All F Mean patch All feature descriptor

Computer Vision Lab Seoul National University Vocabulary Tree Construction To build each node… Root node List Clustering by K-means method Branch = 8, Tree depth = 5

Computer Vision Lab Seoul National University Vocabulary Tree Construction To build each node… Root node List Clustering by K-means method List Computing mean patch Building keyframe list Branch = 8, Tree depth = 5

Computer Vision Lab Seoul National University Vocabulary Tree Construction Finally… Root node List Clustering by K-means method List Computing mean patch Building keyframe list List Branch = 8, Tree depth = 5

Computer Vision Lab Seoul National University Online tracking stage SIFT Feature Extraction SIFT feature Input image

Computer Vision Lab Seoul National University Online tracking stage SIFT Feature Extraction Vocabulary tree Keyframe Recognition

Computer Vision Lab Seoul National University Online tracking stage SIFT Feature Extraction Keyframe Recognition Keyframe-based matching Input imageSelected F

Computer Vision Lab Seoul National University Online tracking stage SIFT Feature Extraction Keyframe Recognition Keyframe-based matching Camera Pose Estimation 3-point algorithm 3-D map 2-D point

Computer Vision Lab Seoul National University Keyframe recognition With SIFT patches of input image… Root node List Matching input patch & mean patches Vocabulary tree Key frame voting memory

Computer Vision Lab Seoul National University Keyframe recognition With SIFT patches of input image… Root node List Key frame voting memory Move patch And Update voting memory

Computer Vision Lab Seoul National University Keyframe recognition With SIFT patches of input image… Root node List Key frame voting memory Select keyframe with largest voting value

Computer Vision Lab Seoul National University Processing time Process time per frame with a single thread… Frame rate is around 6fps!!!

Computer Vision Lab Seoul National University Processing time Using multiple-thread with pipeline SIFT Feature Extraction Time axis Select Keyframe Matching SIFT Feature Extraction Pose Estimation Thread#1Thread#2 Thread#3Thread#4 Select Keyframe Matching Pose Estimation 20 fps First frame is time consuming!

Computer Vision Lab Seoul National University Demonstration

Computer Vision Lab Seoul National University Conclusion Real-time camera tracking is presented !! The optimal keyframe selection algorithm Real-time tracking is presented! Minimizing the energy formulation! Fast keyframe recognition using vocabulary tree Multiple thread with pipeline structure Limitation Incremental updating of VT is not considered… Proposed process is only available for known environment.

Computer Vision Lab Seoul National University Q&AQ&A