Digital Days 29/6/2001 ISTORAMA: A Content-Based Image Search Engine and Hierarchical Triangulation of 3D Surfaces. Dr. Ioannis Kompatsiaris Centre for.

Slides:



Advertisements
Similar presentations
Aggregating local image descriptors into compact codes
Advertisements

November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
3D Graphics Rendering and Terrain Modeling
Extended Gaussian Images
3D Shape Histograms for Similarity Search and Classification in Spatial Databases. Mihael Ankerst,Gabi Kastenmuller, Hans-Peter-Kriegel,Thomas Seidl Univ.
Automatic Feature Extraction for Multi-view 3D Face Recognition
Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
Multiresolution Analysis of Arbitrary Meshes Matthias Eck joint with Tony DeRose, Tom Duchamp, Hugues Hoppe, Michael Lounsbery and Werner Stuetzle Matthias.
Xianfeng Gu, Yaling Wang, Tony Chan, Paul Thompson, Shing-Tung Yau
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
A Novel Scheme for Video Similarity Detection Chu-Hong Hoi, Steven March 5, 2003.
CS CS 175 – Week 4 Mesh Decimation General Framework, Progressive Meshes.
A New Block Based Motion Estimation with True Region Motion Field Jozef Huska & Peter Kulla EUROCON 2007 The International Conference on “Computer as a.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
1 Displaced Subdivision Surfaces Aaron Lee Princeton University Henry Moreton Nvidia Hugues Hoppe Microsoft Research.
ADVISE: Advanced Digital Video Information Segmentation Engine
Curve Analogies Aaron Hertzmann Nuria Oliver Brain Curless Steven M. Seitz University of Washington Microsoft Research Thirteenth Eurographics.
Visualization and graphics research group CIPIC January 30, 2003Multiresolution (ECS 289L) - Winter MAPS – Multiresolution Adaptive Parameterization.
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
Automatic Key Video Object Plane Selection Using the Shape Information in the MPEG-4 Compressed Domain Berna Erol and Faouzi Kossentini, Senior Member,
Frederic Payan, Marc Antonini
Tracking Video Objects in Cluttered Background
Visualization and graphics research group CIPIC Feb 18, 2003Multiresolution (ECS 289L) - Winter Progressive Meshes (SIGGRAPH ’96) By Hugues Hoppe.
CS292 Computational Vision and Language Visual Features - Colour and Texture.
A Novel 2D To 3D Image Technique Based On Object- Oriented Conversion.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
ECS 289L A Survey of Mesh-Based Multiresolution Representations Ken Joy Center for Image Processing and Integrated Computing Computer Science Department.
Introduction --Classification Shape ContourRegion Structural Syntactic Graph Tree Model-driven Data-driven Perimeter Compactness Eccentricity.
AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University 3D Shape Classification Using Conformal Mapping In.
Content-based Retrieval of 3D Medical Images Y. Qian, X. Gao, M. Loomes, R. Comley, B. Barn School of Engineering and Information Sciences Middlesex University,
Technology and Historical Overview. Introduction to 3d Computer Graphics  3D computer graphics is the science, study, and method of projecting a mathematical.
 Coding efficiency/Compression ratio:  The loss of information or distortion measure:
1 Faculty of Information Technology Generic Fourier Descriptor for Shape-based Image Retrieval Dengsheng Zhang, Guojun Lu Gippsland School of Comp. & Info.
Digital Image Processing Lecture 20: Representation & Description
Dynamic Meshing Using Adaptively Sampled Distance Fields
BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.
Presented By Greg Gire Advised By Zoë Wood California Polytechnic State University.
Multiscale Moment-Based Painterly Rendering Diego Nehab and Luiz Velho
2004, 9/1 1 Optimal Content-Based Video Decomposition for Interactive Video Navigation Anastasios D. Doulamis, Member, IEEE and Nikolaos D. Doulamis, Member,
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
December 4, 2014Computer Vision Lecture 22: Depth 1 Stereo Vision Comparing the similar triangles PMC l and p l LC l, we get: Similarly, for PNC r and.
Applied Mathematics 1 Do we have to see everything - all the time? …or is it at all possible? Dr. ing. Rune Aasgaard.
1 Mpeg-4 Overview Gerhard Roth. 2 Overview Much more general than all previous mpegs –standard finished in the last two years standardized ways to support:
Wavelet-based Coding And its application in JPEG2000 Monia Ghobadi CSC561 final project
Semi-regular 3D mesh progressive compression and transmission based on an adaptive wavelet decomposition 21 st January 2009 Wavelet Applications in Industrial.
1 Compressing Triangle Meshes Leila De Floriani, Paola Magillo University of Genova Genova (Italy) Enrico Puppo National Research Council Genova (Italy)
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Segmentation of Vehicles in Traffic Video Tun-Yu Chiang Wilson Lau.
1 Multi-resolution Tetrahedral Meshes Leila De Floriani Department of Computer and Information Sciences University of Genova, Genova (Italy)
Subject Name: Computer Graphics Subject Code: Textbook: “Computer Graphics”, C Version By Hearn and Baker Credits: 6 1.
1/12/ Multimedia Data Mining. Multimedia data types any type of information medium that can be represented, processed, stored and transmitted over.
Yixin Chen and James Z. Wang The Pennsylvania State University
Content Based Color Image Retrieval vi Wavelet Transformations Information Retrieval Class Presentation May 2, 2012 Author: Mrs. Y.M. Latha Presenter:
Outline ● Introduction – What is the problem ● Generate stochastic textures ● Improve realism ● High level approach - Don't just jump into details – Why.
1 Faculty of Information Technology Enhanced Generic Fourier Descriptor for Object-Based Image Retrieval Dengsheng Zhang, Guojun Lu Gippsland School of.
Outline ● Introduction – What is the problem ● Generate stochastic textures ● Improve realism ● High level approach - Don't just jump into details – Why.
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
CIVET seminar Presentation day: Presenter : Park, GilSoon.
11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.
An Image Database Retrieval Scheme Based Upon Multivariate Analysis and Data Mining Presented by C.C. Chang Dept. of Computer Science and Information.
Introduction to Graphics Modeling
Nonparametric Semantic Segmentation
Improving Retrieval Performance of Zernike Moment Descriptor on Affined Shapes Dengsheng Zhang, Guojun Lu Gippsland School of Comp. & Info Tech Monash.
JPEG Still Image Data Compression Standard
Wavelet-based Compression of 3D Mesh Sequences
Presentation transcript:

Digital Days 29/6/2001 ISTORAMA: A Content-Based Image Search Engine and Hierarchical Triangulation of 3D Surfaces. Dr. Ioannis Kompatsiaris Centre for Research and Technology Hellas Informatics and Telematics Institute Thermi-Thessaloniki, Greece

Digital Days 29/6/2001 Outline Introduction Istorama architecture K-Means with Connectivity Constraint Algorithm (KMCC) Demo Object/model based coding Adaptive Triangulation and Progressive transmission Reduced pyramid - quincunx sampling Experimental results Conclusions

Digital Days 29/6/2001 Need for efficient image search Huge number of images or databases of images Highly visual and graphical nature of the Web Text descriptors are not always efficient Greater flexibility with “content-based” access Queries which are more natural to humans

Digital Days 29/6/2001 Proposed approach Usually a description, a “signature” or a set indexes is created for the whole image Images usually contain different objects Proposed approach: the image is first separated into objects (segmentation) Descriptors are created for each object The user can search for a specific object contained in images

Digital Days 29/6/2001 ISTORAMA architecture Server World Wide Web Data Base JDBC Java Data Base Connection User PHP Crawler - Spider Indexing - Retrieval Algorithms

Digital Days 29/6/2001 The K-Means with Connectivity Constraint Algorithm (KMCC) I Based on K-Means algorithm K-Means does not take into account spatial information In KMCC, the spatial proximity of each region is also taken into account by defining a new spatial center and by integrating the K-Means with a component labeling procedure KAutomatic correction of the number of regions K

Digital Days 29/6/2001 The K-Means with Connectivity Constraint Algorithm (KMCC) II Step1 K-Means is performed  Step2 Spatial centers are calculated Step3 Generalised distance LStep 4 Component labeling  L connected regions

Digital Days 29/6/2001 The K-Means with Connectivity Constraint Algorithm (KMCC) III

Digital Days 29/6/2001 Object descriptors Color, texture and spatial characteristics Color: histogram, 8 bins Spatial: (centroid), Shape: area, eccentricity where λ 1, λ 2 are the two first eigenvalues

Digital Days 29/6/2001 Experimental Results (Synthetic)

Digital Days 29/6/2001 Experimental Results (Synthetic)

Digital Days 29/6/2001 Experimental Results

Digital Days 29/6/2001 Experimental Results (Claire) Facial region Moving object Original sequence Frames 1-10 Segmentation

Digital Days 29/6/2001 Experimental results (Claire)

Digital Days 29/6/2001 Experimental results (table-tennis) Original sequence Frames 1-10

Digital Days 29/6/2001 Experimental results (table-tennis) Segmentation Moving objects

Digital Days 29/6/2001 Experimental results (Akiyo+Foreman) Facial region Original sequence Frames 1-10 Original sequence Frames 1-10

Digital Days 29/6/2001 Conclusions K-means with spatial proximity algorithm Multiple features segmentation Higher order segmentation Correspondence of objects between consequent frames Max-min criterion for automatic regularisation parameters

Digital Days 29/6/2001 Future work Use of texture Indexing of video Integration with text descriptors

Digital Days 29/6/2001 Triangular meshes of high quality are used in: Computer Aided Design 3D representation of objects (e.g. archaeological artifacts) Animation and visual simulation Entertainment (computer games) Digital Terrain Modelling Introduction

Digital Days 29/6/2001 Object/model-based coding

Digital Days 29/6/2001 Object/model-based coding

Digital Days 29/6/2001 Object/model-based coding

Digital Days 29/6/2001 Compression of finely detailed surfaces is necessary for: computation storage transmission display efficiency Adaptive triangulation

Digital Days 29/6/2001 Early, coarse approximations are refined though additional bits Progressive transmission

Digital Days 29/6/2001 Vertices removal and retriangulation [Schroeder] [Cohen] General mesh optimization process/function [Hoppe] Multiresolution analysis (MRA) [Lounsbery] Wavelets [Schroeder] [Gross] Progressive transmission [Schroeder] [Hoppe] Generalized triangle mesh representation [Deering] Background

Digital Days 29/6/2001 Properties of the algorithm Efficient compression of the wireframe information Simplification of the wireframe by adaptive triangulation Progressive transmission of the wireframe information Prioritised transmission of the wireframe Straightforward correspondence between successive scales

Digital Days 29/6/2001 Input surfaces Surface represented as a parametric function in the parametric space determined by the position of a set of control points or nodes It allows for arbitrary, possibly closed wire-frame surfaces to be defined.

Digital Days 29/6/2001 Input surfaces The filters are applied to the 2D parametric representation of the surface as though it were a 2D image with intensity equal to Such surfaces include also: depth images estimated from stereo pairs and every surface that is homomorphic to a plane, cylinder or torus

Digital Days 29/6/2001 Block diagram of the proposed procedure

Digital Days 29/6/2001 Reduced pyramid with quincunx sampling matrix

Digital Days 29/6/2001 Corresponding triangulation

Digital Days 29/6/2001 Optimal filters Optimal filters are determined by their Fourier transform: where is the power spectral density. Alternatively may be determined by the equation:

Digital Days 29/6/2001 Optimum bit allocation bits/vertex is assumed to be transmitted bits/vertex are allocated to each level using is the sum of error variances

Digital Days 29/6/2001 Error prioritization The prediction errors corresponding to all predicted vertices are calculated and sorted with the vertices corresponding to higher errors being put first on the list Higher Errors Lower Errors

Digital Days 29/6/2001 Entropy estimation Entropy coding is used The number of bits needed for error transmission is the entropy of the errors Using the quincunx sampling geometry at the receiver, there is no need to transmit the exact co-ordinates of the position of each transmitted vertex The final cost of the transmission is the sum of the error entropy and the position entropy

Digital Days 29/6/2001 Adaptive Triangulation Procedure Synthesis stage of the QMVINT pyramid The vertex along with the vertices used to predict it are added to the mesh Handling of cracks Triangulation of the next vertex

Digital Days 29/6/2001 Adaptive triangulation procedure

Digital Days 29/6/2001 Adaptive triangulation procedure

Digital Days 29/6/2001 Experimental results Original dense depth map and surface of the “Venus” data

Digital Days 29/6/2001 Experimental results 2569 vertices and 4006 triangles at level 2 MSE = 1.30

Digital Days 29/6/2001 Experimental results 7661 vertices and triangles at level 1 MSE = 1.30

Digital Days 29/6/2001 Experimental results vertices and triangles at level 0 MSE = 0.12

Digital Days 29/6/2001 Experimental results

Digital Days 29/6/2001 Conclusions Hierarchical representation of 3D surfaces using 3D adaptive triangular wireframes The variance of the error transmitted is minimised and therefore results to optimal compression of the wireframe information It produces a hierarchy where coarse meshes are as similar to their finer versions as is possible

Digital Days 29/6/2001 Conclusions The triangulation algorithm is integrated with a bit allocation procedure The number of nodes and triangles of the wireframe as well as the information needed for the transmission or storage of the wireframe are reduced simultaneously using a unified approach (QMVINT filtering) Precise correspondence between triangles at each level is achieved

Digital Days 29/6/2001 Future work Expansion and application directly to 3D surfaces Estimation of filters