An MPEG-7 Based Semantic Album for Home Entertainment Presented by Chen-hsiu Huang 2003/08/12 Presented by Chen-hsiu Huang 2003/08/12.

Slides:



Advertisements
Similar presentations
National Technical University of Athens Department of Electrical and Computer Engineering Image, Video and Multimedia Systems Laboratory
Advertisements

A Human-Centered Computing Framework to Enable Personalized News Video Recommendation (Oh Jun-hyuk)
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Change Detection C. Stauffer and W.E.L. Grimson, “Learning patterns of activity using real time tracking,” IEEE Trans. On PAMI, 22(8): , Aug 2000.
Simple Face Detection system Ali Arab Sharif university of tech. Fall 2012.
Detecting Categories in News Video Using Image Features Slav Petrov, Arlo Faria, Pascal Michaillat, Alex Berg, Andreas Stolcke, Dan Klein, Jitendra Malik.
1 Overview of Image Retrieval Hui-Ying Wang. 2/42 Reference Smeulders, A. W., Worring, M., Santini, S., Gupta, A.,, and Jain, R “Content-based.
COLORCOLOR A SET OF CODES GENERATED BY THE BRAİN How do you quantify? How do you use?
Facial feature localization Presented by: Harvest Jang Spring 2002.
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
1 Content-Based Retrieval (CBR) -in multimedia systems Presented by: Chao Cai Date: March 28, 2006 C SC 561.
 Introduction  Principles of context – aware saliency  Detection of context – aware saliency  Result  Application  Conclusion.
ICME 2008 Huiying Liu, Shuqiang Jiang, Qingming Huang, Changsheng Xu.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Object-based Image Representation Dr. B.S. Manjunath Sitaram Bhagavathy Shawn Newsam Baris Sumengen Vision Research Lab University of California, Santa.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Content-Based Image Retrieval (CBIR) Student: Mihaela David Professor: Michael Eckmann Most of the database images in this presentation are from the Annotated.
February, Content-Based Image Retrieval Saint-Petersburg State University Natalia Vassilieva Il’ya Markov
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Visual Attention More information in visual field than we can process at a given moment Solutions Shifts of Visual Attention related to eye movements Some.
Computer Vision Introduction to Image formats, reading and writing images, and image environments Image filtering.
T.Sharon 1 Internet Resources Discovery (IRD) Introduction to MMIR.
Visual Attention Jeremy Wyatt. Where to look? Many visual processes are expensive Humans don’t process the whole visual field How do we decide what to.
Presented by Zeehasham Rasheed
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
김덕주 (Duck Ju Kim). Problems What is the objective of content-based video analysis? Why supervised identification has limitation? Why should use integrated.
Eigenfaces for Recognition Student: Yikun Jiang Professor: Brendan Morris.
Content-Based Video Retrieval System Presented by: Edmund Liang CSE 8337: Information Retrieval.
A Tutorial on Object Detection Using OpenCV
A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.
Multimedia Databases (MMDB)
Content-Based Image Retrieval
Nonparametric Part Transfer for Fine-grained Recognition Presenter Byungju Kim.
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
Welcome Students Research Experience for Teachers (R.E.T.) Teachers seek out promising students from high school to connect to the engineering program.
An MPEG-7 Based Content- aware Album System for Consumer Photographs 2003/12/18 Chen-Hsiu Huang, Chih-Hao Shen, Chun-Hsiang Huang and Ja-Ling Wu Communication.
Photo Album by User
Jan SedmidubskyApril 7, 2014Face Recognition Technology Faculty of Informatics Masaryk University Brno, Czech Republic 1/11.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
Content-Based Image Retrieval Using Fuzzy Cognition Concepts Presented by Tienwei Tsai Department of Computer Science and Engineering Tatung University.
INTERACTIVELY BROWSING LARGE IMAGE DATABASES Ronald Richter, Mathias Eitz and Marc Alexa.
Street Smarts: Visual Attention on the Go Alexander Patrikalakis May 13, XXX.
MMDB-9 J. Teuhola Standardization: MPEG-7 “Multimedia Content Description Interface” Standard for describing multimedia content (metadata).
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
+ Speed Up Texture Classification in Clothing Retrieval System 電機三 吳瑋凌.
Module 4 – Tahirih Morrison Site Map Sketch – on paper brainstorming Site Map Sketch – on computer Navigation Sketch Site Map Navigation Map Rationale.
IBM - CVUT Student Research Projects IBM Presence detection Milan Stezka
3D Face Recognition Using Range Images Literature Survey Joonsoo Lee 3/10/05.
 Mentor : Prof. Amitabha Mukerjee Learning to Detect Salient Objects Team Members - Avinash Koyya Diwakar Chauhan.
Frank Bergschneider February 21, 2014 Presented to National Instruments.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.
Ontology-based Automatic Video Annotation Technique in Smart TV Environment Jin-Woo Jeong, Hyun-Ki Hong, and Dong-Ho Lee IEEE Transactions on Consumer.
MPEG 7 &MPEG 21.
OpenCV C++ Image Processing
Presented By Bhargav (08BQ1A0435).  Images play an important role in todays information because A single image represents a thousand words.  Google's.
Digital Video Library - Jacky Ma.
Technologies: for Enhancing Broadcast Programmes with Bridgets
Visual Information Retrieval
Introduction Multimedia initial focus
Multimedia Content-Based Retrieval
Personalized Social Image Recommendation
Content-based Image Retrieval
Video Summarization by Spatial-Temporal Graph Optimization
Image Segmentation Techniques
A User Attention Based Visible Watermarking Scheme
Ying Dai Faculty of software and information science,
Introduction to Artificial Intelligence Lecture 24: Computer Vision IV
Presentation transcript:

An MPEG-7 Based Semantic Album for Home Entertainment Presented by Chen-hsiu Huang 2003/08/12 Presented by Chen-hsiu Huang 2003/08/12

Introduction  The digital album should not only process the meaningless 0/1 bits but also realize the semantic information from media.  It could be much better if computer knows which region is more important, which photo has any important person in it, or which photos have close relation with current browsing one.  The face detection & recognition technology have developed for years, can we query our daily photos by face?

Core Functionalities  Query image by face –Face detection & recognition  Photo Focus identification –Smart Thumbnail  Photo Similarity –Relative photos  Photo Grammar –Not yet done

Query Images by Face  Steps to achieve query by face: –Find out the faces in photos –Build the face databases –Training face databases –Recognize faces in photos –Query images by Face PS: We use Intel OpenCV as face detection & recognition module

Photo Focus  For photos with people, human faces are surely our focus when viewing.  The user attention model has applied to find some saliency points: –Red: Intensity based –Green: Color based –Blue: Skin color based –Texture based

Smart Thumbnail  Direct Scaling –Traditional way of creating thumbnail  Focus based –Cropping the focus region first, then scaling –Better then direct scaling, but not so good  Adaptive selection –For each face & saliency points, a weighting function was applied to calculate its importance. –User can select the cropping ratio, the cropping region is adaptive decided according to the weighting value.

Focus Based Adaptive Selection Direct Scale

Photo Similarity  Color Layout Descriptor –It is designed to efficiently represent spatial distribution of colors  Dominant Color Descriptor –The representative colors in an image or image region  By using the human faces information and MPEG-7 descriptors, we can calculate the similarities between images.

System Diagram Face detection & reorganization User attention model Saliency Map MPEG-7 Visual Descriptors Query by Face Photo Focus & Smart Thumbnail Photo Similarity Photo Grammar Evaluation We can get more semantic meanings from low level features by combining those kernel modules.

In the Future  The album should be able to cope with different type photos.  The album system can be improved both systematic side and component side.  The album should be able to process other media type such as audio and video.  The album syntax should be fully conform to the MPEG-7 standard.

The End  Any recommendation is welcomed.  Thank you.

Adaptive Selection  For all the visual objects (faces, saliency points), calculate its importance by:  When adaptive selection, sort those visual objects by importance, dropping the least import object to achieve the goal cropping ratio.