Download presentation
Presentation is loading. Please wait.
1
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007
2
CS335 Principles of Multimedia Systems Introduction With the increase of multimedia content on the web, we need methods to find the image, audio or video. Text only schemes have limitations –A lot of manual work. –Inaccurate description. Content based media retrieval studies how to –find media data using classification and recognition methods based on “features” in audio, image and videos. –For example, we can use an exemplar (like a sample picture) to find other similar ones. Applications: –Digital library, personal image album and audio folders, satellite image processing and medical applications.
3
CS335 Principles of Multimedia Systems An Image Retrieval Example (Viper) The query input.
4
CS335 Principles of Multimedia Systems An Image Retrieval Example (Viper) The query output.
5
CS335 Principles of Multimedia Systems User feedback.
6
CS335 Principles of Multimedia Systems Refined results. Better?
7
CS335 Principles of Multimedia Systems Another query for paintings.
8
CS335 Principles of Multimedia Systems Painting Search Result The shortlist returned from the search.
9
CS335 Principles of Multimedia Systems Content Based Media Retrieval The input –A text description. –One or more exemplar images, audio or video clips. –A sketch, e.g., a dark background with an orange disk in the the middle used to search for sunset scenes. –Or the combination or them. The output –A shortlist of images, or audio video clips. –The instances of the event you want to find in videos or audios. –Can be structured into a web page or other documents. –Usually allows user feedback to improve the result. The basic task in content based media retrieval is comparing and searching multimedia data.
10
CS335 Principles of Multimedia Systems How Do We Evaluate the Performance? Precision and Recall –Precision = (# of relevant items) / (# of items retrieved) –Recall = (# of relevant items) / (Total # of related items in the dataset) The procedure of drawing a Recall-Precision Curve: –Compute the relevance score for each item in the database. –Sort the list. –Assume the sorted list is like r r r n n r r r n n … and we have total 6 relevant items in the database
11
CS335 Principles of Multimedia Systems The Recall-Precision Curve 1/6 2/6 3/6 4/6 5/6 1 1 Precision Recall Short list is like: r r r n n r r r n n … Q: Why do not we just use a single value instead of a curve?
12
CS335 Principles of Multimedia Systems The “Best” Recall-Precision Curve 1 Precision Recall 1/(# of relevant items) (# of relevant items)/ (# of total items) 1
13
CS335 Principles of Multimedia Systems Image Retrieval Methods To find images in a database, we have to compare images quantitatively based on “features”. We can compare the images as a whole using features like: –Color, textures and their spatial layouts. We can also segment images into regions and use similar features in object detection. In some recent systems, people use salient features such as SIFT (Scale Invariant Transform) like features, learning and pattern recognition methods.
14
CS335 Principles of Multimedia Systems Color Histogram Methods Color only schemes tend to find many unrelated images. http://amazon.ece.utexas.edu/~qasim/qdialog_IMGDATA2_v1_Birds_Swans.html
15
CS335 Principles of Multimedia Systems Improve Color Histogram Methods If we can separate the foreground with background the result will be improved. Foreground Background
16
CS335 Principles of Multimedia Systems Improve Color Histogram Methods Their spatial relations also help to find the right object. Color Blob 2 Color Blob 1
17
CS335 Principles of Multimedia Systems Finding Shapes Finding similar shapes is a very useful tool in managing large number of images. Chamfer matching is a standard method to compare the similarity of shapes. General Hough Transform can also be used to find shapes in images.
18
CS335 Principles of Multimedia Systems Shape Context Shape context is another widely used feature in shape retrieval. C ij is the distance of shape contexts h i and h j
19
CS335 Principles of Multimedia Systems Improve Matching Efficiency Fast pruning in matching –Reprehensive shape contexts –Shapemes Greg Mori, Serge Belongie, and Jitendra Malik, Shape Contexts Enable Efficient Retrieval of Similar Shapes, CVPR, 2001
20
CS335 Principles of Multimedia Systems Example Results Reprehensive shape contexts in shape matching
21
CS335 Principles of Multimedia Systems Current Trends and Challenges We now show a more “recent” work L. Fei-Fei, R. Fergus, and P. Perona. A Bayesian approach to unsupervised One-Shot learning of Object categories. ICCV 2003. The goal is to detect whether an object appears in an image.
22
CS335 Principles of Multimedia Systems SIFT features are used. The good features are In fact learned from Small set of training images.
23
CS335 Principles of Multimedia Systems Motor bike Results.
24
CS335 Principles of Multimedia Systems Competitions about Object Recognition http://www.pascal- network.org/challenges/VOC/voc2007/
25
CS335 Principles of Multimedia Systems Retrieve Other Multimedia Data Audio retrieval –Find a audio clip in a large database. Video retrieval –Find a specific video clip. –Find a video short that has specific person or action. –Browsing video …
26
CS335 Principles of Multimedia Systems Data Structures in Media Retrieval In multimedia data retrieval we often need to find the “nearest Neighbor” in the database from the exemplar. We can abstract each media object as a feature vector. Our goal is to organize the database so that we can locate the most similar vector as quickly as possible. Q: Think of some data structures that help to improve the searching.
27
CS335 Principles of Multimedia Systems K-d Tree A 2D k-d tree a b c d e f a b c df
28
CS335 Principles of Multimedia Systems Summary Content based multimedia retrieval is still not mature. Many problems still need to be solved. There is no single method that solves all the problems. We need better object detection and classification schemes. Other related problems like multimedia data mining are also attracting more and more interest.
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.