Understanding The Semantics of Media Chapter 8 Camilo A. Celis.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 4 – Digital Image Representation Klara Nahrstedt Spring 2009.
Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.
Dimensionality Reduction PCA -- SVD
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
1 Content-Based Retrieval (CBR) -in multimedia systems Presented by: Chao Cai Date: March 28, 2006 C SC 561.
What is missing? Reasons that ideal effectiveness hard to achieve: 1. Users’ inability to describe queries precisely. 2. Document representation loses.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Toward Semantic Indexing and Retrieval Using Hierarchical Audio Models Wei-Ta Chu, Wen-Huang Cheng, Jane Yung-Jen Hsu and Ja-LingWu Multimedia Systems,
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
DIMENSIONALITY REDUCTION BY RANDOM PROJECTION AND LATENT SEMANTIC INDEXING Jessica Lin and Dimitrios Gunopulos Ângelo Cardoso IST/UTL December
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Multimedia and Text Indexing. Multimedia Data Management The need to query and analyze vast amounts of multimedia data (i.e., images, sound tracks, video.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
ADVISE: Advanced Digital Video Information Segmentation Engine
Visualizating the Non-Visual: Spatial Analysis and Interaction with Information from Text Documents J.A. Wise, J.J. Thomas, K. Pennock, D. Lantrip, M.
Expectation Maximization Method Effective Image Retrieval Based on Hidden Concept Discovery in Image Database By Sanket Korgaonkar Masters Computer Science.
Multimedia Search and Retrieval Presented by: Reza Aghaee For Multimedia Course(CMPT820) Simon Fraser University March.2005 Shih-Fu Chang, Qian Huang,
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
1/ 30. Problems for classical IR models Introduction & Background(LSI,SVD,..etc) Example Standard query method Analysis standard query method Seeking.
IR Models: Latent Semantic Analysis. IR Model Taxonomy Non-Overlapping Lists Proximal Nodes Structured Models U s e r T a s k Set Theoretic Fuzzy Extended.
SLIDE 1IS 240 – Spring 2007 Prof. Ray Larson University of California, Berkeley School of Information Tuesday and Thursday 10:30 am - 12:00.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
ICME 2004 Tzvetanka I. Ianeva Arjen P. de Vries Thijs Westerveld A Dynamic Probabilistic Multimedia Retrieval Model.
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
Information Retrieval in Practice
Content-Based Video Retrieval System Presented by: Edmund Liang CSE 8337: Information Retrieval.
TEMPORAL VIDEO BOUNDARIES -PART ONE- SNUEE KIM KYUNGMIN.
Document (Text) Visualization Mao Lin Huang. Paper Outline Introduction Visualizing text Visualization transformations: from text to pictures Examples.
Multimedia and Time-series Data
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.
Multimedia Databases (MMDB)
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Multimedia Information Retrieval
Query Processing In Multimedia Databases Dheeraj Kumar Mekala Devarasetty Bhanu Kiran.
Indices Tomasz Bartoszewski. Inverted Index Search Construction Compression.
Understanding the Semantics of Media Lecture Notes on Video Search & Mining, Spring 2012 Presented by Jun Hee Yoo Biointelligence Laboratory School of.
Generic text summarization using relevance measure and latent semantic analysis Gong Yihong and Xin Liu SIGIR, April 2015 Yubin Lim.
Latent Semantic Indexing: A probabilistic Analysis Christos Papadimitriou Prabhakar Raghavan, Hisao Tamaki, Santosh Vempala.
Authors: Rosario Sotomayor, Joe Carthy and John Dunnion Speaker: Rosario Sotomayor Intelligent Information Retrieval Group (IIRG) UCD School of Computer.
LATENT SEMANTIC INDEXING Hande Zırtıloğlu Levent Altunyurt.
Music Information Retrieval Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
SINGULAR VALUE DECOMPOSITION (SVD)
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
Gene Clustering by Latent Semantic Indexing of MEDLINE Abstracts Ramin Homayouni, Kevin Heinrich, Lai Wei, and Michael W. Berry University of Tennessee.
1 CSC 594 Topics in AI – Text Mining and Analytics Fall 2015/16 6. Dimensionality Reduction.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
Modern information retreival Chapter. 02: Modeling (Latent Semantic Indexing)
1 Latent Concepts and the Number Orthogonal Factors in Latent Semantic Analysis Georges Dupret
V. Clustering 인공지능 연구실 이승희 Text: Text mining Page:82-93.
Web Search and Text Mining Lecture 5. Outline Review of VSM More on LSI through SVD Term relatedness Probabilistic LSI.
Hierarchical Segmentation: Finding Changes in a Text Signal Malcolm Slaney and Dulce Ponceleon IBM Almaden Research Center.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Link Distribution on Wikipedia [0407]KwangHee Park.
A Patent Document Retrieval System Addressing Both Semantic and Syntactic Properties Liang Chen*,Naoyuki Tokuda+, Hisahiro Adachi+ *University of Northern.
Natural Language Processing Topics in Information Retrieval August, 2002.
VISUAL INFORMATION RETRIEVAL Presented by Dipti Vaidya.
MULTIMEDIA DATA MODELS AND AUTHORING
Singular Value Decomposition and its applications
Visual Information Retrieval
Introduction Multimedia initial focus
Multimedia Information Retrieval
Image Segmentation Techniques
Multimedia Information Retrieval
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
Presentation transcript:

Understanding The Semantics of Media Chapter 8 Camilo A. Celis

Questions 1. What kind of application does SVD has? How is it used in this paper? 2. What does MPESAR stands for? What does this system do? 3. How does MPESAR generally works?

Contents Understanding the problem Analysis Tools Segmenting Video Semantic Retrieval

Contents Understanding the problem  Different Approaches  Segmentation Literature  Semantic Retrieval Literature Analysis Tools Segmenting Video Semantic Retrieval

Understanding the problem Semantic: (N) the study of meaning. changes in meaning and form.  the study of linguistic development by classifying and examining changes in meaning and form. Rapid growth of media: personal media, social media...  Low price  Social preassure We are not understanding media.

Different Approaches Increasing number of methods to retrieve information from media [Aner and Kender] Finds a background in a video shot, and then clusters shorts into physical scenes by noting shots with common background. QBIC (IMB) Allow to search for images based on the colors and images in an image. Known as query by-example. Where is the semantics of the media? "The most important information is in the WORDS!"

Segmentation Literature Extension of others work.  Latent semantic indexing. (LSI) - Allows to summarize the semantic content of a document and measure similarities.  Visualization and segmentation algorithm based on wavelet analysis of text documents. (time and frequency)  Scaled-space ideas to segmentation problem. Multi-dimensional signals.

Semantic Retrieval Literature Multimedia retrieval systems. (audio and video)  Mixtures of probability expert for semantic-audio retrieval (MPESAR) is a sophisticated model connection words and media. - Consider the acoustic and semantics similarity of sounds, allowing user to retrieve sounds without searching on the an exact word.  "MPESAR algorithm is appropriate for mapping one type of media to another."

Contents Understanding the problem Analysis Tools Segmenting Video Semantic Retrieval

Contents Understanding the problem Analysis Tools  SVD Principles  Color Space  Word Space Segmenting Video Semantic Retrieval

Analysis Tools Common tools and mathematics used to analyze multimedia signals. Two type of transformations, which reduce raw text and video signals into meaningful spaces.  Preprocessing the data  * Mapping from a one dimensional signal (speech) into a multidimensional signal (video).

Analysis Tools: SVD SVD (Singular Value Decomposition) principles: Factorization of real or complex matrixes. Noise reduction.  Semantic and video data are expressed as vector-value function of time.  Collect data from an entire video and put the data into a matrix X. (Columns of X represent the signal at different times)  Using SVD, rewrite the matrix X in terms of 3 matrices U,S,V.

Analysis Tools: Color Space Color changes are useful metrics for finding the boundary between shots.  Collect a histogram of colors of each frame. (512 histograms bins)  Convert all the tree intensities RGB intensities (0-255) to a single histogram bin, by finding the log base 2, of the intensity value  Pack the tree colors into a 9-bit number using floor() to covert to an integer.

Analysis Tools: Word Space Latent Semantic indexing (LSI), uses a SVD in direct analogy to the color analysis. Analyse the audio data by collection a histogram of the words in a transcript of the video.  Only one document to study. Consider sentences of the document, which define a semantic space. Issues? Synonomous and Polysemy. SVD captures both relationships.

Contents Understanding the problem Analysis Tools Segmenting Video Semantic Retrieval

Contents Understanding the problem Analysis Tools Segmenting Video  Temporal Properties Video  Segmentation overview  Scalar Space  Combined Image and Audio Data  Hierarchical Segmentation Results Semantic Retrieval

Segmenting Video Indexing by combining two major sources of data  i mages  words Describe the semantic path of a vide's transcript as a signal, from the initial sentence to the conclusion. Instead of trying to find similarities (segments) see audio-visual content as a signal and look for large changes in this signal.

Scale Space Used to find boundaries in a signal. Analyse a signal with many different kernels that vary in size of the temporal neighborhood that is included in the analysis at each point in time. Look for changes in the signal over time. (Do so by calculating the derivate of the signal with respect to time)

Overall From hierarchial segmentation and compare it with other forms of segmentation. A simple description of a video is possible by unifying the representations. Combine 2 well known technique to find boundaries in a video. Reduce dimensionality (SVD) and put all in the same format and its application on color and word data.

Combining color, words and scale space The result is a 20-dimensional vector function of time and scale.

Scale Space representations:

Scale Space

Results: Autocorrelation

Results: Grouping correlation

Results(cont.) Representations of the semantic information in the Headline News video in scale space. The top image shows the cosine of the angular change of the semantic trajectory with different amounts of low-pass filtering. The middle plot shows the peaks of the scale-space derivative The bottom plot shows the peaks traced back to their original starting point. These peaks represent topic boundaries.

Results: Shot Boundary Segmentation

Results:

Segmentation in Perspective New framework for combining into a unified representation and for segmentation from multiple types of information from a video. Described hierarchial segmentation (Unexactedly) good amount of information in the color. This method is also applicable with other type of information. (musical key, audio emotion, etc)

Contents Understanding the problem Analysis Tools Segmenting Video Semantic Retrieval

Contents Understanding the problem Analysis Tools Segmenting Video Semantic Retrieval  The algorithm  Testing  Conclusions

Semantic Retrieval: MEPSAR Connecting sounds to words and vice-versa.  Queries with sounds and words Learn about the connections between semantic space and acoustic space.

Algorithm Semantic Features  Uses PORTER stemmer to remove common suffixes from words, and deletes common words before further processing.  Partition the space into overlapping clusters of regions. Acoustic Features  Signal processing and machine learning calculations endeavors to capture the sound.  MFCC(mel-frequency cepstral coefficient) Analyse speech sounds. Used to reduce the audio signal  GMM captures the long-term characteristics of each sound.

Semantic Retrieval Acoustic signal processing chain Building MPESAR models

Testing Audio to semantic testing procedure.

Retrieval Results Histogram of true label ranks based on likehood from audio-to- semantic test. Histogram of true label ranks based on likehood from semantic-to-audio test.

Questions 1. What kind of application does SVD has? How is it used? The SVD has also applications in digital signal processing, e.g., as a method for noise reduction. It allows to summarize different kind of video data and combine the results into a common representation. 2. What does MPESAR stands for? What does this system do? (Mixture of Probability Expert for Semantic-Audio Retrieval) Learns the connections between a semantic space and an acoustic space. -Ex) Given a description of a word, the system finds audio signal that best fits the word. 3. How does MPESAR generally works? Semantic space maps words into a high-dimentional probabilistic space. Acoustic space describes sounds by a multidimensional vector. A many to many connection.

Thank you

Questions 1. What kind of application does SVD has? How is it used? The SVD has also applications in digital signal processing, e.g., as a method for noise reduction. It allows to summarize different kind of video data and combine the results into a common representation. 2. What does MPESAR stands for? What does this system do? (Mixture of Probability Expert for Semantic-Audio Retrieval) Learns the connections between a semantic space and an acoustic space. -Ex) Given a description of a word, the system finds audio signal that best fits the word. 3. How does MPESAR generally works? Semantic space maps words into a high-dimensional probabilistic space. Acoustic space describes sounds by a multidimensional vector. A many to many connection.

Q&A