Video Classification By: Maryam S. Mirian

Slides:

Advertisements

Similar presentations

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Advertisements

Trajectory Analysis of Broadcast Soccer Videos Computer Science and Engineering Department Indian Institute of Technology, Kharagpur by Prof. Jayanta Mukherjee.

DONG XU, MEMBER, IEEE, AND SHIH-FU CHANG, FELLOW, IEEE Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment.

Automatic Video Shot Detection from MPEG Bit Stream Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC.

SmartPlayer: User-Centric Video Fast-Forwarding K.-Y. Cheng, S.-J. Luo, B.-Y. Chen, and H.-H. Chu ACM CHI 2009 (international conference on Human factors.

Automatic Soccer Video Analysis and Summarization

Computer Science Engineering Lee Sang Seon.  Introduction  Basic notions for temporal video boundaries  Micro-Boundaries  Macro-Boundaries  Mega-Boundaries.

Visual Event Detection & Recognition Filiz Bunyak Ersoy, Ph.D. student Smart Engineering Systems Lab.

Personalized Abstraction of Broadcasted American Football Video by Highlight Selection Noboru Babaguchi (Professor at Osaka Univ.) Yoshihiko Kawai and.

SOMM: Self Organizing Markov Map for Gesture Recognition Pattern Recognition 2010 Spring Seung-Hyun Lee G. Caridakis et al., Pattern Recognition, Vol.

Broadcast News Parsing Using Visual Cues: A Robust Face Detection Approach Yannis Avrithis, Nicolas Tsapatsoulis and Stefanos Kollias Image, Video & Multimedia.

Content-based Video Indexing, Classification & Retrieval Presented by HOI, Chu Hong Nov. 27, 2002.

ICME 2008 Huiying Liu, Shuqiang Jiang, Qingming Huang, Changsheng Xu.

Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.

1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.

ADVISE: Advanced Digital Video Information Segmentation Engine

Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,

Face Detection: a Survey Speaker: Mine-Quan Jing National Chiao Tung University.

Multimedia Search and Retrieval Presented by: Reza Aghaee For Multimedia Course(CMPT820) Simon Fraser University March.2005 Shih-Fu Chang, Qian Huang,

LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.

On the Use of Computable Features for Film Classification Zeeshan Rasheed,Yaser Sheikh Mubarak Shah IEEE TRANSCATION ON CIRCUITS AND SYSTEMS FOR VIDEO.

Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.

Presented by Zeehasham Rasheed

A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.

Support Vector Machine based Logo Detection in Broadcast Soccer Videos Hossam M. Zawbaa Cairo University, Faculty of Computers and Information; ABO Research.

Multimedia Data Mining Arvind Balasubramanian Multimedia Lab (ECSS 4.416) The University of Texas at Dallas.

Information Retrieval in Practice

Learning to classify the visual dynamics of a scene Nicoletta Noceti Università degli Studi di Genova Corso di Dottorato.

TEMPORAL VIDEO BOUNDARIES -PART ONE- SNUEE KIM KYUNGMIN.

Bridge Semantic Gap: A Large Scale Concept Ontology for Multimedia (LSCOM) Guo-Jun Qi Beckman Institute University of Illinois at Urbana-Champaign.

Multimedia Databases (MMDB)

Multimedia Information Retrieval and Multimedia Data Mining Chengcui Zhang Assistant Professor Dept. of Computer and Information Science University of.

Player Action Recognition in Broadcast Tennis Video with Applications to Semantic Analysis of Sport Game Guangyu Zhu, Changsheng Xu Qingming Huang, Wen.

Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab

A Novel Framework for Semantic Annotation and Personalized Retrieval of Sports Video IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 10, NO. 3, APRIL 2008.

Tactic Analysis in Football Instructors: Nima Najafzadeh Mahdi Oraei Spring

Spatio-temporal constraints for recognizing 3D objects in videos Nicoletta Noceti Università degli Studi di Genova.

Understanding The Semantics of Media Chapter 8 Camilo A. Celis.

1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.

Levi Smith.  Reading papers  Getting data set together  Clipping videos to form the training and testing data for our classifier  Project separation.

1 Data Mining for Surveillance Applications Suspicious Event Detection Dr. Bhavani Thuraisingham April 2006.

PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.

Case Study 1 Semantic Analysis of Soccer Video Using Dynamic Bayesian Network C.-L Huang, et al. IEEE Transactions on Multimedia, vol. 8, no. 4, 2006 Fuzzy.

Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.

1 Broadcast News Segmentation using Metadata and Speech-To-Text Information to Improve Speech Recognition Sebastien Coquoz, Swiss Federal Institute of.

MMDB-9 J. Teuhola Standardization: MPEG-7 “Multimedia Content Description Interface” Standard for describing multimedia content (metadata).

GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.

Semantic Extraction and Semantics-Based Annotation and Retrieval for Video Databases Authors: Yan Liu & Fei Li Department of Computer Science Columbia.

Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.

1/12/ Multimedia Data Mining. Multimedia data types any type of information medium that can be represented, processed, stored and transmitted over.

CSSE463: Image Recognition Day 11 Due: Due: Written assignment 1 tomorrow, 4:00 pm Written assignment 1 tomorrow, 4:00 pm Start thinking about term project.

Journal of Visual Communication and Image Representation

1 CS 430 / INFO 430 Information Retrieval Lecture 17 Metadata 4.

Data Mining for Surveillance Applications Suspicious Event Detection Dr. Bhavani Thuraisingham.

Pattern Recognition NTUEE 高奕豪 2005/4/14. Outline Introduction Definition, Examples, Related Fields, System, and Design Approaches Bayesian, Hidden Markov.

Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.

Data Mining for Surveillance Applications Suspicious Event Detection

Visual Information Retrieval

Automatic Video Shot Detection from MPEG Bit Stream

IMAGE PROCESSING RECOGNITION AND CLASSIFICATION

Multimedia Content Based Retrieval

Presenter: Ibrahim A. Zedan

Multimedia Content-Based Retrieval

Semantic Video Classification

Video-based human motion recognition using 3D mocap data

Data Mining for Surveillance Applications Suspicious Event Detection

Automatic Generation of Personalized Music Sports Video ACM MM’2005

Multimedia Information Retrieval

Ying Dai Faculty of software and information science,

Data Mining for Surveillance Applications Suspicious Event Detection

Presentation transcript:

Video Classification By: Maryam S. Mirian For: Multimedia & Pattern Recognition Joint Courses Project

Outline What is Video Classification? Straightforward or Difficult? What is its Applications? What are its methods? Review of Video Classification Methods What is my own Project, exactly?

What is Video Classification? Classify a Video (Shot) into one of Nc predefined Classes: Indoor / outdoor News / Sports …

Is Video Classification Difficult? Why? YES, Because: Data Stream is a Multi-dimensional signal. It has a subjective nature.

Classification

Required Steps for Classification Object Observations Feature Extraction Feature Reduction Classification Class Labels Using Methods like: PCA, LDA The most Important and the most difficult part

Methods of Classification Bayesian Classification kNN Classification Neural Classification MLP RBF Classification based on Support Vector Machines Rule-based Classification

Bayesian Decision Making So, x belongs to w2

Methods of Classification Bayesian Classification kNN Classification Neural Classification MLP RBF Classification based on Support Vector Machines Rule-based Classification

While 3 Black Neighbor, so X should be Black! kNN Decision Making k = 5, 2 Red Neighbor While 3 Black Neighbor, so X should be Black!

Methods of Classification Bayesian Classification kNN Classification Neural Classification MLP RBF Classification based on Support Vector Machines Rule-based Classification

MLP Classifier

Video Content Analysis

Applications of Automatic video classification Automatic Video segmentation content based retrieval browsing and retrieving digitized video identifying close-up video frames before running a computationally expensive face recognizer. effective management of ever-increasing amount of broadcast news video: personalization of news video.

Classify Shot or Video? One effective way to organize the video is to segment the video into small, single-story units and classify these units according to their semantics. A shot represents a contiguous sequence of visually similar frames. It is a syntactical representation and does not usually convey any coherent semantics to the users.

Looking @ Video Classification

Ide et al. [1998] Problem Domain: News video Features: Videotext motion face segmented the video into shots used clustering techniques classify each shot into 1 of 5 classes: Speech/report, Anchor, Walking, Gathering, and Computer graphics shots. Quite simple but seems effective for this restricted class of problems.

Huang et al. [1999] Problem Domain: TV Programs Features: news report weather forecast Commercials basketball games football games Features: Audio Color motion

Chen and Wong [2001] Problem Domain: Features: news video: News Weather Reporting Commercials Basketball Football Features: Motion Color text caption cut rate used a rule-based approach

Looking @ Lekha Chaisorn et.al [2002] in More Details

Basic Ideas Proposes a two-level, multi-modal framework. The video is analyzed at the shot and story unit (or scene) levels. At the shot level, a Decision Tree to classify the shot into one of 13 pre-defined categories is employed. At the scene level, the HMM (Hidden Markov Models) analysis is used to eliminate shot classification errors Results indicate that a high accuracy of over 95 % for shot classification can be achieved. The use of HMM analysis helps to improve the accuracy of the shot classification and achieve over 89% accuracy on story segmentation.

Predefined Classes

Features in Shot Level Low-level Visual Content Feature Color Histogram Temporal Features Background scene change Speaker change Audio Motion activity Shot duration High-level Object-based features Face Shot type Videotext Centralized Videotext

Feature vector of a shot Si = (a, m, d, f, s, t, c) a the class of audio, a ∈{ t=speech, m=music, s=silence, n =noise, tn = speech + noise, tm= speech + music, mn=music+noise} m the motion activity, m ∈{l=low, m=medium, h=high} d the shot duration, d ∈{s=short, m=medium, l=long} f the number of faces, Ν ∈ f s the shot type, s ∈{c= closed-up, m=medium, l=long, u=unknown} t the number of lines of text in the scene, Ν ∈ t c set to “true” if the videotexts present are centralized, c ∈{t=true, f=false}

Decision Tree for Shot Classification

Reading these papers, I decided about My own Project….

About Problem Domain… Sport Classification seems OK Interesting Enough It is helpful for Sports-Lovers

About Extracting features…. Features used in video analysis: color,texture,shape,motion vector… Criteria of choosing features : they should have similar statistical behavior across time Color histogram: simple and robust Motion vectors:invariance to color and light

So, My Own Project is Design a Classifier Test the Approach Sports Video Classifications : Football, Basketball, ….(Those Well-defined sports, I can find Video On!) Steps I should take: Finding or Gathering a Video Collection Shot Detection Feature Extraction : Key Frame (s) Extraction: Selecting Middle Shot I-Frame Use of Clustering … Motion Vector–based Features Straight Lines Detection Design a Classifier Test the Approach

Looking @Ekin,Tekalp[2003] one Research on Football Video Classification

Features Cinematic Object-based result from common video composition and production rules. shot types, camera motions and replays. Object-based Described by their spatial, e.g., color, texture, and shape, and spatio-temporal features, such as object motions and interactions

Robust Dominant Color Region Detection A soccer field has one distinct dominant color (a tone of green) that may vary from stadium to stadium, and also due to weather and lighting conditions within the same stadium. The statistics of this dominant color, in the HSI space, are learned by the system at start-up, and then automatically updated to adapt to temporal variations.

Shot classification Long Shot In-Field Medium Shot Close-Up Shot A long shot displays the global view of the field. In-Field Medium Shot a whole human body is usually visible. Close-Up Shot shows the above-waist view of one person Out of Field Shot The audience, coach, and other shots

How Extend to Shot from a Frame? Due to the computational simplicity they find the class of every frame in a shot and assign the shot class to the label of the majority of frames.

Decision Schema based on G The first stage uses G value and two thresholds, TcloseUp and Tmedium to determine the frame view label.

Soccer Eevent Detection Goal Detection Referee Detection Controversial calls, such as red-yellow cards and penalties Penalty Box Detection

Goal Detection Occurrence of a goal is generally followed by a special pattern of cinematic features. A goal event leads to a break in the game. one or more close-up views of the actors of the goal event. show one or more replay(s) the restart of the game is usually captured by a long shot.

Referee Detection Assumed that there is, a single referee in a: medium out of field close-up shot So no search for a referee in a long shot

Penalty Box Detection Field lines in a long view can be used to localize the view and/or register the current frame on the standard field model

Interesting Summaries Goal summaries summaries with Referee and Penalty box objects

Adaptation of Parameters Tcolor in dominant color region detection TcloseUp and Tmedium in shot classification referee color statistics The training stage can be performed in a very short time to find Mean and Variance of a Normal pdf.

Results for High-Level Analysis and Summarization Goal detection results

Results for High-Level Analysis and Summarization(2) Referee detection results

Results for High-Level Analysis and Summarization(3) Penalty box detection results

References Automatic soccer video analysis and summarization, in Symp. Electronic Imaging: Science and Technology: Storage and Retrieval for Image and Video Databases IV, IS&T/SPI03, Jan. 2003, CA. “The Segmentation and Classification of Story Boundaries In News Video”, Proceeding of 6th IFIP working conference on Visual Database Systems- VDB6 2002, Australia 2002 Pattern Classification, by Duda, Hart, and Stork, 2000

Thanks for Your Attention Any Question or Comment?