Automatic Video Shot Detection from MPEG Bit Stream

Slides:



Advertisements
Similar presentations
Automatic Video Shot Detection from MPEG Bit Stream Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC.
Advertisements

Automated Shot Boundary Detection in VIRS DJ Park Computer Science Department The University of Iowa.
Taxonomic classification for web- based videos Author: Yang Song et al. (Google) Presenters: Phuc Bui & Rahul Dhamecha.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
B. Prabhakaran1 Multimedia Metadata Multimedia information needs to be “interpreted” Popular example: “A picture is worth thousand words” Who will “write”
MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.
A presentation by Modupe Omueti For CMPT 820:Multimedia Systems
Discussion on Video Analysis and Extraction, MPEG-4 and MPEG-7 Encoding and Decoding in Java, Java 3D, or OpenGL Presented by: Emmanuel Velasco City College.
Content-based Video Indexing, Classification & Retrieval Presented by HOI, Chu Hong Nov. 27, 2002.
1 Adaptive slice-level parallelism for H.264/AVC encoding using pre macroblock mode selection Bongsoo Jung, Byeungwoo Jeon Journal of Visual Communication.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
ADVISE: Advanced Digital Video Information Segmentation Engine
Issues for Multimedia Privacy & Security ---- Video Content Privacy Protection, Copyright Protection & Database Access Control Jianping Fan Dept of Computer.
1 Motion in 2D image sequences Definitely used in human vision Object detection and tracking Navigation and obstacle avoidance Analysis of actions or.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
T.Sharon 1 Internet Resources Discovery (IRD) Video IR.
Efficient MPEG Compressed Video Analysis Using Macroblock Type Information Soo-Chang Pei, Yu-Zuong Chou IEEE TRANSACTIONS ON MULTIMEDIA, DECEMBER,1999.
Traditional Database Indexing Techniques for Video Database Indexing Jianping Fan Department of Computer Science University of North Carolina at Charlotte.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Automatic Key Video Object Plane Selection Using the Shape Information in the MPEG-4 Compressed Domain Berna Erol and Faouzi Kossentini, Senior Member,
Decision Trees for Error Concealment in Video Decoding Song Cen and Pamela C. Cosman, Senior Member, IEEE IEEE TRANSACTION ON MULTIMEDIA, VOL. 5, NO. 1,
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Presented by Zeehasham Rasheed
Stockman MSU Fall Computing Motion from Images Chapter 9 of S&S plus otherwork.
1 Motion in 2D image sequences Definitely used in human vision Object detection and tracking Navigation and obstacle avoidance Analysis of actions or.
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
Object Tracking for Retrieval Application in MPEG-2 Lorenzo Favalli, Alessandro Mecocci, Fulvio Moschetti IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR.
Video Streaming via Transcoding Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC 28223
Multimedia Databases (MMDB)
Multimedia Information Retrieval
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
IBM QBIC: Query by Image and Video Content Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC 28223
Performance Characterization of Video-Shot-Change Detection Methods U. Gargi, R. Kasturi, S. Strayer Presented by: Isaac Gerg.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
MPEG-4 Systems Introduction & Elementary Stream Management
Vamsi Krishna Vegunta University of Texas, Arlington
Implementation, Comparison and Literature Review of Spatio-temporal and Compressed domains Object detection. By Gokul Krishna Srinivasan Submitted to Dr.
Image Classification for Automatic Annotation
Bachelor of Engineering In Image Processing Techniques For Video Content Extraction Submitted to the faculty of Engineering North Maharashtra University,
Content-Based Image Retrieval QBIC Homepage The State Hermitage Museum db2www/qbicSearch.mac/qbic?selLang=English.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
Attila Kiss, Tamás Németh, Szabolcs Sergyán, Zoltán Vámossy, László Csink Budapest Tech Recognition of a Moving Object in a Stereo Environment Using a.
Instructor : Dr. K. R. Rao Presented by : Vigneshwaran Sivaravindiran
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
Ontology-based Automatic Video Annotation Technique in Smart TV Environment Jin-Woo Jeong, Hyun-Ki Hong, and Dong-Ho Lee IEEE Transactions on Consumer.
Course Project Lists for ITCS6157 Jianping Fan. Project Implementation Lists Automatic Image Clustering You can download 1,000,000 images from You can.
Visual Information Retrieval
ITCS 6157/8157: Visual Database
CS644 Advanced Topics in Networking
Presenter: Ibrahim A. Zedan
Multimedia Content-Based Retrieval
JPEG Image Coding Standard
MPEG-1 Video Coding Standard
Personalized Social Image Recommendation
Video Compression - MPEG
Security Issues for Visual Data: Copyright and Access Control
Project Implementation for ITCS4122
Watermarking for Image Authentication ( Fragile Watermarking )
Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission Vineeth Shetty Kolkeri EE Graduate,UTA.
Image Segmentation Techniques
CIS679: MPEG MPEG.
Fast Decision of Block size, Prediction Mode and Intra Block for H
DC Image Extraction and Shot Segmentation
MPEG4 Natural Video Coding
Multimedia Content Description Interface
Multimedia Information Retrieval
A Block Based MAP Segmentation for Image Compression
Presentation transcript:

Automatic Video Shot Detection from MPEG Bit Stream Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC 28223 jfan@uncc.edu http://www.cs.uncc.edu/~jfan

Why we need video shots? Indexing a. Text Retrieval: Keyword Extraction Reverse File Indexing Document Storage

Why we need video shots? Indexing b. Database Query: Entity Extraction B-Tree Indexing Database Storage

Why we need video shots? Indexing c. Image Retrieval: Object Indexing Ontology Indexing Database Storage

Why we need video shots? MPEG Video Sequence Video shot frames

Why we need video shots? Story Unit in News Video Each story unit may consist of multiple connected video shots!

??? Why we need video shots? Indexing Shot Indexing Video Shots in Storage

Video shot =?= keyword in video? ??? Shot is used as basic unit for video indexing! Storage Database Networks Query Processing Server

1. Why we need shot detection? Potential Search on Video Shots: Give me some videos which consist of similar video shots! b. Give me all the videos which are related to this video shot!

2. Content Structure of Video Sequence Video shot frames Shot-Based Video Content Interpretation!

3. Coding Structure in MPEG Video Video sequence MPEG encoding GOP ? shot

Interesting Questions If you are leader of MPEG around 1994, how can you design new MPEG standards which can be used for video indexing?

New MPEG Standard with Video Shot Video sequence Video shot GOP GOP GOP GOP is Video Shot! frames

4. What’s mean of video shot? Why We Have Shots: (a) Human Editing; (b) Content Change & Camera Motion Shot Types: (a) Scene Cuts; (b) Fade in & Fade out; © Dissolves & Wipes

4. What’s mean of video shot? Scene Cuts: Sudden change of video content or focus Shot 1 time

4. What’s mean of video shot? b. Fade: gradual transition between a screen and a constant image (fade out).

4. What’s mean of video shot? c. Dissolve: gradual transition from one screen to another, the first screen fade & the second one fade out.

4. What’s mean of video shot? Dissolve

4. What’s mean of video shot? d. Wipe: a line moves across the screen & new appears behind the line

4. What’s mean of video shot? Wipe

Conclusion: Why we have video shots? Video Sequence Video shot frames Big change between two continuous video frames!

How can we detect video shots? Why we can define them as cuts, wipe, ….? Because they are changed between neighboring video frames!!!! How can we measure the changes or similarities between the neighboring video frames? How can make decision on whether they are change or not?

5. How to detect video shot? Major Components for Shot Detection: Visual Representation of Video Frames: Color, Texture… Difference Calculation for Neighboring Video Frames Threshold for Decision Making: How Large is Enough?

5. How to detect video shot? How to measure statistical property of video frames? Color Histogram

5. How to detect video shot? How to measure statistical property of video frames? Color Histogram

5. How to detect video shot? How to measure statistical property of video frames? Texture Histogram

5. How to detect video shot?

Color Histogram Difference 5. How to detect video shot? How we can do this more efficient on MPEG videos? MPEG videos a. Easy but not smart Automatic Threshold Determination MPEG Decoder Color Histogram Difference Scene Cut Frame Output

Scene Cut

5. How to detect video shot? Effects on color histogram difference by scene cuts:

5. How to detect video shot? How to measure the change of video content? Color Histogram Difference

5. How to detect video shot? How we can select the threshold automatically? J. Fan, D.K.Y. Yau, W.G. Aref, A. Rezgui, ``Adaptive motion- compensated video coding scheme towards content-based bit rate allocation”, Journal of Electronic Imaging, vol.9, no.4, pp. 521-533, 2000. This algorithm can adapt the thresholds to different videos. b. J. Fan, et al., ``ClassView: Hierarchical video shot classification and retrieval”, IEEE Trans. on Multimedia, 2004. This algorithm can adapt thresholds to different video shots.

5. How to detect video shot? Relationships among continuous frames can defined as: scene cuts versus non-scene cuts 1 non-scene cut scene cut probability T 1

5. How to detect video shot? How to obtain T automatically? Entropy for non-scene cut frames:

Interesting Questions What will happen on MPEG bit stream if the current video frame is a shot boundary? I B P How can we use such properties for shot detection?

5. How to detect video shot? b. Complex but smart approach If scene cuts happen on I frames in MPEG Video? MPEG encoding GOP How to calculate color histograms of I frames efficiently? Only decode DC coefficient!!!

5. How to detect video shot? b. Complex but smart approach What will happen on MPEG bit stream if there is a shot boundary? 4Mb/s 1.5Mb/s high Picture Quality low

If content change happens on P frames in MPEG video? Most macroblocks can not find their correspondences!! Only decode the block coding types in P frames!!! Reference frame( I or P) Current P frame

If content change happens on B frames in MPEG video? Only decode the coding types in B frames!!!! Most macroblocks in B current frame can only be predicted by backward way!! Future Reference Previous Reference Current B frame

6. Shot Detection via Edge Extraction Edges indicate the content structure of video frames!

6. Shot Detection via Edge Extraction a. Easy but not smart MPEG videos Automatic Threshold Determination MPEG Decoder Color Edge Detection Scene Cut Frame Output

6. Shot Detection via Edge Extraction J. Fan, W.G. Aref, M.-S. Hacid and A.K. Elmagarmid, ``An automatic isotropic color edge detection technique”, Pattern Recognition Letters, vol.22, pp.1419-1429, 2001. 1 2 1 1 -1 2 1 1 2 2 -2 1 -1 -1 1 -1 -1 -2 -1 -2 1 -1 -2 -1

6. Shot Detection via Edge Extraction b. complex but smart Only decode only part of MPEG videos 2550 0 0 0 0 0 0 255 255 255 255 255 255 255 255 0 0 0 0 0 0 0 0 255 255 255 255 255 255 255 255 DCT 0 0 0 0 0 0 0 0 255 255 255 255 255 255 255 255 255 255 255 255 255 255 255 255 0 0 0 0 0 0 0 0 255 255 255 255 255 255 255 255 0 0 0 0 0 0 0 0 255 255 255 255 255 255 255 255 0 0 0 0 0 0 0 0 255 255 255 255 255 255 255 255 255 255 255 255 255 255 255 255 0 0 0 0 0 0 0 0 What you find from this figure?

6. Shot Detection via Edge Extraction 1. Calculate the directional edge histogram Horizontal edges Vertical edges Northwest diagonal edges Northeast diagonal edges 2. Calculate the differences of the directional edge histogram 3. Threshold to obtain the scene cuts

7. Complex Shot Detection Effects on color histogram different by Fades & Dissolves

7. Complex Shot Detection Wipe Modeling Dissolve Modeling

8. Camera Motion Detection Zoom in

8. Camera Motion Detection b. Zoom out

8. Camera Motion Detection c. Left Pan

8. Camera Motion Detection d. Right Pan

Why We Need Keyframe Extraction?

Why We Need Keyframe Extraction?

Keyframe Extraction What are keyframes? Different from others! How to measure the significance of video frames? How to make the decision?

9. Background Modeling for Shot Detection

9. Background Modeling for Shot Detection

9. Background Modeling for Shot Detection

9. Background Modeling for Shot Detection

9. Background Modeling for Shot Detection

9. Background Modeling for Shot Detection

10. Shot-Based Video Content Representation Statistical Information Keyframe-Based Information Color histogram & variance; (b) edge histogram & variance © motion histogram & variance; Color histogram; (b) edge histogram © motion histogram

11. Shot-Oriented Video Semantics Interpretation

Who cares?

12. Who works on this topic? 1. IBM research center 2. Intel 3. Microsoft Redmond & Beijing 4. Kodak at Rochester 5. HP at Polo Alto 6. Google 7. Yahoo!

12. Requirement If you are asked to work on video shot detection from compressed video stream, what you should know (basic requirement)? MPEG video coding standard: how to decode and read MPEG file b. Knowledge on video shot analysis: Color, motion, edge, fade, dissolve, … c. Skills on C++

Design a system for automatic shot detection from MPEG video streams Project Introduction Design a system for automatic shot detection from MPEG video streams Interface design: read in MPEG files and display shot boundaries, or even decision process!