Text Detection in Video Min Cai 2002.3.13. Background  Video OCR: Text detection, extraction and recognition  Detection Target: Artificial text  Text.

Slides:



Advertisements
Similar presentations
Object Recognition from Local Scale-Invariant Features David G. Lowe Presented by Ashley L. Kapron.
Advertisements

By: Mani Baghaei Fard.  During recent years number of moving vehicles in roads and highways has been considerably increased.
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 4 – Digital Image Representation Klara Nahrstedt Spring 2009.
November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
Computer Vision Lecture 16: Texture
Image Segmentation Image segmentation (segmentace obrazu) –division or separation of the image into segments (connected regions) of similar properties.
Digital Image Processing In The Name Of God Digital Image Processing Lecture3: Image enhancement M. Ghelich Oghli By: M. Ghelich Oghli
Lecture 07 Segmentation Lecture 07 Segmentation Mata kuliah: T Computer Vision Tahun: 2010.
電腦視覺 Computer and Robot Vision I Chapter2: Binary Machine Vision: Thresholding and Segmentation Instructor: Shih-Shinh Huang 1.
Computer Vision Lecture 16: Region Representation
Image Segmentation Region growing & Contour following Hyeun-gu Choi Advisor: Dr. Harvey Rhody Center for Imaging Science.
Segmentation and Region Detection Defining regions in an image.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
6/9/2015Digital Image Processing1. 2 Example Histogram.
EE 7730 Image Segmentation.
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
A New Approach for Video Text Detection and Localization M. Cai, J. Song and M.R. Lyu VIEW Technologies The Chinese University of Hong Kong.
Prénom Nom Document Analysis: Segmentation & Layout Analysis Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Processing Digital Images. Filtering Analysis –Recognition Transmission.
Segmentation Divide the image into segments. Each segment:
Face Detection: a Survey Speaker: Mine-Quan Jing National Chiao Tung University.
Quadtrees, Octrees and their Applications in Digital Image Processing
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Chinese Character Recognition for Video Presented by: Vincent Cheung Date: 25 October 1999.
The Segmentation Problem
Image Analysis Preprocessing Arithmetic and Logic Operations Spatial Filters Image Quantization.
Smart Traveller with Visual Translator. What is Smart Traveller? Mobile Device which is convenience for a traveller to carry Mobile Device which is convenience.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
E.G.M. PetrakisBinary Image Processing1 Binary Image Analysis Segmentation produces homogenous regions –each region has uniform gray-level –each region.
Scale-Invariant Feature Transform (SIFT) Jinxiang Chai.
Thresholding Thresholding is usually the first step in any segmentation approach We have talked about simple single value thresholding already Single value.
Spatial-based Enhancements Lecture 3 prepared by R. Lathrop 10/99 updated 10/03 ERDAS Field Guide 6th Ed. Ch 5: ;
Machine Vision ENT 273 Image Filters Hema C.R. Lecture 5.
Information Extraction from Cricket Videos Syed Ahsan Ishtiaque Kumar Srijan.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Graph-based Segmentation. Main Ideas Convert image into a graph Vertices for the pixels Vertices for the pixels Edges between the pixels Edges between.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
CS 6825: Binary Image Processing – binary blob metrics
University of Kurdistan Digital Image Processing (DIP) Lecturer: Kaveh Mollazade, Ph.D. Department of Biosystems Engineering, Faculty of Agriculture,
Texture. Texture is an innate property of all surfaces (clouds, trees, bricks, hair etc…). It refers to visual patterns of homogeneity and does not result.
Quadtrees, Octrees and their Applications in Digital Image Processing.
Chapter 10 Image Segmentation.
G52IVG, School of Computer Science, University of Nottingham 1 Edge Detection and Image Segmentation.
Oct 21, 2008IIP20081 Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts Olarik Surinta Mahasarakham University Thailand.
December 9, 2014Computer Vision Lecture 23: Motion Analysis 1 Now we will talk about… Motion Analysis.
CS654: Digital Image Analysis
Image Processing Part II. 2 Classes of Digital Filters global filters transform each pixel uniformly according to the function regardless of its location.
Autonomous Robots Vision © Manfred Huber 2014.
Lecture # 19 Image Processing II. 2 Classes of Digital Filters Global filters transform each pixel uniformly according to the function regardless of.
Text From Corners: A Novel Approach to Detect Text and Caption in Videos Xu Zhao, Kai-Hsiang Lin, Yun Fu, Member, IEEE, Yuxiao Hu, Member, IEEE, Yuncai.
Digital Image Processing
Suspicious Behavior in Outdoor Video Analysis - Challenges & Complexities Air Force Institute of Technology/ROME Air Force Research Lab Unclassified IED.
Wonjun Kim and Changick Kim, Member, IEEE
Scene Text Extraction Using Focus of Mobile Camera Egyul Kim, SeongHun Lee, JinHyung Kim Artificial Intelligence & Pattern Recognition Lab, KAIST, Korea.
Machine Vision Edge Detection Techniques ENT 273 Lecture 6 Hema C.R.
Edge Segmentation in Computer Images CSE350/ Sep 03.
Instructor: Mircea Nicolescu Lecture 5 CS 485 / 685 Computer Vision.
Digital Image Processing CSC331
Image Quality Measures Omar Javed, Sohaib Khan Dr. Mubarak Shah.
Course 3 Binary Image Binary Images have only two gray levels: “1” and “0”, i.e., black / white. —— save memory —— fast processing —— many features of.
Color Image Segmentation Mentor : Dr. Rajeev Srivastava Students: Achit Kumar Ojha Aseem Kumar Akshay Tyagi.
Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons.
Graph-based Segmentation
Course : T Computer Vision
OCR Reading.
Data Mining Soongsil University
Presenter: Ibrahim A. Zedan
A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.
Computer Vision Lecture 16: Texture II
Digital Image Processing
Presentation transcript:

Text Detection in Video Min Cai

Background  Video OCR: Text detection, extraction and recognition  Detection Target: Artificial text  Text detection: Detect the region from Single frame Refine the region by combining consecutive frames

Existing Work Feature ExtractionText Detection based on feature ColorConnected-component TextureTexture-Segmentation EdgeTop-Down Bottom-Up

Connected-component-based methods  Basic idea Treat text as an uniform color (color level) and classify each pixel as text or non-text according to the color value. Combine connected text-pixels into connected components. Group collinear connected components into a text string.  Advantage Can detect an arbitrary orientation text ---- with similar color and in a simple background.  Disadvantage Sensitive to color variance Lossy compression of video introduces color bleeding Complex background

Texture Segmentation method  Basic idea Treat text as a type of texture Use texture segmentation algorithms to detect text Gabor Filter Gaussian derivatives  Advantage Can segment text areas & graphic areas in a simple background efficiently. It is usually used in document analysis.  Disadvantage Time-consuming Cannot handle well a text embedded in various background.

Bottom-Up method  Basic idea A seed region is defined as a small region with high edge density. Grow a seed region into successively larger components until all seed regions are reached on the image.  Advantage It is a generic method to detect a homogeneous object of various shape. That is, it can detect not only a rectangular object, but also other shapes.  Disadvantage Sensitive to noise. Can not handle the large range of font-size. Sensitive to the stroke density (different language).

Top-Down method  Basic idea Based on run-length smoothing algorithm Analyze horizontal and vertical projection profiles  Advantage Can detect the boundary of horizontal alignment text string quickly and correctly Noise insensitive  Disadvantage Cannot handle diagonal alignment text. One pass of horizontal & vertical projection cannot handle the complex layout.

Analysis (1)  A certain contrast against background Artificial text strings are designed to be read easily  A certain stroke density  Text strings always appear horizontally  Spatial cohesion Characters of the same text string are of similar heights, orientation and spacing  Size constraint Text strings have certain size restriction  A text string appears in multiple consecutive frames and the similar position.

Analysis (2) ProblemsResolutions How to extract more useful edge?Local Thresholding How to highlight text areas?Text area recovery How to detect text regions fast and correctlyHow to detect text regions fast and correctly? Coarse-To-Fine detection

Single Threshold

Local threshold (1)  Use a small kernel (red) to scan the whole image.  In a bigger window (gray) surrounding the kernel, calculate the local threshold corresponding to its local histogram. a. Window move MINMAXT-local Count Edge strength 0 Low half High half b. Local threshold selection

Local threshold (2)

Text-like area recovery (1) Before recoveryAfter recovery

Text-like area recovery (2) Before recoveryAfter recovery

High pass filter

 Using Top-down scheme to detect text-like areas Coarse-to-Fine detection Horizontal project Vertical project Can divide? The first region from the array Add to Processing array Initial: Add the whole Image to processing array Add to result array Yes No

Detect text-like areas b. Coarse vertical projection 1) 2) 3) 4)

Refinement  Combine the neighboring text areas with similar height  Using size constraints to remove unsatisfied areas

Multi-frame analysis  Text region matching Find all the regions corresponding to the same text  Text region enhancement Enhance the text image quality by multi-frame integration  Repetitive text elimination Only record the text at its first emergence.

Thank you! End