Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.

Slides:

Advertisements

Similar presentations

Patient information extraction in digitized X-ray imagery Hsien-Huang P. Wu Department of Electrical Engineering, National Yunlin University of Science.

Advertisements

Applications of one-class classification

QR Code Recognition Based On Image Processing

A Novel Approach of Assisting the Visually Impaired to Navigate Path and Avoiding Obstacle-Collisions.

IntroductionIntroduction AbstractAbstract AUTOMATIC LICENSE PLATE LOCATION AND RECOGNITION ALGORITHM FOR COLOR IMAGES Kerem Ozkan, Mustafa C. Demir, Buket.

Facial feature localization Presented by: Harvest Jang Spring 2002.

Esmail Hadi Houssein ID/  Motivation  Problem Overview  License plate segmentation  Character segmentation  Character Recognition.

Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Text Detection in Video Min Cai Background  Video OCR: Text detection, extraction and recognition  Detection Target: Artificial text  Text.

Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2005 with a lot of slides stolen from Steve Seitz and.

LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.

Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.

Iris localization algorithm based on geometrical features of cow eyes Menglu Zhang Institute of Systems Engineering

California Car License Plate Recognition System ZhengHui Hu Advisor: Dr. Kang.

Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.

Con-Text: Text Detection Using Background Connectivity for Fine-Grained Object Classification Sezer Karaoglu, Jan van Gemert, Theo Gevers 1.

Handwritten Thai Character Recognition Using Fourier Descriptors and Robust C-Prototype Olarik Surinta Supot Nitsuwat.

IIIT HyderabadUMASS AMHERST Robust Recognition of Documents by Fusing Results of Word Clusters Venkat Rasagna 1, Anand Kumar 1, C. V. Jawahar 1, R. Manmatha.

Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.

West Virginia University

Presented by: Kamakhaya Argulewar Guided by: Prof. Shweta V. Jain

GM-Carnegie Mellon Autonomous Driving CRL TitleAutomated Image Analysis for Robust Detection of Curbs Thrust AreaPerception Project LeadDavid Wettergreen,

3D Motion Capture Assisted Video human motion recognition based on the Layered HMM Myunghoon Suk & Ashok Ramadass Advisor : Dr. B. Prabhakaran Multimedia.

EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.

BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.

Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.

An efficient method of license plate location Pattern Recognition Letters 26 (2005) Journal of Electronic Imaging 11(4), (October 2002)

S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.

CS 6825: Binary Image Processing – binary blob metrics

COMPARISON OF IMAGE ANALYSIS FOR THAI HANDWRITTEN CHARACTER RECOGNITION Olarik Surinta, chatklaw Jareanpon Department of Management Information System.

Pedestrian Detection and Localization

Handwritten Recognition with Neural Network Chatklaw Jareanpon, Olarik Surinta Mahasarakham University.

Handwritten Hindi Numerals Recognition Kritika Singh Akarshan Sarkar Mentor- Prof. Amitabha Mukerjee.

Digital Image Processing - (monsoon 2003) FINAL PROJECT REPORT Project Members Sanyam Sharma Sunil Mohan Ranta Group No FINGERPRINT.

Bo QIN, Zongshun MA, Zhenghua FANG, Shengke WANG Computer-Aided Design and Computer Graphics, th IEEE International Conference on, p Presenter.

Human Detection Mikel Rodriguez. Organization 1. Moving Target Indicator (MTI) Background models Background models Moving region detection Moving region.

PROJECT PROPOSAL DIGITAL IMAGE PROCESSING TITLE:- Automatic Machine Written Document Reader Project Partners:- Manohar Kuse(Y08UC073) Sunil Prasad Jaiswal(Y08UC124)

NTIT IMD 1 Speaker: Ching-Hao Lai( 賴璟皓 ) Author: Hongliang Bai, Junmin Zhu and Changping Liu Source: Proceedings of IEEE on Intelligent Transportation.

Imaged Document Text Retrieval without OCR IEEE Trans. on PAMI vol.24, no.6 June, 2002 報告人：周遵儒.

Experimental Results Abstract Fingerspelling is widely used for education and communication among signers. We propose a new static fingerspelling recognition.

SAR-ATR-MSTAR TARGET RECOGNITION FOR MULTI-ASPECT SAR IMAGES WITH FUSION STRATEGIES ASWIN KUMAR GUTTA.

Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.

Face Image-Based Gender Recognition Using Complex-Valued Neural Network Instructor :Dr. Dong-Chul Kim Indrani Gorripati.

GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.

Lukáš Neumann and Jiří Matas Centre for Machine Perception, Department of Cybernetics Czech Technical University, Prague 1.

Wonjun Kim and Changick Kim, Member, IEEE

Essential components of the implementation are:  Formation of the network and weight initialization routine  Pixel analysis of images for symbol detection.

1 A Statistical Matching Method in Wavelet Domain for Handwritten Character Recognition Presented by Te-Wei Chiang July, 2005.

Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons.

Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.

An intelligent strategy for checking the annual inspection status of motorcycles based on license plate recognition Yo-Ping Huang a, Chien-Hung Chen b,

License Plate Recognition of A Vehicle using MATLAB

Submitted by ANGELA LINCY.J( ) RENJU.K.S( ) ELCY GEORGE( ) GUIDE NAME: Mrs. J. SAHAYA JENIBA ASSISTANT PROFESSOR, COMPUTER.

Group Number5 Group Members Harsh AroraY08UC059 Mayank AgarwalY08UC079 Vishal BharadwajY08UC141 Automatic Number Plate Recognition.

Optical Character Recognition

Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation Dr.G.M.Nasira R. Vidya R. P. Jaia Priyankka.

Another Example: Circle Detection

Digital Image Processing - (monsoon 2003) FINAL PROJECT REPORT

Automatic Video Shot Detection from MPEG Bit Stream

S.Rajeswari Head , Scientific Information Resource Division

Improving the Performance of Fingerprint Classification

DIGITAL SIGNAL PROCESSING

Presenter: Ibrahim A. Zedan

ABSTRACT FACE RECOGNITION RESULTS

A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.

Text Detection in Images and Video

presented by Thomas L. Packer

Introduction to Artificial Intelligence Lecture 22: Computer Vision II

Presentation transcript:

Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images

Introduction  Among all the contents in images, text information has inspired great interests, since it can be easily understood by both human and computer  Text in the image contains useful information which helps to acquire the overall idea behind the image.  Lot of existing text detection and recognition systems are considered for horizontal or near horizontal texts but detecting texts of random orientations from images has become an increasingly important and yet challenging task.

Introduction  Detecting texts of random orientations from images is a challenging problem due to the variety of fonts, sizes, styles, orientations, alignment effects, reflections, shadows, the complexity of image background.  As a result, normal document OCR does not give accurate recognition text due to the above mentioned factors.

Applications Extraction and recognition of text from various types of images, are very effectual in text based application like:  Video and image database retrieval  Image annotation  Data mining  Detection of vehicle license plate  Automatic detection of street name, location, traffic warning and name of commercial goods

Text information extraction system Text information extraction system Are there any text in image Where is the text How can separate text from background Textual Content Input Image Input Image Text Detection Text Detection Text Localizati on Text Localizati on Text Extraction Text Extraction Output Image Output Image

According to literature in text recognition, we have three classes of methods that are: (i) Methods to recognize the segmented text by proposing their own features with classifiers (ii) Methods to binarize and recognize the text without segmentation of text lines using multiple hypotheses frames works (iii) Methods to enhance the text through binarization to improve recognition rate

Hidden Markov Models (HMM) Approach consist of four phases: 1. Binarization of Text Line 2.Path of Sliding Window 3.Feature Extraction 4. Hidden Markov Models

1. Binarization of Text Line  Wavelet-Gradient-Fusion method to convert text line image into binary image.  Fuses of horizontal, vertical and diagonal information obtained by the wavelet and the gradient on text line images to enhance the text information.  An unsupervised clustering algorithm is applied on row wise and column-wise pixels separately to extract possible text information.

(a). Input text line image (b) Horizontal Wavelet (c) Horizontal Gradient (d) Fusion-1 of (b) and (c) (e) Vertical Wavelet (f) Vertical Gradient (g) Fusion-2 of (e) and (f) (h) Diagonal Wavelet (i) Diagonal Gradient (j) Fusion-3 of (h) and (i) (k) Fusion of Fusion-1, Fusion-2 and Fusion-3

2.Path of Sliding Window  For feature analysis, the fixed width sliding window is placed at the left most position of the curved line and is moved to the next positions in steps.

At each position, the sliding window is subdivided into rectangular cells.

3. Feature Extraction - Two types of features were used: Marti- Bunke feature: The sliding window has a width of 1 pixel, moving from left to right and at each position 9 geometrical features are extracted. LGH feature : Based on the calculation of the local gradient histogram. Here, a sliding window traverses the image and each window is sub-divided into 4 × 4 regular cells. From all pixels in each cell a histogram of gradient orientations is calculated. Considered 8 orientations thus the final feature vector which is the concatenation of the 16 histograms results in a feature vector containing 128features.

4. Hidden Markov Models The text recognition system is performed using HMMs. The basic models considered in this approach are character models.

Performance CHARACTER AND WORD RECOGNITION PERFORMANCE BY HMM METHOD (%) : DatasetsAccuracy (%) CharacterWord ICDAR MSRA-TD NUS

Why accuracy is not high?  Some wrong recognition results of characters obtained from the HMM method.  Analyzing the recognition result, we find classification errors and noted that those errors are mainly caused by ambiguous characters, such as {L, I}, {O, D}, {h, n}, {e, c} etc.

Conclusion  Due to the variety of orientation and complex backgrounds, text reading from natural scene images is still an unsolved problem.  Though we have large no of algorithms and methods for text extraction from image but none of them provide a adequate output because of deviation in text.

Future Scope  As we find classification errors mainly caused by ambiguous characters we can make further improvements by making features of characters more robust.  Available techniques can also be extended to work with touching and broken characters.

References  S. Roy, P. P. Roy, P. Shivakumara, G. Louloudis, C. L. Tan, HMM-based Multi Oriented Text Recognition in Natural Scene Image© 2013 IEEE  Lukas Neumann Jirı Matas, Real-Time Scene Text Localization and Recognition©2012 IEEE  S. Grover, K. Arora, S. K. Mitra, Text Extraction from Document Images using Edge Information IEEE Indian Council Conference© 2009

THANK YOU!