Optical Character Recognition

Slides:



Advertisements
Similar presentations
Patient information extraction in digitized X-ray imagery Hsien-Huang P. Wu Department of Electrical Engineering, National Yunlin University of Science.
Advertisements

Applications of one-class classification
Optical Character Recognition for Handwritten Characters
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Image Analysis Phases Image pre-processing –Noise suppression, linear and non-linear filters, deconvolution, etc. Image segmentation –Detection of objects.
Document Image Processing
CDS 301 Fall, 2009 Image Visualization Chap. 9 November 5, 2009 Jie Zhang Copyright ©
July 27, 2002 Image Processing for K.R. Precision1 Image Processing Training Lecture 1 by Suthep Madarasmi, Ph.D. Assistant Professor Department of Computer.
Chapter 9: Morphological Image Processing
1 Probabilistic Artificial Neural Network For Recognizing the Arabic Hand Written Characters Khalaf khatatneh, Ibrahiem El Emary,and Basem Al- Rifai Journal.
Each pixel is 0 or 1, background or foreground Image processing to
CS771 Machine Learning : Tools, Techniques & Application Gaurav Krishna Y Harshit Maheshwari Pulkit Jain Sayantan Marik
Features for handwriting recognition. | 2 The challenge “Rappt JD 10 Feb no 175, om machtiging om af”
Handwritten Character Recognition Using Artificial Neural Networks Shimie Atkins & Daniel Marco Supervisor: Johanan Erez Technion - Israel Institute of.
Aula 5 Alguns Exemplos PMR5406 Redes Neurais e Lógica Fuzzy.
Processing Digital Images. Filtering Analysis –Recognition Transmission.
Traffic Sign Recognition Jacob Carlson Sean St. Onge Advisor: Dr. Thomas L. Stewart.
Chaincode Generation Contour separation extracted by algorithm Image Chaincode contour Represented as an array of coordinates and corresponding slopes.
Computer Vision Basics Image Terminology Binary Operations Filtering Edge Operators.
Facial Features Extraction Amit Pillay Ravi Mattani Amit Pillay Ravi Mattani.
E.G.M. PetrakisBinary Image Processing1 Binary Image Analysis Segmentation produces homogenous regions –each region has uniform gray-level –each region.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Presented by: Kamakhaya Argulewar Guided by: Prof. Shweta V. Jain
IIS for Image Processing Michael J. Watts
1 Template-Based Classification Method for Chinese Character Recognition Presenter: Tienwei Tsai Department of Informaiton Management, Chihlee Institute.
Chapter 9.  Mathematical morphology: ◦ A useful tool for extracting image components in the representation of region shape.  Boundaries, skeletons,
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
COMPARISON OF IMAGE ANALYSIS FOR THAI HANDWRITTEN CHARACTER RECOGNITION Olarik Surinta, chatklaw Jareanpon Department of Management Information System.
Yang, Luyu.  Postal service for sorting mails by the postal code written on the envelop  Bank system for processing checks by reading the amount of.
Digital Image Processing Chapter 9: Morphological Image Processing 5 September 2007 Digital Image Processing Chapter 9: Morphological Image Processing.
Digital Image Processing CCS331 Relationships of Pixel 1.
Morphological Image Processing
Image Processing and Pattern Recognition Jouko Lampinen.
OCR a survey Csink László Problems to Solve Recognize good quality printed text Recognize good quality printed text Recognize neatly written handprinted.
Automated Target Recognition Using Mathematical Morphology Prof. Robert Haralick Ilknur Icke José Hanchi Computer Science Dept. The Graduate Center of.
Handwritten Recognition with Neural Network Chatklaw Jareanpon, Olarik Surinta Mahasarakham University.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Handwritten Hindi Numerals Recognition Kritika Singh Akarshan Sarkar Mentor- Prof. Amitabha Mukerjee.
PROJECT PROPOSAL DIGITAL IMAGE PROCESSING TITLE:- Automatic Machine Written Document Reader Project Partners:- Manohar Kuse(Y08UC073) Sunil Prasad Jaiswal(Y08UC124)
Reporter: 資訊所 P Yung-Chih Cheng ( 鄭詠之 ).  Introduction  Data Collection  System Architecture  Feature Extraction  Recognition Methods  Results.
Presented By Lingzhou Lu & Ziliang Jiao. Domain ● Optical Character Recogntion (OCR) ● Upper-case letters only.
Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology.
DIGITAL IMAGE PROCESSING Instructors: Dr J. Shanbehzadeh Mostafa Mahdijo Mostafa Mahdijo ( J.Shanbehzadeh.
Mobile Image Processing
Face Image-Based Gender Recognition Using Complex-Valued Neural Network Instructor :Dr. Dong-Chul Kim Indrani Gorripati.
CS654: Digital Image Analysis
1 Machine Vision. 2 VISION the most powerful sense.
By Pushpita Biswas Under the guidance of Prof. S.Mukhopadhyay and Prof. P.K.Biswas.
CS654: Digital Image Analysis
Scanned Documents INST 734 Module 10 Doug Oard. Agenda Document image retrieval  Representation Retrieval Thanks for David Doermann for most of these.
Morphological Image Processing Robotics. 2/22/2016Introduction to Machine Vision Remember from Lecture 12: GRAY LEVEL THRESHOLDING Objects Set threshold.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Lecture 3 Template Matching Edge Detection. 2 Processes for Assignment 1  Understand Image Format  Pre Processing - Gaussian, Mean Filter to clean up.
Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.
Chapter 6 Skeleton & Morphological Operation. Image Processing for Pattern Recognition Feature Extraction Acquisition Preprocessing Classification Post.
Automatic Script Identification. Why do we need Script Identification OCRs are generally language dependent. Document layout analysis is sometimes language.
Arabic Handwriting Recognition Thomas Taylor. Roadmap  Introduction to Handwriting Recognition  Introduction to Arabic Language  Challenges of Recognition.
1 Convolutional neural networks Abin - Roozgard. 2  Introduction  Drawbacks of previous neural networks  Convolutional neural networks  LeNet 5 
1 A Statistical Matching Method in Wavelet Domain for Handwritten Character Recognition Presented by Te-Wei Chiang July, 2005.
License Plate Recognition of A Vehicle using MATLAB
Leaves Recognition By Zakir Mohammed Indiana State University Computer Science.
IMAGE PROCESSING Tadas Rimavičius.
COMP 9517 Computer Vision Binary Image Analysis 4/15/2018
Features for handwriting recognition
HIT and MISS.
IIS for Image Processing
Extracting Old Persian Cuneiform Font Out of
Convolutional neural networks Abin - Roozgard.
Computer and Robot Vision I
Presentation transcript:

Optical Character Recognition By: mahdi sheida Guide: Dr. A. Osareh fall-2010

Objective of OCR Glossary of Image Processing Terms identification of characters from a graphic file and converting them into a regular text file : APPENDIX A Glossary of Image Processing Terms

OCR Applications: Reading important handwritten & machine-printed characters: postal zip code passport number employee code Bank cheque Ancient texts

OCR System Text in graphic file pre-processing Feature extraction classification

pre-processing Pre-processing aims to produce data that are easy for the OCR systems to operate accurately. The main objectives of pre-processing are : Binarization Noise Reduction Segmentation Normalization

Binarization Document image binarization (thresholding) refers to the conversion of a gray-scale image into a binary image.

Noise Reduction Noise reduction improves the quality of the document. Two main approaches: Filtering (masks) Morphological Operations (erosion, dilation, etc)

Segmentation Text Line Detection (histogram projections) Word Extraction (vertical projections, connected component analysis) همان روش پیدا کردن isolated connected component ها برای سگمنت کردن کاراکترها هم قابل استفاده است. ولی این مشکل را دارد که کاراکتر هایی که overlap دارندرا به درستی سگمنت نمیکند. مثلا یک کاراکتر به دوتا تبدیل میشه m به n و r

Normalization Normalization provides a tremendous reduction in data size, thinning extracts the shape information of the characters Fix size for bounding box(template matching) مثلا برای حالت الفبای ماشین نویسی شده،می توانیم دیتاستی داشته باشیم با فونت های مختلف که ابعادشان فرق می کند به این ترتیب تکنیک thinig مناسب می تواند اسکلت کاراکتر را بیرون بکشد به طوریکه مشابهت یک کاراکتر در فونت های مختلف، بیشتر نمایان شود. نرمال کردن سایز، مناسب template matching مطمئن نیستم:( یعنی کاراکترها scale(size) invariant می شوند. به این ترتیب فیچرها نسبت به تغییرات روی سایز حساسیت نشان نمی دهند.

OCR System Text in graphic file pre-processing Feature extraction classification

Feature Extraction goal of feature extraction: to extract a set of features, which maximizes the recognition rate with the least amount of elements. Statistical features: Representation of a character image by statistical distribution of points Zoning Projections profiles

Invariants In order to recognize many variations of the same character, features that are invariant to certain transformations on the character need to be used. برای آنکه کاراکتری با شکل های مختلف را بتوانیم به درستی شناسایی کنیم، فیچرها باید روی یک سری از تبدیلاتی که روی کاراکتر صورت میگیرد ثابت. حالت rotate invariant (مستقل از چرخش) حالت scaled invariant (مستقل از تغییر سایز) حالت skew invarient (مستقل از تغییر شیب واریب)مناسب کاراکترهای دست نویس و چند فونته(مثلا ترکیب italic و Arial) است. original rotated scaled skewed

Zoning The character image is divided into N×M zones. From each zone features are extracted to form the feature vector. The goal of zoning is to obtain the local characteristics instead of global characteristics مناسب تفکیک کاراکترهایی که در یک ناحیه توزیع بیشتری نسبت به سمت دیگر دارند. این فیچر حتما باید همراه با فیچرهای دیگر استفاده بشود تا نتیجه مناسب بدهد.

Zoning-Direction Based on the contour of the character image For each zone the contour is followed and a directional histogram is obtained by analyzing the adjacent pixels in a 3x3 neighborhood

Projection Histograms The basic idea behind using projections is that character images, which are 2-D signals, can be represented as 1-D signal. Scale-invariant Depend on Rotation and skew No information about character shape در این روش فیچرها نسبت به تغییر سایز مستقل می شوند.scale invariant اطلاعات مهم راجع به شکل کاراکتر

Profiles The profile counts the number of pixels (distance) between the bounding box of the character image and the edge of the character. describe well the external shapes of characters and allow to distinguish between a great number of letters, such as “p” and “q”. Dependent on rotation

OCR System Text in graphic file pre-processing Feature extraction classification

Classification Template Matching Statistical approaches Neural Net Hidden Markov Modeling KNN Support Vector Machines Neural Net Template matching نیازی به مرحله استخراج ویژگی ندارد و خود تصویر کاراکتر به عنوان بردار ویژگی فرض می شود. در مرحله تشخیص، یک similarity measure بین template و تصویر کاراکتر در نظر گرفته می شود، تصویر با تمپلیت های کلاس های مختلف مقایسه می شه و نتایج میزان مشابهت بدست آمده با تمام تمپلیت ها با یک حد آستانه مقایسه میشه، هر کدوم که از آستانه بیشتر بود، تصویر به همون کلاس اختصاص پیدا می کنه ایرادات: Scale , rotate invariant نیست نویز تاثیر زیادی روی خرابی نتایج دارد. راه حل: استفاده از چند template به ازای هر کلاس. مشکل: هزینه محاسباتی زیاد

1 Title: Feature Extraction and MLP Neural Network Classifier for Farsi Digit Recognition

Farsi Digit Recognition with MLP Dataset: training :18000 normalized (40*40) binary images testing : 2000 normalized (40*40) binary images Classifier: MLP

Feature Extraction Method A: Statistical Approach: compare number of black pixels on the upper and lower halves and also left and right halves of the digit image. این فیچر به درد ارقام تایپ شده می خورد اما در ارقامی که دستی نوشته شده اند بهتر است با فیچر های دیگر ترکیب شود چون در دستی ها مرکز رقم و تصویر بر هم منطبق نیستند.

Feature Extraction Method B: The Number Of Intersections متد مناسبی برای تفکیک اعدادی مثل 2 و 3 است (Part1) (Part2)

Feature Extraction Method C. Elastic Meshing Feature Extraction: decomposes the whole binary character image into four directional sub-images:

Classifier 2 layer MLP

Results approach Number of neurons in hidden layer Recognition percentage of system for testing data Statistical Approach 3 90.13% Number Of Intersections 9 94.43% Elastic Meshing 5 93.68% Combination of 3 methods 16 97.62%

2 Title: Hand Written Character Recognition Using Twelve Directional Feature Input and Neural Network

Pre-processing & Dataset Noise removing Skeletonization Normalization(32*32 pixel) Dataset: combination of Hindi – English and special characters Train: 500 handwritten Characters Test: 250 handwritten Character

Feature extraction A: Conventional Feature Extraction (0&1) B: Gradient Feature Extraction (using sobel mask) (Horizontal component) (Vertical component)

Feature extraction C: Directional Features نگاشت مقادیر گرادیان هر پیکسل روی یکی از دوازده جهت به وسیله این فرمول محاسبه می شود. که حالت گسسته ی آن را در اسلاید بعدی می بینید.

Feature extraction C: Directional Features

Classifier Back propagation neural Network Learning rate: 0.2 Input layer: 1024 1 hidden layer with 32 nerouns Output layer: 5 (3Hindi, 1English , 1 special)

Results Input To MLP Number Of Hidden neurons iteration Training time Classification Performance On Testset Pixel input 12 50 970 946 75 Gradient 764 246 90 12 Directional 421 221 97 به این دلیل دقت تشخیص در مدل 12 جهته بالاتر رفته که در این تکنیک استخراج ویژگی، اطلاعات بیشتری مد نظر قرار گرفته است.

References: A. K. Jain, T. Taxt, “Feature extraction methods for character recognition--A Survey”, Elsevier Science Ltd, Vol. 29, No. 4, pp. 641-662, 1996. H. Fujisawa, “Forty years of research in character and document recognition an industrial perspective”, Elsevier, Vol.41, pp.2435 -2446, 2008 M. Moradi, M. Poornima, “Feature Extraction and MLP Neural Network Classifier for Farsi Handwritten Digit Recognition”,IEEE Computer Society, 2009 D. Singh, S. Kr. Singh, “Hand Written Character Recognition Using TwelveDirectional Feature Input and Neural Network “, Internatinal Jornal of Computer Applications , Vol.1,2010.

Thanks for attention