1 A Statistical Matching Method in Wavelet Domain for Handwritten Character Recognition Presented by Te-Wei Chiang July, 2005.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Bayesian network classification using spline-approximated KDE Y. Gurwicz, B. Lerner Journal of Pattern Recognition.
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Chessmen Position Recognition Using Artificial Neural Networks Jun Hou Dec. 8, 2003.
Combining Inductive and Analytical Learning Ch 12. in Machine Learning Tom M. Mitchell 고려대학교 자연어처리 연구실 한 경 수
Recognition of Fragmented Characters Using Multiple Feature-Subset Classifiers Recognition of Fragmented Characters Using Multiple Feature-Subset Classifiers.
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
Chapter 1: Introduction to Pattern Recognition
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Handwritten Thai Character Recognition Using Fourier Descriptors and Robust C-Prototype Olarik Surinta Supot Nitsuwat.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Prénom Nom Document Analysis: Fundamentals of pattern recognition Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Introduction to machine learning
Handwritten Character Recognition using Hidden Markov Models Quantifying the marginal benefit of exploiting correlations between adjacent characters and.
An Automatic Segmentation Method Combined with Length Descending and String Frequency Statistics for Chinese Shaohua Jiang, Yanzhong Dang Institute of.
Facial Feature Detection
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Case Studies Dr Lee Nung Kion Faculty of Cognitive Sciences and Human Development UNIVERSITI MALAYSIA SARAWAK.
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
Presented by: Kamakhaya Argulewar Guided by: Prof. Shweta V. Jain
Classification with Hyperplanes Defines a boundary between various points of data which represent examples plotted in multidimensional space according.
Image Recognition and Processing Using Artificial Neural Network Md. Iqbal Quraishi, J Pal Choudhury and Mallika De, IEEE.
1 Template-Based Classification Method for Chinese Character Recognition Presenter: Tienwei Tsai Department of Informaiton Management, Chihlee Institute.
1 An Efficient Classification Approach Based on Grid Code Transformation and Mask-Matching Method Presenter: Yo-Ping Huang Tatung University.
Wavelet-Based Multiresolution Matching for Content-Based Image Retrieval Presented by Tienwei Tsai Department of Computer Science and Engineering Tatung.
Image recognition using analysis of the frequency domain features 1.
Presented by Tienwei Tsai July, 2005
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
COMPARISON OF IMAGE ANALYSIS FOR THAI HANDWRITTEN CHARACTER RECOGNITION Olarik Surinta, chatklaw Jareanpon Department of Management Information System.
Image Classification 영상분류
Visual Inspection Product reliability is of maximum importance in most mass-production facilities.  100% inspection of all parts, subassemblies, and.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
BARCODE IDENTIFICATION BY USING WAVELET BASED ENERGY Soundararajan Ezekiel, Gary Greenwood, David Pazzaglia Computer Science Department Indiana University.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Handwritten Recognition with Neural Network Chatklaw Jareanpon, Olarik Surinta Mahasarakham University.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
BEHAVIORAL TARGETING IN ON-LINE ADVERTISING: AN EMPIRICAL STUDY AUTHORS: JOANNA JAWORSKA MARCIN SYDOW IN DEFENSE: XILING SUN & ARINDAM PAUL.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
Chapter 3: Maximum-Likelihood Parameter Estimation l Introduction l Maximum-Likelihood Estimation l Multivariate Case: unknown , known  l Univariate.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 07: BAYESIAN ESTIMATION (Cont.) Objectives:
1 CONTEXT DEPENDENT CLASSIFICATION  Remember: Bayes rule  Here: The class to which a feature vector belongs depends on:  Its own value  The values.
Presented By Lingzhou Lu & Ziliang Jiao. Domain ● Optical Character Recogntion (OCR) ● Upper-case letters only.
Content-Based Image Retrieval Using Block Discrete Cosine Transform Presented by Te-Wei Chiang Department of Information Networking Technology Chihlee.
1 An Efficient Classification Approach Based on Grid Code Transformation and Mask-Matching Method Presenter: Yo-Ping Huang.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
Scanned Documents INST 734 Module 10 Doug Oard. Agenda Document image retrieval  Representation Retrieval Thanks for David Doermann for most of these.
An Image Retrieval Approach Based on Dominant Wavelet Features Presented by Te-Wei Chiang 2006/4/1.
Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.
Content-Based Image Retrieval Using Color Space Transformation and Wavelet Transform Presented by Tienwei Tsai Department of Information Management Chihlee.
BAYESIAN LEARNING. 2 Bayesian Classifiers Bayesian classifiers are statistical classifiers, and are based on Bayes theorem They can calculate the probability.
Computer Vision Lecture 7 Classifiers. Computer Vision, Lecture 6 Oleh Tretiak © 2005Slide 1 This Lecture Bayesian decision theory (22.1, 22.2) –General.
License Plate Recognition of A Vehicle using MATLAB
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.
Optical Character Recognition
Feature learning for multivariate time series classification Mustafa Gokce Baydogan * George Runger * Eugene Tuv † * Arizona State University † Intel Corporation.
Voice Activity Detection Based on Sequential Gaussian Mixture Model Zhan Shen, Jianguo Wei, Wenhuan Lu, Jianwu Dang Tianjin Key Laboratory of Cognitive.
Neural Network Architecture Session 2
Chapter 3: Maximum-Likelihood Parameter Estimation
Presented by Li-Jen Kao July, 2005
Pattern Recognition Sergios Theodoridis Konstantinos Koutroumbas
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Outline Background Motivation Proposed Model Experimental Results
Coarse Classification via Discrete Cosine Transform and Quantization
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Presentation transcript:

1 A Statistical Matching Method in Wavelet Domain for Handwritten Character Recognition Presented by Te-Wei Chiang July, 2005

2 Outline Introduction The Proposed Classification Approach Feature Extraction Statistical Mask-Matching Approach Experimental Results Conclusion

3 1 Introduction Paper documents -> Computer codes OCR(Optical Character Recognition)

4 Chinese handwriting recognition is very difficult due to three factors: the character set is very large, the structure of a Chinese character is quite complex, and many Chinese characters have similar shapes. optical character recognition (OCR) technique has been introduced as a practical approach for converting paper documents into computer codes.

5 The three best known approaches for OCR are: statistical approach structural approach neural networks (NNs)

6 their design has to be broken down into subproblems such as preprocessing, feature extraction, classification and postprocessing

7 divide and conquer For brevity, we consider the design of OCR systems in terms of two subproblems: (1)feature extraction and (2)classification. In this paper, wavelet transform is used for feature extraction.

8 2.The Proposed Classification Approach Our experimental system is operated in two phases: training and classification.

9 Figure 1. The framework of our classification approach.

10 3.Feature Extraction Based on the requirement of reliable and general features, wavelet transform is first applied to extract statistical features.

Wavelet transform Mallat' s pyramid algorithm

12 Figure 2. Figure 3.

Mask generation We know that the border bits are the most unreliable; the bits at the edge of a character image are often subject to writing and scanning noise. We can see this by superimposing a number of images of the “ same ” character and calculating the fraction of time that a given bit is black.

14

15 4. Statistical Mask-Matching Approach Most commonly used optimization methods in statistical approach are based on Bayes ’ theorem. Our mask- matching approach is also derived from Bayes ’ theorem.

Bayes classificaton In statistical pattern recognition, we recognize that features may be measured with error and that some of the features are useful for identification of the class while others are not. Our goals are then to obtain useful sets of features and to use these features such that the identification is as accurate as possible.

17 If there is an object that is to be classified on the basis of a feature x, into M possible classes (c1, c2, …, cM), then the probability of x in class i when x is observed can be described by P(ci|x). From the “ theorem on compound probabilities ”, we obtain

18 In our situation, x is the feature and y represents the class variable ci. Substituting for x and y in (5), we obtain the probability that the class is i when the feature x is observed.

Measures for mask matching we have to define a measure to indicate the degree of matching between a sample character and a mask. Suppose the wavelet-based character images and the masks are of the same size (N x N bitmap).

20 The black bits are those bits with value 1 in the bitmaps, and white bits are those with value 0. Let NNN×b(p) be the number of black bits in bitmap p, and Mb(p, q) be the number of black bits with the same positions in both bitmap p and bitmap q. Then, the degree of matching between an unknown character x and the mask of class i, m i, can be defined by:

21

Statistical mask-matching

23 Finally, to decide the expected class of the input pattern x, the following decision rules are used:

24 5 Experimental Results samples (about 640 categories) extracted from one of the famous handwritten rare books, Kin-Guan bible. Each character image was transformed into a 48×48 bitmap.

25 6 Conclusions This paper presents a wavelet-based statistical mask-matching approach for recognizing handwritten characters in Chinese paleography. After generating the mask for each prototype character and calculating some prior probabilities in advance, we can obtain the probability of a class being present when the mask of the class is matched in a certain degree. In our preliminary experimental results, the recognition rate is about 80 percent for a unique candidate, and 89 percent for multichoice with 10 candidates.

26 Future works Since features of different types complement one another in classification performance, by using features of different types simultaneously, classification accuracy could be improved. A In order to alleviate the load of the character recognition, a coarse classification scheme needs to be involved in our system.