We think you have liked this presentation. If you wish to download it, please recommend it to your friends in any social system. Share buttons are a little bit lower. Thank you!
Presentation is loading. Please wait.
Published byHamza Gorbet
Modified about 1 year ago
Prénom Nom Document Analysis: Document Image Processing Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008
© Prof. Rolf Ingold 2 Outline Image acquisition Image enhancement Foreground / background separation Binarization Color clustering Skew detection and correction Skew estimation Deskewing Text normalization
© Prof. Rolf Ingold 3 Image acquisition Document images are acquired by drum scanners flatbed scanners high resolution digital cameras specialized book scanners or extracted from 3D scene images video sequences
© Prof. Rolf Ingold 4 Image quality Various types of document images binary images (fax) gray level images (256 levels) RGB images (24 bits, or more) at different resolutions 200 dpi (low fax quality) dpi (standard resolution for office automation) Mpixels for A4 format 600 dpi or higher for special applications Images may be degraded distorted, non planar noisy, with artifacts (JPEG)
© Prof. Rolf Ingold 5 Document image examples 200 dpi images400 dpi images
© Prof. Rolf Ingold 6 Overview of document image processing Image preprocessing is an initial step of document analysis it aims at preparing the image for further processing The most important initial steps are Image enhancement Binarization, i.e., foreground / background separation Skew correction More specialized techniques are used locally Text size normalization Slant correction ...
© Prof. Rolf Ingold 7 Image enhancement Classical image filtering algorithms are applied To reduce or remove color information To enhance the contrast between foreground and background To correct irregular illumination To strengthen contours To smooth contours To remove salt and pepper noise To thin or thicken strokes … Image enhancement is often combined with segmentation or shape analysis
© Prof. Rolf Ingold 8 Foreground / background separation Document image analysis requires the separation between foreground (ink) and background (paper) Foreground / background is trivial for simple document classes Binarization determined by appropriate threshold Problems arise in following situations Non uniform background (mixing colors and “reverse video”) Textured backgrounds Halftoning artifacts Non uniformly illuminated documents Degraded documents (bad inking, old paper, with holes, …) Paper Transparency, ink traversing
© Prof. Rolf Ingold 9 Binarization in presence of dithering In case of dithering a low pass filter should first be used to smooth the background
© Prof. Rolf Ingold 10 Niblack’s method Niblack’s method is using a local threshold where x,y and x,y represent respectively the mean and standard deviation of gray levels in a N x N neighborhood around pixel x,y k is a constant between 0 and 1 (suggested value 0.2) R is the range of gray levels
© Prof. Rolf Ingold 11 Sauvola's method Sauvola at al. has proposed a variant which assumes that text is dark in bright background where R =128, k =0.5 Problems remain when the hypothesis is not true (even after reversing)
© Prof. Rolf Ingold 12 Binarization in case of colored background Binarisation by global thresholding and Sauvola's method
© Prof. Rolf Ingold 13 Comparison of binarization techniques Original image Fisher Fisher (wind.) Yanowitz B. Niblack Sauvola et al. INSA, Lyon from F. Lebourgeois, INSA, Lyon
© Prof. Rolf Ingold 14 Color clustering For rich colored documents Check, forms, … Geographic maps Historical documents Advertising foreground background separation is performed by color clustering Color clustering may be achieved automatically k-means Gaussian mixtures …
© Prof. Rolf Ingold 15 Skew detection and correction Most document image recognition algorithms need perfectly, horizontally and vertically aligned text Very often, acquisition systems are not accurate enough Skew correction requires two steps Skew estimation (with a precision < 1 degree) Image deskewing (rotation with a small angle) For book reading systems, due to page curvatures, more sophisticated image correction algorithms are required
© Prof. Rolf Ingold 16 Skew estimation Many different methods have been proposed for skew estimation for printed documents Margin detection by white stream analysis by projection profile analysis Hough transforms at pixel level of centers of connected components Linear regressions of centers of connected components Most methods can be applied on down-sampled images Skew detection for handwriting is more difficult, but less useful
© Prof. Rolf Ingold 17 Projection profiles Projection profiles are simple histograms accumulating pixels along a line or a column
© Prof. Rolf Ingold 18 Hough Transform The Hough transform is a global transformation mapping the spatial space (x,y) to a parametric space ( , ) each pixel is accumulated on a beam of lines defined in polar coordinates, i.e
© Prof. Rolf Ingold 19 Skew estimation by Hough transform The Hough transform allows to estimate the skew angle
© Prof. Rolf Ingold 20 Deskewing of document image Deskewing requires an image rotation rotation of color or gray level images needs re-sampling rotation of binary images has several pitfalls they introduce distortions and noise they are not reversible (except for Pythagoras angles) Deskewing can also be approximated by combining two affine transforms
© Prof. Rolf Ingold 21 Rotation of binary images Pixel based rotations of binary images introduce distortions this artifact can be avoided by connected component replacement
© Prof. Rolf Ingold 22 Rotation of binary images (2) Better results are obtained by rotating the original gray level image (before binarization)
© Prof. Rolf Ingold 23 Normalization of character size For text recognition normalization of character sizes is often required Size normalization can be achieved By bounding boxes of isolated characters By base line, ascenders and descenders
© Prof. Rolf Ingold 24 Normalization techniques for handwriting In case of handwriting additional normalization may be applied size normalization for ascenders and descenders slant correction Slant estimation is performed by averaging the direction of the median of straight vertical segments
© Prof. Rolf Ingold 25 Run Length Smearing Algorithm (RLSA) The Run Length Smearing Algorithm (RLSA) consists in replacing white runs by black runs, if their length is smaller than a given threshold it can be applied horizontally or vertically RLSA is often usefull for segmentation
Optical Character Recognition for Handwritten Characters Giorgos Vamvakas National Center for Scientific Research “Demokritos” Athens - Greece Institute.
Histogram Processing The histogram of a digital image with gray levels in the range [0, L-1] is a discrete function h(r k ) = n k where r k is the k th.
Shapelets Correlated with Surface Normals Produce Surfaces Peter Kovesi School of Computer Science & Software Engineering The University of Western Australia.
Image Registration Mapping of Evolution. Registration Goals Assume the correspondences are known Find such f() and g() such that the images are best.
Patient information extraction in digitized X-ray imagery Hsien-Huang P. Wu Department of Electrical Engineering, National Yunlin University of Science.
Object Recognition from Local Scale-Invariant Features David G. Lowe Presented by Ashley L. Kapron.
Kapitel 11 Tracking – p. 1 Tracking Fundamentals Object representation Object detection Object tracking (Point, Kernel, Silhouette) Articulated tracking.
5th Intensive Course on Soil Micromorphology Naples th - 14th September Image Analysis Lecture 5 Thresholding/Segmentation.
5th Intensive Course on Soil Micromorphology Naples th - 14th September Image Analysis Lecture 2 Image Acquisition Historic Aspects.
A 3-D Reconstruction System for the Human Jaw Using a Sequence of Optical Images To Dr\ Ahmed Agamia Eng\ Safaa By: Eman Sayed.
Computer Vision : CISC 4/689 Corner Detection Basic idea: Find points where two edges meeti.e., high gradient in two directions Cornerness is undefined.
5th Intensive Course on Soil Micromorphology Naples th - 14th September Image Analysis Lecture 6 Morphological Segmentation Orientation Analysis.
Indexing Techniques for Multimedia Databases Multimedia Similarity Search Structure Image Indexing Video Indexing.
Applications of one-class classification -- searching for comparable applications for negative selection algorithms.
Image Processing IB Paper 8 – Part A Ognjen Arandjelović Ognjen Arandjelović
5. Video Technology - Copyright © Denis Hamelin - Ryerson University Video Technology and Animation.
1 How to present scientific information, supported by slides? Paul Nieuwenhuysen Vrije Universiteit Brussel Information and Library Science, University.
OverviewOverview Motion correction Smoothing kernel Spatial normalisation Standard template fMRI time-series Statistical Parametric Map General Linear.
Multi-View Stereo for Community Photo Collections Michael Goesele, Noah Snavely, Brian Curless, Hugues Hoppe, Steven M. Seitz.
Environmental Remote Sensing GEOG 2021 Lecture 5 Spatial information in remote sensing.
September 2, 2014Data Mining: Concepts and Techniques1.
Dynamic View Selection for Time-Varying Volumes Guangfeng Ji* and Han-Wei Shen The Ohio State University *Now at Vital Images.
Modeling Modeling is simply the process of creating 3D objects –Many different processes to create models –Many different representations of model data.
Image Processing MR1510 Lecture 5. Image Processing Definition Image Processing is defined as the "examination, processing and analysis of (remotely sensed)
Bayesian Belief Propagation Reading Group. Overview Problem Background Bayesian Modelling Bayesian Modelling Markov Random Fields Markov Random Fields.
Integrated Data Editing and Imputation Ton de Waal Department of Methodology Voorburg Statistics Netherlands ICES III conference, Montréal June 19, 2007.
Name: ………………………………………………………… Class:……………… Teacher:…………………………………………..
Computational Neuroanatomy John Ashburner SmoothingSmoothing Motion CorrectionMotion Correction Between Modality Co-registrationBetween.
Special Topics in Computer Science Advanced Topics in Information Retrieval Lecture 6 (book chapter 12) : Multimedia IR: Indexing and Searching Alexander.
© 2016 SlidePlayer.com Inc. All rights reserved.