Retriving the Exact Text Line from Handwritten Document based on An Energy Minimization Framework for Indian Script Languages

S. Dhivyaprabha; Dr.G. Jagajothi

Retriving the Exact Text Line from Handwritten Document based on An Energy Minimization Framework for Indian Script Languages

S. Dhivyaprabha, Dr.G. Jagajothi

Abstract

In this project, we present algorithm for extracting text-lines from handwritten document images. Our algorithm is based on the novel approach for content aware image resizing. We adopted the signed distance transform to generate the energy map, where extreme points indicate the layout of text-lines. Dynamic programming is then used to compute the minimum energy left-to right paths, which pass along the “middle“of the text lines. Each path intersects a set of components, which determine the extracted text-line and estimate its height. The estimated height determines the text-line’s region, which guides splitting touching components among consecutive lines. Unassigned components that fall within the region of a text-line are added to the components list of the line. The components between two consecutive lines are processed when the two lines are extracted and assigned to the closest text-line, based on the attributes of extracted lines, the sizes and positions of components. Our experimental results on Tamil, Hindi, and English historical documents show that our approach manage to separate multi-skew text blocks into lines at high success rates.

Keywords

Handwritten Document, State Estimation in Document Images, Text-Line Extraction

Full Text:

PDF

References

L. O’Gorman, “The document spectrum for page layout analysis,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 15, no. 11, pp. 1162–1173, Nov. 1993.

K. Kise and M. Iwata, “Segmentation of page images using the area Voronoi diagram,” Comput. Vis. Image Understand., vol. 70, no. 3, pp. 370–382, Jun. 1998.

Y. Xiao and H. Yan, “Text region extraction in a document image based on the Delaunay tessellation,” Pattern Recognit., vol. 36, no. 3, pp. 799–809, Mar. 2003.

S. Bukhari, F. Shafait, and T. Breuel, “Segmentation of curled textlines using active contours,” in Proc. 8th IAPR Int. Workshop DAS, Sep. 2008, pp. 270–277.

S. S. Bukhari, F. Shafait, and T. M. Breuel, “Coupled snakelet model for curled textline segmentation of camera-captured document images,” in Proc. Int. Conf. Doc. Anal. Recognit., Jul. 2009, pp. 61–65.

H. I. Koo and N. I. Cho, “State estimation in a document image and its application in text block identification and text line extraction,” in Proc. ECCV (2), 2010, pp. 421–434.

D. Mumford, and J. Shah, “Optimal approximation by piecewise smooth functionals and associated variational problems,” Comm. Pure Appl. Math, ol. 42, pp.577-685, 1989.

S. Osher and J. Sethian, “Fronts propagating with curvaturedependent speed: Algorithm based on the Hamilton-Jacobi formulation,” Journal of Computational Physics, 79 pp12-49, 1988.

T. F. Chan and L. A. Vese, “Active contours without edges,” IEEE Trans on Image Processing, vol. 10, no. 2, pp.266-277, 2001.

Yi Li, Yefeng Zheng, David Doermann, and Stefan Jaeger, “A New Algorithm for Detecting Text Line in Handwritten Documents,” Proceedings of the Tenth International Workshop on Frontiers in Handwriting Recognition, Oct. 2006, La Baule.

G. Aubert and P. Kornprobst, “Mathematical Problem in Image Processing: Partial Differential Equations and the Calculus of Variations,” New York: Springer, vol. 147, 2002.

A. Tsai, A. Yezzi, and Alan S. Willsky, “Curve evolution implementation of the Mumford-Shah functional for image segmentation, denoising, interpolation, and magnification,” IEEE Trans. on Image Processing, vol. 10, no. 8, pp.1169- 1186, 2001.

J. A. Sethian, Level Set methods: evolving interfaces in geometry, fluid mechanics, Cambridge Univ. Press, 1996

S. Gao and T. D. Bui, “Image segmentation and selective smoothing by using Mumford-Shah model,” IEEE Transactions on Image Processing, vol. 14, no. 10, pp.1537- 1549, 2005.

K. Kise, A. Sato, and M. Iwata, “Segmentation of page images using the area Voronoi diagram,” Computer Vision and Image Understanding, vol. 70, no. 3, pp. 370–382, June 1998.

K. Y. Wong, R. G. Casey, and F. M. Wahl, “Document analysis system,” IBM Journal of Research and Development, vol. 26, no. 6, pp. 647–656, 1982.

H. S. Baird, “Background structure in document images,” in Document Image Analysis, H. Bunke, P. Wang, and H. S. Baird, Eds. World Scientific, Singapore, 1994, pp. 17–34.

T. M. Breuel, “Two geometric algorithms for layout analysis,” in Document Analysis Systems, Princeton, NY, Aug. 2002, pp. 188–199.

I. Guyon, R. M. Haralick, J. J. Hull, and I. T. Phillips, “Data sets for OCR and document image understanding research,” in Handbook of character recognition and document image analysis, H. Bunke and P. Wang, Eds. World Scientific, Singapore, 1997, pp. 779–799.

Y. Wang, R. Haralick, and I. Phillips, “Document zone content classification and its performance evaluation,” Pattern Recognition, vol. 39, no. 1, pp. 57–73, Jan. 2006.

Refbacks

There are currently no refbacks.

This work is licensed under a Creative Commons Attribution 3.0 License.

Username
Password
Remember me