Line Segmentation of Handwritten Kannada Text
Abstract
Text line segmentation is an essential pre-processing
stage for off-line handwriting recognition in many Optical Character Recognition (OCR) systems. It is an important step, as inaccurately segmented text lines will cause errors in the subsequent recognition stage. Text line segmentation of handwritten documents is still one of the most complicated problems in developing a reliable OCR. The nature of handwriting styles makes the process of text line segmentation very challenging. This paper provides a means of text line segmentation using Vertical Strip Projection based method. In this paper, the document is divided into vertical strips of appropriate width. Strip-wise horizontal histograms are then computed and the relationship of the peak valley points is used for line segmentation.
Though the global horizontal projection method works efficiently for printed documents, it becomes highly unreliable when used in case of unconstrained handwritten text because of many factors like variable skew, touching and overlapping lines. The proposed method provides much improved and reliable results when compared to global horizontal profile method in case of skewed and variable sloping
lines. Segmentation accuracy of 83% is achieved for handwritten Kannada text using the proposed method.
Keywords
Full Text:
PDFReferences
Zaidi Razak and Khansa Zulkiflee, “ Off –Line Handwriting Text Line
Segmentation : A Review “, IJCSNS ,pp 12-20, vol 8(7) ,2008.
U. Pal and S. Datta, “Segmentation of Bangla unconstrained
Handwritten Text “ , Proceedings of the 7th International Conference,
ICDAR, pp.1128-1132, 2003.
Tripathy N, Pal U. “Handwriting segmentation of unconstrained Oriya
text”, International Conference on Frontiers in handwriting Recognition,
IWFHR-9 2004, pp. 306 – 311.
A. Zahour, B. Taconet, P. Mercy and S. Ramdane, “.Arabic Handwritten
Text line extraction”, in Proc. 6th ICDAR.01, 2001, pp.281-285.
5]. Garg N.K., Kaur L., Jindal M.K., “A New Method for Line
Segmentation of Handwritten Hindi Text” Seventh International
Conference on Information technology, 2010 , pp. 392 – 397.
Y. Li, Y. Zheng, D. Doermann, and S. Jaeger, “A new Algorithm for
detecting text line in handwritten documents”, in Proc. 10th IWFHR.06,
, p. 35-40.
Likforman-Sulem, L., Hanimyan, A., Faure, C., “A Hough based
algorithm for extracting text lines in handwritten documents”, Third
International Conference on Document Analysis and Recognition 1995,
pp: 774 – 777.
Chaudhuri, B.B.; Bera, S., “Handwritten Text Line Identification in
Indian Scripts” , 10th International Conference on Document Analysis
and Recognition, 2009, pp: 636 – 640.
C. Weliwitage, A. L. Harvey and A. B. Jennings, “Handwritten
Document Offline Text Line Segmentation”, Proceedings of Digital
Imaging Computing: Techniques and Applications, pp. 184-187, 2005.
Koppula, V.K., Atul, N. Garain, U. “Robust Text Line, Word And
Character Extraction from Telugu Document Image”, 2nd International
Conference on Emerging Trends in Engineering and Technology
(ICETET), 2009pp: 269 - 272 .
D.Sarkar and R.Ghosh. “A bottom up approach of line segmentation
from handwritten text”, www.vecc.gov.in.
Rodos Santos, R.P., Clemente, G.S.; Tsang Ing Ren, Cavalcanti, G.D.C.,
“Text Line Segmentation Based on Morphology and Histogram
Projection”, 10th International Conference on document Analysis and
Recognition, 2009. ICDAR, pp. 651 – 655.
V. N. Manjunath Aradhya, C. Naveena “Text Line Segmentation of
Unconstrained Kannada Script” ICCCS’11 February 12-14, 2011,
Rourkela.
Rajiv. K.Sharma and Dr. AmardeepSingh,” Segmentation of
Handwritten Text in Gurumuki Script”, International Journal of Image
Processing, Vol 2(3), 2011,pp 12-17.
N.Tripathy and U.Pal ,” Handwriting Segmentation of Unconstrained
Oriya Text”, Sadhana, vol 31(6) 2006,pp755-769,2006.
.Plamondan .R and Srihari,” On Line and Off line Handwriting
Recognition : A Comprehensive Survey. IEEE transactions on PAMI
,vol 22(1),2000,pp 63-84.
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution 3.0 License.