Open Access Open Access  Restricted Access Subscription or Fee Access

Skew Estimation In Document Images Using Wigner-Ville Time-Frequency Functions

Shikha Tripathi, Arvind Pawan, Aditi Bansal

Abstract


In this paper a robust technique for skew detection and estimation based on the application of Cohen’s class distribution using Wigner-Ville time-frequency function, to the projection profile histogram of document images has been proposed. Major strength of the paper lies in choice of the criterion function for skew detection.Instead of considering the average/maximum intensity of Wigner-Ville distribution we use the total intensity to enable detection of very minor changes in the skew angle. Complexity has been substantially reduced by using lesser number of image rotations and considering only a small part of the document. Also rough skew is estimated using morphological means, which further reduces complexity to a large extent. The proposed technique provides good accuracy with a resolution of 0.1 degrees, which is less than the previously reported methods. A wide range of skews varying from +89 to -89 degrees and scripts that include 1 and 2-column pages, text-book pages, handwritten documents, scanned images, newspapers and magazines have been considered for testing. The existing methods, although fairly accurate, have limited range of skews and scripts. It is also found to be less computationally intensive compared to other time -frequency transform based methods.


Keywords


Cohen’s class distribution, projection profiles; Skew detection; Wigner-Ville Distributions.

Full Text:

PDF

References


S. N. Srihari and V. Govindaraju, “Analysis of textual images using the Hough transform,” Machine Vision and Applications, Vol.2, no.3,Springer (1989) 141-153.

Y. Lee, “Method of detecting the skew angle of a printed business form,”Eastman Kodak Company, U.S. Patent 5,054,098, October 1, 1991.

G. S. D. Farrow and C. S. Xydeas, “Detecting skew in digitized images,”Int. Computers Ltd., London, European Patent App. 485,051, May 13,1992

Y. Kurosu, Y. Yokoyama, K. Nishikawa, H. Masuzaki and M. Fujinawa,“Method for determining the amount of skew of image, method for correcting the same, and image data processing system,” Hitachi, Ltd.,U.S. Patent 5,181,260, January 19, 1993.

D. X. Le and G. R. Thoma, “Document skew angle detection algorithm,”SPIE Conference on Visual Information Processing, vol. 1961, (1993) 251-262.

Y. K. Ham, H. K. Chung, I. K. Kim and R. H. Park, “Automated analysis of mixed documents consisting of printed korean alphanumeric texts and graphic images”, Optical Engineering 33, 6 (1994) 1845-1853.

D. S. Le, G. R. Thoma and H. Wechsler, “Automated page orientation and skew angle determination for binary document images,” Pattern Recognition 27, 10 (1994) 1325- 1344.

B. B. Fast and D. R. Allen, “OCR image pre-processor for detecting and reducing skew of the image of textual matter of a printed document,” U.S.Patent 5,594,817, January 14, 1997.

B. Yu and A. K. Jain, “A robust and fast skew detection algorithm for generic documents,” Pattern Recognition, 29, no. 10, (1996) 1599-1630.

W. Postl, “Detection of linear oblique structures and skew scan in digitized documents,” Proceedings of the 8th International Conference on Pattern Recognition, Paris, France, (1986) 687-689.

D. S. Bloomberg and G. Kopec, “Method and apparatus for identification and correction of document skew”, Xerox Corporation, U.S. Patent 5,187,753, February 16, 1993.

D. S. Bloomberg and G. Kopec, “Method and apparatus for identification of document skew,” Xerox Corporation, U.S. Patent 5,355,420, October11, 1994.

Y. Ishitani, “Document skew detection based on local region complexity,” Proceedings of the Second International Conference on Document Analysis and Recognition, Tsukuba Science City, Japan,October 20-22, (1993) 49-52.

D. S. Bloomberg, G. E. Kopec and L. Dasari, “Measuring document image skew and orientation,” Document Recognition II (SPIE vol. 2422),San Jose, CA, February 6-7 1995, pp. 302-316.

T. Akiyama and N. Hagita, “Automated entry system for printed documents,” Pattern Recognition, Elsevier Science Inc., 23, 11 (1990),pp. 1141-1154.

A. Hashizume, P.S. Yeh, and A. Rosenfeld. “A method of detecting the orientation of aligned components”, Pattern Recognition Letters,4:125{132, 1986.

L. O'Gorman. “The Document Spectrum for Page Layout Analysis”, IEEE Transactions on Pattern Analysis and Machine Intelligence,15(11):1162{1173, 1993.

R. Smith. “A Simple and E±cient Skew Detection Algorithm via Text Row Accumulation”, Proc. of the 3th International Conference on Document Analysis and Recognition, pages 1145{1148, Montreal, Canada, August 1995.

M. Hase and Y. Hoshino, “Periodic characteristics in printed document,”Transactions of Japanese Institute of Electronics and Communications Engineers, v. J65-D, No. 2, (1982) 298-299.

M.Sarfraz, A Zidouri, S.A.shabab, “A novel approach for skew estimation of Document images in OCR System”, Proceedings of the Computer Graphics Imaging and Vision: New Trends(CGIV05), (2005) 175-180.

S. Chen and R.M. Haralick. “An Automatic Algorithm for Text Skew Estimation in Document Images Using Recursive Morphological Transforms”, Proc. of the first IEEE International Conference on Image Processing, pages 139{143, Austin, Texas, 1994.

J. Sauvola and M. PietikÄainen. “Skew Angle Detection Using Texture Direction Analysis”, Proc. of the 9th Scandinavian Conference on Image Analysis, pages 1099{1106, Uppsala, Sweden, June 1995.

C. Sun and D. Si. “Skew and Slant Correction for Document Images Using Gradient Direction”, Proc. of the 4th International Conference on Document Analysis and Recognition, pages 142{146, Ulm, Germany,1997.

H. Yan. “Skew Correction of Document Images Using Interline Cross-Correlation”, CVGIP: Graphical Models and Image Processing,55(6), (1993)538-543.

B. Gatos, N. Papamarkos, and C. Chamzas. “Skew Detection and Text Line Position Determination in Digitized Documents”, Pattern Recognition, 30(9):1505{1519, 1997}

J.J.Hull, S.L.Taylor, “Document Image Skew Detection: Survey and Annotated Bibliography”, World Scientific, (1998) 40-64.

R. Cattoni, T. Coianiz, S. Messelodi, C. M. Modena, “Geometric Layout Analysis Technique for Document Image Understanding: A Review”,ITC-IRST, ITC-IRST, Trento, Italy 1998.

Cohen. L, “Generalized phase space distribution functions”, J. Math.Phys. 7, (1966) 781-786.

Cohen. L, “Time Frequency Distribution a review”, Proceedings of the IEEE, vol 77, Issue 7 (1989) 941-981.

E.Kavallieratou,N.Fakotakis, G.Kokkinakis, “Skew Angle Estimation in Document processing using Cohen ‘s Class Distribution” Pattern Recognition Letters, Elsevier, vol 20, (1999) 1305-1311.

W.Postl, “Detection of linear oblique structures and skew scan in digitized documents”, Proc. Of 8th international conference on Pattern Recognition, (1986) 687-689.

G. Ciardiello, G. Scafuro, M.T. Degrandi, M.R. Spada, and M.P. Roccotelli, “An experimental system for office document handling and text recognition”, Proc. of the 9th International Conference on Pattern Recognition, volume 2, (1988) 739-743.

Y. Ishitani, “Document Skew Detection Based on Local Region Complexity”, Proc. of the 2nd International Conference on Document Analysis and Recognition, IEEE Computer Society, (1993) 49-52.

S.N. Srihari, V. Govindaraju. “Analysis of Textual Images Using the Hough Transform”, Machine Vision and Applications, 2(3), (1989)141-153.

Manjunath Aradhya VN, Hemantha K G, Shivakumara, “Skew Detection Technique for Binary Document Images based on Hough Transform”,International Journal of Information Technology, vol. 3, (2006) 194-200.

U. Pal, B.B. Chaudhuri, “An improved document skew angle estimation technique”, Pattern Recognition Letters, 17(8), (1996) 899-904.

Y. Min, S.-B. Cho, Y.Lee, “A Data Reduction Method for Efficient Document Skew Estimation Based on Hough Transformation”, Proc. of the 13th International Conference on Pattern Recognition, IEEE Press,(1996) 732-736.

P Shivakumara, et al., “A novel Technique for Estimation of Skew in Binary Text Document Images based on Linear Regression Analysis”,Sadhana, Vol. 30, Part 1, (2005) 69-85.

A.Hashizume, P.S.Yeh, A.Rosenfeld, “A method of detecting the orientation of aligned components”, Pattern Recognition Letters, 4:(1986) 125-132.

N. Liolios, et al, “Improved Document Skew Detection Based on Text Line Connected-Component Clustering”, Proc. of Intl. Conference on Image Processing, vol. 1, ( 2001) 1098-1011.

R. Smith, “A Simple and Efficient Skew Detection Algorithm via Text Row Accumulation”, Proc. of the 3rd International Conference on Document Analysis and Recognition, (1995) 1145-1148.

L. Jubisa Stankovic and Vladimir Katlovnik, “The Wigner Distribution of noisy signals with adaptive time frequency varying window”, IEEE Transactions on Signal Processing, Vol. 47, No. 4, (1999) 1099-1108.

L. Jubisa Stankovic, “A method for improved distribution concentration in the time frequency analysis of multi component signals using the L-Wigner distribution”, IEEE Transactions on Signal Processing, Vol. 43, No. 5, (1995) 1262-1268.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.