Performance Evaluation of PCA based Speech Enhancement Algorithm with Different Noise Estimation Method

Sangita Bavkar; Shashikant Sahare

Performance Evaluation of PCA based Speech Enhancement Algorithm with Different Noise Estimation Method

Sangita Bavkar, Shashikant Sahare

Abstract

In this paper we are presenting a speech enhancement algorithm for noisy speech signal using the principal component analysis. Principal component analysis (PCA) is a universal subspace approach which is used for enhancement of speech distorted by the noise. The principal component analysis is based on the eigenvalue analysis; the noisy speech signal eigenvalues are classified into clean speech signal eigenvalues and noisy signal eigenvalues and retaining only clean speech signal eigenvalues estimate enhanced signal. The performance evaluation of PCA based speech enhancement algorithm with different noise estimation methods are analyzed in this paper. Objective and informal listening test shows that proposed method works efficiently with improved minima controlled recursive averaging method. The system performs better noise reduction with negligible residual musical noise when tested with sentences corrupted by noise.

Keywords

Eigenvalue Analysis, Noise Estimation, Noise Variance, Principal Component Analysis (PCA), Speech Enhancement, Subspace Signal

Full Text:

PDF

References

Y. Ephraim and H.L. Van Trees,A signal subspace approach for speech enhancement", IEEE Trans. on Speech and Audio Proc., Vol. 3,No. 4, pp. 251-266, July 1995.

Lindsay I Smith, “A tutorial on Principal Components Analysis”, February 26, 2002.

Jonathon Shlens, “A Tutorial On Principal Component Analysis”, Center for Neural Science, New York University, New York City, NY 10003-6603 and Systems Neurobiology Laboratory, Salk Instituted for Biological Studies La Jolla, CA 92037 (Dated: April 22, 2009; Version 3.01)

R. Vetter, N. Virag, P. Renevey, and J.-M. Vesin, “Single channel speech enhancement using principal component analysis and MDL subspace selection”, in Eurospeech’99, Budapest, Hungary, 1999.

Amin Haji Abolhassani, Sid-Ahmed Selouani, Douglas O’Shaughnessy, “Speech Enhancement Using PCA and Variance Of The Reconstruction Error In Distributed Speech Recognition”, Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE workshop on Digital Object Identifier:10.1109/ASRU.2007.4430077,Publication year: 2007, Page 19-23.

Loizou, P., Sundarajan R. “A Noise estimation Algorithm for highly non–stationary Environments”, speech communication 48 (2006) 220-231 Science direct.

Cohen, I. (2003). Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging. IEEE Transactions on Speech and Audio Processing, 11(5), 466-475.

Gerkmann, T. & Hendriks, R. C. “Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay” IEEE Trans Audio, Speech, Language Processing, 2012, 20, 1383-1393

Yi Hu and Philipos C. Loizou, Senior Member, IEEE “Evaluation of Objective Quality Measures for Speech”, IEEE, speech, and language processing, vol. 16, no. 1, January 2008.

Saeed V. Vaseghi, Advanced “Digital Signal Processing and noise Reduction”, Chapter 9 ,WILEY, Fourth Edition

P. C. Loizou, “Speech Enhancement: Theory and Practice”, 2nd ed. Boca Raton, FL.: CRC, 2007.

S. Haykin, “Adaptive filter theory,” Printice Hall.

J. Rissanen, “Modeling by shortest data description,” vol. 14, pp. 465–471, Automatica, 1978.

C.J. James and D. Lowe, “Extracting multisource brain activity from a single electromagnetic channel,” Artificial Intelligence in Medicine, vol. 28, issue.1, pp. 89–104, 2003.

K. Judd and A. Mees, “On selecting models for nonlinear time series”, Physica D, Vol. 82, pp. 426-444, 1995.

Refbacks

There are currently no refbacks.

This work is licensed under a Creative Commons Attribution 3.0 License.

Username
Password
Remember me