Open Access Open Access  Restricted Access Subscription or Fee Access

Stationary Bionic Wavelet Transform and Teager Energy Operator for Speech Enhancement

Mourad Talbi, Anis Ben Aicha, Lotfi Salhi, Adnane Cherif

Abstract


In this paper, we have proposed a new speech enhancement technique based on the time adaptation of stationary bionic wavelet thresholds. The time dependence is introduced by approximating the Teager Energy (TE) of the stationary bionic wavelet coefficients. The Stationary Bionic Wavelet Transform (SBWT) was used in this work, in order to have a perfect reconstruction of the speech signal. Experimental results based on Mean Square Error  (MSE) between the original speech signal and the reconstructed speech signal,    show that the SBWT is better than the Bionic Wavelet Transform (BWT). This fact can improve the speech enhancement system based on thresholding in the bionic wavelet domain. We have also used the pause detection to introduce more improvement of the noise level estimation in the stationary bionic wavelet subbands. We have also used the time-space adapted thresholds in order to improve the procedure of stationary bionic wavelet thresholding and avoiding severs degradations of the enhanced speech signal. The proposed technique was evaluated by comparing it to others techniques such as the technique based on Wiener filtering and the techniques based on thresholding in wavelet and bionic wavelet domains and the technique based on undecimated wavelet packet-perceptual filterbanks and MMSE-STSA estimation. The two thresholding techniques are based on time adaptation of wavelet thresholds too. The obtained results from the SNR, SSNR, ISd and PESQ computation show that our proposed technique outperforms the others techniques.


Keywords


Speech Enhancement, Stationary Bionic Wavelet Transform, Teager Energy Operator, Thresholding.

Full Text:

PDF

References


D. Van Compernolle, ‘‘Speech enhancement for applications in communication and recognition,’’ Revue HF Tijdschrift (Special Issue: Speech Processing for Telecommunication), pp. 99-108, 1993.

Y. Ephraim, D. Malah, ‘‘Speech enhancement using a minimum mean square error short time spectral amplitude estimator,’’ IEEE Trans. Acoust. Speech Signal Process, vol. 32, pp. 1109–1121, 1984.

Y. Ephraim, D. Malah, ‘‘Speech enhancement using a minimum mean square error log spectral amplitude estimator,’’ IEEE Trans. Acoust. Speech Signal Process, vol.33, pp. 443–445, 1985.

D. Malah, R. V. Cox, A. J. Accardi, ‘‘Tracking speechpresence uncertainty to improve speech enhancement in nonstationary noise environments,’’ in Proc. ICASSP, 1999, pp. 789-792.

D. Donoho, ‘‘Nonlinear wavelet methods for recovering signals, images, and densities from indirect and noisy data,’’ in Proc. Symp. Appl. Math, 1993, pp. 173–205.

D. Donoho, ‘‘De-noising by soft-thresholding,’’ IEEE Trans. Inform. Theory, vol. 41, pp. 613–627, 1995.

D. Donoho, I. Johnstone, ‘‘Ideal spatial adaptation by wavelet shrinkage,’’ Biometrika, vol. 81, pp. 425–455, 1994.

D. Donoho, I. Johnstone, ‘‘Adapting to unknown smoothness via wavelet shrinkage,’’ J. Amer. Stat. Assoc, pp. 1200–1224, 1995.

I. Johnstone, B. Silverman, ‘‘Wavelet threshold estimators for data with correlated noise,’’ J. Roy. Statist. Soc., vol. 59, pp. 319–351, 1997.

B. Vidakovic, C. Lozoya, ‘‘ On time-dependant wavelet denoising,’’ IEEE Trans. Signal Process., vol. 46, pp. 2549–2554, 1998.

M. Bahoura , J. Rouat, ‘‘Wavelet speech enhancement based on time-scale adaptation,’’ Speech Communication, vol. 48, pp. 1620-1637, 2006.

M. Bahoura, J. Rouat, J., ‘‘Wavelet speech enhancement using the Teager energy operator,’’ IEEE Signal Process. Lett., vol. 8, pp. 10–12, 2001a.

M. Bahoura, J. Rouat, ‘‘New approach for wavelet speech enhancement,’’ in Proc. Eurospeech, 2001, pp.1937–1940.

J. Yao, Y. T. Zhang, ‘‘Bionic wavelet transform: A new time-frequency method based on an auditory model,’’ IEEE Transactions on Biomedical Engineering, vol. 48, pp. 856-863, 2001.

J. Yao, Y. T. Zhang, ‘‘The application of bionic wavelet transform to speech processing in cochlear implants using neural network simulations,’’ IEEE Transactions on Biomedical Engineering, vol. 49, pp. 1299-1309, 2002.

M. T. Johnson, X. Yuan, Y. Ren, ‘‘Speech signal enhancement through adaptive wavelet thresholding,’’ Speech Communication, vol. 49, pp. 123-133, 2007.

F. Chen, Y. T. Zhang, ‘‘A new implementation of discrete bionic wavelet transform: Adaptive tiling,’’ Digital Signal Processing, vol. 16, pp. 233-246, 2006.

X. Yuan, ‘‘Auditory Model-Based Bionic Wavelet Transform for Speech Enhancement,’’ Master’s dissertaion, Marquette University, Milwaukee, WI, USA, 2003.

C. Giguere, ‘‘Speech Processing using a Wave Digital Filter Model of the Auditory Periphery,’’ Ph.D. dissertation, University of Cambridge, UK. 1993.

C. Giguere, P. C. Woodland, ‘‘A computational modelof the auditory periphery for speech and hearing research,’’ J. Acoust. Soc. Amer., vol. 95, pp.331–342, 1994.

M. Talbi, L. Salhi, S. Abid, A. Cherif, ‘‘Recurrent Neural Network and Bionic Wavelet Transform for Speech enhancement,’’ Int. J. Signal and Imaging Systems Engineering, vol. 3, 2010.

M. A. Akhaee, F. A. Marvasti, ‘‘A new method for noise reduction in the wavelet domain,’’ in Proc. IST, 2005.

H. Taşmaz, E. Erçelebi, ‘‘Speech enhancement based on undecimated wavelet packet-perceptual filterbanks and MMSE-STSA estimation in various noise environments,’’ Digital Signal Processing, vol. 16, pp. 797-812, 2008.

A. W. Rix, J. G. Beerends, M. P. Holler, A. P. Hekstra, ‘‘Perceptual evaluation of speech quality (pesq)- a new method for speech quality assessment of telephone networks and codecs,’’ in Proc. ICASSP, vol. 2, 2001, pp. 749-752.

Y. Hu, P. C. Loizou, ‘‘Evaluation of objective measures for speech enhancement,’’ IEEE Trans, Speech, Audio Processing, vol. 16, pp. 229-238, 2008.

ITU-T P.862, ‘‘Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs,’’ ITU Recommendation P.862, 2001.

E. Zavarehei, S. Vaseghi, Q. Yan, ‘‘Inter-frame modeling of DFT trajectories of speech and noise for speech enhancement using Kalman filters,’’ Speech Communication, vol. 48, pp. 1545-1555, 2006.

Philipos C. Loizou, Speech Enhancement Theory and Practice, Taylor & Francis, 2007.

Y. Ephraim, D. Malah, ‘‘Speech enhancement using a minimum mean square error short time spectral amplitude estimator,’’ IEEE Trans. Acoust. Speech Signal Process. , vol. 32, pp. 1109–1121, 1984.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.