Speech Signal Compression and Reconstruction Using Inverse Technique

Hala Shawky; A. Nassar; M. Abdelnaby; F. E. Abd El-Samie

Speech Signal Compression and Reconstruction Using Inverse Technique

Hala Shawky, A. Nassar, M. Abdelnaby, F. E. Abd El-Samie

Abstract

Speech compression is a process of compressing speech signal to reduce its size for transfer. This paper proposed a new technique to compress the speech signal. This technique is called the decimation process. It is opposite of interpolation. This process reduces the sampling rate and thus save time, storage capacity, and cost. Decimation contains two stages, processes of lowpass filtering followed by downsampling. The benefit of using a filter is to avoid aliasing effect. The reconstruction of the original speech signal can be performed using inverse interpolation techniques such as maximum entropy and regularization theory. Finally, we assess the quality of the reconstructed signal using quality metrics such as signal-to-noise ratio (SNR), signal to noise ratio segmental (SNRseg), spectral distortion (SD) and log-likelihood ratio (LLR).

Keywords

Decimation; Interpolation; Maximum Entropy; Regularization Theory

Full Text:

PDF

References

Kondoz AM (2004) Digital speech: coding for low bit rate communication systems, 2nd ed. Wiley, New York. ISBN:0-470-87008-7

ITU-T, Pulse Code Modulation (PCM) of Voice Frequencies, Technical Report G.711, International Telecommunications Union, Geneva, 1993

ITU-T, 5-, 4-, 3- and 2-bits per Sample Embedded Adaptive Differential Pulse Code Modulation (ADPCM), Technical Report G.727, International Telecommunications Union, Geneva, 1990.

W. Y. V. Leung and P. J. Bones “Statistical interpolation of sampled images,” Optical Engineering, Vol. 40, 2001, pp. 547-553.

J. H. Shin, J. H. Jung, and J. K. Paik, “Regularized iterative image interpolation and its application to spatially scalable coding,” IEEE Transactions on Consumer Electronics, Vol. 44, 1998, pp. 1042-1047.

G. W. Wornell, “Emerging applications of multirate signal processing and wavelets in digital communications,” in Proceedings of the IEEE, vol. 84, 1996, pp. 586–603.

M. Unser, A. Aldroubi, and M. Eden “B-Spline signal processing: Part I- Theory” IEEE Trans. Signal Processing, vol. 41, no.2, pp. 821-833, February 1993

H. C. Anderws and B. R. Hunt, Digital Image Restoration, Prentice-Hall, Englewood Cliffs, New Jersey, 1977.

N. B. Karayiannis and A. N. Venetsanopoulos, “Regularization theory in image restoration: the stabilizing functional approach,” IEEE Transactions on Acoustics, Speech and Signal Processing, Vol. 38, 1990, pp. 1155-1179.

Kang, M G., and A. K. Katsaggelos, "Simultaneous Multichannel Image Restoration and Estimation of the Regularization Parameters", IEEE Trans. Image Processing, vol. 6, issue 5, pp. 774-778, 05/1997.

S. E. El-Khamy, M. M. Hadhoud, M. I. Dessouky, B. M. Salam and F. E. A. El-Samie, “Sectioned implementation of regularized image interpolation,” in Proceedings of the IEEE International Midwest Symposium on Circuits and Systems, Vol. 2, 2003, pp. 656-659.

Refbacks

There are currently no refbacks.

This work is licensed under a Creative Commons Attribution 3.0 License.

Username
Password
Remember me