Open Access Open Access  Restricted Access Subscription or Fee Access

Speech Signal Compression and Reconstruction Using Inverse Technique

Hala Shawky, A. Nassar, M. Abdelnaby, F. E. Abd El-Samie

Abstract


Speech compression is a process of compressing speech signal to reduce its size for transfer. This paper proposed a new technique to compress the speech signal. This technique is called the decimation process. It is opposite of interpolation. This process reduces the sampling rate and thus save time, storage capacity, and cost. Decimation contains two stages, processes of lowpass filtering followed by downsampling. The benefit of using a filter is to avoid aliasing effect. The reconstruction of the original speech signal can be performed using inverse interpolation techniques such as maximum entropy and regularization theory. Finally, we assess the quality of the reconstructed signal using quality metrics such as signal-to-noise ratio (SNR), signal to noise ratio segmental (SNRseg), spectral distortion (SD) and log-likelihood ratio (LLR).

Keywords


Decimation; Interpolation; Maximum Entropy; Regularization Theory

Full Text:

PDF

References


Kondoz AM (2004) Digital speech: coding for low bit rate communication systems, 2nd ed. Wiley, New York. ISBN:0-470-87008-7

ITU-T, Pulse Code Modulation (PCM) of Voice Frequencies, Technical Report G.711, International Telecommunications Union, Geneva, 1993

ITU-T, 5-, 4-, 3- and 2-bits per Sample Embedded Adaptive Differential Pulse Code Modulation (ADPCM), Technical Report G.727, International Telecommunications Union, Geneva, 1990.

W. Y. V. Leung and P. J. Bones “Statistical interpolation of sampled images,” Optical Engineering, Vol. 40, 2001, pp. 547-553.

J. H. Shin, J. H. Jung, and J. K. Paik, “Regularized iterative image interpolation and its application to spatially scalable coding,” IEEE Transactions on Consumer Electronics, Vol. 44, 1998, pp. 1042-1047.

G. W. Wornell, “Emerging applications of multirate signal processing and wavelets in digital communications,” in Proceedings of the IEEE, vol. 84, 1996, pp. 586–603.

M. Unser, A. Aldroubi, and M. Eden “B-Spline signal processing: Part I- Theory” IEEE Trans. Signal Processing, vol. 41, no.2, pp. 821-833, February 1993

H. C. Anderws and B. R. Hunt, Digital Image Restoration, Prentice-Hall, Englewood Cliffs, New Jersey, 1977.

N. B. Karayiannis and A. N. Venetsanopoulos, “Regularization theory in image restoration: the stabilizing functional approach,” IEEE Transactions on Acoustics, Speech and Signal Processing, Vol. 38, 1990, pp. 1155-1179.

Kang, M G., and A. K. Katsaggelos, "Simultaneous Multichannel Image Restoration and Estimation of the Regularization Parameters", IEEE Trans. Image Processing, vol. 6, issue 5, pp. 774-778, 05/1997.

S. E. El-Khamy, M. M. Hadhoud, M. I. Dessouky, B. M. Salam and F. E. A. El-Samie, “Sectioned implementation of regularized image interpolation,” in Proceedings of the IEEE International Midwest Symposium on Circuits and Systems, Vol. 2, 2003, pp. 656-659.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.