Open Access Open Access  Restricted Access Subscription or Fee Access

Speech Recognition using Dynamic Time Warping

R. Shashikant, Daulappa G. Bhalke

Abstract


In this paper, we discuss an speech recognition based on dynamic time-warping for image retrieval. The DTW-algorithm originates from the field of speech recognition. We claim that the results of a recognizer based on the DTW-algorithm template matching are more “intuitive" to humans than the results of other recognizers. We use MFCC for feature extraction. It is proposed that higher recognition rates can be achieved using MFCC features with DTW. ASM FCC coefficient increases the accuracy also increases.

Keywords


DTW; Speech Recognition; Endpoint Detection; Training; Search Path.

Full Text:

PDF

References


Tiberius Zaharia, Svetlana Segarceanu, Marius Cotescu, Alexandru Spataru “Quantized Dynamic Time Warping (DTW) algorithm”IEEE.2010.

Guangyu Kang “Variable sliding window DTW speech identification algorithm” IEEE.2009.

Oscar, Luis Mengibar-Pozo,Andrzej,Pacut "A new algorithm for signature verification system based on DTW” IEEE .2008

Ivan Kraljevski, Slavcho Chungurski, Zoran Gacovski, Sime Arsenovski Perceived templete matchingquality estimation using DTW algorithm 16th Telecommunications forum TELFOR 2008.

Heng-Da Cheng et al. , A VLSI architecture for dynamic time-wrap recognition of handwritten symbols, IEEE ASSP, Vol. 34, N.3, Jun 1986.

G. R. Quenot et al. , A Dynamic Programming Processor for Templete matchingRecognition, IEEE JSSC, Vol. 24, N(F9)q.20, April 1989.

Stan Salvador, Chan, FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space, IEEE Transactions on Biomedical. Engineering, vol. 43, no. 4

Tiberius Zaharia, Svetlana Segarceanu, Marius Cotescu, Alexandru Spataru Department of Applied Electronics and Information Engineering POLITEHNICA University of Bucharest.

Lindasalwa Muda, Mumtaj Begam and I. Elamvazuthi, “Voice Recognition algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques “Journal of computing , Volume 2, issue 3, March 2010, ISSN 2151-9617

Md. Rashidul Hasan, Mustafa Jamil, Md. Golam Rabbani Md. Saifur Rahman,“ Speaker Identification using Mel frequency cepstral coefficients ” ICECE 2004, 28-30 December 2004, Dhaka, Bangladesh

Mark D. Skowronski and John G. Harris,” Improving the“IEEE Intl Symposium on Circuits and Systems, Bangkok, Thailand, vol IV, pp 281-284, May 25 - 28, 2003, ISBN: 0-7803-7761-31

A. Waibel , B. Yegnanarayana , “ Comparative Study of Nonlinear Time Warping Techniques in Isolated Word Speech Recognition Systems “IEEE transaction on Acoustics , speech and signal processing Vol. . ASSP-31, NO. 6, December 1983

Mahmoud I. Abdalla and Hanaa S. Ali “Wavelet-Based Mel- Frequency Cepstral Coefficients for Speaker Identification using Hidden Markov Models “Journal of teleco

Cao Maojun, Shang Fuhua, etc .. Improved DTW algorithm in real·time speech recognition ell, Science, Technology and Engineering, 2010,10 (7) : 1652· 1655

Wu Xiaojie, HU Zhanyi, WU Yihong. Segmental·DTW·based Unsupervised Segmentation behavior [J], Journal of Software, 2008,19 (9): 2285·2292

Liu Xuanhe, Song Tingxin. Speech Recognition and Control Application Technology [M]. Beijing: Science Press, 2008 (3) :42·44.

Ni Xunbo, Zhao Debin, Jiang Fengsong ,etc. Analysis of relationship between Viterbi algorithm and DTW _ application in non·specific language recognition [J], Computer Research and Development, 2010,47 (2) :305·315


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.