Voice Based Dictionary Using DTW and MFCC

Parvin I Kinikar; S. A. Pardeshi

Voice Based Dictionary Using DTW and MFCC

Parvin I Kinikar, S. A. Pardeshi

Abstract

Digital processing of voice recognition algorithms is required for highly accurate and fast voice recognition technology. The voice or speech signal contains an information .To represent the voice signal, digital signal processes such as feature extraction technique and feature matching techniques are introduced. In this paper voice based dictionary has been developed, in voice based dictionary words are accessed through spoken alphabet from collected database or dictionary. The developed voice based dictionary system uses silence removal algorithm which remove silence from recorded alphabets, feature extraction technique i.e mel- frequency cepstral coefficients, gives good discrimination of the speech signal. Speech recognition approach i.e. dynamic time warping algorithm, measures similarity between the stored template and test template for speech recognition

Keywords

Automatic Speech Recognition (ASR), Silence Removal Algorithm, Mel-Frequency Cepstral Coefficients (MFCC), Dynamic Time Warping (DTW).

Full Text:

PDF

References

LRabinar,BHJuang,B Yegnanrayana,”Fundamental of Speech Recognition”Pearson 2012.

L Rabinar, R W Schafer,”Digital processing of speech signals”, Pearson 2011. Vimla C, Dr.V .Radha,Rreview on Speech Recognition Challenges and Approaches”,World of computer science and information technology journal(WCSIT),ISSN:2221-0741 Vol.2,No.1,1-7,2012

Guiling Li,Yuanzhen Wang, Min Li,ZongdaWu,”similarity match in time series streams under dynamic time warping distance”, International conference on computer science and software engineering,978-0-7695-3336-0/08 IEEE DOI 10.1109/CSSE.2008.1117.

Jian-Kui Guo,Qing Wang, Zhehhua Huang, Shengli Sun, Yang-Yong Zhu, “ Estimating Similarity over Data Streams Based on Dynamic Time Waroing” ,Fourth international conference on fuzzy system and knowledge discovery (FSKD 2007),0-7695-2874-0/07 IEEE

D.Nov‟akl,D.Cuesta-Frau,T.Al ani,M.Aboy, ”Speech Recognition Methods Applied to Biomedical Signal Processing ”, 26th annual international conference of the IEEE EMBS San Francisco,CA,USA.september 1-5,2004.

Md Afzal Hossan,Sheeraz Memon,Mark A Gregory,”A Novel Approach for MFCC Feature Extraction ”,978-1-4244-7907-8/10 IEEE.

Wei Han,Cheong-Fat Chan,Chiu-Sing CHOY AND Kong-Pang Pun.”An efficient MFCC extraction method in speech recognition”, 0-7803-9390-2/06 IEEE 145 ISCAS

Santosh K Gaikwad,Bharti W.Gawali and Pravin Yannawar.”A Review on Speech recognition technique”,international journal of computer applications (0975-8887)Volume 10-No.3,November 2010.

Wiqas Ghai and Navdeep Singh,”Literature review on automatic speech recognition ”,international journal of computer applications(0975-8887) volume41-No.8,march 2012.

Titus Felix Furtuna,academy of economics studies Bucharest,”Dynamic programming algorithms in speech recognition”,revista information economica nr 2(46)/2008.

Mr.D.G.Bhalke,Dr.C.B.Rama,Rao,DR.D.S Bormane,”Dynamic Time Wraping Technique for Musical Instrument Recognition for Musical instrument recognition for isolated notes”,processing of ICETECT 2011,978-1-4244-7926-9/11 IEEE.

Jian-Kui Guo, Qing Wang, Zhenhua Huang, Shengli Sun, Yang-Yong Zhu “Estimating Similarity Over Data Streams Based on Dynamic Time Warping*” Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007)0-7695-2874-0/07$25.00©2007IEEE

Muda Lindasalwa, Begam Mumtaj and Elamvazuthi I.,“ Voice Recognition algorithm using mel frequency cepstral coefficient (MFCC) and dynamic time wraping (DTW) techniques. Journal Of Computing, Volume 2, Issue 3, pp 138-143, ISSN 2151-9617,March 2010.

Paulraj M P1, Sazali Bin Yaacob1, Ahamad Nazri2 and Sathees Kumar1,“Classification of Vowel Sounds Using MFCC and Feed Forward Neural Network” ,5th International Colloquium on Signal

Processing & Its Applications (CSPA), pp 60 -63, ISBN: 978-1-4244-4152-5, March 2009.

Christopher Hale, CamQuynh Nguyen,“Voice Command Recognition Using Fuzzy logic”, Motorola, Austin, Texas 78735, pp 608-613,ISBN no: 0-7803-2636-9.

Mahdi Shaneh and Azizollah Taheri, “Voice Command Recognition System Based on MFCC and VQ Algorithms” ,World Academy of Science, Engineering and Technology 57 2009, pp 534-538.

Holmes, J. & Holmes, W. (2001), Speech Synthesis and Recognition, 2th ed., Tailor & Francis, London.

Xing Fan and John H. L. Hansen, Fellow, IEEE “Speaker Identification within Whispered Speech Audio Streams” IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 5, JULY 2011

Ben Milner and Xu Shao, “SPEECH RECONSTRUCTION FROM MEL-FREQUENCY CEPSTRAL COEFFICIENTS USING A SOURCE-FILTER MODEL” School of Information Systems, University of East Anglia, Norwich, UK.

Refbacks

There are currently no refbacks.

Username
Password
Remember me