Open Access Open Access  Restricted Access Subscription or Fee Access

Hierarchical Multi-class Audio Classification using Support Vector Machine

Shrinivas P. Mahajan, Aparna S. Joshi

Abstract


Automatic audio classification is an important part in audio indexing, content-based audio retrieval and online audio distribution. The efficiency of an audio classification or categorization depends on the ability to capture proper audio features and to accurately classify each feature set corresponding to its own class. In this paper a novel hierarchical audio classification algorithm using Support Vector Machine(SVM) is proposed to classify the  input digital audio stream into five audio classes namely male speech,female speech, music, silence and environmental sound. An audio database “Swarsanchay” consists of recordings of 1000 audio clips of all the above mentioned audio classes with different noisy conditions was created for experimentation. Out of which five hundred clips were used for training and remaining five hundred audio clips were used for testing. The proposed multi class audio classifier correctly identified 475 audio clips giving an overall classification accuracy of 95%. During our investigation of feature extraction we identified best ranking features pertinent to class having better discriminatory power amongst the classes. The best classification accuracy of 95% is obtained when best ranking feature set is used for hierarchical multiclass classification.


Keywords


Audio Classification, Audio Features, Feature Extraction, Support Vector Machines

Full Text:

PDF

References


J. Foote,” Content-based retrieval of music and audio”. In Multimedia Storage and Archiving Systems II, Proc. of SPIE, volume 3229, pp 138-147,1997.

E. Wold, T. Blum, and J. Wheaton. “Content-based Classification, Search and Retrieval of Audio” IEEE Multimedia, 3(3), pp.27-36, 1996.

L. Lu, H. Zhang, H. Jiang, “Content analysis for audio Classification and segmentation”, IEEE Trans. Speech Audio Process, vol. 10, no. 7, pp.504–516, Oct. 2002.

S. Z. Li, “Content-based audio classification and retrieval using the nearest feature line method”, IEEE Transactions on Speech and Audio Processing, vol. 8, pp. 619-625, 2000.

C. C. Lin, S. H. Chen, T. K. Truong, Y. Chang, “Audio classification and categorization based on wavelets and support vector machine”, IEEE Transactions on Speech and Audio Processing, vol. 13, pp. 644-651, 2005.

P. Dhanalakshmi , S. Palanivel, V. Rama lingam’,” Classification of audio signals using SVM and RBFNN”, Expert systems with applications 2008./j.eswa.2008.06.126

Shirazi j, Shaharoh G, “Audio classification based on Sinusoidal model: A new feature.” IEEE Region 10 conference, TENCON 2008

V. N.Vapnik, The Nature of Statistical Learning Theory, Springer, 1995.

http/kernelmachines.org

Chansheng Xu, Namunu C. Maddage, and Xi Shao” Automatic music classification and summarization” IEEE speech and audio processing, vol.13, no.3, 2005

C. J. C. Burges., “A tutorial on support vector machines for pattern recognition,” Data Mining and Knowledge Discovery, 1998.

E.L. Allwein, R.E.Schapire, and Y. Singer, “Reducing multi class to binary: A unifying approach for margin classifiers,” Journal of Machine Learning, 2002.

West K., Cox S, “Features and Classifiers for the automatic classification of musical audio signals”, Proc. ISMIR, pages 531-536,2004.

Schölkopf, B. and Smola, A., Learning with Kernels, MIT Press, Cambridge, MA.

K.E. M. Klein, G. Petrucci, and P. Kabal, “Speech/music discrimination for multimedia applications,” in ICASSP, June 2000.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.