Open Access Open Access  Restricted Access Subscription or Fee Access

A Study on Feature Selection using Machine Learning Techniques

V. Arul Kumar, L. Arockiam

Abstract


Feature selection has become an emerging research area in the field of pattern recognition and machine learning. It is one of the most important processes in Knowledge Discovery. The data set contains irrelevant, redundant and noisy data, which can be preprocessed using feature selection technique. Through feature selection technique the relevant features are identified for the mining process. Feature selection is one of the factors to classify the data without any misclassification and address the performance of the model. In this study, an attempt is made to review the different feature selection techniques in machine learning scheme.

Keywords


Feature Selection, Supervised Learning, Unsupervised Learning, Semi Supervised Learning.

Full Text:

PDF

References


AshaGowdaKaregowda, M.A.Jayaram, A.S. Manjunath, “Feature Subset Selection Problem using Wrapper Approach in Supervised Learning”, International Journal of Computer Applications,Volume 1,No. 7, 2010, pp.13-17, ISSN:0975 – 8887.

Sven F.Crone, NikolaosKourentzes, “Feature selection for time series prediction–A combined filter and wrapper approach for neural networks”, Journal of Neurocomputing, Volume 73, Issues 10-12, June-2010,pp. 1923-1936, ISSN: 0925-2312.

Pawel Smialowski1,Dmitrij Frishman1,and Stefan Kramer, "Pitfalls of supervised feature selection", Published by Oxford University Press, December 9 , 2010.

Subramanian Appavu Alias Balamurugan, RamasamyRajaram, “Effective and Efficient Feature Selection for Large-scale Data Using Bayes' Theorem”, International Journal of Automation and Computing, Volume6, Issue 1, Feb 2009, pp. 62-71, DOI: 10.1007/s11633-009-0062-2.

Yuanhong Li, Ming Dong, and Jing Hua, “A Gaussian Mixture Model To Detect Clusters Embedded In Feature Subspace”, Journal of Communications in Information and Systems, Volume 7, Number. 4, 2007, pp. 337-352.

Peng Liu, Naijun Wu, Jiaxian Zhu, Junjie Yin, and Wei Zhang, “A Unified Strategy of Feature Selection”,The Second International Conference onAdvanced Data Mining and Applications(ADML 2006), China, August 2006, pp. 457 – 464.

Frederico Coelho, Antonio Padua Braga, and Michel Verleysen, “Multi-Objective Semi-Supervised Feature Selection and Model Selection Based on Pearson’s Correlation Coefficient”, Springer LNCS 6419, 2010, pp. 509–516.

IanisseQuinzán, José M. Sotoca, FilibertoPla, “Clustering-based Feature Selection in Semi-supervised Problems”, Ninth International Conference on Intelligent Systems Design and Applications, Italy, 2009, pp. 535-540, ISBN: 978-0-7695-3872-3/09

Jidong Zhao, Ke Lu, Xiaofei He, "Locality sensitive semi-supervised feature selection", Journal of Neurocomputing, Volume 71, Issues 10-12, June-2008, pp. 1842-1849, ISSN: 0925-2312.

M. Ramaswami and R. Bhaskaran, "A Study on Feature Selection Techniques in Educational Data Mining", Journal of Computing Volume 1, Issue 1, December 2009, pp.7-11,ISSN: 2151-9617.

Zhu Zhang,"Mining relational data from text: From strictly supervised to weakly supervised learning", Journal of Information System, Volume 33, issues 3, May 2008, pp 300-314, doi:10.1016/j.is.2007.10.002

Huan Liu,Hiroshi Motoda, Rudy Setiono and Zheng Zhao, "Feature Selection: An Ever Evolving Frontier in Data Mining",Journal of Machine Learning Research, volume 10, june 2010, Hyderabad, pp. 4-13, ISSN: 1938-7228

Le Song, Alex Smola, Karsten M. Borgwardt, Justin Bedo, "Supervised Feature Selection via Dependence Estimation", procedings International conference of Machine Learning(ICML), June 2007, USA.

Seoung Bum Kim, Panaya Rattakorn, “Unsupervised feature selection using weighted principal components”, International journal of Expert Systems with Applications, Volume 38 Issue 5, May, 2011, pp 5704-5710, DOI:10.1016/j.eswa.2010.10.063.

Daoqiang Zhanga,Songcan Chena, Zhi-Hua Zhoub, “ Constraint Score:A new filter method for feature selection with pairwise constraints” , Elsevier: The Journal of the Pattern Recognition Society, October 2007, DOI: 10.1016/j.patcog.2007.10.009.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.