Open Access Open Access  Restricted Access Subscription or Fee Access

Mean Centroid K-Means Clustering (MCKM) Based on Boundary Region Analysis for Share Market Database

M. Aruna, Dr. S. Sugumaran, Dr. V. Srinivasan

Abstract


In many research areas it’s always found that it is very difficult to cluster the databases which come under the close region of clusters. This work proposes a novel methodology for performing a structured cluster analysis of share market data. In this work clustering is done using the Mean Centroid K-Means Clustering (MCKM) algorithm and complex regions are selected which are closed to two or more clusters and this selected database is again carefully examined by each of the attribute and then finally clustered to produce more accuracy. In the proposed MCKM Clustering algorithm, centroid value is chosen based on the mean value of the data points. Instead of using random centroid values, mean value of the data points are used as the centroid for clustering. Proposed work has used stock returns at different times along with their valuation ratios from the stocks of National Stock Exchange (NSE) for the India's leading stock exchange. Results of the analysis shows that MCKM analysis builds the most compact clusters when compared to K-Means clustering and self organizing maps (SOM) for share reduced features data. The results are measured by using Silhouette index and Davies–Bouldin index.


Keywords


Boundary Region Analysis, Precision Clusters, Share Market Database, Large database, Reduced Dataset, Clustering, Mean Centroid K-Means Clustering (MCKM) Algorithm

Full Text:

PDF

References


B. Neelima, C.K Jha, & K.B. Sandeep KB 2012, ‘Application of Neural Network in Analysis of Stock Market Prediction’, International Journal of Computer Science & Engineering Technology, vol. 3, no. 4, pp. 61-68.

C.A. Astudillo, J. Poblete, M. Resta, and B.J. Oommen, 2016, A Cluster Analysis of Stock Market Data Using Hierarchical SOMs. In Australasian Joint Conference on Artificial Intelligence (pp. 101-112). Springer, Cham.

C.Y. Chiu, Y. F. Chen, I.T. Kuo, & C.K. He, (2009). An intelligent market segmentation system using k-means and particle swarm optimization. Expert Systems with Applications, 36, 4558–4565.

F. Ren, Y.N. Lu, S.P. Li, X.F. Jiang, L.X. Zhong, and T. Qiu, 2017. Dynamic portfolio strategy using clustering approach. PloS one, 12(1), p.e0169299.

G.A.V. Pai, T. Michel, Evolutionary optimization of constrained k-means clustered assets for diversification in small portfolios. IEEE Trans Evolut Comput. 2009; 13:1030–1053.

I.H. Witten, E. Frank, M.A. Hall, and C.J. Pal, 2016. Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann.

Indian Stock Exchange 2003, Stock Board information. Available from [20 June 2003].

Indian Stock Exchange 2005, Share Market Information. Available from [10 May 2005].

J.B. Kedarnath, & A.T. Nur, 2007, ‘Relationship between entropy and test data compression’, IEEE Transaction on Computer Aided Design of Integrated Circuits and Systems, vol. 26, no. 2, pp. 386-395.

J. Richard, & S. Qiang, 2007, ‘Fuzzy-Rough Sets Assisted Attribute Selection’, IEEE Transactions on Fuzzy Systems, vol. 15, no. 1, pp. 73-89.

M.S. Babu, N. Geethanjali, and B. Satyanarayana, 2012. Clustering approach to stock market prediction. International Journal of Advanced Networking and Applications, 3(4), pp.1281-1291.

P. Berkhin, 2006. A survey of clustering data mining techniques. In Grouping multidimensional data (pp. 25-71). Springer, Berlin, Heidelberg.

S.H. Liao, S.Y. Chou, Data mining investigation of co-movements on the Taiwan and China stock markets for future investment portfolio. Expert Syst Appl. 2013; 40:1542–1554.

S.R. Nanda, B. Mahanty, and M.K. Tiwari, 2010. Clustering Indian stock market data for portfolio management. Expert Systems with Applications, 37(12), pp.8793-8798.

T.D. Chaudhuri, and I. Ghosh, 2016. Using clustering method to understand indian stock market volatility. arXiv preprint arXiv:1604.05015.

V.D. Ta, and C.M. Liu, 2016, Stock market analysis using clustering techniques: the impact of foreign ownership on stock volatility in Vietnam. In Proceedings of the Seventh Symposium on Information and Communication Technology (pp. 99-106). ACM.

V.Srinivasan ‘’Fuzzy Classification to Classify the Income Category Based on Entropy’’ Journal of Computer science and Technology Vol 11 No.2 October 2011.

V. Srinivasan, & M. Aruna, ‘A fuzzy fast classification for share market database with lower and upper bounds’ American journal of Applied Science, vol.12, sep 2012, PP.1934-1939.

X.W. Ai, T. Hu, X. Li, and H. Xiong, 2010, Clustering high-frequency stock data for trading volatility analysis. International Conference on Machine Learning and Applications (ICMLA), pp. 333-338.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.