Open Access Open Access  Restricted Access Subscription or Fee Access

Uncertain Data Classification using RBF Neural Networks

G.V. Suresh, Dr.A.yesu Babu, Usman Ali Sheik

Abstract


-Data uncertainty is often found in real-world applications due to reasons such as imprecise measurement, outdated sources, or sampling errors. Numerous factors lead to data uncertainty including data acquisition device error, approximate measurement, sampling fault, transmission latency, data integration error and so on. In this paper, we focus on one commonly encountered type of data uncertainty the exact data value is unavailable and we only know the probability distribution of the data. An intuitive method of handling this type of uncertainty is to represent the uncertain range by its expectation value, and then process it as certain data. This method, although simple and straightforward, may cause valuable information loss. In this paper, we extend the conventional neural networks classifier by applying Radial basis function so that it can take not only certain data but also uncertain probability distribution as the input. RBF’s have very attractive properties such as localization, functional approximation and cluster modelling. These properties made them attractive in many applications.

Keywords


Radial Basis Function, Uncertainty, Classification.

Full Text:

PDF

References


Aggarwal, C.C., Yu, P.: A framework for clustering uncertain data streams. In: IEEE International Conference on Data Engineering, ICDE (2008)

Cormode, G., McGregor, A.: Approximation algorithms for clustering uncertain data. In: Principle of Data base System, PODS (2008)

Kriegel, H., Pfeifle, M.: Density-based clustering of uncertain data. In: ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), pp. 672–677 (2005)

Singh, S., Mayfield, C., Prabhakar, S., Shah, R.,Hambrusch, S. :Indexing categorical data with uncertainty. In: IEEE International Conference on Data Engineering (ICDE), pp. 616–625 (2007)

Aggarwal, C.C.: On Density Based Transforms for uncertain Data Mining. In: IEEE International Conference on Data Engineering, ICDE (2007)

Aggarwal, C.C.: A Survey of Uncertain Data Algorithms and Applications. IEEE Transactions on Knowledge and Data Engineering 21(5) (2009)

Agrawal, R., Imielinski, T., Swami, A.N.: Database mining: A performance perspective.IEEE Transactions on Knowledge& Data Engineering (1993)

Aggarwal, C.C.: Managing and Mining Uncertain Data. Springer, Heidelberg (2009)

UNN: A Neural Network for Uncertain Data Classification - 449Jiaqi Ge, Yuni Xia, and Chandima Nadungodage Advances in Knowledge Discovery and Data Mining 14th Pacific-Asia Conference, PAKDD 2010Hyderabad, India, June 21-24, 2010

Bi, J., Zhang, T.: Support Vector Machines with Input Data Uncertainty. In: Proceeding of Advances in Neural Information Processing Systems (2004)

Qin, B., Xia, Y., Li, F.: DTU: A Decision Tree for Classifying Uncertain Data. In: Theera-munkong, T., Kijsirikul, B., Cercone, N., Ho, T.-B. (eds.) PAKDD 2009. LNCS, vol. 5476, pp. 4–15. Springer, Heidelberg (2009)

Asuncion, A., Newman, D.: UCI machine learning repository(2007),http://www.ics.uci.edu/mlearn/MLRepository.html

H.-P. Kriegel and M. Pfeifle, “Hierarchical density-based clustering of uncertain data,” in ICDM. IEEE Computer Society, 2005, pp. 689–692.

B. Kao, S. D. Lee, D. W. Cheung, W.-S. Ho, and K. F.Chan, “Clustering uncertain data using Voronoi diagrams,” in ICDM. IEEE Computer Society, 2008, pp. 333–342

.G. Cormode and A. McGregor, “Approximation algorithms for clustering uncertain data,” in PODS, M. Lenzerini and D. Lembo, Eds. ACM, 2008, pp. 191–200.

S. D. Lee, B. Kao, and R. Cheng, “Reducing UK-means to k-means,” in ICDM Workshops. IEEE Computer Society,2007, pp. 483–488.

C. C. Aggarwal and P. S. Yu, “Outlier detection with uncertain data,” in SDM. SIAM, 2008, pp. 483–493.

C. K. Chui and B. Kao, “A decremental approach for mining frequent itemsets from uncertain data,” in PAKDD, vol. 5012. Springer, 2008, pp. 64–75.

Q. Zhang, F. Li, and K. Yi, “Finding frequent items in probabilistic data,” in SIGMOD Conference, J. T.-L. Wang,Ed. ACM, 2008, pp. 819–832.

J. Pei, B. Jiang, X. Lin, and Y. Yuan, “Probabilistic skylines on uncertain data,” in VLDB. ACM, 2007, pp. 15–26.

Cheng, R., Kalashnikov, D., and Prabhakar, S. “Evaluating Probabilistic Queries over Imprecise Data,”Proceedings of the ACM SIGMOD International Conference on Management of Data, June 2003.

Cheng, R., Kalashnikov, D., and Prabhakar, S. “Querying Imprecise Data in Moving Object Environments,”IEEE Transactions on Knowledge and Data Engineering, 16(9) (2004) 1112-1127.

Sato, M., Sato, Y., and Jain, L. Fuzzy Clustering Models and Applications. Physica-Verlag, Heidelberg (1997)

Hamdan, H. and Govaert, G. “Mixture Model Clustering of Uncertain Data,” IEEE International Conference on Fuzzy Systems (2005) 879-884.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.