On Measuring the Reliability of K-Mediods with Obstacles, Facilitators Constraints and Edge Detection on Spatial Clustering
Abstract
Clustering spatial data is a well-known problem that has been extensively studied. Grouping similar data in large 2-dimensional spaces to find hidden patterns or meaningful sub-groups has many applications such as satellite images, geographic information systems, medical image analysis, marketing, computer visions, etc. Spatial clustering has been an active research area in Spatial Data Mining (SDM). Many methods on spatial clustering have been proposed in the literature, but few of them have taken into account constraints that may be present in the data clustering. In this paper, we discuss the problem of spatial clustering with obstacles constraints and propose a novel spatial clustering using edge detection method and K-Mediods, which objective is to cluster the spatial data (images) with the constraints and also comparing the result with the various constraints based clustering algorithms in terms of number of clusters and its execution time.The Edge detection based K-Mediods algorithms can not only given attention to higher speed and stronger global optimum search, but also get down to the obstacles and facilitator constraints and practicalities of spatial clustering. Taking into account these constraints during the clustering process is costly and the modeling of the constraints is paramount for good performance. The results on real datasets shown that the Edge detection based spatial clustering with the constraints are performs better than the existing constraint based clustering.
Keywords
Full Text:
PDFReferences
Ankerst. M, Breunig. M. M, Kriegel, H. P and Sander. J OPTICS: ordering points to identify the clustering structure. In ACM-SIGMOD Int. Conf. Management of Data (SIG- MOD’ 99), pages 49–60, 1999.
Bradley. P. S. , Fayyad. U. M., and Reina. C. Scaling clustering algorithms to large databases. In Knowledge Discovery and Data Mining, pages 9–15, 1998.
Estivill-Castro. V and. Lee “I Autoclust+: Automatic clustering of point-data sets in the presence of obstacles”. In In- ternational Workshop on Temporal and Spatial and Spatio Temporal Data Mining (TSDM2000), pages 133–146, 2000.
Ester. M, Kriegel. H. P, and Sander. J, “Spatial Data Mining: A Database Approach, ” Proc. 5th Int'l Symp. on Large Spatial Databases, Berlin, 1997, pp. 48-66.
Ester. M, Kriegel. H, Sander. J, and Xu. X, “A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise, ” Proc. of 2nd KDD, Portland, 1996, pp. 226-231
Hinneburg. A and Keim. D. A. An efficient approach to clustering in large multimedia databases with noise. In Knowledge Discovery and Data Mining, pages 58–65, 1998.
Koperski. K and Han, J “Discovery of Spatial Association Rules in Geographic Information Databases, ” Proc. 4th Int'l Symp. on Large Spatial Databases, Portland, Maine, 1995, pp. 47-66.
Karypis. G, Han. E, and Kumar. V. Chameleon: A hierarchical clustering algorithm using dynamic modeling. In IEEE Computer, pages 68–75, 1999.
Kaufman. L and Rousseeuw. P. J, Finding Groups in Data: An Introduction to Cluster Analysis, Wiley, 1990.
Koperski. K and Han, J “Discovery of Spatial Association Rules in Geographic Information Databases, ” Proc. 4th Int'l Symp. on Large Spatial Databases, Portland, Maine, 1995, pp. 47-66.
Matheus C. J. ; Chan P. K. ; and Piatetsky-Shapiro G. 1993. “Systems for Knowledge Discovery in Databases, ” IEEE Transactions on Knowledge and Data Engineering 5(6): 903-913
Ng. R. T and Han. J. Efficient and effective clustering methods for spatial data mining. In Proc. of VLDB Conf., pages 144–155.
Ralambondrainy. H A conceptual version of the k-means algorithm. Pattern Recognition Letters, 16(11):1147–1157, 1995.
Sheikholeslami, G Chatterjee, S and. Zhang. A Wavecluster: “A multi-resolution clustering approach for very large spatial databases, ” 1998
Shekhar. S and. Chawla,. S “Spatial Databases: A Tour, ” Prentice Hall, 2003
Tung. A. K. H, Han. J, Lakshmanan. L. V. S, and Ng. T. V. “Constraint- Based Clustering in Large Databases, ” In Proceedings of the International Conference on Database Theory (ICDT'01) [C], London, U. K., 2001. pp. 405-419.
Tung. A. K. H, Hou. J, and Han. J. “Spatial Clustering in the Presence of Obstacles”, In Proceedings of International Conference on Data Engineering (ICDE'01) [C], Heidelberg, Germany, April, 2001. pp. 359-367.
Wang. X and Hamilton H. J. “DBRS: A Density-Based Spatial Clustering Method with Random Sampling. ” In Proceedings of the 7th PAKDD [C], Seoul, Korea, 2003. pp. 563- 575
Zahn. C. Graph-theoretical methods for detecting and de- scribing gestalt clusters. In IEEE Transactions on Comput- ers, pages 20:68–86, 1971.
Zaïane O. R and Lee. C. H “Clustering Spatial Data When Facing Physical Constraints”. In Proceedings of the IEEE International Conference on Data Mining (ICDM'02) [C], Maebashi City, Japan, 2002. pp. 737-740.
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution 3.0 License.