Open Access Open Access  Restricted Access Subscription or Fee Access

Discovering Frequent Patterns from XML Data

M. Kalyan Chakravarthi, R. Srinivas

Abstract


Currently many websites in the internet are built with the help of XML technology. XML is used to process semi-structured as well as structured data. Also XML was designed to transport and store huge volumes of data. Compatibility between different platforms can be easily achieved with the help of XML. To manage information stored in XML and to extract information in efficient way. Many techniques have been proposed to speed up the performance of XML data. The processing is known as “XML mining”. In this paper, based on the XML data structure, we are going to analyze the similar kind of patterns and propose the data mining technique about extracting the similar patterns from XML data. In this paper to speed up the performance we are going use FP-growth algorithm for mining similar patterns about the XML data structure. The proposed method using FP-growth algorithm applied to XML query subtrees surpass Apriori algorithm.

Keywords


XML Data, FP-Growth Method, Data Mining.

Full Text:

PDF

References


Ling Chen, Sourav S. Bhowmick and Liang-Tien Chia,” Mining Maximal Frequently Changing Subtree Patterns from XML documents”,2003

A. Termier, M.-C. Rousset, and M. Sebag. Mining XML data with frequent trees. In DBFusion Workshop’02, pages 87–96.

Zhang Wei, Liao Hongzhi, Zhao Na, “Research on the FP growth algorithm about association rule mining”, ISBIM 2008, December 19,2008, pp. 315-318.

Yunlong Song,”Research on Application of Data Mining Based on FP-Growth Algorithm for Digital library”,2011.

L.H.Rusu, W.Rahayu, D.Taniar, "Mining Changes from Versions of Dynamic XML Documents", KDXD 2006, LNCS3915, pp.3-12

Chen Qi,”XML-based Data Mining Design and Implementation”,2010

L.Chen, S.S.Bhowmick, L.T.Chia, ”FRACTURE-Mining: Mining Frequently and Concurrently Mutating Structures from Historical XML Documents”, Elsevier Science Journal: Data & Knowledge Engineering Volume: 59 Issue: 2, 2006, pp. 320-347

FAN Ya-qin,” XML in Web Data Mining Application”,2010

Li Juan,” Research of An Association Rule Mining Algorithm Based on FP tree”,2010

Cristal Karina Galindo Dur´an, Mihaela Juganaru-Mathieu,” Specification Design for an XML Mining Configurable Application”,2011

D. Braga, A. Campi, M. Klemettinen, and P. L. Lanzi. Mining association rules from XML data. In Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery (DaWaK 2002), September 4-6, Aixen-Provence, France 2002.

R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In P. Buneman and S. Jajodia, editors, SIGMOD93, pages 207–216, Washington, D.C., USA, May 1993

R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In J. B. Bocca, M. Jarke, and C. Zaniolo, editors, Proceedings of 20th International Conference on Very Large Data Bases, pages 487–499, Santiago, Chile, September 12-15 1994

Jiawei Han,Jian Pei,“Mining Frequent Patterns by Pattern-Growth: Methodology and Implications”,2000

Evgeny Kharlamov,Pierre Senellart,“Modeling, Querying, and Mining Uncertain XML Data”,2011


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.