An Efficient Way of Frequent Embedded Subtree Mining on Biological Data
|
Title | An Efficient Way of Frequent Embedded Subtree Mining on Biological Data |
Authors | |
Abstract | Data mining provides biological research a useful information analyzing tool. The key factors which influence the performance of biological data mining approaches are the large-scale of biological data and the high similarities among patterns mined. In this paper, we present an efficient algorithm named IRTM for mining frequent subtrees embedded in biological data. We also advance a string encoding method for representing the trees, and a scope-list for extending all substrings for frequency test. The IRTM algorithm adopts vertically mining approach, and uses some pruning techniques to further reduce the computational time and space cost. Experimental results show that IRTM algorithm can achieve significantly performance improvement over previous works. |
Publisher | ACADEMY PUBLISHER |
Date | 2011-12-01 |
Source | Journal of Computers Vol 6, No 12 (2011): Special Issue: Selected Best Papers of ICFMD2010 |
Rights | Copyright © ACADEMY PUBLISHER - All Rights Reserved.To request permission, please check out URL: http://www.academypublisher.com/copyrightpermission.html. |