An Efficient Method of Web Sequential Pattern Mining Based on Session Filter and Transaction Identification
|
Title | An Efficient Method of Web Sequential Pattern Mining Based on Session Filter and Transaction Identification |
Authors | |
Abstract | Web sequential pattern mining is an important way to analyze the access behavior of web users. In this paper, we present an efficient method of web sequential pattern mining based on session filter and transaction identification. Different from traditional mining methods, we categorize the user sessions into human user sessions, crawler sessions and resource-download user sessions. Then we filter out the non-human user sessions, leaving the human user sessions for sequential pattern mining. With the purpose of mining users’ meaningful sequential patterns, we identify users’ transactions from the user sessions, and do the sequential pattern mining based on transaction level. We present a method of transaction identification based on users’ access path tree. It can find out all the transactions effectively. We also make some improvements on PrefixSpan algorithm, which can reduce the memory space it takes and avoid generating duplicate projections. The experimental results of our mining method are very satisfactory. |
Publisher | ACADEMY PUBLISHER |
Date | 2010-09-02 |
Source | Journal of Networks Vol 5, No 9 (2010): Special Issue: Recent Advances in Computer Science and Engineering - Track on N |
Rights | Copyright © ACADEMY PUBLISHER - All Rights Reserved.To request permission, please check out URL: http://www.academypublisher.com/copyrightpermission.html. |