Logo Goletty

An Efficient Method of Web Sequential Pattern Mining Based on Session Filter and Transaction Identification
Journal Title Journal of Networks
Journal Abbreviation jnw
Publisher Group Academy Publisher
Website http://ojs.academypublisher.com
PDF (368 kb)
   
Title An Efficient Method of Web Sequential Pattern Mining Based on Session Filter and Transaction Identification
Authors Gao, Guozhu; Wu, Haiyan; Zhu, Jingjun
Abstract Web sequential pattern mining is an important way to analyze the access behavior of web users. In this paper, we present an efficient method of web sequential pattern mining based on session filter and transaction identification. Different from traditional mining methods, we categorize the user sessions into human user sessions, crawler sessions and resource-download user sessions. Then we filter out the non-human user sessions, leaving the human user sessions for sequential pattern mining. With the purpose of mining users’ meaningful sequential patterns, we identify users’ transactions from the user sessions, and do the sequential pattern mining based on transaction level. We present a method of transaction identification based on users’ access path tree. It can find out all the transactions effectively. We also make some improvements on PrefixSpan algorithm, which can reduce the memory space it takes and avoid generating duplicate projections. The experimental results of our mining method are very satisfactory.
Publisher ACADEMY PUBLISHER
Date 2010-09-02
Source Journal of Networks Vol 5, No 9 (2010): Special Issue: Recent Advances in Computer Science and Engineering - Track on N
Rights Copyright © ACADEMY PUBLISHER - All Rights Reserved.To request permission, please check out URL: http://www.academypublisher.com/copyrightpermission.html. 

 

See other article in the same Issue


Goletty © 2024