Logo

Sequential pattern mining using PrefixSpan with pseudoprojection and separator database

Saputra, D. and Rambli, D.R.A. and Foong, Oi Mean (2008) Sequential pattern mining using PrefixSpan with pseudoprojection and separator database. In: International Symposium on Information Technology 2008, ITSim, 26 August 2008 through 29 August 2008, Kuala Lumpur.

This is the latest version of this item.

[img] PDF (Sequential pattern mining using PrefixSpan with pseudoprojection and separator database)
Restricted to Registered users only

11Kb

Abstract

Sequential pattern mining is a new branch of data mining science that solves inter-transaction pattern mining problems. A comprehensive performance study has been reported that PrefixSpan, one of its algorithms, outperforms GSP, SPADE, as well as FreeSpan in most cases, and PrefixSpan integrated with pseudoprojection technique is the fastest among those tested algorithms. Nevertheless, Pseudoprojection technique, which requires maintaining and visiting the in-memory sequence database frequently until all patterns are found, consumes a considerable amount of memory and induces the algorithm to undertake redundant and unnecessary checks to this copy of original database into memory when the candidate patterns are examined. In this paper, we propose Separator Database to improve PrefixSpan with pseudoprojection through early removal of uneconomical in-memory sequence database. The experimental results show that Separator Database improves PrefixSpan with pseudoprojection. Future research includes exploring the use of Separator Database in PrefixSpan with pseudoprojection to improve mining constrained sequential patterns. © 2008 IEEE.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:Decision support systems; Information management; Information technology; Mining; Separation; Separators; Candidate patterns; Comprehensive performances; Future researches; Memory sequences; Prefixspan; Sequential Pattern minings; Sequential patterns; Transaction patterns; Database systems
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments / MOR / COE:Departments > Computer Information Sciences
ID Code:222
Deposited By: Foong Oi Mean
Deposited On:25 Feb 2010 07:49
Last Modified:19 Jan 2017 08:26

Available Versions of this Item

Repository Staff Only: item control page

Document Downloads

More statistics for this item...