Mining top-K sequential rules

Philippe Fournier-Viger*, S. Tseng

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

41 Scopus citations

Abstract

Mining sequential rules requires specifying parameters that are often difficult to set (the minimal confidence and minimal support). Depending on the choice of these parameters, current algorithms can become very slow and generate an extremely large amount of results or generate too few results, omitting valuable information. This is a serious problem because in practice users have limited resources for analyzing the results and thus are often only interested in discovering a certain amount of results, and fine-tuning the parameters can be very time-consuming. In this paper, we address this problem by proposing TopSeqRules, an efficient algorithm for mining the top-k sequential rules from sequence databases, where k is the number of sequential rules to be found and is set by the user. Experimental results on real-life datasets show that the algorithm has excellent performance and scalability.

Original languageEnglish
Title of host publicationAdvanced Data Mining and Applications - 7th International Conference, ADMA 2011, Proceedings
Pages180-194
Number of pages15
EditionPART 2
DOIs
StatePublished - 28 Dec 2011
Event7th International Conference on Advanced Data Mining and Applications, ADMA 2011 - Beijing, China
Duration: 17 Dec 201119 Dec 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume7121 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th International Conference on Advanced Data Mining and Applications, ADMA 2011
Country/TerritoryChina
CityBeijing
Period17/12/1119/12/11

Fingerprint

Dive into the research topics of 'Mining top-K sequential rules'. Together they form a unique fingerprint.

Cite this