Efficiently mining high average-utility itemsets with an improved upper-bound strategy

Guo Cheng Lan, Tzung Pei Hong*, S. Tseng

*此作品的通信作者

研究成果: Article同行評審

63 引文 斯高帕斯(Scopus)

摘要

Utility mining has recently been discussed in the field of data mining. A utility itemset considers both profits and quantities of items in transactions, and thus its utility value increases with increasing itemset length. To reveal a better utility effect, an average-utility measure, which is the total utility of an itemset divided by its itemset length, is proposed. However, existing approaches use the traditional average-utility upper-bound model to find high average-utility itemsets, and thus generate a large number of unpromising candidates in the mining process. The present study proposes an improved upper-bound approach that uses the prefix concept to create tighter upper bounds of average-utility values for itemsets, thus reducing the number of unpromising itemsets for mining. Results from experiments on two real databases show that the proposed algorithm outperforms other mining algorithms under various parameter settings.

原文English
頁(從 - 到)1009-1030
頁數22
期刊International Journal of Information Technology and Decision Making
11
發行號5
DOIs
出版狀態Published - 9月 2012

指紋

深入研究「Efficiently mining high average-utility itemsets with an improved upper-bound strategy」主題。共同形成了獨特的指紋。

引用此