Mining Compact High Utility Itemsets Without Candidate Generation

Cheng Wei Wu*, Philippe Fournier-Viger, Jia Yuan Gu, Vincent S. Tseng


研究成果: Chapter同行評審

13 引文 斯高帕斯(Scopus)


Though the research topic of high utility itemset (HUI) mining has received extensive attention in recent years, current algorithms suffer from the crucial problem that too many HUIs tend to be produced. This seriously degrades the performance of HUI mining in terms of execution and memory efficiency. Moreover, it is very hard for users to discover meaningful information in a huge number of HUIs. In this paper, we address this issue by proposing a promising framework with a novel algorithm named CHUI (Compact High Utility Itemset)-Mine to discover closed $$^{+}$$ HUIs and maximal HUIs, which are compact representations of HUIs. The main merits of CHUI-Mine lie in two aspects: First, in terms of efficiency, unlike existing algorithms that tend to produce a large amount of candidates during the mining process, CHUI-Mine computes the utility of itemsets directly without generating candidates. Second, in terms of losslessness, unlike current algorithms that provide incomplete results, CHUI-Mine can discover the complete closed $$^{+}$$ or maximal HUIs with no miss. A comprehensive investigation is also presented to compare the relative advantages of different compact representations in terms of computational cost and compactness. To our best knowledge, this is the first work addressing the issue of mining compact high utility itemsets in terms of closed $$^{+}$$ and maximal HUIs without candidate generation. Experimental results show that CHUI-Mine achieves a massive reduction in the number of HUIs and is several orders of magnitude faster than benchmark algorithms.

主出版物標題Studies in Big Data
發行者Springer Science and Business Media Deutschland GmbH
出版狀態Published - 2019


名字Studies in Big Data


深入研究「Mining Compact High Utility Itemsets Without Candidate Generation」主題。共同形成了獨特的指紋。