TY - JOUR
T1 - Reducing database scans for on-shelf utility mining
AU - Lan, Guo Cheng
AU - Hong, Tzung Pei
AU - Tseng, Vincent Shin-Mu
PY - 2011/3
Y1 - 2011/3
N2 - Utility mining has recently been an emerging topic in the field of data mining. It finds out high-utility itemsets by considering both the profits and quantities of items in transactions. In real applications, however, utility mining may have a bias if items are not always on shelf. On-shelf utility mining is then proposed, which considers not only individual profit and quantity of each item in a transaction but also common on-shelf time periods of a product combination. In the past, a two-phase on-shelf utility mining was proposed to discover the desired patterns in on-shelf utility mining. It, however, adopted the level-wise mining way to find the patterns. To speed up the execution efficiency, a three-scan mining approach is thus proposed in the paper to efficiently discover high on-shelf utility itemsets. The proposed approach utilizes an itemset-generation mechanism to prune redundant candidates early and to systematically check the itemsets from transactions. At last, the experimental results on synthetic datasets show the proposed approach has a better performance than the previous one.
AB - Utility mining has recently been an emerging topic in the field of data mining. It finds out high-utility itemsets by considering both the profits and quantities of items in transactions. In real applications, however, utility mining may have a bias if items are not always on shelf. On-shelf utility mining is then proposed, which considers not only individual profit and quantity of each item in a transaction but also common on-shelf time periods of a product combination. In the past, a two-phase on-shelf utility mining was proposed to discover the desired patterns in on-shelf utility mining. It, however, adopted the level-wise mining way to find the patterns. To speed up the execution efficiency, a three-scan mining approach is thus proposed in the paper to efficiently discover high on-shelf utility itemsets. The proposed approach utilizes an itemset-generation mechanism to prune redundant candidates early and to systematically check the itemsets from transactions. At last, the experimental results on synthetic datasets show the proposed approach has a better performance than the previous one.
KW - Data mining
KW - High-utility itemsets
KW - On-shelf data
KW - Utility mining
UR - http://www.scopus.com/inward/record.url?scp=79953778432&partnerID=8YFLogxK
U2 - 10.4103/0256-4602.78090
DO - 10.4103/0256-4602.78090
M3 - Article
AN - SCOPUS:79953778432
SN - 0256-4602
VL - 28
SP - 103
EP - 112
JO - IETE Technical Review (Institution of Electronics and Telecommunication Engineers, India)
JF - IETE Technical Review (Institution of Electronics and Telecommunication Engineers, India)
IS - 2
ER -