Compact prediction tree: A lossless model for accurate sequence prediction

Ted Gueniche, Philippe Fournier-Viger, S. Tseng

研究成果: Conference contribution同行評審

58 引文 斯高帕斯(Scopus)

摘要

Predicting the next item of a sequence over a finite alphabet has important applications in many domains. In this paper, we present a novel prediction model named CPT (Compact Prediction Tree) which losslessly compress the training data so that all relevant information is available for each prediction. Our approach is incremental, offers a low time complexity for its training phase and is easily adaptable for different applications and contexts. We compared the performance of CPT with state of the art techniques, namely PPM (Prediction by Partial Matching), DG (Dependency Graph) and All-K-th-Order Markov. Results show that CPT yield higher accuracy on most datasets (up to 12% more than the second best approach), has better training time than DG and PPM, and is considerably smaller than All-K-th-Order Markov.

原文English
主出版物標題Advanced Data Mining and Applications - 9th International Conference, ADMA 2013, Proceedings
頁面177-188
頁數12
版本PART 2
DOIs
出版狀態Published - 2013
事件9th International Conference on Advanced Data Mining and Applications, ADMA 2013 - Hangzhou, 中國
持續時間: 14 12月 201316 12月 2013

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
號碼PART 2
8347 LNAI
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference9th International Conference on Advanced Data Mining and Applications, ADMA 2013
國家/地區中國
城市Hangzhou
期間14/12/1316/12/13

指紋

深入研究「Compact prediction tree: A lossless model for accurate sequence prediction」主題。共同形成了獨特的指紋。

引用此