Improving missing-value estimation in microarray data with Collaborative Filtering based on rough-set theory

Bo Wen Wang, S. Tseng

研究成果: Article同行評審

11 引文 斯高帕斯(Scopus)

摘要

Data mining techniques have been used to extract useful knowledge from DNA microarray gene expression data for discovering the relations between novel diseases and their related genes. However, DNA microarray gene expression data often contain missing values that must be dealt with to prevent them from signicantly affecting analysis results. Hence, a number of missing-value imputation approaches have been proposed. In this paper, an intelligent imputation approach named the CFBRST (Collaborative Filtering Based on Rough-Set Theory) method is proposed to impute missing values more accurately than currently done by existing approaches. Experimental results on real microarray gene expression datasets reveal that the proposed approach can effectively improve missing-value estimation. The collaborative filtering (CF) approach is often used in recommender systems due to its excellent performance. The proposed CFRBS method is based on the CF method and rough-set theory. The CFBRST method is compared with the k-nearest neighbor (k-NN) imputation algorithm. Experimental results show that the CFBRST method has better accuracy than that of a k-NN approach for yeast cDNA microarray datasets, especially when the percentage of missing values is high.

原文English
頁(從 - 到)2157-2172
頁數16
期刊International Journal of Innovative Computing, Information and Control
8
發行號3 B
出版狀態Published - 1 3月 2012

指紋

深入研究「Improving missing-value estimation in microarray data with Collaborative Filtering based on rough-set theory」主題。共同形成了獨特的指紋。

引用此