TY - JOUR
T1 - Efficient mining of multilevel gene association rules from microarray and gene ontology
AU - Tseng, S.
AU - Yu, Hsieh Hui
AU - Yang, Shih Chiang
PY - 2009/9
Y1 - 2009/9
N2 - Some recent studies have shown that association rules can reveal the interactions between genes that might not have been revealed using traditional analysis methods like clustering. However, the existing studies consider only the association rules among individual genes. In this paper, we propose a new data mining method named MAGO for discovering the multilevel gene association rules from the gene microarray data and the concept hierarchy of Gene Ontology (GO). The proposed method can efficiently find out the relations between GO terms by analyzing the gene expressions with the hierarchy of GO. For example, with the biological process in GO, some rules like Process A (up) → Process B (up) cab be discovered, which indicates that the genes involved in Process B of GO are likely to be up-regulated whenever those involved in Process A are up-regulated. Moreover, we also propose a constrained mining method named CMAGO for discovering the multilevel gene expression rules with user-specified constraints. Through empirical evaluation, the proposed methods are shown to have excellent performance in discovering the hidden multilevel gene association rules.
AB - Some recent studies have shown that association rules can reveal the interactions between genes that might not have been revealed using traditional analysis methods like clustering. However, the existing studies consider only the association rules among individual genes. In this paper, we propose a new data mining method named MAGO for discovering the multilevel gene association rules from the gene microarray data and the concept hierarchy of Gene Ontology (GO). The proposed method can efficiently find out the relations between GO terms by analyzing the gene expressions with the hierarchy of GO. For example, with the biological process in GO, some rules like Process A (up) → Process B (up) cab be discovered, which indicates that the genes involved in Process B of GO are likely to be up-regulated whenever those involved in Process A are up-regulated. Moreover, we also propose a constrained mining method named CMAGO for discovering the multilevel gene expression rules with user-specified constraints. Through empirical evaluation, the proposed methods are shown to have excellent performance in discovering the hidden multilevel gene association rules.
KW - Association rules mining
KW - Data mining
KW - Gene expression analysis
KW - Gene ontology
KW - Microarray
KW - Multi-level association rules
UR - http://www.scopus.com/inward/record.url?scp=68849087940&partnerID=8YFLogxK
U2 - 10.1007/s10796-009-9156-1
DO - 10.1007/s10796-009-9156-1
M3 - Article
AN - SCOPUS:68849087940
SN - 1387-3326
VL - 11
SP - 433
EP - 447
JO - Information Systems Frontiers
JF - Information Systems Frontiers
IS - 4
ER -