TY - JOUR
T1 - A DIAMOND method of inducing classification rules for biological data
AU - Li, Han-Lin
AU - Huang, Yao Huei
PY - 2011/8
Y1 - 2011/8
N2 - Identifying the classification rules for patients, based on a given dataset, is an important role in medical tasks. For example, the rules for estimating the likelihood of survival for patients undergoing breast cancer surgery are critical in treatment planning. Many well-known classification methods (as decision tree methods and hyper-plane methods) assume that classes can be separated by a linear function. However, these methods suffer when the boundaries between the classes are non-linear. This study presents a novel method, called DIAMOND, to induce classification rules from datasets containing non-linear interactions between the input data and the classes to be predicted. Given a set of objects with some classes, DIAMOND separates the objects into different cubes, and assigns each cube to a class. Via the unions of these cubes, DIAMOND uses mixed-integer programs to induce classification rules with better rates of accuracy, support and compact. This study uses three practical datasets (Iris flower, HSV patients, and breast cancer patients) to illustrate the advantages of DIAMOND over some current methods.
AB - Identifying the classification rules for patients, based on a given dataset, is an important role in medical tasks. For example, the rules for estimating the likelihood of survival for patients undergoing breast cancer surgery are critical in treatment planning. Many well-known classification methods (as decision tree methods and hyper-plane methods) assume that classes can be separated by a linear function. However, these methods suffer when the boundaries between the classes are non-linear. This study presents a novel method, called DIAMOND, to induce classification rules from datasets containing non-linear interactions between the input data and the classes to be predicted. Given a set of objects with some classes, DIAMOND separates the objects into different cubes, and assigns each cube to a class. Via the unions of these cubes, DIAMOND uses mixed-integer programs to induce classification rules with better rates of accuracy, support and compact. This study uses three practical datasets (Iris flower, HSV patients, and breast cancer patients) to illustrate the advantages of DIAMOND over some current methods.
KW - Classification rules
KW - Cubes
KW - DIAMOND
KW - Integer program
UR - http://www.scopus.com/inward/record.url?scp=79960624270&partnerID=8YFLogxK
U2 - 10.1016/j.compbiomed.2011.05.002
DO - 10.1016/j.compbiomed.2011.05.002
M3 - Article
C2 - 21742322
AN - SCOPUS:79960624270
SN - 0010-4825
VL - 41
SP - 587
EP - 599
JO - Computers in Biology and Medicine
JF - Computers in Biology and Medicine
IS - 8
ER -