Using dual response surface methodology as a benchmark to process multi-class imbalanced data

Lee-Ing Tong, Kuei Hu Chang*, Ping Yi Wu, Yung-Chia Chang

*此作品的通信作者

研究成果: Article同行評審

1 引文 斯高帕斯(Scopus)

摘要

Constructing a classification model for the multi-class data is a critical problem in many areas. In practical applications, data in multiple classes are often imbalanced which might result in a classification model with high overall accuracy rate but with low accuracy rate for the minority class. However, minority class is usually the more important one compared to other classes in practice. This study integrates dual response surface methodology, logistic regression analysis, and desirability function to develop an optimal re-sampling strategy for classifying multi-class imbalanced data to effectively improve the low classification accuracy rate of the minority class(es) while still maintain a certain accuracy rate for the majority class(es). Three data-sets drawn from KEEL Database were used in the numerical experiments. The results showed that the proposed method can effectively improve the low classification accuracy rate of the minority class in contrast to the previous work.

原文English
頁(從 - 到)147-158
頁數12
期刊Journal of Industrial and Production Engineering
34
發行號2
DOIs
出版狀態Published - 17 2月 2017

指紋

深入研究「Using dual response surface methodology as a benchmark to process multi-class imbalanced data」主題。共同形成了獨特的指紋。

引用此