TY - JOUR
T1 - Application of eXtreme gradient boosting trees in the construction of credit risk assessment models for financial institutions
AU - Chang, Yung-Chia
AU - Chang, Kuei Hu
AU - Wu, Guan Jhih
PY - 2018/12/1
Y1 - 2018/12/1
N2 - The majority of the studies on credit risk assessment models for financial institutions during recent years focus on the improvement of imbalanced data or on the enhancement of classification accuracy with multistage modeling. Whilst multistage modeling and data pre-processing can boost accuracy somewhat, the heterogeneous nature of data may affects the classification accuracy of classifiers. This paper intends to use the classifier, eXtreme gradient boosting tree (XGBoost), to construct a credit risk assessment model for financial institutions. Cluster-based under-sampling is deployed to process imbalanced data. Finally, the area under the receiver operative curve and the accuracy of classifications are the assessment indicators, in the comparison with other frequently used single-stage classifiers such as logistic regression, self-organizing algorithms and support vector machine. The results indicate that the XGBoost classifier used by this paper achieve better results than the other three and can serve as a superior tool for the development of credit risk models for financial institutions.
AB - The majority of the studies on credit risk assessment models for financial institutions during recent years focus on the improvement of imbalanced data or on the enhancement of classification accuracy with multistage modeling. Whilst multistage modeling and data pre-processing can boost accuracy somewhat, the heterogeneous nature of data may affects the classification accuracy of classifiers. This paper intends to use the classifier, eXtreme gradient boosting tree (XGBoost), to construct a credit risk assessment model for financial institutions. Cluster-based under-sampling is deployed to process imbalanced data. Finally, the area under the receiver operative curve and the accuracy of classifications are the assessment indicators, in the comparison with other frequently used single-stage classifiers such as logistic regression, self-organizing algorithms and support vector machine. The results indicate that the XGBoost classifier used by this paper achieve better results than the other three and can serve as a superior tool for the development of credit risk models for financial institutions.
KW - Credit risk assessment model
KW - eXtreme gradient boosting tree
KW - Receiver operative curve
KW - Support vector machine
UR - http://www.scopus.com/inward/record.url?scp=85054799324&partnerID=8YFLogxK
U2 - 10.1016/j.asoc.2018.09.029
DO - 10.1016/j.asoc.2018.09.029
M3 - Article
AN - SCOPUS:85054799324
SN - 1568-4946
VL - 73
SP - 914
EP - 920
JO - Applied Soft Computing Journal
JF - Applied Soft Computing Journal
ER -