Bridging accuracy and interpretability: A rescaled cluster-then-predict approach for enhanced credit scoring

Huei Wen Teng*, Ming Hsuan Kang, I. Han Lee, Le Chi Bai

*此作品的通信作者

研究成果: Article同行評審

1 引文 斯高帕斯(Scopus)

摘要

Credit scoring is pivotal in the financial industry for assessing individuals’ creditworthiness and optimizing financial institutions’ risk-adjusted returns. While the XGBoost algorithm stands as the state-of-the-art classifier for credit scoring, its intricate nature impedes easy interpretation, a critical aspect for stakeholders’ decision-making. This paper introduces a novel approach termed the “Rescaled Cluster-then-Predict Method,” aimed at enhancing both the interpretability and predictive performance of credit scoring models. Our method employs a two-step process, initially rescaling the features and subsequently clustering the data into subgroups. Consequently, we employ Logistic Regression within each subgroup to generate predictions. The paper's primary contributions are twofold. Firstly, empirical evaluations on two distinct datasets demonstrate that our proposed method attains a competitive performance compared to XGBoost while substantially improving interpretability. Notably, in some instances, the Logistic Regression outperforms XGBoost. Secondly, we reveal that clustering solely the positive cases, as opposed to the entire dataset, yields comparable results while markedly reducing computational requirements. These insights hold significant practical implications for the financial industry, which consistently seeks credit scoring models that are not only accurate but also interpretable and computationally efficient.

原文English
文章編號103005
期刊International Review of Financial Analysis
91
DOIs
出版狀態Published - 1月 2024

指紋

深入研究「Bridging accuracy and interpretability: A rescaled cluster-then-predict approach for enhanced credit scoring」主題。共同形成了獨特的指紋。

引用此