Sage Journals: Discover world-class research

Abstract

Credit scoring, which forecasts the probability of loan default based on borrower attributes and credit history, is still a crucial task in the financial industry. Finding the most important characteristics to improve credit scoring accuracy has become more difficult due to the complexity of borrower profiles. This paper presents a systematic and multidimensional evaluation of the impact of different feature selection techniques, namely wrapper-based, filter-based, and embedded methods, on the performance of various machine learning classifiers such as Random Forest (RF) and Extreme Gradient Boosting (XGBoost). The influence of data resampling techniques to address class imbalance is also explored. The study evaluates all combinations under three settings: original, oversampled, and undersampled data, using three publicly available datasets: German, Taiwan, and Australian credit scoring datasets. Experimental results show that ensemble classifiers, especially XGBoost and RF, consistently outperform single classifier models. Additionally, feature selection methods, especially embedded and wrapper techniques, enhance model performance and reduce false positive and false negative rates across the three datasets.

Keywords

credit scoring feature selection data resampling ensemble learning

Get full access to this article

View all access options for this article.

References

Akogul

(2023). A novel approach to increase the efficiency of filter-based feature selection methods in high-dimensional datasets with strong correlation structure. IEEE Access, 11, 115025–115032. https://doi.org/10.1109/ACCESS.2023.3325331

Ala’raj

Abbod

M. F.

(2016). Classifiers consensus system approach for credit scoring. Knowledge-Based Systems, 104, 89–105.

Asuncion

Newman

(2007a). UCI machine learning repository. https://archive.ics.uci.edu/ml/datasets/statlog+(german+credit+data). Accessed: 2025-01-01.

Asuncion

Newman

(2007b). UCI machine learning repository. https://archive.ics.uci.edu/dataset/143/statlog+australian+credit+approval. Accessed: 2025-05-01.

Belete

D. M.

Manjaiah

(2020). A comparative study of filter and wrapper methods on EDHS–HIV/AIDS dataset. In 2020 Third international conference on smart systems and inventive technology (ICSSIT) (pp. 1264–1271). IEEE.

Carter

J. V.

Pan

Rai

S. N.

Galandiuk

(2016). Roc-ing along: Evaluation and interpretation of receiver operating characteristic curves. Surgery, 159(6), 1638–1645.

Chen

Guestrin

(2016). Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 785–794).

Ellis

D. E.

Hubbard

R. A.

Willis

A. W.

Zuppa

A. F.

Zaoutis

T. E.

Hennessy

(2022). Comparing LASSO and random forest models for predicting neurological dysfunction among fluoroquinolone users. Pharmacoepidemiology and Drug Safety, 31(4), 393–403.

Emmanuel

Sun

Wang

(2024). A machine learning-based credit risk prediction engine system using a stacked classifier and a filter-based feature selection method. Journal of Big Data, 11(1), 23.

10.

Esenogho

Mienye

I. D.

Swart

T. G.

Aruleba

Obaido

(2022). A neural network ensemble with feature engineering for improved credit card fraud detection. IEEE Access, 10, 16400–16407.

11.

Galdi

Tagliaferri

(2019). Data mining: Accuracy and error measures for classification and prediction. Elsevier.

12.

Gholamy

Kreinovich

Kosheleva

(2018). Why 70/30 or 80/20 relation between training and testing sets: A pedagogical explanation. International Journal of Intelligent Technologies and Applied Statistics, 11(2), 105–111.

13.

Zhang

(2018). A novel ensemble method for credit scoring: Adaption of different imbalance ratios. Expert Systems with Applications, 98, 105–117.

14.

Meng

Finley

Wang

Chen

Liu

T. Y.

(2017). LightGBM: A highly efficient gradient boosting decision tree. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan & R. Garnett (Eds.), Advances in neural information processing systems, volume 30. Curran Associates, Inc.

15.

Koc

Ugur

Kestel

A. S.

(2023). The impact of feature selection and transformation on machine learning methods in determining the credit scoring. arXiv preprint arXiv:2303.05427.

16.

Kwon

Jang

Kim

C. O.

(2025). Credit scoring using multi-task siamese neural network for improving prediction performance and stability. Expert Systems with Applications, 259, 125327.

17.

Labatut

Cherifi

(2012). Accuracy measures for the comparison of classifiers. CoRR abs/1207.3790.

18.

Laborda

Ryoo

(2021). Feature selection in a credit scoring model. Mathematics, 9(7), 746.

19.

Leiva

R. G.

Anta

A. F.

Mancuso

Casari

(2019). A novel hyperparameter-free approach to decision tree construction that avoids overfitting by design. IEEE Access, 7, 99978–99987.

20.

Liao

Wang

Xue

Lei

(2024). Data augmentation methods for reject inference in credit risk models. Journal of Financial Data Science, 10(1), 55–72.

21.

Mokheleli

Museba

(2023). Machine learning approach for credit score predictions. Journal of Information Systems and Informatics, 5(2), 497–517.

22.

Prokhorenkova

Gusev

Vorobev

Dorogush

A. V.

Gulin

(2018). CatBoost: Unbiased boosting with categorical features. Advances in neural information processing systems, volume 31.

23.

Valentini

(2012). Ensemble methods. Advances in machine learning and data mining for astronomy (pp. 563–593).

24.

Rofik

Aulia

Musaadah

Ardyani

S. S. F.

Hakim

A. A.

(2024). The optimization of credit scoring model using stacking ensemble learning and oversampling techniques. Journal of Information System Exploration and Research, 2(1), DOI: 10.52465/joiser.v2i1.203.

25.

Song

Sunny

Gurushanth

Mendonca

Mukhia

Patrick

Gurudath

Raghavan

Tsusennaro

, et al. (2021). Classification of imbalanced oral cancer image data from high-risk population. Journal of Biomedical Optics, 26(10), 105001.

26.

Talaat

F. M.

Aljadani

Badawy

Elhosseini

(2024). Toward interpretable credit scoring: Integrating explainable artificial intelligence with deep learning for credit card default prediction. Neural Computing and Applications, 36(9), 4847–4865.

27.

Tripathi

Shukla

A. K.

Reddy

B. R.

Bopche

G. S.

Chandramohan

(2022). Credit scoring models using ensemble learning and classification approaches: A comprehensive survey. Wireless Personal Communications, 123(1), 785–812.

28.

(2025). Inherently interpretable machine learning for credit scoring: Optimal classification tree with hyperplane splits. European Journal of Operational Research, 322(2), 647–664.

29.

Wang

Xiao

Wang

Yao

(2024). A novel federated learning approach with knowledge transfer for credit scoring. Decision Support Systems, 177, 114084.

30.

Yeh

I. C.

Lien

C. H.

(2009). The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients.

31.

Zhang

Chi

(2021). A heterogeneous ensemble credit scoring model based on adaptive classifier selection: An application on imbalanced data. International Journal of Finance & Economics, 26(3), 4372–4385.

Improving Credit Scoring with Feature Selection and Predictive Modeling

Abstract

Keywords

Get full access to this article

References