An Enhanced TSA-MLP Model for Identifying Credit Default Problems

Abstract

Credit default has always been one of the critical factors in the development of personal credit business. By establishing a default identification model, default can be avoided effectively. There are some existing methods to identify credit default. However, these methods have some problems: Problem (1): It is different to deal the non-linear data, Problem (2): The local stagnation results in the high error rate, and Problem (3): The premature convergence leads to the low classification rate. In this paper, the sinhTSA-MLP default risk identification model is proposed to solve these problems. In this model, the proposed sinhTSA method can effectively avoid the problems of falling into local optimum and premature convergence. And the benchmark test results demonstrate sinhTSA is superior to other methods. According to the two experiments, the classification rate reaches 77.35% and 96.48%. Therefore, the sinhTSA-MLP default identification model has some particular advantages in identifying credit problems The feasibility of the sinhTSA-MLP default identification model has been proved through helping to manage credit default more consciously.

Keywords

sinhTSA-MLP default identification model credit risk default problems

Introduction

Credit risk, also known as default risk, refers to the risk of economic loss caused by the failure of the counterpart to fulfill the obligations in the agreed contract (Oreski & Oreski, 2014). Trustees ignore the obligation of repayment of principal, interest, causing that the expected return of the grantee may deviate from the actual return, which is the primary type of financial risk (Ramos-Tallada, 2015). In recent years, due to the non-rich research in credit default, the international economic growth is weak. Among the banks, the management and control mechanisms of credit default is not advanced yet and the insufficient information making the research on credit default confronted with many challenges (Butaru et al., 2016). Therefore, there is a need to create a systematic prevention and control mechanism. That is to say, the involvement of research on credit default, the establishment of a forward-looking default supervision system, and the reinforcement of the supervision default prediction and default identification are necessary.

With the increasingly diversified assets of financial institutions and the rise of internet finance, the management of credit assets becomes more complex, making bank credit face more accordingly defaults (Ozbayoglu et al., 2020). The loss of bank capital caused by the credit default not only affects the survival and development of the bank itself, but also causes the chain reflection of the correlation (Bhattacharya et al., 2020). To achieve risk control, traditional financial institutions often build credit scoring models based on a rule or statistical analysis of historical data. The traditional customer credit default identification system has been challenging to meet the actual needs. Therefore, the proposal of an appropriate and effective method to identify credit default has become the dominating intersection point of many scholars and practical applications (Wang, Ma, et al., 2014).

Many methods have been proposed to address credit default identification problems (Chou et al., 2017; Liu et al., 2019), which can be divided into three categories: specialized judgments (Huang et al., 2005), statistical methods and Computational Intelligence (CI) approach (Chou et al., 2017). The most common method of CI credit default identification, the supervised learning method is better than that of the traditional methods (Carcillo et al., 2021; Zhang et al., 2021). At present, the traditional methods contain Random Forest (RF; Rao et al., 2020), Decision Tree (DT) (Abri Aghdam et al., 2021), Logistic Regression (LR; Fan et al., 2020), Support Vector Machines (SVM; Danenas & Garsva, 2015) and so on.

Credit default identification, belonging to supervised learning in machine learning, has an explicit target variable—the type of customer (Jordan & Mitchell, 2015; Ping & Yongheng, 2011). With the evolution of artificial intelligence, artificial neural networks (ANN) is employed to forecast credit default (Lopez-Garcia et al., 2020; Thomas, 2000). The construction process of supervised learning is the same as that of the traditional method, which follows the basic business norms (Rtayli & Enneya, 2020). The independent and dependent variables, including many typical indicators and good or bad samples, are determined by the business needs. In essence, machine learning does not have an impact on the existing credit business from the practical application and there is some room for promotion and application. Owing to the complex and non-linear characteristics of the financial market, these methods generally have limitations in timeliness and data integrity so that the comprehensive and reliable risk control can not be achieved. What’s more, the phenomena of local stagnation and premature convergence constantly emerge (Malhotra & Malhotra, 2003; Pławiak et al., 2019). Consequently, the traditional statistical methods are not suitable for the analysis of complex, high dimensional and noisy data.

To summarize, in the big data environment, according to the features of massive transaction data, the improved artificial neural networks are emerging, which can solve the problem of high-level, complex data (Falavigna, 2012; Meng et al., 2021), and it has been applied in the field of intelligent finance and big data risk control. Under the financial background, the model can be trained to quickly identify credit default identification. The Multi-Layer Perceptron (MLP) method has high accuracy in the classification problems and real-world problems (Feng et al., 2020; Mohammadi et al., 2021). The MLP with a hidden layer is a typical architecture of the ANN. It needs to be optimized since it always falls into local optimum when dealing with massive data (Meng et al., 2021). In solving specific problems, traditional swarm intelligence methods are relatively perfect and mature (Ertenlice & Kalayci, 2018). This triggered us to utilize the swarm intelligence methods to optimize the weights and biases term of MLP to achieve the optimal performance (Meng et al., 2021; Mirjalili et al., 2012). Because of the merits, MLP can effectively prevent credit risks and reduce the non-performing loan ratio. MLP complies with the development trend, giving full play to new advantages such as the internet and big data, which compensates for the shortcomings of traditional default identification methods, and provides more space for improving the ability of credit default identification In this paper, the swarm intelligence optimization method optimizes the weights and biases of the MLP mainly used in the field of credit default identification, which makes up for the limitations of traditional technology.

Tree-Seed Algorithm (TSA) is a swarm intelligence optimization method to imitate the way of propagation between trees and seeds (Kiran, 2015). At present, TSA method has been applied in many fields: engineering optimization (Jiang et al., 2020a), symmetric traveling salesman problem (Cinar et al., 2020), optimal power flow problem (El-Fergany & Hasanien, 2018). Compared with traditional and some meta-heuristic method, it has space for research and development due to the characteristics, such as fewer parameters (Kiran & Hakli, 2021) and easy to implement (Jiang et al., 2020b). Meanwhile, many scholars have been concerned, and many variants have been proposed to improve the performance and solve the real- world problems showing in Table 1. These variants also have produced very significant results in their areas. It is worth mentioning that TSA is not enough to optimize the MLP processing massive data, so it is necessary to propose a TSA variant to enhance the performance of TSA and further optimize MLP.

Table 1.

The Exiting Variants of the Basic TSA.

Literature and author	Method	Mechanism
Kiran and Hakli (2021)	NTSA	The novel algorithm is based on four different algorithmic approaches which uses two different solution generating mechanisms in order to improve balance local and global search abilities.
Jiang et al. (2020a)	fb_TSA	Through feedback mechanism, the ST and the number of seeds are dynamically adjusted to achieve the balance between exploration and exploitation.
Jiang et al. (2020b)	STSA	An adaptive automatic adjustment mechanism and the new initialization for the number of seeds to balance exploration and exploitation.
Ding et al. (2020)	C-Jaya-TSA	A hybrid swarm intelligence technique based on Jaya and TSA.
Jiang et al. (2020c)	TSASC	Two features from sine-cosine method are integrated into the TSA to balance the exploration and exploitation.
Jiang et al. (2019)	EST-TSA	Redesigns the balance rule between exploitation and exploration and improves the original position updating formula of TSA.
Cinar et al. (2020)	DTSA	The integration of the swap, shift, and symmetry transformation operators
Ding et al. (2019)	I-TSA	The Lvy search mechanism and a new updating equation are introduced to improve the original TSA.
Babalik et al. (2018)	CTSA	Application of the Debs rules to solve constrained optimization problems.
Cinar and Kiran (2018)	LogicTSA, SimTSA SimLogicTSA	Logic gates (LogicTSA) and similarity measurement techniques (SimTSA) is used to improve the performance.
Kiran (2017)	TSAWP	A new control parameter named as withering process (WP) to enhance the performance.

Here, we can get the motivation of this paper:

The MLP credit default risk identification model can effectively identify credit default problems.

The TSA variant improves the MLP performance to obtain the precious classification rate.

From two aspects of MLP and swarm intelligence optimization method, this paper aims to build a credit default identification model based on an intelligent optimization method. To achieve the above research objectives, the following research contents are drawn up:

Firstly, based on the TSA, candidate and adjustment mechanisms are introduced to propose the TSA variant called sinhTSA.

Secondly, the sinhTSA-MLP credit default identification model is proposed to obtain the default results and improve the final classification rate and accuracy.

Thirdly, the feasibility and effectiveness of the proposed sinhTSA-MLP default identification model are verified by financial credit identification data.

Literature Review

Theory

With the rapid development of personal consumer loans, it is common to occur default events. In fact, there are many factors influencing credit default. Due to asymmetric information, banks and other lenders are in a relatively weak position, while borrowers are the opposite. Banks often do not have knowledge of the borrower’s repayment motivation, repayment ability and “project risk” (Ma, 2020). Currently, in the risk control process, the first and second lines of defense of banks are usually due to the emergence of obvious risk indicators. It is a typical stop loss risk control method, such as too many overdue days of loans, loss of contact with borrowers, etc. There is a certain lag in risk identification and remedial measures. According to the possibility of repayment, the classification of loan risk is divided into default and non default, so as to reveal the real value of the loan. Personal credit default is related to the characteristics of individual loans (Zhang, 2013), including age, education level, length of service, residence, family income, loan income ratio, credit card debts, other debts, sex, the value of fixed assets, loan term, whether mortgage, the family structure and so on. For example, Women prefer stable and are less likely to choose default than men.

Data Mining Techniques

Logistic regression (LR) can predict the credit risk of the small and medium-sized enterprises for financial institutions (Zhu et al., 2016) and consumer default risk (Costa e Silva et al., 2020). Naive Bayes (NB) is a classification method based on Bayes theorem and independent assumption of feature conditions (Chen et al., 2020). The two most widely used classification models are the decision tree model (Zhou et al., 2021) and naive Bayesian model (NBM; Yager, 2006). The first payment default (FPD) loans prediction is solved by the NB (Koç & Sevgi̇li̇, 2020) K-Nearest Neighbor (KNN) not only applies to the consumer credit risk (Kruppa et al., 2013) but also it can apply in bank loan default prediction (Arora & Kaur 2020; Kou et al., 2014). Consequently, it can be concluded that KNN has a great prospect in predicting credit default.

This paper uses hybrid model to identify credit risk. Hybrid model refers to using relevant data to generate several learners based on certain rules, and then integrating the above learners into a model through some algorithm model integration strategy. In the model output stage, the results of each learner are fused by using the pre-determined judgment criteria, and the final output is the output of the hybrid model. Through the proposed model, this paper selects two data sets and judge whether the customer is defaulted by taking the influencing factors as the input.

Method

Tree-seed Algorithm

Tree-seed Algorithm (TSA) is a heuristic method that simulates the propagation behavior between trees and seeds. And there are the following several essential parts.

Firstly, trees are generated through the initialization phase.

Secondly, seeds are generated through parent tree controlling by the search tendency (ST).

Thirdly, when the fitness value of seed is less than that of the tree, the tree is updated by seed.

Finally, when reaching the maximum iterations, the optimal global value is obtained.

Multi-Layer Perceptron

Multi-Layer Perceptron (MLP) has been widely applied to the finical problems (Chen et al., 2016). For example, it obtains superior outcome for the bankruptcy prediction of Iranian companies (Mokhatab Rafiei et al., 2011).

MLP is one of the Artificial Neural Network (ANN; Turkoglu & Kaya, 2020). In addition to the input and output layer, it can have multiple hidden layers between the input layer and the output layer. Equations (1)–(4) complete the whole MLP optimization process.

Firstly, calculate the weighted sum of the inputs by equation (1):

S_{j} = \sum_{i = 1}^{n} (w_{i j} X_{i}) - θ_{j}, j = 1, 2, \dots, h

(1)

Secondly, calculate the output for the hidden nodes by equation (2):

S_{j} = s i g m o i d (s_{j}) = \frac{1}{(1 + e x p (- s_{j}))}, j = 1, 2, \dots, h

(2)

Thirdly, calculate the result of the hidden node to get the final output by equations (3) and (4):

o_{k} = \sum_{j = 1}^{h} (w_{i k} S_{j}) - θ_{k}^{'}, k = 1, 2, \dots, m

(3)

O_{k} = s i g m o i d (o_{k}) = \frac{1}{(1 + e x p (- o_{k}))}, k = 1, 2, \dots, m

(4)

where n is the number of the input nodes, Wij indicates the weight linking the ith input layer node and the jth hidden layer node. Xi presents the ith input. wjk is the weight connecting between the jth hidden node and the kth output node. θj and θ $k (/)$ is the bias of the jth hidden node and kth output node, respectively.

Proposed sinhTSA

The candidate mechanism

The candidate mechanism is considered to enhance the global diversity and the ability of exploration. And it can adjust the convergence speed based on the original TSA, guiding the optimal global solution.

B_{tree (avg)} = \frac{B_{tree (1)} + B_{tree (2)} + B_{tree (3)} + B_{tree (4)}}{4}

(5)

B_{t r e e, p o o l} = {B_{t r e e (1)}, B_{t r e e (2)}, B_{t r e e (3)}, B_{t r e e (4)}, B_{t r e e (a v g)}}

(6)

where, the $B_{t r e e (1)}, B_{t r e e (2)}, B_{t r e e (3)}, B_{t r e e (4)}$ are the top 4 best trees. $B_{t r e e (a v g)}$ is the average tree of the four trees.

An adjustment mechanism k with iteration

The main contribution is the definition of suitable hyperbolic coefficients, the dynamic regulation of the expansion factor coefficient with the iteration, inspiring by the hyperbolic function as shown in equation (7).

\sinh = \frac{e^{x} - e^{- x}}{2}

(7)

The hyperbolic coefficients (k₁, k₂) changes with the number of iterations, where coefficients k₁ and k₂ are updated by equations (8) and (9).

k_{1} = \sinh (2) + \frac{simh (- \frac{3 * iter}{maxiter})}{\sinh (3)}

(8)

k_{2} = \frac{simh (\frac{3 * iter}{maxiter})}{\sinh (3)}

(9)

In the basic TSA, the seed generation mechanism results in premature convergence. Meanwhile, tree update mechanism leads to local stagnation. The hyperbolic coefficients that can reduce the local stagnation and increase the global diversity to a certain extent. The adjustment coefficients k1 and k2 are variables that decrease to negative numbers with the number of iterations. At the end of iteration, it can help the fine-regulation of the current search area to find the global optimum.

Through the above analysis, the new seed production mechanism is the effective combination of the two mechanisms. The equation (10) utilizes the candidate and adjustment mechanisms to generate the seed based on the current and random trees when the ST is less than the random constant. This generation method increases the global search diversity and decrease the possibility of local stagnation. On the contrary, equation (11) computes the seed generation position. This mechanism is based on the current, best and random trees to generate seeds. This mechanism can efficaciously increase the accuracy of finding the optimal global solution, avoid premature convergence, and accelerate the convergence speed.

S_{j, d} = {Tcand}_{k, d} + (T_{i, d} - T_{r, d}) {* k}_{1}

(10)

S_{j, d} = T_{i, d} + (B_{i, d} - T_{r, d}) {* k}_{2}

(11)

IEEE CEC 2014 benchmark test functions are used to verify the superior performance of the sinhTSA, which includes unimodal, multimodal, hybrid, and composition test benchmark functions. To fully test the searching precision and convergence rate of the proposed sinhTSA, eight representative methods are employed to compare the performance including GA (Holland & Reitman, 1977), ABC (Karaboga & Ozturk, 2011), BA (Yang & He, 2013), DE (Wang, Li et al., 2014), SCA (Mirjalili, 2016), BOA (Arora & Singh, 2019), EST-TSA (Jiang et al., 2019), and STSA (Jiang, et al., 2020b). The parameter settings of the eight methods are shown in Table 2. Table 3 shows the experimental results on the standard test set, and the best results are reflected in bold. The smaller the value, the better performance of the sinhTSA. Table 4 shows the Wilcoxon rank sum test, and it is also a nonparametric method to test whether there is a significant difference in the distribution of the population from which the two paired samples come. When the p-value is less than α, it presents the sinhTSA is superior to other algorithms. It can be seen from Tables 3 and 4, the sinhTSA has advantages on dealing with complex problems. Hence, the sinhTSA can train the MLP and form a model to identify credit default.

Table 2.

The Initial Parameters Settings for Corresponding Methods.

Method	Parameter	Value
GA (Holland & Reitman, 1977)	Type	Real coded
	Selection	Roulette wheel
	Crossover	Probability = .7
	Mutation	Probability = .2
	Scale factor primary scale	0.6
	Factor secondary 1	0.5
DE (Wang, Li, et al., 2014)	Scale factor secondary 2	0.3
	Crossover rate	0.8
	Mutation scheme	1
	Sorted selection	0
STSA (Jiang et al., 2020b)	ST	0.1
EST-TSA (Jiang et al., 2019)	ST	0.1
EST-TSA (Jiang et al., 2019)	Loudness(A)	0.5
BA (Yang & He, 2013)	Pulse rate (a)	0.5
	Frequency minimum	0
	Frequency maximum	2
SCA (Mirjalili, 2016)	a	2
	r₁	Linearly decreased from a to 0
	p	0.8
BOA (Arora & Singh, 2019)	Power exponent	0.1
BOA (Arora & Singh, 2019)	sensory	Modality = 0.01

Table 3.

The Results of the sinhTSA and Other Methods.

Function		sinhTSA	TSA	EST-TSA	DE	BA	GA	STSA	SCA	ABC	BOA
Unimodal Functions	F1	3.39546E+08	2.27037E+09	1.44219E+09	2.41422E+09	2.05117E+10	3.87567E+09	1.12957E+10	4.97094E+09	1.74246E+10	9.39854E+09
	F2	2.27626E+07	5.28157E+10	4.10819E+10	1.07730E+10	5.29066E+11	2.16332E+11	4.63057E+11	2.32768E+11	5.66610E+11	2.95182E+11
	F3	1.85873E+04	3.43771E+05	2.88086E+05	4.01181E+05	3.63047E+06	2.86961E+05	8.18550E+05	4.00092E+05	1.78930E+06	3.28212E+05
Multimodal Functions	F4	7.72380E+02	8.89927E+03	7.82436E+03	2.02510E+03	2.41497E+05	5.32228E+04	1.61554E+05	5.49467E+04	2.10931E+05	1.07037E+05
	F5	5.21331E+02	5.21360E+02	5.21344E+02	5.21411E+02	5.21543E+02	5.21382E+02	5.21364E+02	5.21394E+02	5.21378E+02	5.21412E+02
	F6	7.23215E+02	7.43732E+02	7.39570E+02	7.64007E+02	7.77526E+02	7.51749E+02	7.63430E+02	7.59589E+02	7.66614E+02	7.55081E+02
	F7	7.01115E+02	1.15543E+03	1.08121E+03	7.66975E+02	5.53836E+03	2.88271E+03	4.91776E+03	2.95812E+03	5.49186E+03	3.83535E+03
	F8	1.40473E+03	1.88983E+03	1.90919E+03	1.78638E+03	2.71920E+03	2.01967E+03	2.59021E+03	2.17620E+03	2.69708E+03	2.16178E+03
	F9	1.87114E+03	2.09419E+03	2.19640E+03	1.98028E+03	3.17931E+03	2.25513E+03	3.00979E+03	2.39414E+03	3.34093E+03	2.40085E+03
	F10	2.09232E+04	2.96095E+04	2.90068E+04	2.92610E+04	3.69478E+04	3.05620E+04	3.28091E+04	3.18577E+04	3.07016E+04	3.33463E+04
	F11	3.04394E+04	3.20615E+04	2.99118E+04	3.40591E+04	3.68181E+04	2.98003E+04	3.28839E+04	3.33669E+04	3.52541E+04	3.30960E+04
	F12	1.20390E+03	1.20452E+03	1.20423E+03	1.20538E+03	1.20691E+03	1.20440E+03	1.20443E+03	1.20489E+03	1.20496E+03	1.20522E+03
	F13	1.30068E+03	1.30338E+03	1.30252E+03	1.30090E+03	1.31288E+03	1.30793E+03	1.31161E+03	1.30803E+03	1.31303E+03	1.30960E+03
	F14	1.40040E+03	1.53972E+03	1.50329E+03	1.40955E+03	2.79295E+03	2.05486E+03	2.62966E+03	2.04287E+03	3.07616E+03	2.33572E+03
	F15	1.60624E+03	6.89148E+05	2.55932E+05	2.32220E+04	3.75940E+08	4.82087E+06	2.08899E+08	9.19389E+06	6.81539E+08	2.55419E+07
	F16	1.64662E+03	1.64689E+03	1.64677E+03	1.64760E+03	1.64876E+03	1.64681E+03	1.64756E+03	1.64755E+03	1.64849E+03	1.64739E+03
Hybrid Functions	F17	2.68815E+07	2.22350E+08	1.52979E+08	3.21219E+08	3.83205E+09	7.30620E+08	1.15674E+09	6.78554E+08	1.67370E+09	1.76963E+09
	F18	1.07119E+05	3.20493E+03	3.42645E+03	1.73023E+06	7.58147E+10	2.16863E+10	2.22456E+10	1.45383E+10	8.08493E+08	4.04451E+10
	F19	2.01744E+03	2.15316E+03	2.12300E+03	2.06813E+03	2.13242E+04	5.89738E+03	6.51836E+03	4.50720E+03	4.74239E+03	1.24079E+04
	F20	3.94551E+04	2.61785E+05	2.18876E+05	1.18865E+06	5.34169E+07	4.58291E+05	4.20631E+06	8.59189E+05	1.47022E+07	1.51876E+06
	F21	1.14693E+07	8.31503E+07	5.07210E+07	1.37788E+08	1.72030E+09	1.68577E+08	4.94436E+08	2.68921E+08	1.00425E+09	5.23683E+08
	F22	5.79760E+03	7.04892E+03	6.68073E+03	6.45941E+03	2.58269E+06	1.91405E+04	1.08225E+04	9.22401E+03	1.09404E+04	4.98043E+05
Composition Functions	F23	2.64884E+03	2.77018E+03	2.50000E+03	2.72803E+03	9.35259E+03	3.13178E+03	5.92918E+03	4.36581E+03	6.41485E+03	2.50000E+03
	F24	2.78744E+03	2.98643E+03	2.60000E+03	2.91048E+03	4.15912E+03	2.75238E+03	3.94206E+03	3.18147E+03	4.33685E+03	2.60000E+03
	F25	2.82466E+03	3.04470E+03	2.70000E+03	3.07542E+03	3.98103E+03	2.73000E+03	3.83414E+03	3.10719E+03	3.92945E+03	2.70000E+03
	F26	2.82455E+03	2.99401E+03	2.80000E+03	3.24152E+03	3.92861E+03	2.80237E+03	2.71950E+03	2.92857E+03	2.73627E+03	2.80000E+03
	F27	5.45758E+03	6.33631E+03	6.61698E+03	6.67247E+03	8.78594E+03	8.33461E+03	7.45197E+03	7.41930E+03	7.25452E+03	8.24340E+03
	F28	9.81319E+03	2.08535E+04	5.58124E+03	8.70205E+03	4.25371E+04	3.60509E+04	2.33372E+04	2.57620E+04	2.63754E+04	2.71039E+04
	F29	5.44081E+04	2.15025E+07	8.33143E+07	5.36778E+05	2.29831E+09	1.91569E+09	1.43202E+09	1.91974E+09	1.04227E+05	3.10000E+03
	F30	7.93135E+04	2.41660E+06	1.15228E+06	1.16230E+06	4.87134E+08	1.38853E+08	5.95475E+07	6.21143E+07	7.18491E+04	3.20000E+03
Friedman mean rank		1.73	4.00	2.60	4.40	9.80	5.30	7.00	6.27	7.70	6.10
Rank		1	3	2	4	10	5	8	7	9	6

Table 4.

The Results of the Wilcoxon Rank Sum Test for the sinhTSA and Other Methods.

Methods		W⁺	w⁻	Better	Worst	Percentage	p Value	α = .05	α = .1
sinhTSA	vs TSA	444	21	29	1	96.7	1.3601E-05	YES	YES
sinhTSA	VS DE	465	0	30	0	100	8.4661E-06	YES	YES
sinhTSA	VS BA	465	0	30	0	100	1.7344E-06	YES	YES
sinhTSA	VS GA	445	20	27	0	90	3.7243E-05	YES	YES
sinhTSA	VS STSA	465	0	30	0	100	3.1817E-06	YES	YES
sinhTSA	VS SCA	465	0	30	0	100	1.7344E-06	YES	YES
sinhTSA	VS ABC	441	24	28	2	93.3	1.7988E-05	YES	YES
sinhTSA	VS ETS-TSA	411	54	24	6	80	3.3789E-03	YES	YES
sinhTSA	VS BOA	397	68	24	6	80	7.1570E-04	YES	YES

The Proposed sinhTSA-MLP Credit Default Identification Model

The sinhTSA-MLP Credit Default Identification Model

We use sinhTSA to construct the credit default identification model. Through the continuous optimization of weights and biases, the performance of MLP has been improved, which has achieved the effect of enhancing classification rate and reducing the error rate.

Parameter Setting and Criteria

The sinhTSA-MLP is compared with other methods to verify the ability of the proposed method for credit default identification. Table 4 shows the comparison method parameter settings. Meanwhile, reasonable evaluation criteria are of great importance, and in the paper, equation (12) computes the mean squared error (MSE).

MSE = \sum_{i = 1}^{m} {(o_{i}^{k} - d_{i}^{k})}^{2}

(12)

where m is the number of outputs, d $i (k)$ and o $i (k)$ are the desired output and actual output of the ith input by using the kth training sample. The average MSE (MSE) is used to ensure the efficiency and it is computed by equation (13).

\bar{MSE} = \sum_{k = 1}^{s} \frac{\sum_{i = 1}^{m} {(o_{i}^{k} - d_{i}^{k})}^{2}}{s}

(13)

where s is the number of training samples. There are some uncertainties in training MLP, where MSE for the sinhTSA method is consistent to the equation (14).

minimize : F (\vec{v}) = \bar{MSE}

(14)

The error rate is one of the criteria of the evaluation model (Jain & Duin, 2000; Lessmann et al., 2015) and it is not sensitive to the classification accuracy of the model. Therefore, the error rate is a criterion in the paper.

Description of the Data

This Taiwan Data Set was selected because it is widely used and compares the predictive accuracy of default probability among six data mining methods. In order to further verify the effectiveness of the proposed method, the South German Credit (UPDATE) Data Set is chosen.

Taiwan data set

The data from an important bank (a cash and credit card issuer) in Taiwan from April to October, 2005 is adopted to train and verify the proposed default identification model. The data from the UCI, which includes a total of 25,000 observations, and 5,529 observations (22.12%), are the cardholders with default payment. In the paper, the binary variables are used (Yes = 1 and No = 0). As shown in the Table 5, the 23 variables are selected as inputs and the default is selected as output (Steenackers & Goovaerts, 1989; Yeh & Lien, 2009).

Table 5.

The 23 Variables for Taiwan Data Set.

Variables	Meanings
LIMIT BAL (Input 1)	Amount of given credit in NT dollars (includes individual and family/supplementary credit
SEX (Input 2)	Gender (1 = male, 2 = female)
EDUCATION (Input 3)	1 = graduate school, 2 = university, 3 = high school, 4 = others, 5 = unknown, 6 = unknown
MARRIAGE (Input 4)	Marital status (1 = married, 2 = single, 3 = others)
AGE (Input 5)	Age in years
PAY 0 (Input 6)	Repayment status in September, 2005(−1 = pay duly, 1 = payment delay for 1 month, 2 = payment delay for 2 months, 8 = payment delay for 8 months, 9 = payment delay for 9 months and above)
PAY 2 (Input 7)	Repayment status in August, 2005 (scale same as above)
PAY 3 (Input 8)	Repayment status in July, 2005 (scale same as above)
PAY 4 (Input 9)	Repayment status in June, 2005 (scale same as above)
PAY 5 (Input 10)	Repayment status in May, 2005 (scale same as above)
PAY 6 (Input 11)	Repayment status in April, 2005 (scale same as above)
BILL AMT1 (Input 12)	Amount of bill statement in September, 2005 (NT dollar)
BILL AMT2 (Input 13)	Amount of bill statement in August, 2005 (NT dollar)
BILL AMT3 (Input 14)	Amount of bill statement in July, 2005 (NT dollar)
BILL AMT4 (Input 15)	Amount of bill statement in June, 2005 (NT dollar)
BILL AMT5 (Input 16)	Amount of bill statement in May, 2005 (NT dollar)
BILL AMT6 (Input 17)	Amount of bill statement in April, 2005 (NT dollar)
PAY AMT1 (Input 18)	Amount of previous payment in September, 2005 (NT dollar)
PAY AMT2 (Input 19)	Amount of previous payment in August, 2005 (NT dollar)
PAY AMT3 (Input 20)	Amount of previous payment in July, 2005 (NT dollar)
PAY AMT4 (Input 21)	Amount of previous payment in June, 2005 (NT dollar)
PAY AMT5 (Input 22)	Amount of previous payment in May, 2005 (NT dollar)
PAY AMT6 (Input 23)	Amount of previous payment in April, 2005 (NT dollar)

South German credit (UPDATE) data set

The data donated by the German professor Hans Hofmann via the European Statlog project is obtained from the UCI dataset. The 20 variables (status, duration, credit history, purpose, amount, savings, employment duration, installment rate, personal status sex, other debtors, present residence, property, age, other installment plans, housing, number credits, job, people liable, telephone, and foreign worker) are selected as inputs and the default is selected as output (Fahrmeir & Hamerle, 1981).

For Taiwan Data set, one-half of the data is used to train the model, and the remaining dataset is used to validate the model (32.43%). For South German Credit (UPDATE) Data Set, 50% of the data was used for training and 50% for verification. To reduce the impact of variable inconsistency, this paper preprocesses the data referring (Yu et al., 2012) to compute.

Empirical Analysis and Suggestion

According to the deal data, we can see the distribution of gender, marriage, education, and age among the defaulting customers.

For Taiwan Data Set. Figure 1 shows the proportion of customers in different situations. Through the analysis of the experimental results, it can be seen that the features of defaulting users are uneven, and women are more likely to default than men. Graduate school tends to default. Subscribers of different ages have different degrees of default risk.

Figure 1.

Different characteristics of defaulting customers: (a) Proportion of male and female customers in default, (b) Proportion of default customers with different education level, (c) Proportion of default customers with different marriage status, and (d) Proportion of default customers with different age.

For South German Credit (UPDATE) Data Set, Figure 2 shows the debtors’ some characters. Through the experimental results, it can be seen that the features of defaulting users are uneven, and married men are more likely to default than divorced men. Both renting housing and no counting saving debtors tend to default.

Figure 2.

Different characteristics of defaulting customers: (a) Proportion of male and female customers in default, (b) Proportion of quality of debtor’s job, (c) Proportion of housing the debtor lives in, and (d) Proportion of debtors savings.

Analysis and Discussion of the Credit Default Identification Through the sinhTSA-MLP

In order to verify the performance of the proposed model in identifying credit default, some models are compared together, such as PSO-MLP (Mirjalili et al., 2012), DE-MLP (Wang, Li, et al., 2014), TSA-MLP (Kiran, 2015), GWO-MLP (Mirjalili, 2015), SCA-MLP (Mirjalili, 2016), GA-MLP (Singh & De, 2017) and AGWO-MLP (Meng et al., 2021). In this section, sinhTSA combines with MLP to identify the credit default. For the results, the final classification rate and error rate are the criteria to evaluate the performance of the sinhTSA- MLP.

Taiwan data set

From Table 6, the sinhTSA has the highest final classification rate and the lowest error rate. That is because sinhTSA has strong exploration ability and the ability to avoid local optimum. In short, in this data set, the credit default identification model’s classification accuracy and performance using the sinhTSA method are improved.

Table 6.

The Results of the Taiwan Data Set.

Classification rate	sinhTSA-MLP	TSA-MLP	GWO-MLP	DE-MLP	PSO-MLP	SCA-MLP	GA-MLP	AGWO-MLP
	77.3565	65.0349	35.4338	77.0336	68.9199	68.1387	20.9353	22.1644
Error rates	0	0.1942	0.1571	0.1958	0.2135	0.1992	0.2175	0.1508

South German credit (UPDATE) data set

From Table 7, the sinhTSA-MLP has the highest final classification rate and the lowest error rate. The GWO-MLP, DE-MLP, PSO-MLP, SCA-MLP, and AGWO-MLP have the same classification rate, but the error rate is different. It can be seen that the performances of convergence, exploration and exploitation for algorithms are different to update the MLP. From the results, it can be concluded that sinhTSA has a strong exploration ability to avoid local stagnation and it can effectively update the weights and biases of MLP to improve the classification rate.

Table 7.

The Results of the Credit Default Identification.

Classification rate	sinhTSA-MLP	TSA-MLP	GWO-MLP	DE-MLP	PSO-MLP	SCA-MLP	GA-MLP	AGWO-MLP
	96.8	3.2	3.2	3.2	3.2	3.2	10.4	3.2
Error rate	0	0.2668	0.1746	0.2963	0.3102	0.3177	0.3866	0.1736

It can be seen from Tables 6 and 7 that sinhTSA-MLP has certain advantages in credit default identification. Compared with the basic TSA-MLP, GWO-MLP, AGWO-MLP, DE-MLP, PSO-MLP, SCA-MLP, and GA-MLP, sinhTSA-MLP can obtain a higher classification rate and lower error rate. The error rate proves that sinhTSA has strong global search ability, effectively balances the exploration and exploitation, and avoids the local stagnation to improve the convergence speed. These abilities can enhance the performance of the sinhTSA-MLP credit default identification model. Through the error rate, it can be seen that over fitting will occur in the classification process. However, the precious classification rate can be obtained, which can be proved that sinhTSA-MLP is an effective credit default identification model.

Countermeasures and Suggestions

Through the above experimental results and discussion, countermeasures and suggestions are put forward as follows:

Bank managers should establish a predictive risk supervision and management system to strengthen the control of credit default. We should make a systematic and comprehensive risk response mechanism to strengthen the industry’s handling and response-ability to uncertainty. We can effectively resolve the default risk of non-performing loans in commercial banks by increasing the risk reserve for the imperfect system.

Different approval fluency and mechanisms, such as different repayment periods, loan lines, and different types of customers are set for customers with different economic strengths.

In order to handle the banking business, the customer’s credit database is built to record the monthly repayment situation.

From the perspective of the policy-making level, a knowledge system should be formed according to the accumulation of practical application data. Besides, policy terms should be timely updated to prevent new credit risks. Taking the macro-economy as the premise, it is advisable to measure the current and long-term risk of credit loans and predict the possibility of future losses of bank credit business, so as to effectively resolve the default risk of bank non-performing loans. What’s more, it is essential to clarify management regulations, implement the application of information technology, and utilize digital thinking and computational thinking, making more optimized and rational decisions.

Conclusion and Future Prospects

Conclusion

Credit default identification is complex and highly nonlinear. Default identification and evaluation models, such as support vector machine models, expert systems, etc., are constantly emerging. However, credit default identification is still a challenging problem. Different models need to be combined with varying application objects to accurately predict the actual credit process risk. This paper integrates swarm intelligence optimization method and MLP effectively to identify credit default. According to the No Free Lunch theorem, the swarm intelligence optimization method, TSA has some shortcomings. Therefore, a variant of TSA called sinhTSA is first proposed, innovated by the candidate and adjustment mechanisms. The sinhTSA is tested by the IEEE CEC 2014 benchmark test functions and compares with the EST-TSA, STSA, DE, BA, GA, SCA, ABC, and BOA. It is superior to these methods in terms of exploration ability and local optimum avoidance. It can demonstrate the sinhTSA can deal with complex and real-world problems.

With the rapid growth of personal credit assets, the risk of personal credit assets is gradually showing, which has aroused social attention, regulatory authorities, and banks themselves. Accurately grasping the risk situation of personal credit business and promoting the sustainable and healthy development of personal credit business are essential for the manager. Therefore, it is necessary to conduct in-depth research on the current risk situation and management means of personal credit to exactly understand the problems existing in the development of private credit.

The sinhTSA-MLP credit default identification model is proposed to identify credit default. Based on results, sinhTSA-MLP credit default identification model obtains the highest classification rate and lowest error rate among comparative methods-MLP credit default identification models. The classification rate of the sinhTSA is highest at 77.3565% and 96.8% with the lowest error rate.

By analyzing the experimental results and applying the method, the corresponding countermeasures are given to reduce the possibility of customer default and minimize the default risk.

Future Prospects

Though the accuracy of default identified by this model is high, other users’ behaviors, such as whether they have car loans and housing loans, can be considered in future research to achieve a better prediction.

The artificial neural networks method has high accuracy in credit default identification, which can effectively prevent credit default from happening. It can be further improved to build credit risk identification, for example, finding an appropriate structure and coordinating the parameters based on gradient learning algorithm. At the same time, more complex artificial neural networks should be chosen to identify credit defaults.

Footnotes

Acknowledgements

The authors are grateful to the financial support by the Foundation of the Education Department of Jilin Province, China (nos. JJKH20200141KJ, JJKH20210133KJ).

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The Foundation of the Education Department of Jilin Province, China (no. JJKH20200141KJ, JJKH20210133KJ);

ORCID iDs

Jianhua Jiang

Xianqiu Meng

References

Abri Aghdam

Aghajani

Kanani

Soltan Sanjari

Chaibakhsh

Shirvaniyan

Moosavi

Moghaddasi

. (2021). A novel decision tree approach to predict the probability of conversion to multiple sclerosis in Iranian patients with optic neuritis. Multiple Sclerosis and Related Disorders, 47, 102658.

Arora

Kaur

P. D.

(2020). A Bolasso based consistent feature selection enabled random forest classification algorithm: An application to credit risk assessment. Applied Soft Computing, 86, 105936.

Arora

Singh

(2019). Butterfly optimization algorithm: A novel approach for global optimization. Soft Computing, 23(3), 715–734.

Babalik

Cinar

A. C.

Kiran

M. S.

(2018). A modification of tree-seed algorithm using debs rules for constrained optimization. Applied Soft Computing, 63, 289–305.

Bhattacharya

Inekwe

J. N.

Valenzuela

M. R.

(2020). Credit risk and financial integration: An application of network analysis. International Review of Financial Analysis, 72, 1–14.

Butaru

Chen

Clark

Das

A. W.

Siddique

(2016). Risk and risk management in the credit card industry. Journal of Banking & Finance, 72, 218–239.

Carcillo

Le Borgne

Y. A.

Caelen

Kessaci

Oblé

Bontempi

(2021). Combining unsupervised and supervised learning in credit card fraud detection. Information Sciences, 557, 317–331.

Chen

Ribeiro

Chen

(2016). Financial credit risk assessment: A recent review. Artificial Intelligence Review, 45(1), 1–23.

Chen

Webb

G. I.

Liu

(2020). A novel selective naïve Bayes algorithm. Knowledge-Based Systems, 192, 105361.

10.

Chou

C. H.

Hsieh

S. C.

Qiu

C. J.

(2017). Hybrid genetic algorithm and fuzzy clustering for bankruptcy prediction. Applied Soft Computing, 56, 298–316.

11.

Cinar

A. C.

Kiran

M. S.

(2018). Similarity and logic Gate-based tree-seed algorithms for binary optimization. Computers & Industrial Engineering, 115, 631–646.

12.

Cinar

A. C.

Korkmaz

Kiran

M. S.

(2020). A discrete tree-seed algorithm for solving symmetric traveling salesman problem. Engineering Science and Technology an International Journal, 23(4), 879–890.

13.

Costa e Silva

Lopes

I. C.

Correia

Faria

(2020). A logistic regression model for consumer default risk. Journal of Applied Statistics, 47, 2879–2894.

14.

Danenas

Garsva

(2015). Selection of support vector machines based classifiers for credit risk domain. Expert Systems with Applications, 42(6), 3194–3204.

15.

Ding

Hao

(2020). Non-probabilistic method to consider uncertainties in structural damage identification based on hybrid jaya and tree seeds algorithm. Engineering Structures, 220, 110925.

16.

Ding

Hao

Z. R.

(2019). Nonlinear hysteretic parameter identification using an improved tree-seed algorithm. Swarm and Evolutionary Computation, 46, 69–83.

17.

El-Fergany

A. A.

Hasanien

H. M.

(2018). Tree-seed algorithm for solving optimal power flow problem in large-scale power systems incorporating validations and comparisons. Applied Soft Computing, 64, 307–316.

18.

Ertenlice

Kalayci

C. B.

(2018). A survey of swarm intelligence for portfolio optimization: Algorithms and applications. Swarm and Evolutionary Computation, 39, 36–52.

19.

Fahrmeir

Hamerle

(1981). Kategoriale Regression in der betrieblichen Planung. Zeitschrift fr Operations Research, 25, B63–B78.

20.

Falavigna

(2012). Financial ratings with scarce information: A neural network approach. Expert Systems with Applications, 39(2), 1784–1792.

21.

Fan

Bai

Lei

Zhang

K. C.

Tan

(2020). Privacy preserving based logistic regression on big data. Journal of Network and Computer Applications, 171, 102769.

22.

Feng

S. F.

Huang

Boswell

M. K.

Xue

(2020). A multi-layer perceptron approach for accelerated wave forecasting in lake Michigan. Ocean Engineering, 211, 107526.

23.

Holland

J. H.

Reitman

J. S.

(1977). Cognitive systems based on adaptive algorithms. ACM SIGART Bulletin, 63, 49–49.

24.

Huang

Nakamori

Wang

S. Y.

(2005). Forecasting stock market movement direction with support vector machine. Computers & Operations Research, 32(10), 2513–2522.

25.

Jain

A. K.

Duin

P. W.

(2000). Statistical pattern recognition: A review. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(1), 4–37.

26.

Jiang

Meng

Chen

Qiu

Liu

(2020a). Enhancing tree-seed algorithm via feed-back mechanism for optimizing continuous problems. Applied Soft Computing, 92, 106314.

27.

Jiang

Meng

(2020b). STSA: A sine tree-seed algorithm for complex continuous optimization problems. Physica A: Statistical Mechanics and its Applications, 537(1), 122802.

28.

Jiang

Han

Meng

(2020c). TSASC: Tree–seed algorithm with sine–cosine enhancement for continuous optimization problems. Soft Computing, 24(24), 18627–18646.

29.

Jiang

Meng

Qiu

(2019). EST-TSA: An effective search tendency based to tree seed algorithm. Physica A: Statistical Mechanics and its Applications, 534, 122323.

30.

Jordan

M. I.

Mitchell

T. M.

(2015). Machine learning: Trends, perspectives, and prospects. Science, 349(6245), 255–260.

31.

Karaboga

Ozturk

(2011). A novel clustering approach: Artificial bee colony (ABC) algorithm. Applied Soft Computing, 11(1), 652–657.

32.

Kiran

M. S.

(2015). TSA: Tree-seed algorithm for continuous optimization. Expert Systems with Applications, 42(19), 6686–6698.

33.

Kiran

M. S.

(2017). Withering process for tree-seed algorithm. Procedia Computer Science, 111, 46–51.

34.

Kiran

M. S.

Hakli

(2021). A tree–seed algorithm based on intelligent search mechanisms for continuous optimization. Applied Soft Computing, 98, 106938.

35.

Koç

Sevgi̇li̇

(2020). Consumer loans first payment default detection: A predictive model. Turkish Journal of Electrical Engineering & Computer Sciences, 28(1), 167–181.

36.

Kou

Peng

(2014). MCDM approach to evaluating bank loan default models. Technological and Economic Development of Economy, 20(2), 292–311.

37.

Kruppa

Schwarz

Arminger

Ziegler

(2013). Consumer credit risk: Individual probability estimates using machine learning. Expert Systems with Applications, 40(13), 5125–5131.

38.

Lessmann

Baesens

Seow

H. V.

Thomas

L. C.

(2015). Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research. European Journal of Operational Research, 247(1), 124–136.

39.

Liu

Xie

Zhao

Xie

Liu

(2019). Novel evolutionary multi-objective soft subspace clustering algorithm for credit risk assessment. Expert Systems with Applications, 138, 112827.

40.

Lopez-Garcia

T. B.

Coronado-Mendoza

Domínguez-Navarro

J. A.

(2020). Artificial neural networks in microgrids: A review. Engineering Applications of Artificial Intelligence, 95, 103894.

41.

Malhotra

D. K.

(2003). Evaluating consumer loans using neural networks. Omega, 31(2), 83–96.

42.

(2020). Prediction of default probability of credit-card bills. Open Journal of Business and Management, 08, 231–244.

43.

Meng

Jiang

Wang

(2021). AGWO: Advanced GWO in multi-layer perception optimization. Expert Systems with Applications, 173, 114676.

44.

Mirjalili

(2015). How effective is the grey wolf optimizer in training multi-layer perceptrons. Applied Intelligence, 43(1), 150–161.

45.

Mirjalili

(2016). SCA: A sine cosine algorithm for solving optimization problems. Knowledge-Based Systems, 96, 120–133.

46.

Mirjalili

Mohd Hashim

S. Z.

Moradian Sardroudi

(2012). Training feedforward neural networks using hybrid particle swarm optimization and gravitational search algorithm. Applied Mathematics and Computation, 218(22), 11125–11137.

47.

Mohammadi

Guan

Moazenzadeh

Safari

M. J. S.

(2021). Implementation of hybrid particle swarm optimization-differential evolution algorithms coupled with multi-layer perceptron for suspended sediment load estimation. CATENA, 198, 105024.

48.

Mokhatab Rafiei

Manzari

S. M.

Bostanian

. (2011). Financial health prediction models using artificial neural networks, genetic algorithm and multivariate discriminant analysis: Iranian evidence. Expert Systems with Applications, 38(8), 10210–10217.

49.

Oreski

(2014). Genetic algorithm-based heuristic for feature selection in credit risk assessment. Expert Systems with Applications, 41(4), 2052–2064.

50.

Ping

Yongheng

(2011). Neighborhood rough set and SVM based hybrid credit scoring classifier. Expert Systems with Applications, 38(9), 11300–11304.

51.

Pławiak

Abdar

Rajendra Acharya

(2019). Application of new deep genetic cascade ensemble of SVM classifiers to predict the Australian credit scoring. Applied Soft Computing, 84, 105740.

52.

Ramos-Tallada

(2015). Bank risks, monetary shocks and the credit channel in Brazil: Identification and evidence from panel data. Journal of International Money and Finance, 55, 135–161.

53.

Rao

Liu

Goh

Wen

(2020). 2-stage modified random forest model for credit risk assessment of P2P network lending to “three rurals” borrowers. Applied Soft Computing, 95, 106570.

54.

Rtayli

Enneya

(2020). Enhanced credit card fraud detection based on SVM-recursive feature elimination and hyper-parameters optimization. Journal of Information Security and Applications, 55, 102596.

55.

Singh

K. J.

(2017). MLP-GA based algorithm to detect application layer DDoS attack. Journal of Information Security and Applications, 36, 145–153.

56.

Steenackers

Goovaerts

M. J.

(1989). A credit scoring model for personal loans. Insurance Mathematics and Economics, 8(1), 31–34.

57.

Thomas

L. C.

(2000). A survey of credit and behavioural scoring: Forecasting financial risk of lending to consumers. International Journal of Forecasting, 16, 149–172.

58.

Turkoglu

Kaya

(2020). Training multi-layer perceptron with artificial algae algorithm. Engineering Science and Technology an International Journal, 23(6), 1342–1350.

59.

Wang

Yang

(2014). An improved boosting based on feature selection for corporate bankruptcy prediction. Expert Systems with Applications, 41(5), 2353–2361.

60.

Wang

H. X.

Huang

(2014). Differential evolution based on covariance matrix learning and bimodal distribution parameter setting. Applied Soft Computing, 18, 232–247.

61.

Yager

R. R.

(2006). An extension of the naive Bayesian classifier. Information Sciences, 176(5), 577–588.

62.

Yang

X. S.

(2013). Bat algorithm: Literature review and applications. International Journal of Bio-Inspired Computation, 5(3), 141–149.

63.

Yeh

I. C.

Lien

C. H.

(2009). The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients. Expert Systems with Applications, 36(2), 2473–2480.

64.

Wei

Y. M.

Wang

(2012). A PSO–GA optimal model to estimate primary energy demand of China. Energy Policy, 42, 329–340.

65.

Zhang

Han

Wang

(2021). HOBA: A novel feature engineering methodology for credit card fraud detection with a deep learning architecture. Information Sciences, 557, 302–316.

66.

Zhang

Z. L.

(2013). Identification of credit risk of personal loan in commercial bank based on SVM. Applied Mechanics and Materials, 281, 682–687.

67.

Zhou

Zhang

Zhou

Guo

(2021). A feature selection algorithm of decision tree based on feature weight. Expert Systems with Applications, 164, 113842.

68.

Zhu

Xie

Sun

Wang

G. J.

Yan

X. G.

(2016). Predicting Chinas SME credit risk in supply chain financing by logistic regression, artificial neural network and hybrid models. Sustainability, 8(5), 433.