A Hybrid KNN algorithm with Sugeno measure for the personal credit reference system in China

Abstract

Ever increasing ordinal variables are being collected by the Personal Credit Reference System in China, however this system suffers from analysis of this kind of data, which cannot be calculated by Euclidean distance. In this study, we put forward a hybrid KNN algorithm based on Sugeno measure, and we prove that the error of this algorithm is smaller than that of Euclidean distance, furthermore, we use real data obtained from the Personal Credit Reference System to perform experiments and get the user’s initial portrait. Through the comparisons with Kmeans algorithm and other different distance measures in KNN algorithm, we find that the hybrid KNN algorithm is more suitable for clustering personal credit data.

Keywords

Hybrid KNN clustering personal credit reference system Sugeno measure user’s portrait

Get full access to this article

View all access options for this article.

References

Ezequiel

E.J.P.F.

and Opez-Rubio

, Unsupervised Learning by Cluster Quality Optimization, Information Sciences (2018).

Bahrani

, Minaei-Bidgoli

, Parvin

, Mirzarezaee

, Keshavarz

and Alinejad-Rokny

, User and item profile expansion for dealing with cold start problem, Journal of Intelligent & Fuzzy Systems 38(4) (2020), 4471–4483.

Mikhed

and Vogan

, How data breaches affect consumer credit, Journal of Banking & Finance 88 (2018), 192–207.

Bonchi

, Garcia-Soriano

and Liberty

, Correlation clustering: from theory to practice, Acm Sigkdd International Conference on Knowledge Discovery & Data Mining (2014).

, Cui

, Wang

and Su

, Efficient index-based KNN join processing for high-dimensional data, Information and Software Technology 49(4) (2007), 332–344.

, Zhang

, Huang

and Xiong

, High-dimensional kNN joins with incremental updates, Geoinformatica 14(1) (2009), 55.

Tan

, Zhang

and Wu

, Mutual kNN based spectral clustering, Neural Computing and Applications (2018).

Olivares

, Kermarrec

and Chiluka

, The out-of-core KNN awakens: the light side of computation force on large datasets, Computing 101(1) (2019), 19–38.

, Zhang

, Zhao

, Yang

and Pan

, KNN-based maximum margin and minimum volume hyper-sphere machine for imbalanced data classification, International Journal of Machine Learning and Cybernetics 10(2) (2019), 357–368.

10.

Ali

, Jung

L.T.

, Abdel-Aty

, Abubakar

M.Y.

, Elhoseny

and Ali

, Semantic-k-NN algorithm: An enhanced version of traditional k-NN algorithm, Expert Systems with Applications 151 (2020).

11.

Aburomman

A.A.

and Ibne Reaz

M.B.

, A novel SVM-kNN-PSO ensemble method for intrusion detection system, Applied Soft Computing 38 (2016), 360–372.

12.

Shi

, Han

and Yan

, Adaptive clustering algorithm based on kNN and density, Pattern Recognition Letters 104 (2018), 37–44.

13.

Nordhaug Myhre

, Øyvind Mikalsen

, Løkse

and Jenssen

, Robust clustering using a kNN mode seeking ensemble, Pattern Recognition 76 (2018), 491–505.

14.

Zhang

, Cost-sensitive KNN classification, Neurocomputing (2019).

15.

, Chen

and Song

, Boosted K-nearest neighbor classifiers based on fuzzy granules, Knowledge-Based Systems 195 (2020).

16.

Bhattacharya

, Ghosh

and Chowdhury

A.S.

, An affinity-based new local distance function and similarity measure for kNN algorithm, Pattern Recognition Letters 33(3) (2012), 356–363.

17.

Deng

, Zhu

, Cheng

, Zong

and Zhang

, Efficient kNN classification algorithm for big data, Neurocomputing 195 (2016), 143–148.

18.

Chen

, Hu

, Fan

, Shen

, Zhang

, Liu

, Du

, Li

, Chen

and Li

, Fast density peak clustering for large scale data based on kNN, Knowledge-Based Systems (2019), 104824.

19.

Gou

, Qiu

, Yi

, Xu

, Mao

and Zhan

, A Local Mean Representation-based K-Nearest Neighbor Classifier, ACM Transactions On Intelligent Systems and Technology 10(3) (2019), 1–25.

20.

Zhang

, Li

, Zong

, Zhu

and Cheng

, Learning k for kNN Classification, ACM Transactions on Intelligent Systems and Technology (TIST) 8(3) (2017), 1–19.

21.

and Hwang

S.O.

, Automatic text summarization using string vector based K nearest neighbor, Journal of Intelligent & Fuzzy Systems 35(6) (2018), 6005–6016.

22.

Sugeno

, Theory of fuzzy integrals and its applications, Doctoral Thesis Tokyo Institute of Technology (1974).

23.

Klement

E.P.

, Mesiar

and Pap

, A universal integral as common frame for choquet and Sugeno integral, IEEE Transactions On Fuzzy Systems 18(1) (2010), 178–187.

24.

Agahi

, k-generalized Sugeno integral and its application, Information Sciences 305 (2015), 384–394.

25.

Smrek

, Sugeno integrals with respect to level dependent capacities, Fuzzy Sets and Systems 291 (2016), 33–39.

26.

Halaš

, Mesiar

and Pócs

, A new characterization of the discrete Sugeno integral, Information Fusion 29 (2016), 84–86.

27.

Hala

, Mesiar

and Pócs

, Congruences and the discrete Sugeno integrals on bounded distributive lattices, Information Sciences (2016), 443–448.

28.

Dubois

, Prade

, Rico

and Teheux

, Generalized qualitative Sugeno integrals,416}, Information Sciences 415{– (2017), 429–445.

29.

Brabant

and Couceiro

, k -maxitive Sugeno integrals as aggregation models for ordinal preferences, Fuzzy Sets and Systems (2017).

30.

Shubair

, Ramadass

and Altyeb

, kENFIS: kNN-based evolving neuro-fuzzy inference system for computer worms detection, Journal of Intelligent & Fuzzy Systems 26(4) (2014), 1893–1908.

31.

Wang

M.L.

, Zhang

Z.Q.

, Wang

Q.D.

and Shao

H.Y.

, Adaptive Asymptotic Tracking of Nonlinear Systems Using Nonlinearly Parameterized First-Order Sugeno Fuzzy Approximator, International Journal of Fuzzy Systems 20(4) (2018), 1079–1087.

32.

Boczek

, Hovana

and Hutník

, General form of Chebyshev type inequality for generalized Sugeno integral, International Journal of Approximate Reasoning 115 (2019), 1–12.

33.

Daraby

, Rostampour

, Khodadadi

A.R.

, Rahimi

and Mesiar

, One version of the Prékopa-Leindler type inequality for the Sugeno integral, Fuzzy Sets and Systems (2019).

34.

Beliakov

, Gagolewski

and James

, Robust fitting for the Sugeno integral with respect to general fuzzy measures, Information Sciences (2019).

35.

Román-Flores

, Flores-Franulič0

, Aguirre-Cipe

and Romero-Martínez

, A Sugeno integral inequality of Carleman-Knopp type and some refinements, Fuzzy Sets and Systems (2019).

36.

Halaš

, Mesiar

, Pócs

and Torra

, A note on some algebraic properties of discrete Sugeno integrals, Fuzzy Sets and Systems 355 (2019), 110–120.

37.

Han

, Han

and Zhao

, Orthogonal support vector machine for credit scoring, Engineering Applications of Artificial Intelligence 26(2) (2013), 848–862.