Modeling and Application of Credit Scoring Based on A Multi-Objective Approach to Debtor Data in PT. Bank Riau Kepri

- Sugianto - Politeknik Caltex Riau, Pekanbaru, 28265, Indonesia
Yohana Dewi Lulu Widyasari - Politeknik Caltex Riau, Pekanbaru, 28265, Indonesia
Kartina Diah Kusuma Wardhani - Politeknik Caltex Riau, Pekanbaru, 28265, Indonesia


Citation Format:



DOI: http://dx.doi.org/10.62527/joiv.8.1.1493

Abstract


The development of information technology in Indonesia, marked by the start of Industry 4.0, is very rapid. With the development of technology, many companies use technology to develop their business, one of which is banking, which analyses the process of prospective customers. New employees find it challenging to interpret and tend to agree more easily with prospective customers because they only see the fulfillment of general requirements. This research aims to find an overview of the primary and additional factors to analyze prospective credit customers using The Cross-Industry Standard Process for Data Mining (CRISP-DM). Develop a model in this study using data variables of prospective customers in health insurance as a moderating variable. This model tested the Decision Tree algorithm with an accuracy value of 92.49%, the Random Forest with an accuracy value of 81.72%, the Support Vector Machine (SVM) with an accuracy value of 91.25%, and K-Nearest Neighbor (K-NN) with an accuracy value. 90.58%, Gradient Boosting with an accuracy value of 90.69%, and XGBoost with an accuracy value of 93.27%. The algorithm uses a cross-validation technique at the validation stage by changing the K value to 2, 4, 6, 8, and 10. The results show that the XGBoost Algorithm accuracy is 93.27% with a K value of 8. As the highest model accuracy, this model was implemented using the XGBoost Algorithm.

Keywords


Supervised learning; credit scoring; algorithm; XGBoost; application

Full Text:

PDF

References


Badan Pusat Statistik, “Proporsi Kredit UMKM Terhadap Total Kredit (Triliun Rupiah), 2017-2019,” pp. 2–3, 2020.

O. J. KEUANGAN and REPUBLIK INDONESIA, “Peraturan Otoritas jasa keuangan republik indonesia No. 42 /POJK.03/2019,” vol. 42 /POJK.0, 2019.

V. K. J. Pongilatan et al., “Evaluation Of The Suitability Of The Allowance For Impairment Losses On Credit With Sfas 55 At Sulutgo Bank Branch Ratahan Oleh: Jurusan Akuntansi , Fakultas Ekonomi dan Bisnis E-mail : Keywords : SFAS 55 , recognition and measurement , allowance for impa,” vol. 9, no. 55, pp. 625–632, 2021.

L. C. Thomas, “A survey of credit and behavioral scoring: Forecasting financial risk of lending to consumers,” International Journal of Forecasting, vol. 16, no. 2, pp. 149–172, 2000, doi: 10.1016/S0169-2070(00)00034-0.

Y. Religia, A. Nugroho, and W. Hadikristanto, “Analisis Perbandingan Algoritma Optimasi pada Random Forest untuk Klasifikasi Data Bank Marketing,” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 5, no. 1, pp. 187–192, 2021

E. Dumitrescu, S. Hue, C. Hurlin, and S. Tokpavi, “Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects,” Eur. J. Oper. Res., vol. 297, no. 3, pp. 1178–1192, 2022.

Peng Du and Hong Shu, “Exploration of financial market credit scoring and risk management and prediction using deep learning and bionic algorithm, ” Journal of Global Information Management (JGIM), 30(9), 1-29, 2022

B. Charbuty and A. Abdulazeez, “Classification Based on Decision Tree Algorithm for Machine Learning,” J. Appl. Sci. Technol. Trends, vol. 2, no. 01, pp. 20–28, 2021, doi: 10.38094/jastt20165.

H. K. Yaseen and A. M. Obaid, “Big Data: Definition, Architecture & Applications,” JOIV Int. J. Informatics Vis., vol. 4, no. 1, pp. 45–51, 2020

M. Aljanabi et al., “Large Dataset Classification Using Parallel Processing Concept,” pp. 1–4, 2020.

H. Park and J. Jeon, “Optimal Data Transmission and Improve Efficiency through Machine Learning in Wireless Sensor Networks,” Cheonan, Republic of Korea, 2022. doi: www.joiv.org/index.php/joiv.

D. Lee, J.-Y. Hwang, Y. Lee, and S.-W. Kim, “Informatics and Artificial Intelligence (AI) Education in Korea: Situation Analysis Using the Darmstadt Model,” Cheongju, 28173, Republic of Korea, 2022. [Online]. Available: www.joiv.org/index.php/joiv.

T. Wellem, Y. Nataliani, and A. Iriani, “Academic Document Authentication using Elliptic Curve Digital Signature Algorithm and QR Code,” Salatiga, 50711, Indonesia, 2022. [Online]. Available: www.joiv.org/index.php/joiv.

S. Irawan and R. Firsandaya Malik, “Credit Scoring Menggunakan Algoritma Classification And Regression Tree (CART),” vol. 2, no. 1, pp. 82–85, 2017

F. Irawan and F. Samopa, “A Comparative Assessment of the Random Forest and SVM Algorithms Using Combination of Principal Component Analysis and SMOTE For Accounts Receivable Seamless Prediction case study company X in Surabaya,” 2018.

F. Sodik, B. Dwi, and I. Kharisudin, “Perbandingan Metode Klasifikasi Supervised Learning pada Data Bank Customers Menggunakan Python,” Jurnal Matematika, vol. 3, pp. 689–694, 2020.

H. Lu and X. Ma, “Chemosphere Hybrid decision tree-based machine learning models for short-term water quality prediction,” Chemosphere, vol. 249, p. 126169, 2020, doi: 10.1016/j.chemosphere.2020.126169.

S. Misra and H. Li, Noninvasive fracture characterization based on the classification of sonic wave travel times. Elsevier Inc., 2020.

Palimkar, P., Shaw, R.N., Ghosh, A. (2022). Machine Learning Technique to Prognosis Diabetes Disease: Random Forest Classifier Approach. In: Bianchini, M., Piuri, V., Das, S., Shaw, R.N. (eds) Advanced Computing and Intelligent Technologies. Lecture Notes in Networks and Systems, vol 218. Springer, Singapore. https://doi.org/10.1007/978-981-16-2164-2_19.

Y. Christian, “Predicting Consumer Interest in All You Can Eat Restaurants with Gradient Boosting Algorithm,” Journal Of Informatics And Telecommunication Engineering, vol. 6, no. 1, pp. 91–100, Jul. 2022, doi: 10.31289/jite.v6i1.7209

M. FATİH Yuruk, “Xgboost (Extreme Gradient Boosting) Tabanli Algoritma Ile Gümüş Fiyatlarinin Tahmin Edilmesi Some of the authors of this publication are also working on these related projects: Prediction of Silver Prices With Xgboost (Extreme Gradient Boosting) Based Algorithm View project,” 2022. [Online]. Available: https://www.ispecongress.org/sosyal-bilimler

A. Deharja, M. W. Santi, M. Yunus, and E. Rachmawati, “Sistem Prototype Klasifikasi Risiko Kehamilan Dengan Algoritma k-Nearest Neighbor (k-NN),” JTIM : Jurnal Teknologi Informasi dan Multimedia, vol. 4, no. 1, pp. 66–72, May 2022, doi: 10.35746/jtim.v4i1.229.

S. Styawati, A. Nurkholis, A. A. Aldino, S. Samsugi, E. Suryati and R. P. Cahyono, "Sentiment Analysis on Online Transportation Reviews Using Word2Vec Text Embedding Model Feature Extraction and Support Vector Machine (SVM) Algorithm," 2021 International Seminar on Machine Learning, Optimization, and Data Science (ISMODE), 2022, pp. 163-167, doi: 10.1109/ISMODE53584.2022.9742906.

R. Saedudin et al., “Data Clustering for Identification of Building Conditions Using Hybrid Multivariate Multinominal Distribution Soft Set (MMDS) Method,” Bandung, West Java, Indonesia, 2022. [Online]. Available: www.joiv.org/index.php/joiv.

A. Alhabeeb, M. Mohammed Abusharhah, T. Hariguna, and A. Rafi Hananto, “An Investigation into Indonesian Students’ Opinions on Educational Reforms through the Use of Machine Learning and Sentiment Analysis,” Purwokerto, 53127, Indonesia, 2022. [Online]. Available: www.joiv.org/index.php/joiv.

A. N. Iffah’da and A. Desiani, “Implementasi Algoritma K-Nearest Neighbor (K-NN) dan Single Layer Perceptron (SLP) Dalam Prediksi Penyakit Sirosis Biliari Primer,” J. Ilm. Inform., vol. 7, no. 1, pp. 65–74, 2022.

C. Schröer, F. Kruse, and J. M. Gómez, “A systematic literature review on applying CRISP-DM process model,” Procedia Comput. Sci., vol. 181, pp. 526–534, 2021.

E. Kristoffersen, O. O. Aremu, F. Blomsma, P. Mikalef, and J. Li, “Exploring the relationship between data science and circular economy: an enhanced CRISP-DM process model,” in Conference on e-Business, e-Services and e-Society, 2019, pp. 177–189.

V. Singh, A. Singh, and K. Joshi, “Fair CRISP-DM: Embedding Fairness in Machine Learning (ML) Development Life Cycle.,” in HICSS, 2022, pp. 1–10

A. Pradhan and M. P. Biswal, “Linear fractional programming problems with some multi-choice parameters,” Int. J. Oper. Res., vol. 34, no. 3, pp. 321–338, 2019.

S. K. Singh and S. P. Yadav, “Scalarizing fuzzy multi-objective linear fractional programming with application,” Comput. Appl. Math., vol. 41, no. 3, pp. 1–26, 2022.

Dastile X, Celik T, Potsane M, “ Statistical anda machine learning model in credit scoreing: A Systematic Literature survey”, Appiled Soft Computing, Vol. 91, 2020

Kozodoi , Lessmann S and Baesen B,”A multi-objective Approach for profit-driven feature selection in credit scoring, “Decision Support System, Vol 120, 2019

Kamimura E, Pinto A anda Nagano M. “A recent review on optimasation methods applied to credit scoring model,” Journal of Economics, Finance and administrative Science, 2023

Haoran He,Zhao Wang,Hemant Jain,Cuiqing Jiang,Shanlin Yang, “A privacy-preserving decentralized credit scoring method based on multi-party information.” Decision Support Systems, 2023