Improved Fuzzy Possibilistic C-Means using Artificial Bee Colony for Clustering New Student’s Financial Capability to Determine Tuition Level

Edi Satriyanto - Politeknik Elektronika Negeri Surabaya, Indonesia
Ni Wayan Surya Wardhani - Universitas Brawijaya Malang, Indonesia
Syaiful Anam - Universitas Brawijaya Malang, Indonesia
Wayan Firdaus Mahmudy - Universitas Brawijaya Malang, Indonesia


Citation Format:



DOI: http://dx.doi.org/10.62527/joiv.8.4.3087

Abstract


Outliers in the dataset will affect the quality of the cluster, so a good clustering method is needed. Based on the Mahalanobis distance method, it is known that the research dataset has outliers. Clustering methods that are often used for this type of data are Fuzzy C-means (FCM), Possibilistic C-means (PCM), and Fuzzy Possibilistic C-means (FPCM). This study aims to develop a clustering method that is more robust to outliers by using the Artificial Bee Colony (ABC) algorithm to minimize the objective function of FPCM. This study produces a new algorithm called Artificial Bee Colony Fuzzy Possibilistic C-Means (ABCFPCM) so that the resulting clusters are not easily trapped in the local optimum. This study also provides cluster centroid initialization using K-Means++ to improve cluster quality. ABCFPCM performs best because it significantly increases the Silhouette value and the Between Sum Squares (BSS) and Total Sum Squares (TSS) ratio. ABCFPCM performance provides the best cluster quality of 72.16% based on the BSS/TSS ratio, FPCM of 70.71%, and FCM K-Means++ of 68.14%. K-Means++ in the cluster method does not affect cluster performance except for FCM, where cluster quality is slightly increased. The centroid results of 8 clusters as the best performance of ABCFPCM are used to determine the tuition rate level. The impact of this study is to improve the quality of FPCM performance because it is no longer trapped in a local optimum at the cluster centroid.

Keywords


Artificial bee colony; between sum squares; fuzzy c-means; fuzzy possibilistic C-means; K-means++; mahalanobis distance; outlier; possibilistic C-means; silhouette; total sum squares

Full Text:

PDF

References


Z. Wang, Q. Zhang, and T. Du, “Industrial Data Outlier Detection Based on Density Peaks Clustering (DPC),” Proc. - 2023 2nd Int. Conf. Mach. Learn. Control. Robot. MLCR 2023, pp. 104–108, 2023.

T. Wang, S. Wu, L. Cao, K. Du, Z. Zhao, and Z. He, “A Novel Possibilistic Clustering Algorithm for Measurement Data of Vehicle MMW Radar,” IEEE Sens. J., vol. 23, no. 15, pp. 17103–17116, 2023.

H. Yu, L. Jiang, J. Fan, S. Xie, and R. Lan, “A feature-weighted suppressed possibilistic fuzzy c-means clustering algorithm and its application on color image segmentation,” Expert Syst. Appl., vol. 241, no. September 2022, p. 122270, 2024.

D. Mansoor Hussain, M. Anuroopa, A. Dharshini, and G. Durganandini, “Efficient Bone Tumor Detection and Classification using Fuzzy C Means Clustering Algorithm,” Proc. 5th Int. Conf. Trends Electron. Informatics, ICOEI 2021, pp. 1422–1428, 2021.

S. M. Alqhtani et al., “Improved Brain Tumor Segmentation and Classification in Brain MRI With FCM-SVM: A Diagnostic Approach,” IEEE Access, vol. 12, no. January, pp. 61312–61335, 2024.

Y. Zhao, H. Bai, X. Mao, P. Li, Q. Yang, and D. Nie, “Distribution network insulation fault risk warning method based on LSTM and FCM,” 2024 4th Int. Conf. Neural Networks, Inf. Commun. Eng. NNICE 2024, pp. 1539–1543, 2024.

A. A. Hussein and H. N. Abdulrazzak, “Fuzzy C-Means Clustering Approach for Green IoT in Smart Cities,” 2024 3rd Int. Conf. Power, Control Comput. Technol. ICPC2T 2024, pp. 588–593, 2024.

F. Bu, C. Hu, Q. Zhang, C. Bai, L. T. Yang, and T. Baker, “A Cloud-Edge-Aided Incremental High-Order Possibilistic c-Means Algorithm for Medical Data Clustering,” IEEE Trans. Fuzzy Syst., vol. 29, no. 1, pp. 148–155, 2021.

C. Wu and X. Zhang, “A Novel Kernelized Total Bregman Divergence-Driven Possibilistic Fuzzy Clustering With Multiple Information Constraints for Image Segmentation,” IEEE Trans. Fuzzy Syst., vol. 30, no. 6, pp. 1624–1639, 2022.

W. Zeng, Y. Liu, H. Cui, R. Ma, and Z. Xu, “Interval possibilistic C-means algorithm and its application in image segmentation,” Inf. Sci. (Ny)., vol. 612, pp. 465–480, 2022.

H. A. Gangadwala and R. M. Gulati, “Analysis of Web Usage Mining Using Various Fuzzy Techniques and Cluster Validity Index,” 2022 1st Int. Conf. Electr. Electron. Inf. Commun. Technol. ICEEICT 2022, pp. 1–7, 2022.

S. Kushwaha and K. S. Jadon, “Improved performance using fuzzy possibilistic C-Means clustering algorithm in wireless sensor network,” Proc. - 2020 IEEE 9th Int. Conf. Commun. Syst. Netw. Technol. CSNT 2020, pp. 134–139, 2020.

R. Devi, “Unsupervised Kernel-Induced Fuzzy Possibilistic C-Means Technique in Investigating Real-World Data,” J. Phys. Conf. Ser., vol. 2199, no. 1, 2022.

E. Satriyanto, N. W. S. Wardhani, S. Anam, and W. F. Mahmudy, “Performance Comparison of Dimensional Reduction using Principal Component Analysis with Alternating Least Squares in Modified Fuzzy Possibilistic C-Means and Fuzzy Possibilistic C-Means,” Int. J. Adv. Sci. Eng. Inf. Technol., vol. 14, no. 2, pp. 483–491, 2024.

N. R. Augad and V. S. Gutte, “Swarm Intelligence For AI Problem Solving,” Proc. - 2023 Int. Conf. Adv. Comput. Commun. Technol. ICACCTech 2023, pp. 590–596, 2023.

F. D. Adriyansyah, D. Witarsyah, D. Syamsuar, A. Luthfi, H. Fakhrurroja, and M. I. Herdiansyah, “Path Planning Using the Bee Colony Algorithm to Find Routes for a Coffee Plantation Using Drones,” Procedia Comput. Sci., vol. 227, pp. 17–26, 2023.

H. Zhang, X. Luo, W. Li, and C. Tong, “A Bluetooth Location Method Based on WKNN and Bee Colony K-means Algorithm,” Proc. - 2023 Int. Conf. Serv. Robot. ICoSR 2023, pp. 125–129, 2023.

A. K. Clustering, “Fault diagnosis of pile machine based on improved artificial bee colony and K-means clustering algorithm,” 2023 IEEE Int. Conf. Control. Electron. Comput. Technol., pp. 215–219, 2023.

P. Mutalik and S. Engineer, “Hybrid of Ant Colony and Artificial Bee Colony Optimization for Resource Allocation in Cognitive Radio Networks,” pp. 2–6, 2024.

W. Alomoush et al., “Fuzzy Clustering Algorithm Based on Improved Global Best-Guided Artificial Bee Colony with New Search Probability Model for Image Segmentation,” Sensors, vol. 22, no. 22, 2022.

D. H. Bangkalang, N. Setiyawati, and H. D. Purnomo, “Comparison between K-Means and K-Means++ Clustering Models Using Singular Value Decomposition (SVD) in Menu Engineering,” Int. J. Informatics Vis., vol. 7, no. 3, pp. 871–877, 2023.

A. S. Yaro, F. Maly, P. Prazak, and K. Maly, “Enhancing Fingerprint Localization Accuracy With Inverse Weight-Normalized Context Similarity Coefficient-Based Fingerprint Similarity Metric,” IEEE Access, vol. PP, p. 1, 2024.

J. Pang, D. Liu, Y. Peng, and X. Peng, “Temporal dependence Mahalanobis distance for anomaly detection in multivariate spacecraft telemetry series,” ISA Trans., vol. 140, pp. 354–367, 2023.

M. Mayrhofer and P. Filzmoser, “Multivariate outlier explanations using Shapley values and Mahalanobis distances,” Econom. Stat., no. xxxx, 2023.

A. M. Ikotun, A. E. Ezugwu, L. Abualigah, B. Abuhaija, and J. Heming, “K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data,” Inf. Sci. (Ny)., vol. 622, pp. 178–210, 2023.

Y. Ping, H. Li, B. Hao, C. Guo, and B. Wang, “Beyond k-Means++: Towards better cluster exploration with geometrical information,” Pattern Recognit., vol. 146, no. April 2023, p. 110036, 2024.

K. Sethia, J. Singh, and A. Gosain, “ScienceDirect ScienceDirect Handling incomplete data using Radial basis Kernelized Intuitionistic Fuzzy C-Means,” Procedia Comput. Sci., vol. 235, no. 2023, pp. 2518–2528, 2024.

S. E. Hashemi, F. Gholian-Jouybari, and M. Hajiaghaei-Keshteli, “A fuzzy C-means algorithm for optimizing data clustering,” Expert Syst. Appl., vol. 227, no. May, p. 120377, 2023.

V. Antoine, J. A. Guerrero, and G. Romero, “Possibilistic fuzzy c-means with partial supervision,” Fuzzy Sets Syst., vol. 449, pp. 162–186, 2022.

M.-B. Naghi, L. Kovács, and L. Szilágyi, “A generalized fuzzy-possibilistic c-means clustering algorithm,” Acta Univ. Sapientiae, Inform., vol. 15, no. 2, pp. 404–431, 2023.

G. Santhosh and K. V. Prasad, “Energy optimization routing for hierarchical cluster based WSN using artificial bee colony,” Meas. Sensors, vol. 29, no. January, p. 100848, 2023.

M. Zhao, S. Liu, and S. Wang, “Artificial Bee Colony Algorithm with Population Reduction Strategy,” 2023 Int. Conf. Evol. Algorithms Soft Comput. Tech. EASCT 2023, pp. 1–6, 2023.

X. Li, S. Zhang, and P. Shao, “Discrete artificial bee colony algorithm with fixed neighborhood search for traveling salesman problem,” Eng. Appl. Artif. Intell., vol. 131, no. January, p. 107816, 2024.

L. Yu and C. Zhou, “Determining the Best Clustering Number of K-Means Based on Bootstrap Sampling,” Proc. - 2nd Int. Conf. Data Sci. Bus. Anal. ICDSBA 2018, pp. 78–83, 2018.

A. Bosisio et al., “Performance assessment of load profiles clustering methods based on silhouette analysis,” 21st IEEE Int. Conf. Environ. Electr. Eng. 2021 5th IEEE Ind. Commer. Power Syst. Eur. EEEIC / I CPS Eur. 2021 - Proc., pp. 1–6, 2021.

N. Tressa, “Customer-Based Market Segmentation using Clustering in Data mining,” 2024 2nd Int. Conf. Intell. Data Commun. Technol. Internet Things, pp. 687–691, 2024.

D. Bienvenido-Huertas, F. Farinha, M. J. Oliveira, E. M. J. Silva, and R. Lança, “Challenge for planning by using cluster methodology: The case study of the algarve region,” Sustain., vol. 12, no. 4, 2020.