Multi-Objective k-Nearest Neighbor for Breast Cancer Detection
DOI: http://dx.doi.org/10.62527/joiv.9.1.2669
Abstract
Early detection of cancer is crucial. This study aims to increase the efficiency of breast cancer detection using the modified k-nearest neighbor (k-NN) algorithm. Since k-NN faces challenges with sensitivity to k values and computational complexity, a modification of k-NN was proposed, namely a multi-objective k-NN model. It was developed to incorporate multi-objective optimization and local density to create a more robust and efficient classification algorithm. The model dynamically determines the k value based on the sample density, optimizing accuracy and efficiency. Breast cancer data were collected from the University of Wisconsin Hospitals, Madison. The experimental results showed that the multi-objective k-NN model outperformed traditional k-NN and k-NN with feedback support. The proposed model achieved an accuracy of 93.7%, with precision values of 93% for the negative cancer class and 94% for the positive cancer class. These results indicate that the multi-objective k-NN model provides superior accuracy and precision in breast cancer detection, demonstrating its potential for clinical applications.
Keywords
Full Text:
PDFReferences
H. L. Wiraswati et al., “Informasi Dini Terhadap Penyakit Kanker Payudara berbasis Telepon Pintar,” J. Teknol. Inf. dan Ilmu Komput., vol. 9, no. 4, pp. 691–697, 2022.
WHO, “Global Health Estimates 2020: Deaths by Cause, Age, Sex, by Country and by Region, 2000-2019,” 2020. https://www.who.int/data/gho/data/themes/mortality-and-global-health-estimates/ghe-leading-causes-of-death
H. Sung et al., “Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries,” CA Cancer J Clin, vol. 71, no. 3, pp. 209–249, 2021.
A. Jemal, F. Bray, M. M. Center, J. Ferlay, E. Ward, and D. Forman, “Global cancer statistics,” CA Cancer J Clin, vol. 61, no. 2, pp. 69–90, 2011.
M. A. Azeem, M. I. Khan, and S. A. Khan, “COVID-19 Detection via Image Classification using Deep Learning on Chest X-Ray,” in 2021 Ethics and Explainability for Responsible Data Science (EE-RDS), 2021.
B. Z. Hussain, I. Andleeb, M. S. Ansari, A. M. Joshi, and N. Kanwal, “Wasserstein GAN based Chest X-Ray Dataset Augmentation for Deep Learning Models: COVID-19 Detection Use-Case,” in 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 2022.
S. M. Hasan, M. F. Rabbi, and N. Jahan, “Can Machine Learning Technique Predict the Prostate Cancer accurately?: The fact and remedy,” in 2021 International Conference on Electronics, Communications and Information Technology (ICECIT), 2021.
D. Rawat, “Validating and Strengthen the Prediction Performance Using Machine Learning Models and Operational Research for Lung Cancer,” in 2022 IEEE International Conference on Data Science and Information System (ICDSIS), 2022.
R. K. Barwal and N. Raheja, “A Classification System for Breast Cancer Prediction using SVOF-KNN method,” in 2022 International Conference on Augmented Intelligence and Sustainable Systems (ICAISS), 2022.
A. H. Osman and H. M. A. Aljahdali, “An Effective of Ensemble Boosting Learning Method for Breast Cancer Virtual Screening Using Neural Network Model,” IEEE Access, vol. 8, pp. 39165–39174, 2020.
C. Arthur and K. D. Hartomo, “Enhancing Breast Cancer Prediction with an Advanced K-Nearest Neighbors (KNN) Algorithm Integrated with Feedback Support Mechanism,” Int. Conf. Technol. Eng. Comput. Appl., 2023, doi: 10.1109/ICTECA60133.2023.10491036.
E. Michael, H. Ma, H. Li, and S. Qi, “An Optimized Framework for Breast Cancer Classification Using Machine Learning,” Biomed Res Int, vol. 2022, pp. 1–18, 2022.
M. A. Mezher, A. Altamimi, and R. Altamimi, “An enhanced Genetic Folding algorithm for prostate and breast cancer detection,” PeerJ Comput. Sci., vol. 8, pp. 1–17, 2022.
X. Zhang et al., “Deep Learning Based Analysis of Breast Cancer Using Advanced Ensemble Classifier and Linear Discriminant Analysis,” IEEE Access, vol. 8, pp. 120208–120217, 2020.
Z. Huang and D. Chen, “A Breast Cancer Diagnosis Method Based on VIM Feature Selection and Hierarchical Clustering Random Forest Algorithm,” IEEE Access, vol. 10, pp. 3284–3293, 2021.
P. Israni, “Breast cancer diagnosis (BCD) model using machine learning,” Int. J. Innov. Technol. Explor. Eng., vol. 8, no. 10, pp. 4456–4463, 2019.
M. Rana, P. Chandorkar, Alishiba Dsouza, and N. Kazi, “Breast Cancer Diagnosis and Recurrence Prediciton Using Machine Learning Techniques,” Int. J. Res. Eng. Technol., vol. 4, no. 4, pp. 372–376, 2015.
N. Wu et al., “Deep Neural Networks Improve Radiologists’ Performance in Breast Cancer Screening,” IEEE Trans Med Imaging, vol. 39, no. 4, pp. 1184–1194, 2020.
N. Fatima, L. Liu, S. Hong, and H. Ahmed, “Prediction of Breast Cancer, Comparative Review of Machine Learning Techniques, and Their Analysis,” IEEE Access, vol. 8, pp. 150360–150376, 2020.
A. R. Vaka, B. Soni, and S. R. K., “Breast cancer detection by leveraging Machine Learning,” ICT Express, vol. 6, no. 4, pp. 320–324, 2020.
S. Zhang, “Challenges in KNN Classification,” IEEE Trans. Knowl. Data Eng., vol. 34, no. 10, pp. 4663–4675, 2022, doi: 10.1109/TKDE.2021.3049250.
H. Gweon, M. Schonlau, and S. H. Steiner, “The k conditional nearest neighbor algorithm for classification and class probability estimation,” PeerJ Comput. Sci., vol. 5, pp. 1–21, 2019.
M. Papanikolaou, G. Evangelidis, and S. Ougiaroglou, “Dynamic k determination in k-NN classifier: A literature review,” 12th Int. Conf. Information, Intell. Syst. Appl., 2021, doi: 10.1109/IISA52424.2021.9555525.
S. Zhang, X. Li, M. Zong, X. Zhu, and D. Cheng, “Learning k for kNN Classification,” ACM Trans. Intell. Syst. Technol., vol. 8, no. 3, pp. 1–19, 2017, doi: 10.1145/2990508.
J. Gou, Z. Yi, L. Du, and T. Xiong, “A Local Mean-Based k-Nearest CentroidNeighbor Classifier,” Comput. J., vol. 55, no. 9, pp. 1058–1071, 2012, doi: 10.1093/comjnl/bxr131.
S. A. Dudani, “The Distance-Weighted k-Nearest-Neighbor Rule,” IEEE Trans. Syst. Man. Cybern., vol. SMC-6, no. 4, pp. 325–327, 1976.
S. Mishra and H. Patil, “Improved KNN with Feedback Support,” Int. J. Comput. Appl., vol. 177, no. 1, pp. 1–3, 2017.
N. Maleki, Y. Zeinali, and S. T. A. Niaki, “A k-NN method for lung cancer prognosis with the use of a genetic algorithm for feature selection,” Expert Syst. Appl., vol. 164, p. 113981, 2021, doi: 10.1016/j.eswa.2020.113981.
L. Munkhdalai, T. Munkhdalai, K. H. Park, H. G. Lee, M. Li, and K. H. Ryu, “Mixture of Activation Functions with Extended Min-Max Normalization for Forex Market Prediction,” IEEE Access, vol. 7, pp. 183680–183691, 2019.
A. Bansal and A. Singhrova, “Performance Analysis of Supervised Machine Learning Algorithms for Diabetes and Breast Cancer Dataset,” in 2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS), 2021.
G. P. Dirgantoro, M. A. Soeleman, and C. Supriyanto, “Smoothing Weight Distance to Solve Euclidean Distance Measurement Problems in K-Nearest Neighbor Algorithm,” in 2021 IEEE 5th International Conference on Information Technology, Information Systems and Electrical Engineering (ICITISEE), 2021.
T. Cover and P. Hart, “Nearest Neighbor Pattern Classification,” IEEE Trans. Inf. Theory, vol. 13, no. 1, pp. 21–27, 1967.
J. Zhang, Q. Ding, B. Li, and X. Ye, “Bidirectional k-nearest neighbor spatial crowdsourcing allocation protocol based on edge computing,” PeerJ Comput. Sci., vol. 9, pp. 1–23, 2023.
S. Eltalhi and H. Kutrani, “Breast Cancer Diagnosis and Prediction Using Machine Learning and Data Mining Techniques: A Review,” IOSR J. Dent. Med. Sci., vol. 18, no. 4, pp. 85–94, 2019.
W. Zuo, D. Zhang, and K. Wang, “On kernel difference-weighted k-nearest neighbor classification,” Pattern Anal. Appl., vol. 11, pp. 247–257, 2008.
F. Rustam, A. Mehmood, M. Ahmad, S. Ullah, D. M. Khan, and G. S. Choi, “Classification of Shopify App User Reviews Using Novel Multi Text Features,” IEEE Access, vol. 8, pp. 30234–30244, 2020.
M. I. Prasetiyowati, N. U. Maulidevi, and K. Surendro, “The accuracy of Random Forest performance can be improved by conducting a feature selection with a balancing strategy,” PeerJ Comput. Sci., vol. 8, pp. 1–15, 2022.
S. R. Aziz, T. A. Khan, and A. Nadeem, “Inheritance metrics feats in unsupervised learning to classify unlabeled datasets and clusters in fault prediction,” PeerJ Comput. Sci., vol. 7, pp. 1–30, 2021.