A Multi-Agent K-Means Algorithm for Improved Parallel Data Clustering

Mohammed Ahmed Jubair; Salama A. Mostafa; Aida Mustapha; Zirawani Baharum; Mohamad Aizi Salamat; Aldo Erianda

doi:10.30630/joiv.6.1-2.934

A Multi-Agent K-Means Algorithm for Improved Parallel Data Clustering

Mohammed Ahmed Jubair - Department of Computer Technical Engineering, College of Information Technology, Imam Ja'afar Al-Sadiq University, Al-Muthanna, Iraq
Salama A. Mostafa - Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, Johor, Malaysia.
Aida Mustapha - Faculty of Applied Sciences and Technology, Universiti Tun Hussein Onn Malaysia, 84600, Panchor, Johor, Malaysia.
Zirawani Baharum - Malaysian Institute of Industrial Technology, Universiti Kuala Lumpur, Persiaran Sinaran Ilmu, Bandar Seri Alam, 81750 Johor, Malaysia
Mohamad Aizi Salamat - Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, Johor, Malaysia.
Aldo Erianda - Department of Information Technology, Politeknik Negeri Padang, Sumatera Barat, Indonesia

Citation Format:

DOI: http://dx.doi.org/10.30630/joiv.6.1-2.934

Abstract

Due to the rapid increase in data volumes, clustering algorithms are now finding applications in a variety of fields. However, existing clustering techniques have been deemed unsuccessful in managing large data volumes due to the issues of accuracy and high computational cost. As a result, this work offers a parallel clustering technique based on a combination of the K-means and Multi-Agent System algorithms (MAS). The proposed technique is known as Multi-K-means (MK-means). The main goal is to keep the dataset intact while boosting the accuracy of the clustering procedure. The cluster centers of each partition are calculated, combined, and then clustered. The performance of the suggested method's statistical significance was confirmed using the five datasets that served as testing and assessment methods for the proposed algorithm's efficacy. In terms of performance, the proposed MK-means algorithm is compared to the Clustering-based Genetic Algorithm (CGA), the Adaptive Biogeography Clustering-based Genetic Algorithm (ABCGA), and standard K-means algorithms. The results show that the MK-means algorithm outperforms other algorithms because it works by activating agents separately for clustering processes while each agent considers a separate group of features.

Keywords

K-means; decision-making; clustering; multi-agent system.

Full Text:

PDF

References

P. I. Dalatu, A. Fitrianto, and A. Mustapha, â€œHybrid distance functions for K-Means clustering algorithms,â€ Stat. J. IAOS, vol. 33, no. 4, pp. 989â€“996, 2017.

M. A. Jubair, S. A. Mostafa, A. Mustapha, M. H. Hassan, M. A. Salamat, and M. S. Jawad, (2021, September). Exploring the Role of Multi-Agent Systems in Improving K-Means Clustering Method. In 2021 4th International Symposium on Agents, Multi-Agent Systems and Robotics (ISAMSR) (pp. 59-63). IEEE.

Y. Wang, M. Lees, W. Cai, S. Zhou, and M. Y. H. Low, â€œCluster based partitioning for agent-based crowd simulations,â€ In Proceedings of the 2009 Winter Simulation Conference (WSC), pp. 1047â€“1058, 2009.

P. FrÃ¤nti and S. Sieranoja, â€œK-means properties on six clustering benchmark datasetsâ€, Applied Intelligence., vol. 48, no. 12, pp. 4743â€“4759, 2018.

P. Belsis, A. Koutoumanos, and C. Sgouropoulou, â€œPBURC: A patterns-based, unsupervised requirements clustering framework for distributed agile software development,â€ Requirements engineering, vol. 19, no. 2, pp. 213â€“225, 2014.

N. Kaur and S. Aggarwal, â€œDesigning a New Hybrid K-Means Optimization Algorithm,â€ Int. J. of Adv. Res. in Com. Sci., vol. 8, no. 5, pp. 1567â€“1573, 2017.

M. A. Mahmoud, M. S. Ahmad, A. Ahmad, A. Mustapha, M. Z. M. Yusoff, and N. H. A. Hamid, â€œBuilding norms-adaptable agents from potential norms detection techniques (PNDT),â€ Int. J. of Int. Inf. Tech, 9(3), 38-60, 2013.

M. A., Mahmoud, M. S. Ahmad and, M. Z. M. Yusoff, (2016, March). A norm assimilation approach for multi-agent systems in heterogeneous communities. In Asian Conference on Intelligent Information and Database Systems (pp. 354-363). Springer, Berlin, Heidelberg.

S. A. Mostafa, M. S. Ahmad, M. Annamalai, A. Ahmad, S. S. Gunasekaran. â€œA conceptual model of layered adjustable autonomy,â€ In Advances in information systems and technologies 2013 (pp. 619-630). Springer, Berlin, Heidelberg.

S. A. Mostafa, S. S. Gunasekaran, M. S. Ahmad, A. Ahmad, M. Annamalai. and A. Mustapha, (2014, June). Defining tasks and actions complexity-levels via their deliberation intensity measures in the layered adjustable autonomy model. In 2014 International Conference on Intelligent Environments (pp. 52-55). IEEE.

M. A. Mahmoud, M. S. Ahmad, A. Ahmad, M. Z. M. Yusoff, A. Mustapha, and N. H. A. Hamid, â€œObligation and Prohibition Norms Mining Algorithm for Normative Multi-agent Systems,â€ KES-AMSTA, pp. 115-124, 2013.

S. A. Mostafa, , M. S. Ahmad, , M. Annamalai, , A. Ahmad, , and S. S. Gunasekaran, â€œA dynamically adjustable autonomic agent framework,â€ In Advances in information systems and technologies, Springer, Berlin, Heidelberg, pp. 631-642, 2013.

S. A. Mostafa, M. S. Ahmad, A. Mustapha, and M. A. Mohammed, â€œFormulating layered adjustable autonomy for unmanned aerial vehicles,â€ International Journal of Intelligent Computing and Cybernetics, 2017.

S. A. Mostafa, A. Mustapha, A. A. Hazeem, S. H. Khaleefah, and M. A. Mohammed, â€œAn agent-based inference engine for efficient and reliable automated car failure diagnosis assistance,â€ IEEE Access, 6, pp. 8322-8331, 2018.

S. A. Mostafa, M. S. Ahmad, A. Ahmad, M. Annamalai, and S. S. Gunasekaran, (2016, August). A Flexible Human-Agent Interaction model for supervised autonomous systems. In 2016 2nd International Symposium on Agent, Multi-Agent Systems and Robotics (ISAMSR) (pp. 106-111). IEEE.

M. H. Hassan, , S. A. Mostafa, , H. Mahdin, , A. Mustapha, , A. A. Ramli, , M. H. Hassan, and M. A. Jubair, â€œMobile ad-hoc network routing protocols of time-critical events for search and rescue missions,â€ Bulletin of Electrical Engineering and Informatics, 10(1), 192-199, 2021.

M. K. Abd Ghani, M. A. Mohammed, N. Arunkumar, S. A. Mostafa, D. A. Ibrahim, M. K. Abdullah, and M. A. Burhanuddin, â€œDecision-level fusion scheme for nasopharyngeal carcinoma identification using machine learning techniques,â€ Neural Computing and Applications, 32(3), 625-638, 2020.

S. A. Mostafa, M. Aida, M. A. Mohammed, M. S. Ahmad, and M. A. Mahmoud, â€œA fuzzy logic control in adjustable autonomy of a multi-agent system for an automated elderly movement monitoring application,â€ International journal of medical informatics 112, 173-184, 2018.

A. deep and A. Gupta, â€œA Novel Fuzzy-K Means based Support Vector Machine for Software Quality Prediction,â€ Int. J. Eng. Trends Technol., vol. 37, no. 2, pp. 80â€“89, 2016.

M. Mezghani, J. Kang, and F. SÃ¨des, â€œUsing k-means for redundancy and inconsistency detection: Application to industrial requirements,â€ In International Conference on Applications of Natural Language to Information Systems, pp. 501â€“508, 2018.

D. Xu and Y. Tian, â€œA Comprehensive Survey of Clustering Algorithms,â€ Annals of Data Science, vol. 2, no. 2, pp. 165â€“193, 2015.

P. Lin, Y. Wang, H. Qi, and Y. Hong, â€œDistributed Consensus-Based K-Means Algorithm in Switching Multi-Agent Networks,â€ Journal of Systems Science and Complexity, vol. 31, no. 5, pp. 1128â€“1145, 2018.

V. Bhatnagar, R. Majhi, and P. R. Jena, â€œComparative Performance Evaluation of Clustering Algorithms for Grouping Manufacturing Firms,â€ Arabian Journal for Science and Engineering, vol. 43, no. 8, pp. 4071â€“4083, 2018.

N. Mesbahi, O. Kazar, S. Benharzallah, M. Zoubeidi, and S. Bourekkache, â€œMulti-agents approach for data Mining based K-Means for improving the decision process in the ERP systems,â€ International Journal of Decision Support System Technology (IJDSST), vol. 7, no. 2, pp. 1â€“14, 2015.

S. A. Mostafa, S. S. Gunasekaran, S. H. Khaleefah, A. Mustapha, M. A. Jubair, and M. H. Hassan, â€œA fuzzy case-based reasoning model for software requirements specifications quality assessment,â€ International Journal on Advanced Science Engineering and Information Technology, 2019.

A. K. Dubey, U. Gupta, and S. Jain, â€œComparative study of K-means and fuzzy C-means algorithms on the breast cancer data,â€ International Journal on Advanced Science, Engineering and Information Technology, 8(1), 18-29, 2018.

D. Alexi, G. Francisco, C. Juan, V. Tom, and C. Chiara, â€œWater Quality Analysis in Mantaro River Peru, Before and After the Tailingâ€™s Accident Using the Grey Clustering Method,â€ International Journal on Advanced Science, Engineering and Information Technology, vol. 11, no. 3, pp. 917-922, 2021.

M. F. A. Saputra, and T. Widiyaningtyas, A. P. Wibawa, â€œIlliteracy classification using K means-NaÃ¯ve Bayes algorithm,â€ International Journal on Informatics Visualization (JOIV), 2(3), 153-158, 2018.

A. Satar, A. Mohamed, and A. M. Ali, â€œData Mining Techniques for Pandemic Outbreak in Healthcare,â€ International Journal on Informatics Visualization (JOIV), 5(2), 162-169, 2021.

I. T. R. Yanto, R. Setiyowati, and N. Azizah, â€œA Framework of Mutual Information Kullback-Leibler Divergence based for Clustering Categorical Data,â€ International Journal on Informatics Visualization (JOIV), 5(1), 11-15, 2021.

S. Y. Tan, H. Arshad, and A. Abdullah, â€œAn efficient and robust mobile augmented reality application,â€ International Journal on Advanced Science, Engineering and Information Technology, 8, 1672-1678, 2018.

S. Francesca, C. G. Carlo, F. D. N. Luca, and R. Marco, â€œComparison of low-complexity algorithms for real-time QRS detection using standard ECG database,â€ International Journal on Advanced Science, Engineering and Information Technology, 8(2), 307, 2018.

Username
Password
Remember me