A Multi-tier Model and Filtering Approach to Detect Fake News Using Machine Learning Algorithms

Chiung Chang Yu - Universiti Tun Hussein Onn Malaysia, Parit Raja, Batu Pahat, 86400, Malaysia
Isredza Rahmi A Hamid - Universiti Tun Hussein Onn Malaysia, Parit Raja, Batu Pahat, 86400, Malaysia
Zubaile Abdullah - Universiti Tun Hussein Onn Malaysia, Parit Raja, Batu Pahat, 86400, Malaysia
Kuryati Kipli - Universiti Malaysia Sarawak, Kota Samarahan, Sarawak, Malaysia
Hidra Amnur - Politeknik Negeri Padang, Padang, Indonesia


Citation Format:



DOI: http://dx.doi.org/10.62527/joiv.8.2.2703

Abstract


Fake news trends have overgrown in our societies over the years through social media platforms. The goal of spreading fake news can easily mislead and manipulate the public’s opinion. Many previous researchers have proposed this domain using classification algorithms or deep learning techniques. However, machine learning algorithms still suffer from high margin error, which makes them unreliable as every algorithm uses a different way of prediction. Deep learning requires high computation power and a large dataset to operate the classification model. A filtering model with a consensus layer in a multi-tier model is introduced in this research paper. The multi-tier model filters the news label correctly predicted by the first two-tier layer. The consensus layer acts as the final decision when collision results occur in the first two-tier layer. The proposed model is applied to the WEKA software tool to test and evaluate the model from both datasets. Two sequences of classification models are used in this research paper: LR_DT_RF and LR_NB_AdaBoost. The best performance of sequence for both datasets is LR_DT_RF which yields 0.9892 F1-Score, 0.9895 Accuracy, and 0.9790 Matthews Correlation Coefficient (MCC) for ISOT Fake News Dataset, and 0.9913 F1-Score, 0.9853 Accuracy, and 0.9455 MCC for CHECKED Dataset. This research could give researchers an approach for fake news detection on different social platforms and feature-based

Keywords


Consensus Layer; Fake News; Machine Learning; Multi-tier Model

References


X. Zhou and R. Zafarani, “A survey of fake news: Fundamental theories, detection methods, and opportunities,” ACM Computing Surveys (CSUR), vol. 53, no. 5, pp. 1–40, 2020.

X. Zhang and A. A. Ghorbani, “An overview of online fake news: Characterization, detection, and discussion,” Inf Process Manag, vol. 57, no. 2, p. 102025, 2020, doi: 10.1016/j.ipm.2019.03.004.

N. Grinberg, K. Joseph, L. Friedland, B. Swire-Thompson, and D. Lazer, “Fake news on Twitter during the 2016 U.S. presidential election,” Science (1979), vol. 363, no. 6425, pp. 374–378, 2019, doi: 10.1126/science.aau2706.

S. Vosoughi, D. Roy, and S. Aral, “The spread of true and false news online,” Science (1979), vol. 359, no. 6380, pp. 1146–1151, 2018, doi: 10.1126/science.aap9559.

V. Balakrishnan, K. S. Ng, and H. A. Rahim, “To share or not to share – The underlying motives of sharing fake news amidst the COVID-19 pandemic in Malaysia,” Technol Soc, vol. 66, p. 101676, 2021, doi: https://doi.org/10.1016/j.techsoc.2021.101676.

D. M. J. Lazer et al., “The science of fake news,” Science (1979), vol. 359, no. 6380, pp. 1094–1096, 2018, doi: 10.1126/science.aao2998.

S. Hangloo and B. Arora, “Fake News Detection Tools and Methods–A Review,” arXiv preprint arXiv:2112.11185, 2021.

K. Shu, D. Mahudeswaran, S. Wang, D. Lee, and H. Liu, “FakeNewsNet: A Data Repository with News Content, Social Context, and Spatiotemporal Information for Studying Fake News on Social Media,” Big Data, vol. 8, no. 3, pp. 171–188, 2020, doi: 10.1089/big.2020.0062.

A. S. U and F. M. Philip, “A Hybrid Method for Fake Profile Detection in Social Network Using Artificial Intelligence,” 2021.

K. Shu, S. Wang, and H. Liu, “Beyond news contents: The role of social context for fake news detection,” in WSDM 2019 - Proceedings of the 12th ACM International Conference on Web Search and Data Mining, Association for Computing Machinery, Inc, Jan. 2019, pp. 312–320. doi: 10.1145/3289600.3290994.

Z. Khanam, B. N. Alwasel, H. Sirafi, and M. Rashid, “Fake News Detection Using Machine Learning Approaches,” IOP Conf Ser Mater Sci Eng, vol. 1099, no. 1, p. 012040, 2021, doi: 10.1088/1757-899x/1099/1/012040.

S. B. Parikh and P. K. Atrey, “Media-Rich Fake News Detection: A Survey,” in Proceedings - IEEE 1st Conference on Multimedia Information Processing and Retrieval, MIPR 2018, Institute of Electrical and Electronics Engineers Inc., Jun. 2018, pp. 436–441. doi: 10.1109/MIPR.2018.00093.

R. K. Kaliyar, A. Goswami, and P. Narang, “FakeBERT: Fake news detection in social media with a BERT-based deep learning approach,” Multimed Tools Appl, vol. 80, no. 8, pp. 11765–11788, 2021, doi: 10.1007/s11042-020-10183-2.

H. Ahmed, I. Traore, and S. Saad, “ISOT Fake News Dataset,” pp. 1–2, 2017.

C. Yang, X. Zhou, and R. Zafarani, “CHECKED: Chinese COVID-19 fake news dataset,” Soc Netw Anal Min, vol. 11, no. 1, 2021, doi: 10.1007/s13278-021-00766-8.

O. Stitini, S. Kaloun, and O. Bencharef, “Towards the Detection of Fake News on Social Networks Contributing to the Improvement of Trust and Transparency in Recommendation Systems: Trends and Challenges,” Information (Switzerland), vol. 13, no. 3, Mar. 2022, doi: 10.3390/info13030128.

M. Del Vicario, W. Quattrociocchi, A. Scala, and F. Zollo, “Polarization and Fake News: Early Warning of Potential Misinformation Targets,” Feb. 2018, [Online]. Available: http://arxiv.org/abs/1802.01400

M. L. Della Vedova, E. Tacchini, S. Moretécole, G. Ballarin, M. Dipierro, and L. De Alfaro, “Automatic Online Fake News Detection Combining Content and Social Signals,” 2018. [Online]. Available: https://developers.facebook.com/docs/graph-api

H. Rashkin, E. Choi, J. Y. Jang, S. Volkova, and Y. Choi, “Truth of Varying Shades: Analyzing Language in Fake News and Political Fact-Checking,” in Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark: Association for Computational Linguistics, Sep. 2017, pp. 2931–2937. doi: 10.18653/v1/D17-1317.

R. Megan, “Getting Real about Fake News.” Accessed: Jun. 18, 2023. [Online]. Available: : https://www.kaggle.com/mrisdal/fake-news/data

H. Ahmed, I. Traore, and S. Saad, “Detection of Online Fake News Using N-Gram Analysis and Machine Learning Techniques,” Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 10618 LNCS, no. October, pp. 127–138, 2017, doi: 10.1007/978-3-319-69155-8_9.

J. Shaikh and R. Patil, “Fake news detection using machine learning,” Proceedings - 2020 IEEE International Symposium on Sustainable Energy, Signal Processing and Cyber Security, iSSSC 2020, vol. 2020, 2020, doi: 10.1109/iSSSC50941.2020.9358890.

S. Hakak, M. Alazab, S. Khan, T. R. Gadekallu, P. K. R. Maddikunta, and W. Z. Khan, “An ensemble machine learning approach through effective feature extraction to classify fake news,” Future Generation Computer Systems, vol. 117, pp. 47–58, 2021, doi: 10.1016/j.future.2020.11.022.

B. T.K., C. S. R. Annavarapu, and A. Bablani, “Machine learning algorithms for social media analysis: A survey,” Computer Science Review, vol. 40. Elsevier Ireland Ltd, May 01, 2021. doi: 10.1016/j.cosrev.2021.100395.

M. H. Goldani, S. Momtazi, and R. Safabakhsh, “Detecting fake news with capsule neural networks,” Appl Soft Computing, vol. 101, Mar. 2021, doi: 10.1016/j.asoc.2020.106991.

T. Jiang, J. P. Li, A. U. Haq, A. Saboor, and A. Ali, “A Novel Stacking Approach for Accurate Detection of Fake News,” IEEE Access, vol. 9, pp. 22626–22639, 2021, doi: 10.1109/ACCESS.2021.3056079.

C. Tang, K. Ma, B. Cui, K. Ji, and A. Abraham, “Long text feature extraction network with data augmentation,” Applied Intelligence, 2022, doi: 10.1007/s10489-022-03185-0.

L. Hu, S. Wei, Z. Zhao, and B. Wu, “Deep learning for fake news detection: A comprehensive survey,” AI Open, Oct. 2022, doi: 10.1016/j.aiopen.2022.09.001.

L. Zhou, J. Tao, and D. Zhang, “Does Fake News in Different Languages Tell the Same Story? An Analysis of Multi-level Thematic and Emotional Characteristics of News about COVID-19,” Information Systems Frontiers, Sep. 2022, doi: 10.1007/s10796-022-10329-7.

Y. Yang, S. Nazir, and W. Khalil, “A probabilistic approach toward evaluation of Internet rumor on COVID,” Soft computing, vol. 26, no. 16, pp. 8077–8088, 2022.