Fake News Detection in Indonesian Popular News Portal Using Machine Learning For Visual Impairment

Liliek Triyono; Rahmat Gernowo; Prayitno Prayitno; Mosiur Rahaman; Tri Raharjo Yudantoro

doi:10.30630/joiv.7.3.1243

Fake News Detection in Indonesian Popular News Portal Using Machine Learning For Visual Impairment

Liliek Triyono - Diponegoro University, Semarang, Indonesia
Rahmat Gernowo - Diponegoro University, Semarang, Indonesia
Prayitno Prayitno - Politeknik Negeri Semarang, Semarang, Indonesia
Mosiur Rahaman - Asia University, Taichung City 413, Taiwan
Tri Yudantoro - Politeknik Negeri Semarang, Semarang, Indonesia

Citation Format:

DOI: http://dx.doi.org/10.30630/joiv.7.3.1243

Abstract

It has become a necessity for people to communicate with each other to complete their needs. The exchange of information conveyed in communication often cannot be directly assessed, especially online news. They just get news and are unable to filter out inappropriate stuff. The media website conveys a great deal of information. Popular news websites are one source for keeping up with the newest news. It requires a significant amount of work to deliver news on prominent websites and to choose content that is not incorrect. To crawl the web and analyse enormous data, massive computer power is required, and solutions to lower the process's space and temporal complexity must be created.Data mining is seen to be a solution to the aforementioned difficulties since it extracts particular information based on defined attributes. This research investigated a model to determine the content of false news information in Indonesian popular news. Firstly, preprocessing process from dataset that collected from keaggle. Secondly, we try use classification methods to determined which the optimal method to classify fake news. Thirdly, we use another public dataset for testing method. Furthermore, five machine learning classifiers are compared: Support Vector Machine (SVM), Logistic Regression (LR), Decision Tree Classifier (DTC), Gradient Boosting Classifier (GBC), and Random Forest (RF). These classifications are utilized independently before being compared based on receiver operating characteristic curves and accuracy. The experimental result shows that DTC has the lowest accuracy of 75.33% and SVM has the highest accuracy of 83.55%.Â

Keywords

Data mining; Hoax; False News; Visual Impairment

Full Text:

PDF

References

D. Crowe, â€œFlaws in Coronavirus Pandemic Theory,â€ 2020, [Online]. Available: https://api.semanticscholar.org/CorpusID:214775362.

M. N. Alenezi, H. K. Alabdulrazzaq, A. A. Alshaher, and M. M. Alkharang, â€œEvolution of Malware Threats and Techniques: a Review,â€ Int. J. Commun. Networks Inf. Secur., vol. 12, 2020, [Online]. Available: https://api.semanticscholar.org/CorpusID:231668374.

W. Yue, C. Li, G. Mao, N. Cheng, and D. Zhou, â€œEvolution of road traffic congestion control: A survey from perspective of sensing, communication, and computation,â€ China Commun., vol. 18, pp. 151â€“177, 2021, [Online]. Available: https://api.semanticscholar.org/CorpusID:245541734.

D. N. Rapp, S. R. Hinze, K. Kohlhepp, and R. A. Ryskin, â€œReducing reliance on inaccurate information.,â€ Mem. Cognit., vol. 42, no. 1, pp. 11â€“26, Jan. 2014, doi: 10.3758/s13421-013-0339-0.

O. Balashevych, O. Orliuk, and A. Proskurnia, â€œTHE DEVELOPMENT AND APPROBATION OF THE QUESTIONNAIRE FOR DETECTING THE TENDENCY TO GOSSIP (TTGQ â€“ TENDENCY TO GOSSIP QUESTIONNAIRE): A PILOT STUDY,â€ Psychol. J., 2023, [Online]. Available: https://api.semanticscholar.org/CorpusID:259956986.

B. Probierz, P. Stefa, J. Kozak, B. Probierz, P. Stefa, and J. Kozak, â€œRapid detection of fake news based on machine learning methods,â€ vol. 00, 2021, doi: 10.1016/j.procs.2021.09.060.

M. Surve, P. Joshi, S. Jamadar, and M. M. N. Vharkate, â€œAutomatic Attendance System using Face Recognition Technique,â€ Int. J. Recent Technol. Eng., vol. 9, no. 1, pp. 2134â€“2138, 2020, doi: 10.35940/ijrte.a2644.059120.

P. Jha, â€œA Survey on various Haze and Underwater Digital Image Enhancement Techniques,â€ 2018, [Online]. Available: https://api.semanticscholar.org/CorpusID:195726425.

J. Dunaway, K. Searles, M. Sui, and N. Paul, â€œNews Attention in a Mobile Era,â€ J. Comput. Commun., vol. 23, no. 2, pp. 107â€“124, 2018, doi: 10.1093/jcmc/zmy004.

M. Viviani and G. Pasi, â€œCredibility in social media: opinions, news, and health informationâ€”a survey,â€ Wiley Interdiscip. Rev. Data Min. Knowl. Discov., vol. 7, no. 5, 2017, doi: 10.1002/widm.1209.

T. Chauhan and H. Palivela, â€œOptimization and improvement of fake news detection using deep learning approaches for societal benefit,â€ Int. J. Inf. Manag. Data Insights, vol. 1, no. 2, p. 100051, 2021, doi: 10.1016/j.jjimei.2021.100051.

J. L. Ruiz-Real, J. Uribe-Toril, J. A. Torres, and J. D. E. Pablo, â€œArtificial intelligence in business and economics research: Trends and future,â€ J. Bus. Econ. Manag., vol. 22, no. 1, pp. 98â€“117, 2021, doi: 10.3846/jbem.2020.13641.

A. Geisel, â€œThe Current And Future Impact Of Artificial Intelligence On Business,â€ Int. J. Sci. & Technol. Res., vol. 7, pp. 116â€“122, 2018, [Online]. Available: https://api.semanticscholar.org/CorpusID:115754758.

F. Bezzazi, â€œThe impact of artificial intelligence on business: benefits and ethical challenges on customer level,â€ J. Mark. Consum. Res., 2021, [Online]. Available: https://api.semanticscholar.org/CorpusID:250610075.

C. Chan and D. Petrikat, â€œImpact of Artificial Intelligence on Business and Society,â€ J. Bus. Manag. Stud., 2022, [Online]. Available: https://api.semanticscholar.org/CorpusID:252375047.

Z. Bastick, â€œWould you notice if fake news changed your behavior? An experiment on the unconscious effects of disinformation,â€ Comput. Human Behav., vol. 116, no. November 2020, p. 106633, 2021, doi: 10.1016/j.chb.2020.106633.

Y. Wang, M. McKee, A. Torbica, and D. Stuckler, â€œSystematic Literature Review on the Spread of Health-related Misinformation on Social Media,â€ Soc. Sci. Med., vol. 240, no. August, p. 112552, 2019, doi: 10.1016/j.socscimed.2019.112552.

V. PÃ©rez-Rosas, B. Kleinberg, A. Lefevre, and R. Mihalcea, â€œAutomatic detection of fake news,â€ COLING 2018 - 27th Int. Conf. Comput. Linguist. Proc., pp. 3391â€“3401, 2018.

M. Aldwairi and A. Alwahedi, â€œDetecting Fake News in Social Media Networks,â€ Procedia Comput. Sci., vol. 141, pp. 215â€“222, 2018, doi: https://doi.org/10.1016/j.procs.2018.10.171.

T. Murayama, S. Hisada, M. Uehara, S. Wakamiya, and E. Aramaki, â€œAnnotation-Scheme Reconstruction for â€˜Fake Newsâ€™ and Japanese Fake News Dataset.â€ 2022.

S. Vosoughi, D. Roy, and S. Aral, â€œThe spread of true and false news online,â€ vol. 1151, no. March, pp. 1146â€“1151, 2018, [Online]. Available: https://sci-hub.se/10.1126/science.aap9559.

K. Sharma, F. Qian, H. Jiang, N. Ruchansky, M. Zhang, and Y. Liu, â€œCombating Fake News: A Survey on Identification and Mitigation Techniques.â€ 2019.

W. Shahid et al., â€œDetecting and Mitigating the Dissemination of Fake News: Challenges and Future Research Opportunities,â€ IEEE Trans. Comput. Soc. Syst., 2022, [Online]. Available: https://api.semanticscholar.org/CorpusID:247062169.

Z. Shae, C. Shyu, and M. W. Kearney, â€œAutomatic Fake News detection in News Article with Opinion Classifier using XGBoost Algorithm.â€

Y. Yang, L. Zheng, J. Zhang, Q. Cui, Z. Li, and P. S. Yu, â€œTI-CNN: Convolutional Neural Networks for Fake News Detection,â€ 2018, [Online]. Available: http://arxiv.org/abs/1806.00749.

I. Y. R. Pratiwi, R. A. Asmara, and F. Rahutomo, â€œStudy of hoax news detection using naÃ¯ve bayes classifier in Indonesian language,â€ Proc. 11th Int. Conf. Inf. Commun. Technol. Syst. ICTS 2017, vol. 2018-Janua, no. February 2018, pp. 73â€“78, 2018, doi: 10.1109/ICTS.2017.8265649.

F. Rahutomo, I. Y. R. Pratiwi, and D. M. Ramadhani, â€œEksperimen NaÃ¯ve Bayes Pada Deteksi Berita Hoax Berbahasa Indonesia,â€ J. Penelit. Komun. Dan Opini Publik, vol. 23, no. 1, 2019, doi: 10.33299/jpkop.23.1.1805.

N. Hassan et al., â€œClaim buster: The firstever endtoend factchecking system,â€ Proc. VLDB Endow., vol. 10, no. 12, pp. 1945â€“1948, 2017, doi: 10.14778/3137765.3137815.

B. P. Nayoga, R. Adipradana, R. Suryadi, and D. Suhartono, â€œHoax Analyzer for Indonesian News Using Deep Learning Models,â€ Procedia Comput. Sci., vol. 179, no. 2020, pp. 704â€“712, 2021, doi: 10.1016/j.procs.2021.01.059.

M. S. S. Nur Hayatin, Suraya Alias, Lai Po Hung, â€œSentiment Analysis Based On Probabilistic Classifier Techniques In Various Indonesian Review Data,â€ Jordanian J. Comput. Inf. Technol., vol. 8, no. 3, pp. 171â€“175, 2022.

T. H. J. Hidayat, Y. Ruldeviyani, A. R. Aditama, G. R. Madya, A. W. Nugraha, and M. W. Adisaputra, â€œSentiment analysis of twitter data related to Rinca Island development using Doc2Vec and SVM and logistic regression as classifier,â€ Procedia Comput. Sci., vol. 197, no. 2021, pp. 660â€“667, 2021, doi: 10.1016/j.procs.2021.12.187.

A. G, B. Ganesh, A. Ganesh, C. Srinivas, Dhanraj, and K. Mensinkal, â€œLogistic regression technique for prediction of cardiovascular disease,â€ Glob. Transitions Proc., vol. 3, no. 1, pp. 127â€“130, 2022, doi: 10.1016/j.gltp.2022.04.008.

A. Mulahuwaish, K. Gyorick, K. Z. Ghafoor, H. S. Maghdid, and D. B. Rawat, â€œEfficient classification model of web news documents using machine learning algorithms for accurate information,â€ Comput. Secur., vol. 98, 2020, doi: 10.1016/j.cose.2020.102006.

M. M. Hasan, G. J. Young, M. R. Patel, A. S. Modestino, L. D. Sanchez, and M. Noor-E-Alam, â€œA machine learning framework to predict the risk of opioid use disorder,â€ Mach. Learn. with Appl., vol. 6, no. August, p. 100144, 2021, doi: 10.1016/j.mlwa.2021.100144.

Username
Password
Remember me