Content and Network Feature in Attention-based Neural Network for Stance Detection on COVID-19 Vaccination Tweets
DOI: http://dx.doi.org/10.62527/joiv.9.1.2671
Abstract
Stance detection in COVID-19 vaccination utilizing tweets is crucial for several reasons, such as public health communication, monitoring vaccine sentiment, and identifying misinformation. This research aims to explore the use of attention-based neural networks for stance detection in Indonesian COVID-19 vaccination tweets. The research focuses on enhancing accuracy by integrating content and network features. The content features represent the tweet's text, while network features define the user account's following or unfollowing. The primary contribution of this research is the development of an Attention Long Short-Term Memory (AttLSTM) model for stance detection in Indonesian tweets related to the COVID-19 vaccination. This model combines content and network features to improve accuracy in classifying user attitudes. We also highlight the performance differences between Word2Vec and FastText for numerical text representation in the AttLSTM model. The research used the Indonesian COVID-19 vaccination-related tweet dataset from prior research. The dataset is extracted using user metadata to obtain content and network features necessary to represent users' interest in tweets. Our research method involves data preparation, preprocessing, extraction of content and network features, and the development of an AttLSTM model. By integrating content and network features into the AttLSTM model with Word2Vec text representation, the study demonstrates superior performance compared to the LSTM baseline model and FastText. Adding attention mechanisms to the baseline LSTM model can capture crucial information, such as the minority class inside a tweet's text. Future research will involve exploring advanced data processing methods and ensemble learning techniques to further improve the model's performance.
Full Text:
PDFReferences
S. Zhou, J. Lin, L. Tan, and X. Liu, “Condensed Convolution Neural Network by Attention over Self-attention for Stance Detection in Twitter,” in 2019 International Joint Conference on Neural Networks (IJCNN), Budapest: IEEE, Jul. 2019, pp. 1–8. doi: 10.1109/IJCNN.2019.8851965.
S. M. Alhashmi, A. M. Khedr, I. Arif, and M. El Bannany, “Using a Hybrid-Classification Method to Analyze Twitter Data during Critical Events,” IEEE Access, vol. 9, pp. 141023–141035, 2021, doi: 10.1109/ACCESS.2021.3119063.
Z. Su, Y. Xi, R. Cao, H. Tang, and H. Pan, “A Stance Detection Approach Based on Generalized Autoregressive pretrained Language Model in Chinese Microblogs,” in ACM International Conference Proceeding Series, Association for Computing Machinery, Feb. 2021, pp. 232–238. doi: 10.1145/3457682.3457717.
U. A. Siddiqua, A. N. Chy, and M. Aono, “Tweet Stance Detection Using an Attention based Neural Ensemble Model,” Proceedings of NAACL-HLT 2019, pp. 1868–1873, 2019, doi: http://dx.doi.org/10.18653/v1/N19-1185.
P. Wei, W. Mao, and D. Zeng, “A Target-Guided Neural Memory Model for Stance Detection in Twitter,” in 2018 International Joint Conference on Neural Networks (IJCNN), 2018, pp. 1–8. doi: 10.1109/IJCNN.2018.8489665.
A. I. Al-Ghadir, A. M. Azmi, and A. Hussain, “A novel approach to stance detection in social media tweets by fusing ranked lists and sentiments,” Information Fusion, vol. 67, pp. 29–40, Mar. 2021, doi: 10.1016/j.inffus.2020.10.003.
L. Wang and D. Wang, “Solving Stance Detection on Tweets as Multi-Domain and Multi-Task Text Classification,” IEEE Access, vol. 9, pp. 157780–157789, 2021, doi: 10.1109/ACCESS.2021.3129468.
K. Kawintiranon and L. Singh, “Knowledge Enhanced Masked Language Model for Stance Detection,” in Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics, 2021, pp. 4725–4735. doi: 10.18653/v1/2021.naacl-main.376.
C. B. P. Putra, D. Purwitasari, and A. B. Raharjo, “Stance Detection on Tweets with Multi-task Aspect-based Sentiment: A Case Study of COVID-19 Vaccination,” International Journal of Intelligent Engineering and Systems, vol. 15, no. 5, pp. 515–526, Oct. 2022, doi: 10.22266/ijies2022.1031.45.
F. Cascini et al., “Social media and attitudes towards a COVID-19 vaccination: A systematic review of the literature,” EClinicalMedicine, vol. 48, pp. 1–44, 2022, doi: 10.1016/j.eclinm.2022.101454.
A. ALDayel and W. Magdy, “Stance detection on social media: State of the art and trends,” Inf Process Manag, vol. 58, no. 4, Jul. 2021, doi: 10.1016/j.ipm.2021.102597.
W. Gao, Y. Yang, and Y. Liu, “Stance detection with target and target towards attention,” in Proceedings - 9th IEEE International Conference on Big Knowledge, ICBK 2018, Institute of Electrical and Electronics Engineers Inc., Dec. 2018, pp. 432–439. doi: 10.1109/ICBK.2018.00064.
P. Sobhani, D. Inkpen, and X. Zhu, “Exploring deep neural networks for multitarget stance detection,” Comput Intell, vol. 35, no. 1, pp. 82–97, Feb. 2019, doi: 10.1111/coin.12189.
L. N. N. Do, H. L. Vu, B. Q. Vo, Z. Liu, and D. Phung, “An effective spatial-temporal attention based neural network for traffic flow prediction,” Transp Res Part C Emerg Technol, vol. 108, pp. 12–28, Nov. 2019, doi: 10.1016/j.trc.2019.09.008.
D. Bahdanau, K. Cho, and Y. Bengio, “Neural Machine Translation by Jointly Learning to Align and Translate,” in ICLR 2015, Sep. 2015, pp. 1–15. [Online]. Available: http://arxiv.org/abs/1409.0473
M.-T. Luong, H. Pham, and C. D. Manning, “Effective Approaches to Attention-based Neural Machine Translation,” in Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2015, pp. 1412–1421. doi: 10.48550/arXiv.1508.04025.
A. M. Rush, S. Chopra, and J. Weston, “A Neural Attention Model for Sentence Summarization,” in Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2015, pp. 379–389. doi: 10.48550/arXiv.1509.00685.
K. Xu et al., “Show, Attend and Tell: Neural Image Caption Generation with Visual Attention,” in International Conference on Machine Learning, 2015, pp. 2048–2057.
Y. Bachrach et al., “An Attention Mechanism for Answer Selection Using a Combined Global and Local View,” Jul. 2017, doi: 10.48550/arXiv.1707.01378.
W. S. Paka, R. Bansal, A. Kaushik, S. Sengupta, and T. Chakraborty, “Cross-SEAN: A cross-stitch semi-supervised neural attention model for COVID-19 fake news detection,” Appl Soft Comput, vol. 107, pp. 1–13, Aug. 2021, doi: 10.1016/j.asoc.2021.107393.
D. Purwitasari, C. B. P. Putra, and A. B. Raharjo, “A stance dataset with aspect-based sentiment information from Indonesian COVID-19 vaccination-related tweets,” Data Brief, vol. 47, Apr. 2023, doi: 10.1016/j.dib.2023.108951.
J. P. Singh, A. Kumar, N. P. Rana, and Y. K. Dwivedi, “Attention-Based LSTM Network for Rumor Veracity Estimation of Tweets,” Information Systems Frontiers, vol. 24, no. 2, pp. 459–474, Apr. 2022, doi: 10.1007/s10796-020-10040-5.
A. Aldayel and W. Magdy, “Your stance is exposed! Analysing possible factors for stance detection on social media,” Proc ACM Hum Comput Interact, vol. 3, no. CSCW, pp. 1–20, Nov. 2019, doi: 10.1145/3359307.
M. M. Truşcă, “Efficiency of SVM classifier with Word2Vec and Doc2Vec models,” Proceedings of the International Conference on Applied Statistics, vol. 1, no. 1, pp. 496–503, Oct. 2019, doi: 10.2478/icas-2019-0043.
H. D. Abubakar and M. Umar, “Sentiment Classification: Review of Text Vectorization Methods: Bag of Words, Tf-Idf, Word2vec and Doc2vec,” SLU Journal of Science and Technology, vol. 4, no. 1 & 2, pp. 27–33, Aug. 2022, doi: 10.56471/slujst.v4i.266.
A. Nurdin, B. A. S. Aji, A. Bustamin, and Z. Abidin, “Perbandingan Kinerja Word Embedding Word2Vec, Glove, dan FastText pada Klasifikasi Teks,” Jurnal TEKNOKOMPAK, vol. 14, no. 2, pp. 74–79, 2020.
M. A. Riza and N. Charibaldi, “Emotion Detection in Twitter Social Media Using Long Short-Term Memory (LSTM) and Fast Text,” International Journal of Artificial Intelligence & Robotics (IJAIR), vol. 3, no. 1, pp. 15–26, May 2021, doi: 10.25139/ijair.v3i1.3827.
P. Goyal, S. Pandey, and K. Jain, Deep Learning for Natural Language Processing: Creating Neural Networks with Python. Apress, 2018. doi: 10.1007/978-1-4842-3685-7.
P. Zhou et al., “Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification,” in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016, pp. 207–212. doi: http://dx.doi.org/10.18653/v1/P16-2034.
D. H. Fudholi, R. A. N. Nayoan, and S. Rani, “Stock Prediction Based on Twitter Sentiment Extraction Using BiLSTM-Attention,” Indonesian Journal of Electrical Engineering and Informatics, vol. 10, no. 1, pp. 187–198, Mar. 2022, doi: 10.52549/ijeei.10i1.3011.
J. Du, R. Xu, Y. He, and L. Gui, “Stance Classification with Target-Specific Neural Attention Networks,” in 26th International Joint Conference on Artificial Intelligence, Melbourne: International Joint Conferences on Artificial Intelligence, 2017, pp. 3988–3994. doi: 10.24963/ijcai.2017/557.
M. Hardalov, A. Arora, P. Nakov, and I. Augenstein, “Cross-Domain Label-Adaptive Stance Detection,” arXiv preprint, Apr. 2021, doi: 10.48550/arXiv.2104.07467.
F. Alotaibi and V. Gupta, “Sentiment Analysis System using Hybrid Word Embeddings with Convolutional Recurrent Neural Network,” International Arab Journal of Information Technology, vol. 19, no. 3, pp. 330–335, May 2022, doi: 10.34028/iajit/19/3/6.