Neural Collaborative with Sentence BERT for News Recommender System

Budi Juarto - Computer Science Department, School of Computer Science, Bina Nusantara University, Jakarta, Indonesia
Abba Suganda Girsang - Computer Science Department, Master of Computer Science, Binus Graduate Program, Bina Nusantara University, Jakarta, Indonesia


Citation Format:



DOI: http://dx.doi.org/10.30630/joiv.5.4.678

Abstract


The number of news produced every day is as much as 3 million per day, making readers have many choices in choosing news according to each reader's topic and category preferences. The recommendation system can make it easier for users to choose the news to read. The method that can be used in providing recommendations from the same user is collaborative filtering. Neural collaborative filtering is usually being used for recommendation systems by combining collaborative filtering with neural networks. However, this method has the disadvantage of recommending the similarity of news content such as news titles and content to users. This research wants to develop neural collaborative filtering using sentences BERT. Sentence BERT is applied to news titles and news contents that are converted into sentence embedding. The results of this sentence embedding are used in neural collaboration with item id, user id, and news category. We use a Microsoft news dataset of 50,000 users and 51,282 news, with 5,475,542 interactions between users and news. The evaluation carried out in this study uses precision, recall, and ROC curves to predict news clicks by the user. Another evaluation uses a hit ratio with the leave one out method. The evaluation results obtained a precision value of 99.14%, recall of 92.48%, f1-score of 95.69%, and ROC score of 98%. Evaluation measurement using the hit ratio@10 produces a hit ratio of 74% at fiftieth epochs for neural collaborative with sentence BERT which is better than neural collaborative filtering (NCF) and NCF with news category.

Keywords


Recommender system; news; neural networks; sentence BERT, neural collaborative filtering.

Full Text:

PDF

References


Watson, A., (2020). Average circulation of the Wall Street Journal from 2018 to 2020. Retrieved from Statista. [Online]. Available: https://www.statista.com/statistics/193788/average-paid-circulation-of-the-wall-street-journal/.

Joseph, Johnson., (2021). Worldwide digital population as of January 2021. Retrieved from Statista. [Online]. Available: https://www.statista.com/statistics/617136/digital-population-worldwide/ .

Putri, M. W., Muchayan, A., & Kamisutara, M. (2020). Sistem Rekomendasi Produk Pena Eksklusif Menggunakan Metode Content-Based Filtering dan TF-IDF. JOINTECS (Journal of Information Technology and Computer Science), 5(3), 229-236..

Noorhidayah, N., Indriani, F., & Faisal, M. R. (2019). Sistem Rekomendasi Berita Online dengan menggunakan Pembobotan TF-IDF dan Cosine Similarity. Soliter, 2, 85-93.

Wahyudi, Kristian & Latupapua, Johanes & Chandra, Ritchie & Girsang, Ganda. (2020). Hotel Content-Based Recommendation System. Journal of Physics: Conference Series. 1485. 012017. 10.1088/1742-6596/1485/1/012017.

Abraham, Sirajuddin, & Yeni Dwi Rahayu. (2017). Sistem Rekomendasi Artikel Berita Menggunakan Metode Knearest Neighbor Berbasis Website. Prosiding Sensei 2017 1, no. 1

Nastiti, P. (2019). Penerapan Metode Content Based Filtering dalam Implementasi Sistem Rekomendasi Tanaman Pangan. Teknika, 8(1), 1-10.

Kusuma, Hans Satria, and Aina Musdholifah. (2021).Recommendation System for Thesis Topics Using Content-based Filtering. IJCCS (Indonesian Journal of Computing and Cybernetics Systems) 15, no. 1: 65-74.

Parwita, W. G. S. (2019). Pengujian Akurasi Sistem Rekomendasi Berbasis Content-Based Filtering. Inform. Mulawarman J. Ilm. Ilmu Komputer, 14(1), 27..

Girsang, A. S., Isa, S. M., & Harvy, I. (2021, April). Recommendation System Journalist For Getting Top News Based On Twitter Data. In Journal of Physics: Conference Series (Vol. 1807, No. 1, p. 012006). IOP Publishing.

Wang, Tian, and Yuyangzi Fu. "Item-based Collaborative Filtering with BERT." In Proceedings of The 3rd Workshop on e-Commerce and NLP, pp. 54-58. 2020.

Girsang, A. S., Al Faruq, B., Herlianto, H. R., & Simbolon, S. (2020, June). Collaborative recommendation system in users of anime films. In Journal of Physics: Conference Series (Vol. 1566, No. 1, p. 012057). IOP Publishing.

Prayogo, J. E., Suharso, A., & Rizal, A. (2020). Analisis Perbandingan Model Matrix Factorization dan K-Nearest Neighbor dalam Mesin Rekomendasi Collaborative Berbasis Prediksi Rating. Jurnal Informatika Universitas Pamulang, 5(4), 506-514.

Yulian, H., & Martina, I. (2015). Penggunaan Correlation-Based Similarity untuk Sistem Rekomendasi Tanpa Rating. Jurnal Telematika, 10(1), 3.

Priyono, A. B. A. (2017). Performa Apriori dan Collaborative Filtering untuk Sistem Rekomendasi. Jurnal Ilmiah Informatika Komputer, 21(1).

Kurniawan, A. (2016). Sistem Rekomendasi Produk Sepatu Dengan Menggunakan Metode Collaborative Filtering. In Seminar Nasional Teknologi Informasi dan Komunikasi (Vol. 2016, pp. 18-19).

Muliadi, K. H., & Lestari, C. C. (2019). Rancang Bangun Sistem Rekomendasi Tempat Makan Menggunakan Algoritma Typicality Based Collaborative Filtering. Techno. Com, 18(4), 275-287.

Salakhutdinov, R., Mnih, A., & Hinton, G. (2007, June). Restricted Boltzmann machines for collaborative filtering. In Proceedings of the 24th international conference on Machine learning (pp. 791-798).

Indriawan, W., Gufroni, A. I., & Rianto, R. (2020). Sistem Rekomendasi Penjualan Produk Pertanian Menggunakan Metode Item Based Collaborative Filtering. Jurnal Siliwangi Seri Sains dan Teknologi, 6(2).

Wairegi, S., Mwangi, W., & Rimiru, R. (2020). A Framework for Items Recommendation System Using Hybrid Approach. In 2020 IST-Africa Conference (IST-Africa) (pp. 1-15). IEEE.

Wijaya, A., & Alfian, D. (2018). Sistem Rekomendasi Laptop Menggunakan Collaborative Filtering Dan Content-Based Filtering. Jurnal Computech & Bisnis, 12(1), 11-27.

Liu, Y., Tong, Q., Du, Z., & Hu, L. (2014, September). Content-boosted restricted Boltzmann machine for recommendation. In International Conference on Artificial Neural Networks (pp. 773-780). Springer, Cham.

Fitrianti, B. S., Fachurrozi, M., & Yusliani, N. (2018). Sistem Rekomendasi Artikel Ilmiah Berbasis Web Menggunakan Content-based Learning dan Collaborative Filtering. Generic, 10(1), 17-23.

He, X., Liao, L., Zhang, H., Nie, L., Hu, X., & Chua, T. S. (2017, April). Neural collaborative filtering. In Proceedings of the 26th international conference on world wide web (pp. 173-182).

Reimers, N., & Gurevych, I. (2019). Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084.