BlogNewsRank: Finding and Ranking Frequent News Topics Using Social Media Factors

Harshitha H; Mohammed Rafi

doi:10.30630/joiv.2.3.134

BlogNewsRank: Finding and Ranking Frequent News Topics Using Social Media Factors

Harshitha H - University B.D.T College of Engineering, Davanagere, Karnataka, India
Mohammed Rafi - University B.D.T College of Engineering, Davanagere, Karnataka, India

Citation Format:

DOI: http://dx.doi.org/10.30630/joiv.2.3.134

Abstract

In early days, mass media sources such as news media used to inform us about daily events. Now a days, social media services such as Twitter huge amount of user-generated data, which has a great potential to contain informative news-related content. For these resources to be useful, we have to find a way to filter noise and capture the content that, based on its similarity to the news media, is considered valuable. Even after noise is removed, information overload may still exist in the remaining data. Hence it is convenient to prioritize it for consumption. To achieve prioritization, information must be ranked in order of estimated importance considering mainly three factors. First, the temporal prevalence of a particular topic in the news media is a factor of importance, and can be considered the media focus (MF) of a topic. Second, the temporal prevalence of the topic in social media indicates its user attention (UA). Last, the interaction between the social media users who mention this topic indicates the strength of the community discussing it, and can be regarded as the user interaction (UI) toward the topic. We propose an unsupervised frameworkâ€”BlogNewsRankâ€”which identiï¬es news topics prevalent in both social media and the news media, and then ranks them by relevance(frequency) using their degrees of MF, UA, and UI.

Keywords

Topic identiï¬cation, Topic ranking, Social network analysis, Keyword extraction, Co-occurrence similarity measures, Graph clustering.

Full Text:

PDF

References

D. M. Blei, A. Y. Ng, and M. I. Jordan, â€œLatent Dirichlet allocation,â€ J. Mach. Learn. Res., vol. 3, pp. 993â€“1022, Jan. 2003.

T. Hofmann, â€œProbabilistic latent semantic analysis,â€ in Proc. 15th Conf. Uncertainty Artif. Intell., 1999, pp. 289â€“296

T. Hofmann, â€œProbabilistic latent semantic indexing,â€ in Proc. 22nd Annu. Int. ACM SIGIR Conf. Res. Develop. Inf. Retrieval, Berkeley, CA, USA, 1999, pp. 50â€“57.

Q. Diao, J. Jiang, F. Zhu, and E.-P. Lim, â€œFinding bursty topics from microblogs,â€ in Proc. 50th Annu. Meeting Assoc. Comput. Linguist. Long Papers, vol. 1. 2012, pp. 536â€“544.

H. Yin, B. Cui, H. Lu, Y. Huang, and J. Yao, â€œA uniï¬ed model for stable and temporal topic detection from social media data,â€ in Proc. IEEE 29th Int. Conf. Data Eng. (ICDE), Brisbane, QLD, Australia, 2013, pp. 661â€“672.

K. Shubhankar, A. P. Singh, and V. Pudi, â€œAn efï¬cient algorithm for topic ranking and modeling topic evolution,â€ in Database Expert Syst. Appl., Toulouse, France, 2011, pp. 320â€“330.

S. Brin and L. Page, â€œReprint of: The anatomy of a large-scale hypertextual web search engine,â€ Comput. Netw., vol. 56, no. 18, pp. 3825â€“3833, 2012.

C. Wang, M. Zhang, L. Ru, and S. Ma, â€œAutomatic online news topic ranking using media focus and user attention based on aging theory,â€ in Proc. 17th Conf. Inf. Knowl. Manag., Napa County, CA, USA, 2008, pp. 1033â€“1042.

E. Kwan, P.-L. Hsu, J.-H. Liang, and Y.-S. Chen, â€œEvent identiï¬cation for social streams using keyword-based evolving graph sequences,â€ in Proc. IEEE/ACM Int. Conf. Adv. Soc. Netw. Anal. Min., Niagara Falls, ON, Canada, 2013, pp. 450â€“457.

R. Mihalcea and P. Tarau, â€œTextRank: Bringing order into texts,â€ in Proc. EMNLP, vol. 4. Barcelona, Spain, 2004.

Username
Password
Remember me