Current browse context:
cs.IR
Change to browse by:
References & Citations
Computer Science > Information Retrieval
Title: Learning to Determine the Quality of News Headlines
(Submitted on 26 Nov 2019 (v1), last revised 19 Apr 2020 (this version, v2))
Abstract: Today, most newsreaders read the online version of news articles rather than traditional paper-based newspapers. Also, news media publishers rely heavily on the income generated from subscriptions and website visits made by newsreaders. Thus, online user engagement is a very important issue for online newspapers. Much effort has been spent on writing interesting headlines to catch the attention of online users. On the other hand, headlines should not be misleading (e.g., clickbaits); otherwise, readers would be disappointed when reading the content. In this paper, we propose four indicators to determine the quality of published news headlines based on their click count and dwell time, which are obtained by website log analysis. Then, we use soft target distribution of the calculated quality indicators to train our proposed deep learning model which can predict the quality of unpublished news headlines. The proposed model not only processes the latent features of both headline and body of the article to predict its headline quality but also considers the semantic relation between headline and body as well. To evaluate our model, we use a real dataset from a major Canadian newspaper. Results show our proposed model outperforms other state-of-the-art NLP models.
Submission history
From: Amin Omidvar [view email][v1] Tue, 26 Nov 2019 00:09:30 GMT (364kb)
[v2] Sun, 19 Apr 2020 23:42:19 GMT (402kb)
Link back to: arXiv, form interface, contact.