Learning to Determine the Quality of News Headlines

Omidvar, Amin; Poormodheji, Hossein; An, Aijun; Edall, Gordon

doi:10.5220/0009367504010409

Full-text links:

Download:

PDF only

Current browse context:

cs.IR

< prev | next >

new | recent | 1911

Computer Science > Information Retrieval

Title: Learning to Determine the Quality of News Headlines

Authors: Amin Omidvar, Hossein Poormodheji, Aijun An, Gordon Edall

(Submitted on 26 Nov 2019 (v1), last revised 19 Apr 2020 (this version, v2))

Abstract: Today, most newsreaders read the online version of news articles rather than traditional paper-based newspapers. Also, news media publishers rely heavily on the income generated from subscriptions and website visits made by newsreaders. Thus, online user engagement is a very important issue for online newspapers. Much effort has been spent on writing interesting headlines to catch the attention of online users. On the other hand, headlines should not be misleading (e.g., clickbaits); otherwise, readers would be disappointed when reading the content. In this paper, we propose four indicators to determine the quality of published news headlines based on their click count and dwell time, which are obtained by website log analysis. Then, we use soft target distribution of the calculated quality indicators to train our proposed deep learning model which can predict the quality of unpublished news headlines. The proposed model not only processes the latent features of both headline and body of the article to predict its headline quality but also considers the semantic relation between headline and body as well. To evaluate our model, we use a real dataset from a major Canadian newspaper. Results show our proposed model outperforms other state-of-the-art NLP models.

Comments:	10 Pages, Accepted at the 12th International Conference on Agents and Artificial Intelligence (ICAART) 2020
Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
DOI:	10.5220/0009367504010409
Cite as:	arXiv:1911.11139 [cs.IR]
	(or arXiv:1911.11139v2 [cs.IR] for this version)

Submission history

From: Amin Omidvar [view email]
[v1] Tue, 26 Nov 2019 00:09:30 GMT (364kb)
[v2] Sun, 19 Apr 2020 23:42:19 GMT (402kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.11139

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Information Retrieval

Title: Learning to Determine the Quality of News Headlines

Submission history