We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IR

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Retrieval

Title: Learning to Determine the Quality of News Headlines

Abstract: Today, most newsreaders read the online version of news articles rather than traditional paper-based newspapers. Also, news media publishers rely heavily on the income generated from subscriptions and website visits made by newsreaders. Thus, online user engagement is a very important issue for online newspapers. Much effort has been spent on writing interesting headlines to catch the attention of online users. On the other hand, headlines should not be misleading (e.g., clickbaits); otherwise, readers would be disappointed when reading the content. In this paper, we propose four indicators to determine the quality of published news headlines based on their click count and dwell time, which are obtained by website log analysis. Then, we use soft target distribution of the calculated quality indicators to train our proposed deep learning model which can predict the quality of unpublished news headlines. The proposed model not only processes the latent features of both headline and body of the article to predict its headline quality but also considers the semantic relation between headline and body as well. To evaluate our model, we use a real dataset from a major Canadian newspaper. Results show our proposed model outperforms other state-of-the-art NLP models.
Comments: 10 Pages, Accepted at the 12th International Conference on Agents and Artificial Intelligence (ICAART) 2020
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
DOI: 10.5220/0009367504010409
Cite as: arXiv:1911.11139 [cs.IR]
  (or arXiv:1911.11139v2 [cs.IR] for this version)

Submission history

From: Amin Omidvar [view email]
[v1] Tue, 26 Nov 2019 00:09:30 GMT (364kb)
[v2] Sun, 19 Apr 2020 23:42:19 GMT (402kb)

Link back to: arXiv, form interface, contact.