We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Summarization of Films and Documentaries Based on Subtitles and Scripts

Abstract: We assess the performance of generic text summarization algorithms applied to films and documentaries, using the well-known behavior of summarization of news articles as reference. We use three datasets: (i) news articles, (ii) film scripts and subtitles, and (iii) documentary subtitles. Standard ROUGE metrics are used for comparing generated summaries against news abstracts, plot summaries, and synopses. We show that the best performing algorithms are LSA, for news articles and documentaries, and LexRank and Support Sets, for films. Despite the different nature of films and documentaries, their relative behavior is in accordance with that obtained for news articles.
Comments: 7 pages, 9 tables, 4 figures, submitted to Pattern Recognition Letters (Elsevier)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
ACM classes: I.2.7
Journal reference: Pattern Recognition Letters, Volume 73, 1 April 2016, Pages 7-12
DOI: 10.1016/j.patrec.2015.12.016
Cite as: arXiv:1506.01273 [cs.CL]
  (or arXiv:1506.01273v3 [cs.CL] for this version)

Submission history

From: David Martins de Matos [view email]
[v1] Wed, 3 Jun 2015 15:07:14 GMT (4285kb,D)
[v2] Thu, 4 Jun 2015 12:41:55 GMT (2056kb,D)
[v3] Wed, 9 Mar 2016 16:50:43 GMT (2615kb,D)

Link back to: arXiv, form interface, contact.