We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:


References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Between Flexibility and Consistency: Joint Generation of Captions and Subtitles

Abstract: Speech translation (ST) has lately received growing interest for the generation of subtitles without the need for an intermediate source language transcription and timing (i.e. captions). However, the joint generation of source captions and target subtitles does not only bring potential output quality advantages when the two decoding processes inform each other, but it is also often required in multilingual scenarios. In this work, we focus on ST models which generate consistent captions-subtitles in terms of structure and lexical content. We further introduce new metrics for evaluating subtitling consistency. Our findings show that joint decoding leads to increased performance and consistency between the generated captions and subtitles while still allowing for sufficient flexibility to produce subtitles conforming to language-specific needs and norms.
Comments: Accepted at IWSLT 2021
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2107.06246 [cs.CL]
  (or arXiv:2107.06246v1 [cs.CL] for this version)

Submission history

From: Alina Karakanta [view email]
[v1] Tue, 13 Jul 2021 17:06:04 GMT (5241kb)

Link back to: arXiv, form interface, contact.