We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Why Neural Machine Translation Prefers Empty Outputs

Abstract: We investigate why neural machine translation (NMT) systems assign high probability to empty translations. We find two explanations. First, label smoothing makes correct-length translations less confident, making it easier for the empty translation to finally outscore them. Second, NMT systems use the same, high-frequency EoS word to end all target sentences, regardless of length. This creates an implicit smoothing that increases zero-length translations. Using different EoS types in target sentences of different lengths exposes and eliminates this implicit smoothing.
Comments: 6 pages
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2012.13454 [cs.CL]
  (or arXiv:2012.13454v1 [cs.CL] for this version)

Submission history

From: Kevin Knight [view email]
[v1] Thu, 24 Dec 2020 22:25:22 GMT (31kb)

Link back to: arXiv, form interface, contact.