We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Zero-pronoun Data Augmentation for Japanese-to-English Translation

Abstract: For Japanese-to-English translation, zero pronouns in Japanese pose a challenge, since the model needs to infer and produce the corresponding pronoun in the target side of the English sentence. However, although fully resolving zero pronouns often needs discourse context, in some cases, the local context within a sentence gives clues to the inference of the zero pronoun. In this study, we propose a data augmentation method that provides additional training signals for the translation model to learn correlations between local context and zero pronouns. We show that the proposed method significantly improves the accuracy of zero pronoun translation with machine translation experiments in the conversational domain.
Comments: WAT2021
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2107.00318 [cs.CL]
  (or arXiv:2107.00318v1 [cs.CL] for this version)

Submission history

From: Ryokan Ri [view email]
[v1] Thu, 1 Jul 2021 09:17:59 GMT (154kb,D)

Link back to: arXiv, form interface, contact.