Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

Ki, Dayeon; Carpuat, Marine

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2404

Computer Science > Computation and Language

Title: Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

Authors: Dayeon Ki, Marine Carpuat

(Submitted on 11 Apr 2024)

Abstract: Machine Translation (MT) remains one of the last NLP tasks where large language models (LLMs) have not yet replaced dedicated supervised systems. This work exploits the complementary strengths of LLMs and supervised MT by guiding LLMs to automatically post-edit MT with external feedback on its quality, derived from Multidimensional Quality Metric (MQM) annotations. Working with LLaMA-2 models, we consider prompting strategies varying the nature of feedback provided and then fine-tune the LLM to improve its ability to exploit the provided guidance. Through experiments on Chinese-English, English-German, and English-Russian MQM data, we demonstrate that prompting LLMs to post-edit MT improves TER, BLEU and COMET scores, although the benefits of fine-grained feedback are not clear. Fine-tuning helps integrate fine-grained feedback more effectively and further improves translation quality based on both automatic and human evaluation.

Comments:	21 pages, 8 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Journal reference:	NAACL 2024 Findings
Cite as:	arXiv:2404.07851 [cs.CL]
	(or arXiv:2404.07851v1 [cs.CL] for this version)

Submission history

From: Dayeon Ki [view email]
[v1] Thu, 11 Apr 2024 15:47:10 GMT (9215kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2404.07851

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Computer Science > Computation and Language

Title: Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

Submission history