We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Consistent Human Evaluation of Machine Translation across Language Pairs

Abstract: Obtaining meaningful quality scores for machine translation systems through human evaluation remains a challenge given the high variability between human evaluators, partly due to subjective expectations for translation quality for different language pairs. We propose a new metric called XSTS that is more focused on semantic equivalence and a cross-lingual calibration method that enables more consistent assessment. We demonstrate the effectiveness of these novel contributions in large scale evaluation studies across up to 14 language pairs, with translation both into and out of English.
Comments: 10 pages
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2205.08533 [cs.CL]
  (or arXiv:2205.08533v1 [cs.CL] for this version)

Submission history

From: Philipp Koehn [view email]
[v1] Tue, 17 May 2022 17:57:06 GMT (266kb)

Link back to: arXiv, form interface, contact.