We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:


References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: On the Limits of Minimal Pairs in Contrastive Evaluation

Abstract: Minimal sentence pairs are frequently used to analyze the behavior of language models. It is often assumed that model behavior on contrastive pairs is predictive of model behavior at large. We argue that two conditions are necessary for this assumption to hold: First, a tested hypothesis should be well-motivated, since experiments show that contrastive evaluation can lead to false positives. Secondly, test data should be chosen such as to minimize distributional discrepancy between evaluation time and deployment time. For a good approximation of deployment-time decoding, we recommend that minimal pairs are created based on machine-generated text, as opposed to human-written references. We present a contrastive evaluation suite for English-German MT that implements this recommendation.
Comments: BlackboxNLP 2021
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2109.07465 [cs.CL]
  (or arXiv:2109.07465v1 [cs.CL] for this version)

Submission history

From: Jannis Vamvas [view email]
[v1] Wed, 15 Sep 2021 17:59:15 GMT (32kb,D)

Link back to: arXiv, form interface, contact.