We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP

Abstract: Despite its crucial role in research experiments, code correctness is often presumed only on the basis of the perceived quality of results. This assumption comes with the risk of erroneous outcomes and potentially misleading findings. To address this issue, we posit that the current focus on reproducibility should go hand in hand with the emphasis on software quality. We present a case study in which we identify and fix three bugs in widely used implementations of the state-of-the-art Conformer architecture. Through experiments on speech recognition and translation in various languages, we demonstrate that the presence of bugs does not prevent the achievement of good and reproducible results, which however can lead to incorrect conclusions that potentially misguide future research. As a countermeasure, we propose a Code-quality Checklist and release pangoliNN, a library dedicated to testing neural models, with the goal of promoting coding best practices and improving research software quality within the NLP community.
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as: arXiv:2303.16166 [cs.CL]
  (or arXiv:2303.16166v4 [cs.CL] for this version)

Submission history

From: Sara Papi [view email]
[v1] Tue, 28 Mar 2023 17:28:52 GMT (5325kb,D)
[v2] Wed, 29 Mar 2023 07:49:54 GMT (5331kb,D)
[v3] Fri, 31 Mar 2023 08:15:53 GMT (5331kb,D)
[v4] Tue, 15 Aug 2023 21:53:56 GMT (487kb,D)

Link back to: arXiv, form interface, contact.