We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SE

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Software Engineering

Title: DeepMutants: Training neural bug detectors with contextual mutations

Abstract: Learning-based bug detectors promise to find bugs in large code bases by exploiting natural hints such as names of variables and functions or comments. Still, existing techniques tend to underperform when presented with realistic bugs. We believe bug detector learning to currently suffer from a lack of realistic defective training examples. In fact, real world bugs are scarce which has driven existing methods to train on artificially created and mostly unrealistic mutants. In this work, we propose a novel contextual mutation operator which incorporates knowledge about the mutation context to dynamically inject natural and more realistic faults into code. Our approach employs a masked language model to produce a context-dependent distribution over feasible token replacements. The evaluation shows that sampling from a language model does not only produce mutants which more accurately represent real bugs but also lead to better performing bug detectors, both on artificial benchmarks and on real world source code.
Comments: 12 pages, 10 figures
Subjects: Software Engineering (cs.SE); Machine Learning (cs.LG)
Cite as: arXiv:2107.06657 [cs.SE]
  (or arXiv:2107.06657v1 [cs.SE] for this version)

Submission history

From: Cedric Richter [view email]
[v1] Wed, 14 Jul 2021 12:45:48 GMT (219kb,D)

Link back to: arXiv, form interface, contact.