We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:


References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Software Engineering

Title: Mining Software Repositories with a Collaborative Heuristic Repository

Abstract: Many software engineering studies or tasks rely on categorizing software engineering artifacts. In practice, this is done either by defining simple but often imprecise heuristics, or by manual labelling of the artifacts. Unfortunately, errors in these categorizations impact the tasks that rely on them. To improve the precision of these categorizations, we propose to gather heuristics in a collaborative heuristic repository, to which researchers can contribute a large amount of diverse heuristics for a variety of tasks on a variety of SE artifacts. These heuristics are then leveraged by state-of-the-art weak supervision techniques to train high-quality classifiers, thus improving the categorizations. We present an initial version of the heuristic repository, which we applied to the concrete task of commit classification.
Comments: 5 pages; to appear in Proceedings of ICSE NIER 2021
Subjects: Software Engineering (cs.SE)
Cite as: arXiv:2103.01722 [cs.SE]
  (or arXiv:2103.01722v1 [cs.SE] for this version)

Submission history

From: Hlib Babii [view email]
[v1] Tue, 2 Mar 2021 13:50:22 GMT (415kb,D)

Link back to: arXiv, form interface, contact.