We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SE

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Software Engineering

Title: CodeLabeller: A Web-based Code Annotation Tool for Java Design Patterns and Summaries

Abstract: While constructing supervised learning models, we require labelled examples to build a corpus and train a machine learning model. However, most studies have built the labelled dataset manually, which in many occasions is a daunting task. To mitigate this problem, we have built an online tool called CodeLabeller. CodeLabeller is a web-based tool that aims to provide an efficient approach to handling the process of labelling source code files for supervised learning methods at scale by improving the data collection process throughout. CodeLabeller is tested by constructing a corpus of over a thousand source files obtained from a large collection of open source Java projects and labelling each Java source file with their respective design patterns and summaries. Twenty five experts in the field of software engineering participated in a usability evaluation of the tool using the standard User Experience Questionnaire online survey. The survey results demonstrate that the tool achieves the Good standard on hedonic and pragmatic quality standards, is easy to use and meets the needs of the annotating the corpus for supervised classifiers. Apart from assisting researchers in crowdsourcing a labelled dataset, the tool has practical applicability in software engineering education and assists in building expert ratings for software artefacts.
Comments: 15 pages, 5 Figures, 6 Tables
Subjects: Software Engineering (cs.SE)
Cite as: arXiv:2106.07513 [cs.SE]
  (or arXiv:2106.07513v4 [cs.SE] for this version)

Submission history

From: Najam Nazar Dr [view email]
[v1] Mon, 14 Jun 2021 15:41:21 GMT (1489kb,D)
[v2] Thu, 25 Nov 2021 09:36:11 GMT (3847kb,D)
[v3] Thu, 29 Dec 2022 14:16:02 GMT (820kb)
[v4] Mon, 13 Mar 2023 05:16:15 GMT (1841kb,D)

Link back to: arXiv, form interface, contact.