We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SE

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Software Engineering

Title: Simpler Hyperparameter Optimization for Software Analytics: Why, How, When?

Abstract: How can we make software analytics simpler and faster? One method is to match the complexity of analysis to the intrinsic complexity of the data being explored. For example, hyperparameter optimizers find the control settings for data miners that improve the predictions generated via software analytics. Sometimes, very fast hyperparameter optimization can be achieved by "DODGE-ing"; i.e. simply steering way from settings that lead to similar conclusions. But when is it wise to use that simple approach and when must we use more complex (and much slower) optimizers?} To answer this, we applied hyperparameter optimization to 120 SE data sets that explored bad smell detection, predicting Github issue close time, bug report analysis, defect prediction, and dozens of other non-SE problems. We find that the simple DODGE works best for data sets with low "intrinsic dimensionality" (u ~ 3) and very poorly for higher-dimensional data (u > 8). Nearly all the SE data seen here was intrinsically low-dimensional, indicating that DODGE is applicable for many SE analytics tasks.
Comments: 15 pages
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Journal reference: Transactions on Software Engineering, 2021
DOI: 10.1109/TSE.2021.3073242
Cite as: arXiv:1912.04061 [cs.SE]
  (or arXiv:1912.04061v5 [cs.SE] for this version)

Submission history

From: Amritanshu Agrawal [view email]
[v1] Mon, 9 Dec 2019 14:10:40 GMT (911kb,D)
[v2] Wed, 8 Apr 2020 15:46:19 GMT (284kb,D)
[v3] Thu, 14 Jan 2021 23:04:25 GMT (1684kb,D)
[v4] Fri, 16 Apr 2021 14:28:07 GMT (1877kb,D)
[v5] Thu, 22 Apr 2021 14:29:22 GMT (1877kb,D)

Link back to: arXiv, form interface, contact.