References & Citations
Computer Science > Software Engineering
Title: Applying Information Theory to Software Evolution
(Submitted on 24 Mar 2023 (v1), last revised 26 Apr 2023 (this version, v2))
Abstract: Although information theory has found success in disciplines, the literature on its applications to software evolution is limit. We are still missing artifacts that leverage the data and tooling available to measure how the information content of a project can be a proxy for its complexity. In this work, we explore two definitions of entropy, one structural and one textual, and apply it to the historical progression of the commit history of 25 open source projects. We produce evidence that they generally are highly correlated. We also observed that they display weak and unstable correlations with other complexity metrics. Our preliminary investigation of outliers shows an unexpected high frequency of events where there is considerable change in the information content of the project, suggesting that such outliers may inform a definition of surprisal.
Submission history
From: Sebastian Baltes [view email][v1] Fri, 24 Mar 2023 01:00:45 GMT (1135kb,D)
[v2] Wed, 26 Apr 2023 16:27:37 GMT (450kb,D)
Link back to: arXiv, form interface, contact.