Current browse context:
cond-mat.stat-mech
Change to browse by:
References & Citations
Condensed Matter > Statistical Mechanics
Title: Ising models of deep neural networks
(Submitted on 18 Sep 2022)
Abstract: This work maps deep neural networks to classical Ising spin models, allowing them to be described using statistical thermodynamics. The density of states shows that structures emerge in the weights after they have been trained -- well-trained networks span a much wider range of realizable energies compared to poorly trained ones. These structures propagate throughout the entire network and are not observed in individual layers. The energy values correlate to performance on tasks, making it possible to distinguish networks based on quality without access to data. Thermodynamic properties such as specific heat are also studied, revealing a higher critical temperature in trained networks.
Link back to: arXiv, form interface, contact.