We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computer Vision and Pattern Recognition

Title: Fine-tuning Vision Transformers for the Prediction of State Variables in Ising Models

Abstract: Transformers are state-of-the-art deep learning models that are composed of stacked attention and point-wise, fully connected layers designed for handling sequential data. Transformers are not only ubiquitous throughout Natural Language Processing (NLP), but, recently, they have inspired a new wave of Computer Vision (CV) applications research. In this work, a Vision Transformer (ViT) is applied to predict the state variables of 2-dimensional Ising model simulations. Our experiments show that ViT outperform state-of-the-art Convolutional Neural Networks (CNN) when using a small number of microstate images from the Ising model corresponding to various boundary conditions and temperatures. This work opens the possibility of applying ViT to other simulations, and raises interesting research directions on how attention maps can learn about the underlying physics governing different phenomena.
Comments: Accepted at Ml4Physical Sciences Workshop at Neurips 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
Cite as: arXiv:2109.13925 [cs.CV]
  (or arXiv:2109.13925v2 [cs.CV] for this version)

Submission history

From: Arijit Sehanobish [view email]
[v1] Tue, 28 Sep 2021 00:23:31 GMT (2490kb,D)
[v2] Tue, 30 Nov 2021 04:27:14 GMT (1847kb,D)

Link back to: arXiv, form interface, contact.