We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:


References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Tuna: A Static Analysis Approach to Optimizing Deep Neural Networks

Abstract: We introduce Tuna, a static analysis approach to optimizing deep neural network programs. The optimization of tensor operations such as convolutions and matrix multiplications is the key to improving the performance of deep neural networks. Many deep learning model optimization mechanisms today use dynamic analysis, which relies on experimental execution on a target device to build a data-driven cost model of the program. The reliance on dynamic profiling not only requires access to target hardware at compilation time but also incurs significant cost in machine resources. We introduce an approach that profiles the program by constructing features based on the target hardware characteristics in order. We use static analysis of the relative performance of tensor operations to optimize the deep learning program. Experiments show that our approach can achieve up to 11x performance compared to dynamic profiling based methods with the same compilation time.
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as: arXiv:2104.14641 [cs.DC]
  (or arXiv:2104.14641v3 [cs.DC] for this version)

Submission history

From: Yao Wang [view email]
[v1] Thu, 29 Apr 2021 20:22:02 GMT (458kb,D)
[v2] Tue, 11 May 2021 19:04:53 GMT (458kb,D)
[v3] Sun, 16 May 2021 02:44:50 GMT (460kb,D)

Link back to: arXiv, form interface, contact.