We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SE

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Software Engineering

Title: CheapET-3: Cost-Efficient Use of Remote DNN Models

Authors: Michael Weiss
Abstract: On complex problems, state of the art prediction accuracy of Deep Neural Networks (DNN) can be achieved using very large-scale models, consisting of billions of parameters. Such models can only be run on dedicated servers, typically provided by a 3rd party service, which leads to a substantial monetary cost for every prediction. We propose a new software architecture for client-side applications, where a small local DNN is used alongside a remote large-scale model, aiming to make easy predictions locally at negligible monetary cost, while still leveraging the benefits of a large model for challenging inputs. In a proof of concept we reduce prediction cost by up to 50% without negatively impacting system accuracy.
Comments: Research Abstract. Contact me for a pre-print of the full paper (currently not yet published)
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
DOI: 10.1145/3540250.3559082
Cite as: arXiv:2208.11552 [cs.SE]
  (or arXiv:2208.11552v1 [cs.SE] for this version)

Submission history

From: Michael Weiss [view email]
[v1] Wed, 24 Aug 2022 13:54:27 GMT (318kb,D)

Link back to: arXiv, form interface, contact.