TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks

Cheng, Heng-Tze; Haque, Zakaria; Hong, Lichan; Ispir, Mustafa; Mewald, Clemens; Polosukhin, Illia; Roumpos, Georgios; Sculley, D; Smith, Jamie; Soergel, David; Tang, Yuan; Tucker, Philipp; Wicke, Martin; Xia, Cassandra; Xie, Jianwei

doi:10.1145/3097983.3098171

Full-text links:

Download:

Current browse context:

cs.DC

< prev | next >

new | recent | 1708

Computer Science > Distributed, Parallel, and Cluster Computing

Title: TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks

Authors: Heng-Tze Cheng, Zakaria Haque, Lichan Hong, Mustafa Ispir, Clemens Mewald, Illia Polosukhin, Georgios Roumpos, D Sculley, Jamie Smith, David Soergel, Yuan Tang, Philipp Tucker, Martin Wicke, Cassandra Xia, Jianwei Xie

(Submitted on 8 Aug 2017)

Abstract: We present a framework for specifying, training, evaluating, and deploying machine learning models. Our focus is on simplifying cutting edge machine learning for practitioners in order to bring such technologies into production. Recognizing the fast evolution of the field of deep learning, we make no attempt to capture the design space of all possible model architectures in a domain- specific language (DSL) or similar configuration language. We allow users to write code to define their models, but provide abstractions that guide develop- ers to write models in ways conducive to productionization. We also provide a unifying Estimator interface, making it possible to write downstream infrastructure (e.g. distributed training, hyperparameter tuning) independent of the model implementation. We balance the competing demands for flexibility and simplicity by offering APIs at different levels of abstraction, making common model architectures available out of the box, while providing a library of utilities designed to speed up experimentation with model architectures. To make out of the box models flexible and usable across a wide range of problems, these canned Estimators are parameterized not only over traditional hyperparameters, but also using feature columns, a declarative specification describing how to interpret input data. We discuss our experience in using this framework in re- search and production environments, and show the impact on code health, maintainability, and development speed.

Comments:	8 pages, Appeared at KDD 2017, August 13--17, 2017, Halifax, NS, Canada
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
DOI:	10.1145/3097983.3098171
Cite as:	arXiv:1708.02637 [cs.DC]
	(or arXiv:1708.02637v1 [cs.DC] for this version)

Submission history

From: Illia Polosukhin [view email]
[v1] Tue, 8 Aug 2017 20:06:28 GMT (142kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1708.02637

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Distributed, Parallel, and Cluster Computing

Title: TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks

Submission history