Learning to reinforcement learn for Neural Architecture Search

Robles, J. Gomez; Vanschoren, J.

Full-text links:

Download:

Computer Science > Neural and Evolutionary Computing

Title: Learning to reinforcement learn for Neural Architecture Search

Authors: J. Gomez Robles, J. Vanschoren

(Submitted on 9 Nov 2019 (v1), last revised 2 Dec 2019 (this version, v2))

Abstract: Reinforcement learning (RL) is a goal-oriented learning solution that has proven to be successful for Neural Architecture Search (NAS) on the CIFAR and ImageNet datasets. However, a limitation of this approach is its high computational cost, making it unfeasible to replay it on other datasets. Through meta-learning, we could bring this cost down by adapting previously learned policies instead of learning them from scratch. In this work, we propose a deep meta-RL algorithm that learns an adaptive policy over a set of environments, making it possible to transfer it to previously unseen tasks. The algorithm was applied to various proof-of-concept environments in the past, but we adapt it to the NAS problem. We empirically investigate the agent's behavior during training when challenged to design chain-structured neural architectures for three datasets with increasing levels of hardness, to later fix the policy and evaluate it on two unseen datasets of different difficulty. Our results show that, under resource constraints, the agent effectively adapts its strategy during training to design better architectures than the ones designed by a standard RL algorithm, and can design good architectures during the evaluation on previously unseen environments. We also provide guidelines on the applicability of our framework in a more complex NAS setting by studying the progress of the agent when challenged to design multi-branch architectures.

Comments:	32 pages, 21 figures, 9 tables
Subjects:	Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
Cite as:	arXiv:1911.03769 [cs.NE]
	(or arXiv:1911.03769v2 [cs.NE] for this version)

Submission history

From: Jorge Gomez Robles [view email]
[v1] Sat, 9 Nov 2019 20:13:00 GMT (3437kb,D)
[v2] Mon, 2 Dec 2019 22:22:35 GMT (3437kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.03769

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Neural and Evolutionary Computing

Title: Learning to reinforcement learn for Neural Architecture Search

Submission history