We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Delving into adversarial attacks on deep policies

Abstract: Adversarial examples have been shown to exist for a variety of deep learning architectures. Deep reinforcement learning has shown promising results on training agent policies directly on raw inputs such as image pixels. In this paper we present a novel study into adversarial attacks on deep reinforcement learning polices. We compare the effectiveness of the attacks using adversarial examples vs. random noise. We present a novel method for reducing the number of times adversarial examples need to be injected for a successful attack, based on the value function. We further explore how re-training on random noise and FGSM perturbations affects the resilience against adversarial examples.
Comments: ICLR 2017 Workshop
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as: arXiv:1705.06452 [stat.ML]
  (or arXiv:1705.06452v1 [stat.ML] for this version)

Submission history

From: Jernej Kos [view email]
[v1] Thu, 18 May 2017 08:01:53 GMT (1314kb,D)

Link back to: arXiv, form interface, contact.