Delving into adversarial attacks on deep policies

Kos, Jernej; Song, Dawn

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1705

Statistics > Machine Learning

Title: Delving into adversarial attacks on deep policies

Authors: Jernej Kos, Dawn Song

(Submitted on 18 May 2017)

Abstract: Adversarial examples have been shown to exist for a variety of deep learning architectures. Deep reinforcement learning has shown promising results on training agent policies directly on raw inputs such as image pixels. In this paper we present a novel study into adversarial attacks on deep reinforcement learning polices. We compare the effectiveness of the attacks using adversarial examples vs. random noise. We present a novel method for reducing the number of times adversarial examples need to be injected for a successful attack, based on the value function. We further explore how re-training on random noise and FGSM perturbations affects the resilience against adversarial examples.

Comments:	ICLR 2017 Workshop
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1705.06452 [stat.ML]
	(or arXiv:1705.06452v1 [stat.ML] for this version)

Submission history

From: Jernej Kos [view email]
[v1] Thu, 18 May 2017 08:01:53 GMT (1314kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1705.06452

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Delving into adversarial attacks on deep policies

Submission history