Attacking and Defending Deep Reinforcement Learning Policies

Wang, Chao

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2205

Computer Science > Machine Learning

Title: Attacking and Defending Deep Reinforcement Learning Policies

Authors: Chao Wang

(Submitted on 16 May 2022)

Abstract: Recent studies have shown that deep reinforcement learning (DRL) policies are vulnerable to adversarial attacks, which raise concerns about applications of DRL to safety-critical systems. In this work, we adopt a principled way and study the robustness of DRL policies to adversarial attacks from the perspective of robust optimization. Within the framework of robust optimization, optimal adversarial attacks are given by minimizing the expected return of the policy, and correspondingly a good defense mechanism should be realized by improving the worst-case performance of the policy. Considering that attackers generally have no access to the training environment, we propose a greedy attack algorithm, which tries to minimize the expected return of the policy without interacting with the environment, and a defense algorithm, which performs adversarial training in a max-min form. Experiments on Atari game environments show that our attack algorithm is more effective and leads to worse return of the policy than existing attack algorithms, and our defense algorithm yields policies more robust than existing defense methods to a range of adversarial attacks (including our proposed attack algorithm).

Comments:	nine pages
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR)
Cite as:	arXiv:2205.07626 [cs.LG]
	(or arXiv:2205.07626v1 [cs.LG] for this version)

Submission history

From: Chao Wang [view email]
[v1] Mon, 16 May 2022 12:47:54 GMT (4497kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.07626

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Attacking and Defending Deep Reinforcement Learning Policies

Submission history