AdvFilter: Predictive Perturbation-aware Filtering against Adversarial Attack via Multi-domain Learning

Huang, Yihao; Guo, Qing; Juefei-Xu, Felix; Ma, Lei; Miao, Weikai; Liu, Yang; Pu, Geguang

Full-text links:

Download:

Computer Science > Computer Vision and Pattern Recognition

Title: AdvFilter: Predictive Perturbation-aware Filtering against Adversarial Attack via Multi-domain Learning

Authors: Yihao Huang, Qing Guo, Felix Juefei-Xu, Lei Ma, Weikai Miao, Yang Liu, Geguang Pu

(Submitted on 14 Jul 2021 (v1), last revised 18 Oct 2021 (this version, v2))

Abstract: High-level representation-guided pixel denoising and adversarial training are independent solutions to enhance the robustness of CNNs against adversarial attacks by pre-processing input data and re-training models, respectively. Most recently, adversarial training techniques have been widely studied and improved while the pixel denoising-based method is getting less attractive. However, it is still questionable whether there exists a more advanced pixel denoising-based method and whether the combination of the two solutions benefits each other. To this end, we first comprehensively investigate two kinds of pixel denoising methods for adversarial robustness enhancement (i.e., existing additive-based and unexplored filtering-based methods) under the loss functions of image-level and semantic-level, respectively, showing that pixel-wise filtering can obtain much higher image quality (e.g., higher PSNR) as well as higher robustness (e.g., higher accuracy on adversarial examples) than existing pixel-wise additive-based method. However, we also observe that the robustness results of the filtering-based method rely on the perturbation amplitude of adversarial examples used for training. To address this problem, we propose predictive perturbation-aware & pixel-wise filtering}, where dual-perturbation filtering and an uncertainty-aware fusion module are designed and employed to automatically perceive the perturbation amplitude during the training and testing process. The method is termed as AdvFilter. Moreover, we combine adversarial pixel denoising methods with three adversarial training-based methods, hinting that considering data and models jointly is able to achieve more robust CNNs. The experiments conduct on NeurIPS-2017DEV, SVHN and CIFAR10 datasets and show advantages over enhancing CNNs' robustness, high generalization to different models and noise levels.

Comments:	This work has been accepted to ACM-MM 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2107.06501 [cs.CV]
	(or arXiv:2107.06501v2 [cs.CV] for this version)

Submission history

From: Yihao Huang [view email]
[v1] Wed, 14 Jul 2021 06:08:48 GMT (3360kb,D)
[v2] Mon, 18 Oct 2021 07:51:32 GMT (3138kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2107.06501

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: AdvFilter: Predictive Perturbation-aware Filtering against Adversarial Attack via Multi-domain Learning

Submission history