Constrained Optimization with Dynamic Bound-scaling for Effective NLPBackdoor Defense

Shen, Guangyu; Liu, Yingqi; Tao, Guanhong; Xu, Qiuling; Zhang, Zhuo; An, Shengwei; Ma, Shiqing; Zhang, Xiangyu

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2202

Computer Science > Computation and Language

Title: Constrained Optimization with Dynamic Bound-scaling for Effective NLPBackdoor Defense

Authors: Guangyu Shen, Yingqi Liu, Guanhong Tao, Qiuling Xu, Zhuo Zhang, Shengwei An, Shiqing Ma, Xiangyu Zhang

(Submitted on 11 Feb 2022)

Abstract: We develop a novel optimization method for NLPbackdoor inversion. We leverage a dynamically reducing temperature coefficient in the softmax function to provide changing loss landscapes to the optimizer such that the process gradually focuses on the ground truth trigger, which is denoted as a one-hot value in a convex hull. Our method also features a temperature rollback mechanism to step away from local optimals, exploiting the observation that local optimals can be easily deter-mined in NLP trigger inversion (while not in general optimization). We evaluate the technique on over 1600 models (with roughly half of them having injected backdoors) on 3 prevailing NLP tasks, with 4 different backdoor attacks and 7 architectures. Our results show that the technique is able to effectively and efficiently detect and remove backdoors, outperforming 4 baseline methods.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2202.05749 [cs.CL]
	(or arXiv:2202.05749v1 [cs.CL] for this version)

Submission history

From: Guangyu Shen [view email]
[v1] Fri, 11 Feb 2022 16:40:25 GMT (1186kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2202.05749

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Constrained Optimization with Dynamic Bound-scaling for Effective NLPBackdoor Defense

Submission history