Token-Modification Adversarial Attacks for Natural Language Processing: A Survey

Roth, Tom; Gao, Yansong; Abuadbba, Alsharif; Nepal, Surya; Liu, Wei

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2103

Computer Science > Computation and Language

Title: Token-Modification Adversarial Attacks for Natural Language Processing: A Survey

Authors: Tom Roth, Yansong Gao, Alsharif Abuadbba, Surya Nepal, Wei Liu

(Submitted on 1 Mar 2021 (v1), revised 7 Aug 2023 (this version, v2), latest version 7 Jan 2024 (v3))

Abstract: There are now many adversarial attacks for natural language processing systems. Of these, a vast majority achieve success by modifying individual document tokens, which we call here a token-modification attack. Each token-modification attack is defined by a specific combination of fundamental components, such as a constraint on the adversary or a particular search algorithm. Motivated by this observation, we survey existing token-modification attacks and extract the components of each. We use an attack-independent framework to structure our survey which results in an effective categorisation of the field and an easy comparison of components. This survey aims to guide new researchers to this field and spark further research into individual attack components.

Comments:	Version 2: updated
Subjects:	Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2103.00676 [cs.CL]
	(or arXiv:2103.00676v2 [cs.CL] for this version)

Submission history

From: Tom Roth [view email]
[v1] Mon, 1 Mar 2021 01:00:09 GMT (111kb,D)
[v2] Mon, 7 Aug 2023 03:25:37 GMT (224kb,D)
[v3] Sun, 7 Jan 2024 08:00:31 GMT (515kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2103.00676v2

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Token-Modification Adversarial Attacks for Natural Language Processing: A Survey

Submission history