On the Security Vulnerabilities of Text-to-SQL Models

Peng, Xutan; Zhang, Yipeng; Yang, Jingfeng; Stevenson, Mark

Full-text links:

Download:

Current browse context:

cs.DB

< prev | next >

new | recent | 2211

Computer Science > Computation and Language

Title: On the Security Vulnerabilities of Text-to-SQL Models

Authors: Xutan Peng, Yipeng Zhang, Jingfeng Yang, Mark Stevenson

(Submitted on 28 Nov 2022 (v1), last revised 12 Oct 2023 (this version, v3))

Abstract: Although it has been demonstrated that Natural Language Processing (NLP) algorithms are vulnerable to deliberate attacks, the question of whether such weaknesses can lead to software security threats is under-explored. To bridge this gap, we conducted vulnerability tests on Text-to-SQL systems that are commonly used to create natural language interfaces to databases. We showed that the Text-to-SQL modules within six commercial applications can be manipulated to produce malicious code, potentially leading to data breaches and Denial of Service attacks. This is the first demonstration that NLP models can be exploited as attack vectors in the wild. In addition, experiments using four open-source language models verified that straightforward backdoor attacks on Text-to-SQL systems achieve a 100% success rate without affecting their performance. The aim of this work is to draw the community's attention to potential software security issues associated with NLP algorithms and encourage exploration of methods to mitigate against them.

Comments:	ISSRE 2023: Best Paper Candidate
Subjects:	Computation and Language (cs.CL); Cryptography and Security (cs.CR); Databases (cs.DB); Machine Learning (cs.LG); Software Engineering (cs.SE)
Cite as:	arXiv:2211.15363 [cs.CL]
	(or arXiv:2211.15363v3 [cs.CL] for this version)

Submission history

From: Xutan Peng [view email]
[v1] Mon, 28 Nov 2022 14:38:45 GMT (9176kb,D)
[v2] Fri, 3 Mar 2023 11:10:16 GMT (9282kb,D)
[v3] Thu, 12 Oct 2023 16:12:57 GMT (7718kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2211.15363

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: On the Security Vulnerabilities of Text-to-SQL Models

Submission history