We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Privacy-Preserving Text Classification on BERT Embeddings with Homomorphic Encryption

Abstract: Embeddings, which compress information in raw text into semantics-preserving low-dimensional vectors, have been widely adopted for their efficacy. However, recent research has shown that embeddings can potentially leak private information about sensitive attributes of the text, and in some cases, can be inverted to recover the original input text. To address these growing privacy challenges, we propose a privatization mechanism for embeddings based on homomorphic encryption, to prevent potential leakage of any piece of information in the process of text classification. In particular, our method performs text classification on the encryption of embeddings from state-of-the-art models like BERT, supported by an efficient GPU implementation of CKKS encryption scheme. We show that our method offers encrypted protection of BERT embeddings, while largely preserving their utility on downstream text classification tasks.
Comments: NAACL 2022
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2210.02574 [cs.CL]
  (or arXiv:2210.02574v1 [cs.CL] for this version)

Submission history

From: Minsoo Kim [view email]
[v1] Wed, 5 Oct 2022 21:46:02 GMT (91kb,D)

Link back to: arXiv, form interface, contact.