We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CR

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Cryptography and Security

Title: FLAME: Taming Backdoors in Federated Learning

Abstract: Federated Learning (FL) is a collaborative machine learning approach allowing participants to jointly train a model without having to share their private, potentially sensitive local datasets with others. Despite its benefits, FL is vulnerable to so-called backdoor attacks, in which an adversary injects manipulated model updates into the federated model aggregation process so that the resulting model will provide targeted false predictions for specific adversary-chosen inputs. Proposed defenses against backdoor attacks based on detecting and filtering out malicious model updates consider only very specific and limited attacker models, whereas defenses based on differential privacy-inspired noise injection significantly deteriorate the benign performance of the aggregated model. To address these deficiencies, we introduce FLAME, a defense framework that estimates the sufficient amount of noise to be injected to ensure the elimination of backdoors. To minimize the required amount of noise, FLAME uses a model clustering and weight clipping approach. This ensures that FLAME can maintain the benign performance of the aggregated model while effectively eliminating adversarial backdoors. Our evaluation of FLAME on several datasets stemming from application areas including image classification, word prediction, and IoT intrusion detection demonstrates that FLAME removes backdoors effectively with a negligible impact on the benign performance of the models.
Comments: To appear in the 31st USENIX Security Symposium, August 2022, Boston, MA, USA
Subjects: Cryptography and Security (cs.CR)
Cite as: arXiv:2101.02281 [cs.CR]
  (or arXiv:2101.02281v3 [cs.CR] for this version)

Submission history

From: Duc Thien Nguyen [view email]
[v1] Wed, 6 Jan 2021 21:49:27 GMT (3224kb,D)
[v2] Thu, 21 Jan 2021 15:30:17 GMT (3228kb,D)
[v3] Sun, 9 Jan 2022 19:41:06 GMT (7910kb,D)

Link back to: arXiv, form interface, contact.