We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CR

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Cryptography and Security

Title: OmniLytics: A Blockchain-based Secure Data Market for Decentralized Machine Learning

Abstract: We propose OmniLytics, a blockchain-based secure data trading marketplace for machine learning applications. Utilizing OmniLytics, many distributed data owners can contribute their private data to collectively train an ML model requested by some model owners, and receive compensation for data contribution. OmniLytics enables such model training while simultaneously providing 1) model security against curious data owners; 2) data security against the curious model and data owners; 3) resilience to malicious data owners who provide faulty results to poison model training; and 4) resilience to malicious model owners who intend to evade payment. OmniLytics is implemented as a blockchain smart contract to guarantee the atomicity of payment. In OmniLytics, a model owner splits its model into the private and public parts and publishes the public part on the contract. Through the execution of the contract, the participating data owners securely aggregate their locally trained models to update the model owner's public model and receive reimbursement through the contract. We implement a working prototype of OmniLytics on Ethereum blockchain and perform extensive experiments to measure its gas cost, execution time, and model quality under various parameter combinations. For training a CNN on the MNIST dataset, the MO is able to boost its model accuracy from 62% to 83% within 500ms in blockchain processing time.This demonstrates the effectiveness of OmniLytics for practical deployment.
Comments: An initial version of the article has been published in International Workshop on Federated Learning for User Privacy and Data Confidentiality in Conjunction with ICML 2021(this http URL). This version has been submmited to AAAI'22
Subjects: Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
MSC classes: 68M14, 68T20
ACM classes: I.2.11; C.0
Cite as: arXiv:2107.05252 [cs.CR]
  (or arXiv:2107.05252v4 [cs.CR] for this version)

Submission history

From: Jiacheng Liang [view email]
[v1] Mon, 12 Jul 2021 08:28:15 GMT (2633kb,D)
[v2] Sun, 12 Sep 2021 06:41:09 GMT (11791kb,D)
[v3] Wed, 15 Sep 2021 16:24:13 GMT (11806kb,D)
[v4] Mon, 15 Nov 2021 07:18:28 GMT (5903kb,D)

Link back to: arXiv, form interface, contact.