We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: VFed-SSD: Towards Practical Vertical Federated Advertising

Abstract: As an emerging secure learning paradigm in lever-aging cross-agency private data, vertical federatedlearning (VFL) is expected to improve advertising models by enabling the joint learning of complementary user attributes privately owned by the advertiser and the publisher. However, there are two key challenges in applying it to advertising systems: a) the limited scale of labeled overlapping samples, and b) the high cost of real-time cross-agency serving. In this paper, we propose a semi-supervised split distillation framework VFed-SSD to alleviate the two limitations. We identify that: i)there are massive unlabeled overlapped data available in advertising systems, and ii) we can keep a balance between model performance and inference cost by decomposing the federated model. Specifically, we develop a self-supervised task MatchedPair Detection (MPD) to exploit the vertically partitioned unlabeled data and propose the Split Knowledge Distillation (SplitKD) schema to avoid cross-agency serving. Empirical studies on three industrial datasets exhibit the effectiveness of ourmethods, with the median AUC over all datasets improved by 0.86% and 2.6% in the local andthe federated deployment mode respectively. Overall, our framework provides an efficient federation-enhanced solution for real-time display advertising with minimal deploying cost and significant performance lift.
Comments: Accepted to the Trustworthy Federated Learning workshop of IJCAI2022 (FL-IJCAI22). Old version: Semi-Supervised Cross-Silo Advertising with Partial Knowledge Transfer
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Information Retrieval (cs.IR)
Cite as: arXiv:2205.15987 [cs.LG]
  (or arXiv:2205.15987v4 [cs.LG] for this version)

Submission history

From: Wenjie Li [view email]
[v1] Tue, 31 May 2022 17:45:30 GMT (174kb,D)
[v2] Tue, 21 Jun 2022 12:06:23 GMT (225kb,D)
[v3] Fri, 30 Sep 2022 18:27:05 GMT (105kb,D)
[v4] Sat, 17 Jun 2023 13:17:36 GMT (151kb,D)

Link back to: arXiv, form interface, contact.