We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Towards Practical Few-shot Federated NLP

Abstract: Transformer-based pre-trained models have emerged as the predominant solution for natural language processing (NLP). Fine-tuning such pre-trained models for downstream tasks often requires a considerable amount of labeled private data. In practice, private data is often distributed across heterogeneous mobile devices and may be prohibited from being uploaded. Moreover, well-curated labeled data is often scarce, presenting an additional challenge. To address these challenges, we first introduce a data generator for federated few-shot learning tasks, which encompasses the quantity and skewness of scarce labeled data in a realistic setting. Subsequently, we propose AUG-FedPrompt, a prompt-based federated learning system that exploits abundant unlabeled data for data augmentation. Our experiments indicate that AUG-FedPrompt can perform on par with full-set fine-tuning with a limited amount of labeled data. However, such competitive performance comes at a significant system cost.
Comments: EuroSys23 workshop
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
DOI: 10.1145/3578356.3592575
Cite as: arXiv:2212.00192 [cs.CL]
  (or arXiv:2212.00192v2 [cs.CL] for this version)

Submission history

From: Dongqi Cai [view email]
[v1] Thu, 1 Dec 2022 00:36:48 GMT (10857kb,D)
[v2] Sat, 19 Aug 2023 07:28:53 GMT (8645kb,D)

Link back to: arXiv, form interface, contact.