We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: FewShotTextGCN: K-hop neighborhood regularization for few-shot learning on graphs

Abstract: We present FewShotTextGCN, a novel method designed to effectively utilize the properties of word-document graphs for improved learning in low-resource settings. We introduce K-hop Neighbourhood Regularization, a regularizer for heterogeneous graphs, and show that it stabilizes and improves learning when only a few training samples are available. We furthermore propose a simplification in the graph-construction method, which results in a graph that is $\sim$7 times less dense and yields better performance in little-resource settings while performing on par with the state of the art in high-resource settings. Finally, we introduce a new variant of Adaptive Pseudo-Labeling tailored for word-document graphs. When using as little as 20 samples for training, we outperform a strong TextGCN baseline with 17% in absolute accuracy on average over eight languages. We demonstrate that our method can be applied to document classification without any language model pretraining on a wide range of typologically diverse languages while performing on par with large pretrained language models.
Comments: 8 pages, 4 figures, EACL 2023
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as: arXiv:2301.10481 [cs.CL]
  (or arXiv:2301.10481v2 [cs.CL] for this version)

Submission history

From: Niels van der Heijden [view email]
[v1] Wed, 25 Jan 2023 09:30:32 GMT (915kb,D)
[v2] Mon, 6 Feb 2023 15:53:12 GMT (916kb,D)

Link back to: arXiv, form interface, contact.