Bottleneck Low-rank Transformers for Low-resource Spoken Language Understanding

Wang, Pu; Van hamme, Hugo

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2206

Computer Science > Computation and Language

Title: Bottleneck Low-rank Transformers for Low-resource Spoken Language Understanding

Authors: Pu Wang, Hugo Van hamme

(Submitted on 28 Jun 2022)

Abstract: End-to-end spoken language understanding (SLU) systems benefit from pretraining on large corpora, followed by fine-tuning on application-specific data. The resulting models are too large for on-edge applications. For instance, BERT-based systems contain over 110M parameters. Observing the model is overparameterized, we propose lean transformer structure where the dimension of the attention mechanism is automatically reduced using group sparsity. We propose a variant where the learned attention subspace is transferred to an attention bottleneck layer. In a low-resource setting and without pre-training, the resulting compact SLU model achieves accuracies competitive with pre-trained large models.

Comments:	Accepted by Interspeech 2022
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2206.14318 [cs.CL]
	(or arXiv:2206.14318v1 [cs.CL] for this version)

Submission history

From: Pu Wang [view email]
[v1] Tue, 28 Jun 2022 23:08:32 GMT (203kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.14318

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Bottleneck Low-rank Transformers for Low-resource Spoken Language Understanding

Submission history