We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting

Abstract: User-defined keyword spotting is a task to detect new spoken terms defined by users. This can be viewed as a few-shot learning problem since it is unreasonable for users to define their desired keywords by providing many examples. To solve this problem, previous works try to incorporate self-supervised learning models or apply meta-learning algorithms. But it is unclear whether self-supervised learning and meta-learning are complementary and which combination of the two types of approaches is most effective for few-shot keyword discovery. In this work, we systematically study these questions by utilizing various self-supervised learning models and combining them with a wide variety of meta-learning algorithms. Our result shows that HuBERT combined with Matching network achieves the best result and is robust to the changes of few-shot examples.
Comments: Accepted by SLT 2022
Subjects: Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as: arXiv:2204.00352 [cs.LG]
  (or arXiv:2204.00352v3 [cs.LG] for this version)

Submission history

From: Wei-Tsung Kao [view email]
[v1] Fri, 1 Apr 2022 10:59:39 GMT (1775kb,D)
[v2] Tue, 19 Apr 2022 09:22:26 GMT (1776kb,D)
[v3] Wed, 5 Oct 2022 14:16:58 GMT (1123kb,D)

Link back to: arXiv, form interface, contact.