We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Meta Networks

Abstract: Neural networks have been successfully applied in applications with a large amount of labeled data. However, the task of rapid generalization on new concepts with small training data while preserving performances on previously learned ones still presents a significant challenge to neural network models. In this work, we introduce a novel meta learning method, Meta Networks (MetaNet), that learns a meta-level knowledge across tasks and shifts its inductive biases via fast parameterization for rapid generalization. When evaluated on Omniglot and Mini-ImageNet benchmarks, our MetaNet models achieve a near human-level performance and outperform the baseline approaches by up to 6% accuracy. We demonstrate several appealing properties of MetaNet relating to generalization and continual learning.
Comments: Accepted at ICML 2017 - rewrote: the main section; added: MetaNet algorithmic procedure; performed: Mini-ImageNet evaluation
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:1703.00837 [cs.LG]
  (or arXiv:1703.00837v2 [cs.LG] for this version)

Submission history

From: Tsendsuren Munkhdalai [view email]
[v1] Thu, 2 Mar 2017 15:52:55 GMT (250kb,D)
[v2] Thu, 8 Jun 2017 16:12:40 GMT (254kb,D)

Link back to: arXiv, form interface, contact.