Learning Functions to Study the Benefit of Multitask Learning

Bettgenhäuser, Gabriele; Hedderich, Michael A.; Klakow, Dietrich

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2006

Computer Science > Machine Learning

Title: Learning Functions to Study the Benefit of Multitask Learning

Authors: Gabriele Bettgenhäuser, Michael A. Hedderich, Dietrich Klakow

(Submitted on 9 Jun 2020 (v1), last revised 28 Sep 2020 (this version, v2))

Abstract: We study and quantify the generalization patterns of multitask learning (MTL) models for sequence labeling tasks. MTL models are trained to optimize a set of related tasks jointly. Although multitask learning has achieved improved performance in some problems, there are also tasks that lose performance when trained together. These mixed results motivate us to study the factors that impact the performance of MTL models. We note that theoretical bounds and convergence rates for MTL models exist, but they rely on strong assumptions such as task relatedness and the use of balanced datasets. To remedy these limitations, we propose the creation of a task simulator and the use of Symbolic Regression to learn expressions relating model performance to possible factors of influence. For MTL, we study the model performance against the number of tasks (T), the number of samples per task (n) and the task relatedness measured by the adjusted mutual information (AMI). In our experiments, we could empirically find formulas relating model performance with factors of sqrt(n), sqrt(T), which are equivalent to sound mathematical proofs in Maurer[2016], and we went beyond by discovering that performance relates to a factor of sqrt(AMI).

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
Cite as:	arXiv:2006.05561 [cs.LG]
	(or arXiv:2006.05561v2 [cs.LG] for this version)

Submission history

From: Gabriele Bettgenhäuser [view email]
[v1] Tue, 9 Jun 2020 23:51:32 GMT (305kb,D)
[v2] Mon, 28 Sep 2020 06:19:12 GMT (305kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2006.05561

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Learning Functions to Study the Benefit of Multitask Learning

Submission history