We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Task Formulation Matters When Learning Continually: A Case Study in Visual Question Answering

Abstract: Continual learning aims to train a model incrementally on a sequence of tasks without forgetting previous knowledge. Although continual learning has been widely studied in computer vision, its application to Vision+Language tasks is not that straightforward, as settings can be parameterized in multiple ways according to their input modalities. In this paper, we present a detailed study of how different settings affect performance for Visual Question Answering. We first propose three plausible task formulations and demonstrate their impact on the performance of continual learning algorithms. We break down several factors of task similarity, showing that performance and sensitivity to task order highly depend on the shift of the output distribution. We also investigate the potential of pretrained models and compare the robustness of transformer models with different visual embeddings. Finally, we provide an analysis interpreting model representations and their impact on forgetting. Our results highlight the importance of stabilizing visual representations in deeper layers.
Subjects: Machine Learning (cs.LG)
Cite as: arXiv:2210.00044 [cs.LG]
  (or arXiv:2210.00044v2 [cs.LG] for this version)

Submission history

From: Malvina Nikandrou [view email]
[v1] Fri, 30 Sep 2022 19:12:58 GMT (11229kb,D)
[v2] Sat, 20 Jan 2024 19:15:21 GMT (11241kb,D)

Link back to: arXiv, form interface, contact.