Leveraging Systematic Knowledge of 2D Transformations

Kang, Jiachen; Jia, Wenjing; He, Xiangjian

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2206

Computer Science > Computer Vision and Pattern Recognition

Title: Leveraging Systematic Knowledge of 2D Transformations

Authors: Jiachen Kang, Wenjing Jia, Xiangjian He

(Submitted on 2 Jun 2022 (this version), latest version 23 Apr 2024 (v2))

Abstract: The existing deep learning models suffer from out-of-distribution (o.o.d.) performance drop in computer vision tasks. In comparison, humans have a remarkable ability to interpret images, even if the scenes in the images are rare, thanks to the systematicity of acquired knowledge. This work focuses on 1) the acquisition of systematic knowledge of 2D transformations, and 2) architectural components that can leverage the learned knowledge in image classification tasks in an o.o.d. setting. With a new training methodology based on synthetic datasets that are constructed under the causal framework, the deep neural networks acquire knowledge from semantically different domains (e.g. even from noise), and exhibit certain level of systematicity in parameter estimation experiments. Based on this, a novel architecture is devised consisting of a classifier, an estimator and an identifier (abbreviated as "CED"). By emulating the "hypothesis-verification" process in human visual perception, CED improves the classification accuracy significantly on test sets under covariate shift.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2206.00893 [cs.CV]
	(or arXiv:2206.00893v1 [cs.CV] for this version)

Submission history

From: Jiachen Kang [view email]
[v1] Thu, 2 Jun 2022 06:46:12 GMT (1533kb,D)
[v2] Tue, 23 Apr 2024 03:23:10 GMT (1538kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.00893v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Leveraging Systematic Knowledge of 2D Transformations

Submission history