We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: MetH: A family of high-resolution and variable-shape image challenges

Abstract: High-resolution and variable-shape images have not yet been properly addressed by the AI community. The approach of down-sampling data often used with convolutional neural networks is sub-optimal for many tasks, and has too many drawbacks to be considered a sustainable alternative. In sight of the increasing importance of problems that can benefit from exploiting high-resolution (HR) and variable-shape, and with the goal of promoting research in that direction, we introduce a new family of datasets (MetH). The four proposed problems include two image classification, one image regression and one super resolution task. Each of these datasets contains thousands of art pieces captured by HR and variable-shape images, labeled by experts at the Metropolitan Museum of Art. We perform an analysis, which shows how the proposed tasks go well beyond current public alternatives in both pixel size and aspect ratio variance. At the same time, the performance obtained by popular architectures on these tasks shows that there is ample room for improvement. To wrap up the relevance of the contribution we review the fields, both in AI and high-performance computing, that could benefit from the proposed challenges.
Comments: An improved and extended version of this paper has been published in arXiv:2007.13693 This version is now obsolete
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:1911.08953 [cs.CV]
  (or arXiv:1911.08953v4 [cs.CV] for this version)

Submission history

From: Ferran Parés [view email]
[v1] Wed, 20 Nov 2019 15:01:22 GMT (6003kb,D)
[v2] Fri, 29 Nov 2019 17:30:59 GMT (3002kb,D)
[v3] Thu, 30 Jul 2020 09:27:49 GMT (0kb,I)
[v4] Tue, 29 Sep 2020 11:37:54 GMT (0kb,I)

Link back to: arXiv, form interface, contact.