We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Facing the Hard Problems in FGVC

Abstract: In fine-grained visual categorization (FGVC), there is a near-singular focus in pursuit of attaining state-of-the-art (SOTA) accuracy. This work carefully analyzes the performance of recent SOTA methods, quantitatively, but more importantly, qualitatively. We show that these models universally struggle with certain "hard" images, while also making complementary mistakes. We underscore the importance of such analysis, and demonstrate that combining complementary models can improve accuracy on the popular CUB-200 dataset by over 5%. In addition to detailed analysis and characterization of the errors made by these SOTA methods, we provide a clear set of recommended directions for future FGVC researchers.
Comments: 17 pages, 6 figures, 2 tables; fixed typo, minor adjustment to format, added equations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2006.13190 [cs.CV]
  (or arXiv:2006.13190v2 [cs.CV] for this version)

Submission history

From: Connor Anderson [view email]
[v1] Tue, 23 Jun 2020 17:44:05 GMT (3430kb,D)
[v2] Wed, 24 Jun 2020 20:24:37 GMT (3431kb,D)

Link back to: arXiv, form interface, contact.