References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: ORBIT: A Real-World Few-Shot Dataset for Teachable Object Recognition
(Submitted on 8 Apr 2021 (v1), last revised 8 Oct 2021 (this version, v5))
Abstract: Object recognition has made great advances in the last decade, but predominately still relies on many high-quality training examples per object category. In contrast, learning new objects from only a few examples could enable many impactful applications from robotics to user personalization. Most few-shot learning research, however, has been driven by benchmark datasets that lack the high variation that these applications will face when deployed in the real-world. To close this gap, we present the ORBIT dataset and benchmark, grounded in the real-world application of teachable object recognizers for people who are blind/low-vision. The dataset contains 3,822 videos of 486 objects recorded by people who are blind/low-vision on their mobile phones. The benchmark reflects a realistic, highly challenging recognition problem, providing a rich playground to drive research in robustness to few-shot, high-variation conditions. We set the benchmark's first state-of-the-art and show there is massive scope for further innovation, holding the potential to impact a broad range of real-world vision applications including tools for the blind/low-vision community. We release the dataset at this https URL and benchmark code at this https URL
Submission history
From: Daniela Massiceti [view email][v1] Thu, 8 Apr 2021 15:32:01 GMT (18655kb,D)
[v2] Fri, 9 Apr 2021 16:56:43 GMT (18656kb,D)
[v3] Thu, 10 Jun 2021 14:50:34 GMT (18676kb,D)
[v4] Mon, 16 Aug 2021 16:19:12 GMT (10942kb,D)
[v5] Fri, 8 Oct 2021 13:20:52 GMT (6732kb,D)
Link back to: arXiv, form interface, contact.