Label-Efficient Online Continual Object Detection in Streaming Video

Wu, Jay Zhangjie; Zhang, David Junhao; Hsu, Wynne; Zhang, Mengmi; Shou, Mike Zheng

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2206

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Label-Efficient Online Continual Object Detection in Streaming Video

Authors: Jay Zhangjie Wu, David Junhao Zhang, Wynne Hsu, Mengmi Zhang, Mike Zheng Shou

(Submitted on 1 Jun 2022 (v1), last revised 23 Aug 2023 (this version, v2))

Abstract: Humans can watch a continuous video stream and effortlessly perform continual acquisition and transfer of new knowledge with minimal supervision yet retaining previously learnt experiences. In contrast, existing continual learning (CL) methods require fully annotated labels to effectively learn from individual frames in a video stream. Here, we examine a more realistic and challenging problem$\unicode{x2014}$Label-Efficient Online Continual Object Detection (LEOCOD) in streaming video. We propose a plug-and-play module, Efficient-CLS, that can be easily inserted into and improve existing continual learners for object detection in video streams with reduced data annotation costs and model retraining time. We show that our method has achieved significant improvement with minimal forgetting across all supervision levels on two challenging CL benchmarks for streaming real-world videos. Remarkably, with only 25% annotated video frames, our method still outperforms the base CL learners, which are trained with 100% annotations on all video frames. The data and source code will be publicly available at this https URL

Comments:	ICCV 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2206.00309 [cs.CV]
	(or arXiv:2206.00309v2 [cs.CV] for this version)

Submission history

From: Jay Zhangjie Wu [view email]
[v1] Wed, 1 Jun 2022 08:22:34 GMT (5078kb,D)
[v2] Wed, 23 Aug 2023 15:51:28 GMT (4512kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.00309

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Label-Efficient Online Continual Object Detection in Streaming Video

Submission history