References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: DASHA: Decentralized Autofocusing System with Hierarchical Agents
(Submitted on 29 Aug 2021 (v1), last revised 2 Feb 2022 (this version, v2))
Abstract: State-of-the-art object detection models are frequently trained offline using available datasets, such as ImageNet: large and overly diverse data that are unbalanced and hard to cluster semantically. This kind of training drops the object detection performance should the change in illumination, in the environmental conditions (e.g., rain), or in the lens positioning (out-of-focus blur) occur. We propose a decentralized hierarchical multi-agent deep reinforcement learning approach for intelligently controlling the camera and the lens focusing settings, leading to a significant improvement beyond the capacity of the popular detection models (YOLO, Faster R-CNN, and Retina are considered). The algorithm relies on the latent representation of the camera's stream and, thus, it is the first method to allow a completely no-reference tuning of the camera, where the system trains itself to auto-focus itself.
Submission history
From: Dmitry V. Dylov [view email][v1] Sun, 29 Aug 2021 13:45:15 GMT (5693kb,D)
[v2] Wed, 2 Feb 2022 11:52:08 GMT (5677kb,D)
Link back to: arXiv, form interface, contact.