Current browse context:
cs.CV
Change to browse by:
References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: SAM: Squeeze-and-Mimic Networks for Conditional Visual Driving Policy Learning
(Submitted on 6 Dec 2019 (v1), last revised 19 Nov 2020 (this version, v2))
Abstract: We describe a policy learning approach to map visual inputs to driving controls conditioned on turning command that leverages side tasks on semantics and object affordances via a learned representation trained for driving. To learn this representation, we train a squeeze network to drive using annotations for the side task as input. This representation encodes the driving-relevant information associated with the side task while ideally throwing out side task-relevant but driving-irrelevant nuisances. We then train a mimic network to drive using only images as input and use the squeeze network's latent representation to supervise the mimic network via a mimicking loss. Notably, we do not aim to achieve the side task nor to learn features for it; instead, we aim to learn, via the mimicking loss, a representation of the side task annotations directly useful for driving. We test our approach using the CARLA simulator. In addition, we introduce a more challenging but realistic evaluation protocol that considers a run that reaches the destination successful only if it does not violate common traffic rules. A video summarizing this work is available at this https URL , and code is available at this https URL .
Submission history
From: Albert Zhao [view email][v1] Fri, 6 Dec 2019 04:41:51 GMT (1036kb,D)
[v2] Thu, 19 Nov 2020 07:50:13 GMT (445kb,D)
Link back to: arXiv, form interface, contact.