We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Interpretable Self-Attention Temporal Reasoning for Driving Behavior Understanding

Abstract: Performing driving behaviors based on causal reasoning is essential to ensure driving safety. In this work, we investigated how state-of-the-art 3D Convolutional Neural Networks (CNNs) perform on classifying driving behaviors based on causal reasoning. We proposed a perturbation-based visual explanation method to inspect the models' performance visually. By examining the video attention saliency, we found that existing models could not precisely capture the causes (e.g., traffic light) of the specific action (e.g., stopping). Therefore, the Temporal Reasoning Block (TRB) was proposed and introduced to the models. With the TRB models, we achieved the accuracy of $\mathbf{86.3\%}$, which outperform the state-of-the-art 3D CNNs from previous works. The attention saliency also demonstrated that TRB helped models focus on the causes more precisely. With both numerical and visual evaluations, we concluded that our proposed TRB models were able to provide accurate driving behavior prediction by learning the causal reasoning of the behaviors.
Comments: Submitted to IEEE ICASSP 2020; Pytorch code will be released soon
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Journal reference: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP40776.2020.9053783
Cite as: arXiv:1911.02172 [cs.CV]
  (or arXiv:1911.02172v1 [cs.CV] for this version)

Submission history

From: C.-H. Huck Yang [view email]
[v1] Wed, 6 Nov 2019 02:49:30 GMT (756kb)

Link back to: arXiv, form interface, contact.