3D Convolutional with Attention for Action Recognition

Shrestha, Labina; Dubey, Shikha; Olimov, Farrukh; Rafique, Muhammad Aasim; Jeon, Moongu

Full-text links:

Download:

PDF only

Current browse context:

cs.CV

< prev | next >

new | recent | 2206

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: 3D Convolutional with Attention for Action Recognition

Authors: Labina Shrestha, Shikha Dubey, Farrukh Olimov, Muhammad Aasim Rafique, Moongu Jeon

(Submitted on 5 Jun 2022)

Abstract: Human action recognition is one of the challenging tasks in computer vision. The current action recognition methods use computationally expensive models for learning spatio-temporal dependencies of the action. Models utilizing RGB channels and optical flow separately, models using a two-stream fusion technique, and models consisting of both convolutional neural network (CNN) and long-short term memory (LSTM) network are few examples of such complex models. Moreover, fine-tuning such complex models is computationally expensive as well. This paper proposes a deep neural network architecture for learning such dependencies consisting of a 3D convolutional layer, fully connected (FC) layers, and attention layer, which is simpler to implement and gives a competitive performance on the UCF-101 dataset. The proposed method first learns spatial and temporal features of actions through 3D-CNN, and then the attention mechanism helps the model to locate attention to essential features for recognition.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2206.02203 [cs.CV]
	(or arXiv:2206.02203v1 [cs.CV] for this version)

Submission history

From: Shikha Dubey [view email]
[v1] Sun, 5 Jun 2022 15:12:57 GMT (207kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.02203

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: 3D Convolutional with Attention for Action Recognition

Submission history