We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Joint Learning On The Hierarchy Representation for Fine-Grained Human Action Recognition

Abstract: Fine-grained human action recognition is a core research topic in computer vision. Inspired by the recently proposed hierarchy representation of fine-grained actions in FineGym and SlowFast network for action recognition, we propose a novel multi-task network which exploits the FineGym hierarchy representation to achieve effective joint learning and prediction for fine-grained human action recognition. The multi-task network consists of three pathways of SlowOnly networks with gradually increased frame rates for events, sets and elements of fine-grained actions, followed by our proposed integration layers for joint learning and prediction. It is a two-stage approach, where it first learns deep feature representation at each hierarchical level, and is followed by feature encoding and fusion for multi-task learning. Our empirical results on the FineGym dataset achieve a new state-of-the-art performance, with 91.80% Top-1 accuracy and 88.46% mean accuracy for element actions, which are 3.40% and 7.26% higher than the previous best results.
Comments: Camera ready for IEEE ICIP 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Journal reference: 2021 IEEE International Conference on Image Processing (ICIP)
DOI: 10.1109/ICIP42928.2021.9506157
Cite as: arXiv:2110.05853 [cs.CV]
  (or arXiv:2110.05853v1 [cs.CV] for this version)

Submission history

From: Mei Chee Leong [view email]
[v1] Tue, 12 Oct 2021 09:37:51 GMT (178kb,D)

Link back to: arXiv, form interface, contact.