We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Two-Stream Consensus Network: Submission to HACS Challenge 2021 Weakly-Supervised Learning Track

Abstract: This technical report presents our solution to the HACS Temporal Action Localization Challenge 2021, Weakly-Supervised Learning Track. The goal of weakly-supervised temporal action localization is to temporally locate and classify action of interest in untrimmed videos given only video-level labels. We adopt the two-stream consensus network (TSCN) as the main framework in this challenge. The TSCN consists of a two-stream base model training procedure and a pseudo ground truth learning procedure. The base model training encourages the model to predict reliable predictions based on single modality (i.e., RGB or optical flow), based on the fusion of which a pseudo ground truth is generated and in turn used as supervision to train the base models. On the HACS v1.1.1 dataset, without fine-tuning the feature-extraction I3D models, our method achieves 22.20% on the validation set and 21.68% on the testing set in terms of average mAP. Our solution ranked the 2nd in this challenge, and we hope our method can serve as a baseline for future academic research.
Comments: Second place solution to the HACS Weakly-Supervised Temporal Action Localization Challenge 2021. arXiv admin note: text overlap with arXiv:2010.11594
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2106.10829 [cs.CV]
  (or arXiv:2106.10829v3 [cs.CV] for this version)

Submission history

From: Yuanhao Zhai [view email]
[v1] Mon, 21 Jun 2021 03:36:36 GMT (860kb,D)
[v2] Sun, 11 Jul 2021 06:35:57 GMT (925kb,D)
[v3] Sun, 17 Apr 2022 18:31:23 GMT (860kb,D)

Link back to: arXiv, form interface, contact.