Coarse to Fine: Video Retrieval before Moment Localization

Gao, Zijian; Liu, Huanyu; Liu, Jingyu

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2110

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Coarse to Fine: Video Retrieval before Moment Localization

Authors: Zijian Gao, Huanyu Liu, Jingyu Liu

(Submitted on 14 Oct 2021)

Abstract: The current state-of-the-art methods for video corpus moment retrieval (VCMR) often use similarity-based feature alignment approach for the sake of convenience and speed. However, late fusion methods like cosine similarity alignment are unable to make full use of the information from both query texts and videos. In this paper, we combine feature alignment with feature fusion to promote the performance on VCMR.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2110.07201 [cs.CV]
	(or arXiv:2110.07201v1 [cs.CV] for this version)

Submission history

From: Zijian Gao [view email]
[v1] Thu, 14 Oct 2021 07:54:36 GMT (19kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.07201

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Coarse to Fine: Video Retrieval before Moment Localization

Submission history