We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: SalyPath360: Saliency and Scanpath Prediction Framework for Omnidirectional Images

Abstract: This paper introduces a new framework to predict visual attention of omnidirectional images. The key setup of our architecture is the simultaneous prediction of the saliency map and a corresponding scanpath for a given stimulus. The framework implements a fully encoder-decoder convolutional neural network augmented by an attention module to generate representative saliency maps. In addition, an auxiliary network is employed to generate probable viewport center fixation points through the SoftArgMax function. The latter allows to derive fixation points from feature maps. To take advantage of the scanpath prediction, an adaptive joint probability distribution model is then applied to construct the final unbiased saliency map by leveraging the encoder decoder-based saliency map and the scanpath-based saliency heatmap. The proposed framework was evaluated in terms of saliency and scanpath prediction, and the results were compared to state-of-the-art methods on Salient360! dataset. The results showed the relevance of our framework and the benefits of such architecture for further omnidirectional visual attention prediction tasks.
Comments: Accepted at Electornic Imaging Sympotium 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2201.00096 [cs.CV]
  (or arXiv:2201.00096v1 [cs.CV] for this version)

Submission history

From: Mohamed Amine Kerkouri [view email]
[v1] Sat, 1 Jan 2022 02:37:33 GMT (933kb,D)

Link back to: arXiv, form interface, contact.