We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: PDANet: Pyramid Density-aware Attention Net for Accurate Crowd Counting

Abstract: Crowd counting, i.e., estimating the number of people in a crowded area, has attracted much interest in the research community. Although many attempts have been reported, crowd counting remains an open real-world problem due to the vast scale variations in crowd density within the interested area, and severe occlusion among the crowd. In this paper, we propose a novel Pyramid Density-Aware Attention-based network, abbreviated as PDANet, that leverages the attention, pyramid scale feature and two branch decoder modules for density-aware crowd counting. The PDANet utilizes these modules to extract different scale features, focus on the relevant information, and suppress the misleading ones. We also address the variation of crowdedness levels among different images with an exclusive Density-Aware Decoder (DAD). For this purpose, a classifier evaluates the density level of the input features and then passes them to the corresponding high and low crowded DAD modules. Finally, we generate an overall density map by considering the summation of low and high crowded density maps as spatial attention. Meanwhile, we employ two losses to create a precise density map for the input scene. Extensive evaluations conducted on the challenging benchmark datasets well demonstrate the superior performance of the proposed PDANet in terms of the accuracy of counting and generated density maps over the well-known state of the arts.
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as: arXiv:2001.05643 [cs.CV]
  (or arXiv:2001.05643v10 [cs.CV] for this version)

Submission history

From: Saeed Amirgholipour Kasmani [view email]
[v1] Thu, 16 Jan 2020 04:26:05 GMT (7008kb,D)
[v2] Wed, 22 Jan 2020 05:25:17 GMT (7432kb,D)
[v3] Wed, 22 Jan 2020 05:29:56 GMT (8054kb,D)
[v4] Wed, 29 Jan 2020 05:31:50 GMT (8054kb,D)
[v5] Tue, 25 Feb 2020 05:56:25 GMT (8055kb,D)
[v6] Tue, 25 Feb 2020 05:57:54 GMT (8060kb,D)
[v7] Thu, 26 Mar 2020 02:44:49 GMT (8650kb,D)
[v8] Sat, 4 Apr 2020 12:20:18 GMT (8650kb,D)
[v9] Wed, 15 Apr 2020 02:21:27 GMT (7796kb,D)
[v10] Wed, 29 Apr 2020 03:04:05 GMT (7276kb,D)

Link back to: arXiv, form interface, contact.