Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding

Hamdi, Abdullah; Giancola, Silvio; Ghanem, Bernard

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2111

Computer Science > Computer Vision and Pattern Recognition

Title: Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding

Authors: Abdullah Hamdi, Silvio Giancola, Bernard Ghanem

(Submitted on 30 Nov 2021 (this version), latest version 25 Jan 2023 (v2))

Abstract: Multi-view projection methods have demonstrated promising performance on 3D understanding tasks like 3D classification and segmentation. However, it remains unclear how to combine such multi-view methods with the widely available 3D point clouds. Previous methods use unlearned heuristics to combine features at the point level. To this end, we introduce the concept of the multi-view point cloud (Voint cloud), representing each 3D point as a set of features extracted from several view-points. This novel 3D Voint cloud representation combines the compactness of 3D point cloud representation with the natural view-awareness of multi-view representation. Naturally, we can equip this new representation with convolutional and pooling operations. We deploy a Voint neural network (VointNet) with a theoretically established functional form to learn representations in the Voint space. Our novel representation achieves state-of-the-art performance on 3D classification and retrieval on ScanObjectNN, ModelNet40, and ShapeNet Core55. Additionally, we achieve competitive performance for 3D semantic segmentation on ShapeNet Parts. Further analysis shows that VointNet improves the robustness to rotation and occlusion compared to other methods.

Comments:	preprint
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
MSC classes:	68T45
Cite as:	arXiv:2111.15363 [cs.CV]
	(or arXiv:2111.15363v1 [cs.CV] for this version)

Submission history

From: Abdullah Hamdi [view email]
[v1] Tue, 30 Nov 2021 13:08:19 GMT (18984kb,D)
[v2] Wed, 25 Jan 2023 16:26:04 GMT (25522kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2111.15363v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding

Submission history