References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding
(Submitted on 30 Nov 2021 (this version), latest version 25 Jan 2023 (v2))
Abstract: Multi-view projection methods have demonstrated promising performance on 3D understanding tasks like 3D classification and segmentation. However, it remains unclear how to combine such multi-view methods with the widely available 3D point clouds. Previous methods use unlearned heuristics to combine features at the point level. To this end, we introduce the concept of the multi-view point cloud (Voint cloud), representing each 3D point as a set of features extracted from several view-points. This novel 3D Voint cloud representation combines the compactness of 3D point cloud representation with the natural view-awareness of multi-view representation. Naturally, we can equip this new representation with convolutional and pooling operations. We deploy a Voint neural network (VointNet) with a theoretically established functional form to learn representations in the Voint space. Our novel representation achieves state-of-the-art performance on 3D classification and retrieval on ScanObjectNN, ModelNet40, and ShapeNet Core55. Additionally, we achieve competitive performance for 3D semantic segmentation on ShapeNet Parts. Further analysis shows that VointNet improves the robustness to rotation and occlusion compared to other methods.
Submission history
From: Abdullah Hamdi [view email][v1] Tue, 30 Nov 2021 13:08:19 GMT (18984kb,D)
[v2] Wed, 25 Jan 2023 16:26:04 GMT (25522kb,D)
Link back to: arXiv, form interface, contact.