We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Novel View Synthesis from only a 6-DoF Camera Pose by Two-stage Networks

Abstract: Novel view synthesis is a challenging problem in computer vision and robotics. Different from the existing works, which need the reference images or 3D models of the scene to generate images under novel views, we propose a novel paradigm to this problem. That is, we synthesize the novel view from only a 6-DoF camera pose directly. Although this setting is the most straightforward way, there are few works addressing it. While, our experiments demonstrate that, with a concise CNN, we could get a meaningful parametric model that could reconstruct the correct scenery images only from the 6-DoF pose. To this end, we propose a two-stage learning strategy, which consists of two consecutive CNNs: GenNet and RefineNet. GenNet generates a coarse image from a camera pose. RefineNet is a generative adversarial network that refines the coarse image. In this way, we decouple the geometric relationship between mapping and texture detail rendering. Extensive experiments conducted on the public datasets prove the effectiveness of our method. We believe this paradigm is of high research and application value and could be an important direction in novel view synthesis.
Comments: Accepted by International Conference on Pattern Recognition (ICPR 2020)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2010.11468 [cs.CV]
  (or arXiv:2010.11468v1 [cs.CV] for this version)

Submission history

From: Yuchao Dai Dr. [view email]
[v1] Thu, 22 Oct 2020 06:23:40 GMT (5501kb,D)

Link back to: arXiv, form interface, contact.