Volumetric 3D Reconstruction with Window-wise Global Feature Aggregation
Shihao Ren (Tsinghua University); Yikang Ding (Tsinghua University); Jinli Liao (Tsinghua University); Xinghui Li (Tsinghua University); Jia Guo (None); Wensen Feng (the Shenzhen Graduate School, Tsinghua University, Shenzhen 518071, China); Xueqian Wang (Tsinghua University)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
Volumetric 3D reconstruction methods have shown great performance in reconstructing indoor scenarios from monocular videos. However, as such approaches utilize discrete feature voxels to encode the observed scenes, the global feature interaction within and across different voxels is ignored, leading to imperfect reconstructions.To solve this problem, we propose a novel volumetric 3D reconstruction method named VolGARecon. The core portion of VolGARecon includes two parts: first, we use an MLP-based weighted fusion module (WFM) to unproject the extracted features to each voxel, which considers the visibility and is capable to reduce the noise caused by occlusion;second, a 3D transformer module (3DTR) is used to perform window-wise global feature interaction in a local sliding window, which strengthens the feature expression in 3D space and benefits estimating more complete and spatially coherent 3D models.In addition, we propose a multi-dimensional hybrid loss (MHL) that incorporates the 3D supervisions in classical volumetric methods and the 2D supervisions in novel view synthesis works.Extensive experiments show our method achieves superior performance on multiple datasets.Code will be made available.