基于神经辐射场的稀疏视角三维重建方法
DOI:
CSTR:
作者:
作者单位:

1.新疆大学智能制造现代产业学院 乌鲁木齐 830017; 2.上海交通大学文化创意产业学院 上海 200240

作者简介:

通讯作者:

中图分类号:

TP391.4;TN.9

基金项目:

国家自然科学基金(52275003)、新疆维吾尔自治区重大科技专项(2023A03001)资助


Sparse perspective 3D reconstruction method based on neural radiation field
Author:
Affiliation:

1.College of Intelligent Manufacturing Modern Industry, Xinjiang University,Urumqi 830017, China; 2.School of Cultural and Creative Industries, Shanghai Jiao Tong University,Shanghai 200240, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对神经辐射场在稀疏视角输入条件下渲染结果过于平滑,细节缺失严重等问题,提出一个基于信息关注抑制模块和双阶段损失的网络模型。首先,为解决细节缺失问题,提出一个信息关注抑制模块,该模块在全连接层各层之间采用特征向量归一化模块过滤权重异常值,并以残差网络级联全局信息和局部信息,最后利用通道注意力将将融合后的信息根据重要度进行区分,有效提高了采样点特征向量的准确性。然后,为了解决渲染结果过于平滑导致感知精度低的问题,设计了一种双阶段损失函数,将训练过程划分为两个阶段,粗阶段仅以RGB损失和深度损失指导训练,细阶段在此基础上还引入感知损失和全变分损失,通过渐进优化的方式,充分利用图片的高级特征,提升图像感知能力。本文算法与其他经典方法进行对比,在LLFF数据集上,定量结果表明,整体性能取得最优值,比次优算法性能提升1.9%,在DTU数据集上,定性结果显示,Scan37、Scan55和Scan63等场景重建的完整性和细节水平具有明显优势。

    Abstract:

    In order to address the issue of the neural radiation field rendering results being overly smooth when sparse viewpoint input conditions are present, resulting in a lack of detail, a network model based on an information attention suppression module and a two-stage loss function has been proposed. The first step is to propose an information attention suppression module, which uses a feature vector normalization module to filter outliers in the weights between layers of MLP. It also uses a residual network to cascade global and local information and employs channel attention to differentiate fused information based on its degree of importance. This process improves the accuracy of the sampling points′ feature vectors. To address the issue of low perceptual accuracy resulting from overly smooth rendering, a two-stage loss function is proposed. This function partitions the training phase into two stages. In the initial coarse stage, training is guided by RGB and depth loss. Subsequently, in the fine stage, perceptual loss and TV loss are incorporated. This approach enables the utilisation of high-level image features, thereby enhancing the image perception ability via gradual optimization. This paper′s algorithm is compared with other classical methods, and on the LLFF dataset, the quantitative results demonstrate that the overall performance reaches its optimal value, which is 1.9% superior to the performance of the sub-optimal algorithm. Furthermore, on the DTU dataset, the qualitative results indicate that the reconstruction′s completeness and detail level, as observed in Scan37, Scan55, and Scan63, are notably enhanced.

    参考文献
    相似文献
    引证文献
引用本文

张超,袁亮,肖文东,冉腾,吕凯.基于神经辐射场的稀疏视角三维重建方法[J].电子测量技术,2024,47(20):159-166

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-01-06
  • 出版日期:
文章二维码