收藏切换
Generative model based unsupervised multi-view stereo network
收藏切换
PDF
Yuxuan PAN1, Rui JIN1, Yu LIU1, Lin ZHANG1, 2
Journal of Graphics | 2026, 47(1) : 29 - 38
Less
收藏切换
Journal of Graphics | 2026, 47(1): 29-38
Image Processing and Computer Vision
Generative model based unsupervised multi-view stereo network
Full
Yuxuan PAN1, Rui JIN1, Yu LIU1, Lin ZHANG1, 2
Affiliations
  • 1 School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • 2 Beijing Big Data Center, Beijing 100086, China
Published: 2026-02-28 doi: 10.11996/JG.j.2095-302X.2026010029
Outline
收藏切换

Existing research on multi-view stereo scheme utilizes depth-estimation algorithms to achieve stereo representation by establishing a mapping relationship between the physical and digital worlds. Supervised learning-based neural networks have achieved accurate and high-fidelity 3D reconstruction results through training. However, in-the-wild visual reconstruction remains challenging due to the lack of rendered depth priors and wide-baseline characteristics of images. A novel system was proposed to obtain optimized depth for naturally collected multi-view images without prior information by applying an unsupervised learning network and semantically optimized Neural Radiation Field (NeRF) rendering. First, preliminary depth information for wild multi-view images were produced without ground truth based on unsupervised deep learning. Subsequently, in a separate NeRF module, a diffusion model was used to construct a surface semantic rendering loss, enabling a fine-grained volumetric representation. Experimental results on the benchmark dataset validated the performance of the proposed system by improving an average of 24.6% of the overall metrics, compared with other state-of-the-art schemes. A novel wild wide-baseline dataset was also applied to verify the generalization performance, and the proposed system reduced the reconstruction error by up to 40.8% compared with all methods.

unsupervised deep learning  /  multi-view stereo  /  3D reconstruction  /  neural radiation field  /  depth optimization
Yuxuan PAN, Rui JIN, Yu LIU, Lin ZHANG. Generative model based unsupervised multi-view stereo network[J]. Journal of Graphics, 2026 , 47 (1) : 29 -38 . DOI: 10.11996/JG.j.2095-302X.2026010029
  • National Key Research and Development Program of China(2023YFB2704500)
  • Beijing Natural Science Foundation(4222033)
Year 2026 volume 47 Issue 1
PDF
10
4
Cite this Article
BibTeX
Article Info
doi: 10.11996/JG.j.2095-302X.2026010029
  • Receive Date:2025-04-29
  • Online Date:2026-05-19
  • Published:2026-02-28
Article Data
Affiliations
History
  • Received:2025-04-29
  • Accepted:2025-06-28
Funding
National Key Research and Development Program of China(2023YFB2704500)
Beijing Natural Science Foundation(4222033)
Affiliations
    1 School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
    2 Beijing Big Data Center, Beijing 100086, China

Corresponding:

ZHANG Lin,E-mail:
References
Share
https://castjournals.cast.org.cn/joweb/txxb/EN/10.11996/JG.j.2095-302X.2026010029
Share to
QR

Scan QR to access full text

Cite this article
BibTeX
Citations
表12种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏
  • BibTeX
  • EndNote
  • RefWorks
  • TxT