Research on Multi-View Stereo Network Based on Self-Attention Mechanism

Wenkai Li; Jun Yu; Leilei Fan; Zhiyi Hu

doi:10.2478/ijanmc-2025-0021

.blurhash-client-img { display: none !important; }

Research on Multi-View Stereo Network Based on Self-Attention Mechanism

International Journal of Advanced Network, Monitoring and Controls

Volume 10 (2025): Issue 3 (September 2025)

By: Wenkai Li, Jun Yu, Leilei Fan and Zhiyi Hu

Open Access

|Sep 2025

Xie Qiqi. Multi-view 3D reconstruction based on MVSNet.Qinghai Normal University, 2024.
Search in Google Scholar Back to article
Shi Shuaijie. Research on 3D reconstruction technology for monocular vision based on voxels and point clouds. Harbin Institute of Technology, 2021.
Search in Google Scholar Back to article
Xie, Haozhe et al. "Pix2vox: Context-Aware 3d Reconstruction from Single and Multi-View Images", arXiv: Computer Vision and Pattern Recognition abs/1901.11153.1 (2019): 2690-2698.
Search in Google Scholar Back to article
Xie H, Yao H, Zhang S, et al. Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruction from Single and Multiple Images. International Journal of Computer Vision, 2020, 128(12): 2919-2935.
Search in Google Scholar Back to article
Feng Yajuan. Research on MVS 3D Reconstruction Algorithm Based on Deep Learning. Shanxi University, 2023.
Search in Google Scholar Back to article
Yao, Luo et al. "Mvsnet: Depth Inference For Unstructured Multi-View Stereo", European Conference on Computer Vision 11212. (2018): 785-801.
Search in Google Scholar Back to article
Wang Siqi, Zhang Jiaqiang, Li Liyuan, Li Xiaoyan, Chen Fansheng Application of MVSNet in 3D reconstruction of spatial targets. China Laser: 1-18 [2022-12-24].
Search in Google Scholar Back to article
Yu Jingwei Research on Multi perspective Deep Estimation Methods Based on Deep Learning. Shenyang University of Technology, 2022.
Search in Google Scholar Back to article
Rui C, Songfang H, Jing X, Hao S, et al. Point-Based Multi-View Stereo Network[C], IEEE International Conference on Computer Vision, 2019, 2019(1): 1538-1547.
Search in Google Scholar Back to article
N. Wang, Y. Zhang, Z. Li, et al. Pixel2mesh: Generating 3d mesh models from single rgb images[C]//Proceedings of the European Conference on Computer Vision (ECCV), 2018: 52-67.
Search in Google Scholar Back to article
Liu, Ze et al. “Swin Transformer: Hierarchical Vision Transformer using Shifted Windows.” 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2021): 9992-10002.
Search in Google Scholar Back to article
Dong, Hao-Chen and Jian Yao. “PatchMVSNet: Patch-wise Unsupervised Multi-View Stereo for Weakly-Textured Surface Reconstruction.”ArXiv abs/2203.02156 (2022)
Search in Google Scholar Back to article
Li, Chenhuan et al. “R3D-SWIN: Use Shifted Window Attention for Single-View 3D Reconstruction.” ArXiv abs/2312.02725 (2023)
Search in Google Scholar Back to article
Dosovitskiy, Alexey et al. “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale.” ArXiv abs/2010.11929 (2020)
Search in Google Scholar Back to article
Zhang, Xudong et al. “Long-range Attention Network for Multi-View Stereo.” 2021 IEEE Winter Conference on Applications of Computer Vision (WACV) (2021): 3781-3790.
Search in Google Scholar Back to article
Yu, Zheng-Lun et al. “ACR: Attention Collaboration-based Regressor for Arbitrary Two-Hand Reconstruction.” 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023): 12955-12964.
Search in Google Scholar Back to article

Authors

Metrics

Articles in this issue

DOI: https://doi.org/10.2478/ijanmc-2025-0021 | Journal eISSN: 2470-8038

Journal RSS Feed

Language: English

Page range: 1 - 10

Published on: Sep 30, 2025

Published by: Xi’an Technological University

In partnership with: Paradigm Publishing Services

Publication frequency: 4 issues per year

Keywords:

MVSNet,

Related subjects:

Computer sciences, other

© 2025 Wenkai Li, Jun Yu, Leilei Fan, Zhiyi Hu, published by Xi’an Technological University
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Volume 10 (2025): Issue 3 (September 2025)