Have a personal or library account? Click to login
Research on Multi-View Stereo Network Based on Self-Attention Mechanism Cover

Research on Multi-View Stereo Network Based on Self-Attention Mechanism

By: Wenkai Li,  Jun Yu,  Leilei Fan and  Zhiyi Hu  
Open Access
|Sep 2025

References

  1. Xie Qiqi. Multi-view 3D reconstruction based on MVSNet.Qinghai Normal University, 2024.
  2. Shi Shuaijie. Research on 3D reconstruction technology for monocular vision based on voxels and point clouds. Harbin Institute of Technology, 2021.
  3. Xie, Haozhe et al. "Pix2vox: Context-Aware 3d Reconstruction from Single and Multi-View Images", arXiv: Computer Vision and Pattern Recognition abs/1901.11153.1 (2019): 2690-2698.
  4. Xie H, Yao H, Zhang S, et al. Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruction from Single and Multiple Images. International Journal of Computer Vision, 2020, 128(12): 2919-2935.
  5. Feng Yajuan. Research on MVS 3D Reconstruction Algorithm Based on Deep Learning. Shanxi University, 2023.
  6. Yao, Luo et al. "Mvsnet: Depth Inference For Unstructured Multi-View Stereo", European Conference on Computer Vision 11212. (2018): 785-801.
  7. Wang Siqi, Zhang Jiaqiang, Li Liyuan, Li Xiaoyan, Chen Fansheng Application of MVSNet in 3D reconstruction of spatial targets. China Laser: 1-18 [2022-12-24].
  8. Yu Jingwei Research on Multi perspective Deep Estimation Methods Based on Deep Learning. Shenyang University of Technology, 2022.
  9. Rui C, Songfang H, Jing X, Hao S, et al. Point-Based Multi-View Stereo Network[C], IEEE International Conference on Computer Vision, 2019, 2019(1): 1538-1547.
  10. N. Wang, Y. Zhang, Z. Li, et al. Pixel2mesh: Generating 3d mesh models from single rgb images[C]//Proceedings of the European Conference on Computer Vision (ECCV), 2018: 52-67.
  11. Liu, Ze et al. “Swin Transformer: Hierarchical Vision Transformer using Shifted Windows.” 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2021): 9992-10002.
  12. Dong, Hao-Chen and Jian Yao. “PatchMVSNet: Patch-wise Unsupervised Multi-View Stereo for Weakly-Textured Surface Reconstruction.”ArXiv abs/2203.02156 (2022)
  13. Li, Chenhuan et al. “R3D-SWIN: Use Shifted Window Attention for Single-View 3D Reconstruction.” ArXiv abs/2312.02725 (2023)
  14. Dosovitskiy, Alexey et al. “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale.” ArXiv abs/2010.11929 (2020)
  15. Zhang, Xudong et al. “Long-range Attention Network for Multi-View Stereo.” 2021 IEEE Winter Conference on Applications of Computer Vision (WACV) (2021): 3781-3790.
  16. Yu, Zheng-Lun et al. “ACR: Attention Collaboration-based Regressor for Arbitrary Two-Hand Reconstruction.” 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023): 12955-12964.
Language: English
Page range: 1 - 10
Published on: Sep 30, 2025
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2025 Wenkai Li, Jun Yu, Leilei Fan, Zhiyi Hu, published by Xi’an Technological University
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.