| Issue |
Int. J. Simul. Multidisci. Des. Optim.
Volume 16, 2025
Multi-modal Information Learning and Analytics on Cross-Media Data Integration
|
|
|---|---|---|
| Article Number | 14 | |
| Number of page(s) | 21 | |
| DOI | https://doi.org/10.1051/smdo/2025015 | |
| Published online | 21 October 2025 | |
Research Article
Optimization algorithm for 3D image visual communication based on digital image reconstruction
1
School of Big Data and Computer, Hechi University, Yizhou, 546300 Guangxi, China
2
School of Artificial Intelligence, Nanning Vocational and Technical University, Nanning, 530000 Guangxi, China
3
College of Software, Henan University of Engineering, Zhengzhou, 450000 Henan, China
* e-mail: 04004@hcnu.edu.cn
Received:
25
June
2025
Accepted:
18
August
2025
Existing three-dimensional (3D) reconstruction methods have deficiencies in image depth estimation accuracy, texture mapping continuity, and illumination consistency, which lead to obstacles such as geometric distortion and texture fracture in the visual communication of 3D images, affecting the efficiency of information acquisition and immersive experience. To solve these problems, this paper proposes a 3D image visual communication optimization algorithm that integrates neural implicit modeling and a multi-scale visual perception mechanism. By jointly encoding the image depth map and Red-Green-Blue (RGB) map into the Neural Radiance Fields(NeRF) voxel hashing network, the continuity of spatial structure expression and the integrity of texture restoration are improved. Structural similarity constraints and perceptual consistency loss functions are introduced to enhance visual stability and subjective quality under different viewing angles, and context completion and detail enhancement of edge texture missing areas are achieved through graph neural networks. User evaluation results show that this method can shorten the average recognition time by up to 33.1% in all target recognition tasks, improve the average subjective immersion score by up to 58.2%, and reduce the Root Mean Square Error (RMSE) of depth reconstruction in occluded areas to 0.164 meters. The Structural Similarity Index Measure(SSIM) of high-frequency texture areas reached 0.871, and the Learned Perceptual Image Patch Similarity(LPIPS) was stable at 0.162 under ±45° viewing angle offset, which effectively improved the image's structural restoration quality and functional visual communication performance, and provided algorithm support for high-precision virtual expression in multiple scenarios.
Key words: Three-dimensional reconstruction / visual communication optimization / neural radiance fields / depth estimation network / texture enhancement
© Y. Qin et al., Published by EDP Sciences, 2025
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.
