The study of monocular depth estimation is the basis of many vision tasks. Obtaining a depth map with clear edges and rich details from an image is important for subsequent tasks. Aiming at the problem that the current monocular depth estimation model cannot deeply integrate image semantic information and can not make good use of the edge information of image objects, the superpixel topology relationship map is first constructed, and the graph neural network is used to extract the relationship between local edge information. The topological relationship graph with superpixels as nodes is obtained. Secondly, a joint model of depth estimation and semantic segmentation based on the encoder-decoder structure is constructed. By optimizing the joint objective function, the model can fuse edge semantic information, thereby improving the model"s ability to extract local structural information. Through experimental verification in the NYU-Depth V2 dataset, the results show that the model can construct a depth map with rich details and clear edges, which improves the quality of monocular depth visual estimation. Compared with other models, this model has certain advantages.
参考文献
相似文献
引证文献
引用本文
张玉亮,赵智龙,付炜平,等. 融合边缘语义信息的单目深度估计[J]. 科学技术与工程, 2022, 22(7): 2761-2769. Zhang Yuliang, Zhao Zhilong, Fu Weiping, et al. Integrating Spatial Semantic Information for Monocular Depth Estimation[J]. Science Technology and Engineering,2022,22(7):2761-2769.