National Key Research and Development Program of China (No.2016YFB1000400);Harbin Outstanding Youth Talents Fund of Heilongjiang Province (No.2017RAYXJ016);Free Exploration Foundation for Central Universities (No.HEUCF170605);National Natural Science Foundation of China (No.61573284)
RGB-D Scene Parsing Based on Spatial Structured Inference Deep Fusion Networks[J]. Acta Electronica Sinica, 2018, 46(5): 1253-1258.
DOI:
RGB-D Scene Parsing Based on Spatial Structured Inference Deep Fusion Networks[J]. Acta Electronica Sinica, 2018, 46(5): 1253-1258. DOI: 10.3969/j.issn.0372-2112.2018.05.035.
RGB-D Scene Parsing Based on Spatial Structured Inference Deep Fusion Networks
In order to make up the drawbacks that convolutional neural networks lack the ability of spatial structured learning in RGB-D scene parsing
we propose spatial structured inference deep fusion networks (SSIDFNs) on the basis of deep learning
the embedded structural inference layer organically combines conditional random fields (CRFs) and spatial structured inference model
which is able to learn the three-dimensional spatial distributions of objects and three-dimensional spatial relationships among objects in a more comprehensive and accurate way.Furthermore
the feature fusion layer takes both advantages of deep belief networks and improved CRFs
which is able to achieve deep structured learning according to the comprehensive semantic information of objects and semantic correlation in formation among objects.The experimental results demonstrate that the proposed SSIDFNs achieve the best mean accuracy 53.8% and 54.6% on the standard RGB-D datasets NYUDv2 and SUNRGBD respectively
which will be helpful to implement intelligent computer vision tasks
such as robot task planning and self-driving cars.