自动驾驶论文: DFNet: Semantic Segmentation on Panoramic Images with Dynamic Loss Weights and RFB

环视车位检测和车道线分割 DFNet: Semantic Segmentation on Panoramic Images with Dynamic Loss Weights and Residual Fusion Block
PDF: https://arxiv.org/pdf/1806.07226.pdf
PyTorch: https://github.com/shanglianlm0525/PyTorch-Networks

1 概述:

DFNet主要划分为三块: 基本模块(basic module)、特征提取模块(features extraction module)、细化模块(refinement module).

  • 1 选择Densenet作为基本模块(basic module);
  • 2 特征提取模块(features extraction module)由PSPNet提出的金字塔池模块(pyramid pooling module)后接卷积层和一个双线性上采样层组成.
  • 3 细化模块(refinement module)使用卷积层和池化层组成的残差融合块(residual fusion block, RFB) 减轻上采样带来的噪声干扰以及辨别处于类别边界上的点的归属.

2 网络结构

自动驾驶论文: DFNet: Semantic Segmentation on Panoramic Images with Dynamic Loss Weights and RFB

3 创新点

  • a 根据每个batch中的样本动态计算样本权重,权重计算公式如下:
    自动驾驶论文: DFNet: Semantic Segmentation on Panoramic Images with Dynamic Loss Weights and RFB
    其中, wiw_{i}是类别ww的权重, cc 是类别数, i (0,c)i~(0,c), ααββ 分别是wiw_{i}的上下界阈值,避免权重差异过大, NN 是batch中的全部像素数, nin_{i} 是类别ii的像素数,

when nin_{i} = 0, it means that the class i does not appear in this batch, we set the weight to 1. Because we need to increase the effect of small pixel number class on loss, so the smaller the nin_{i} , the larger the wi is. N and c are constant, wi is just changed by nin_{i} . When the nin_{i} is the average number, wiw_{i} is calculated to be 1/2, the multiplicative coefficient of 1/2 is also used to decrease the wiw_{i} of large pixel number of class.

  • b RFB中使用的结构(由卷积层和池化层组成), 实验表明结构(f)性能最好
    自动驾驶论文: DFNet: Semantic Segmentation on Panoramic Images with Dynamic Loss Weights and RFB

4 实验效果展示自动驾驶论文: DFNet: Semantic Segmentation on Panoramic Images with Dynamic Loss Weights and RFB