PyTorch学习-卷积神经网络（高级篇）

GoogLeNet

PyTorch学习-卷积神经网络（高级篇）
观察具有相同结构的块，用函数封装起来，减少代码冗余。

Inception ：电影《盗梦空间》，重用
Concatenate ：拼接
Average Pooling：设置Padding，stride保证输入输出大小相同

$1\times 1$ Conv ：融合了不同通道相同位置的信息
$C\times W \times H => 1 \times W \times H$
作用：减少运算量
使用 $5\times 5$ Convolution
$[email protected] \times 28 => 32 @ 28\times28$
$5^2 \times 28^2 \times 192 \times 32 = 120422400$
使用 $1\times 1$ Convolution
$[email protected] \times 28 =>[email protected] \times 28 => 32 @ 28\times28$
$1^2 \times 28^2 \times 192 \times 16 + 5^2 \times 28^2 \times 16 \times 32 = 12433648$

Plain nets: stacking 3x3 conv layers

PyTorch学习-卷积神经网络（高级篇）

56-layer的效果比20-layer的差，可能是梯度消失或者过拟合
多个小于1的梯度相乘结果趋近于零，权重的更新（w = w - lr*g) 就几乎不更新。

Residual net

PyTorch学习-卷积神经网络（高级篇）
跳链接： $H(x) = F(x)+x$
求导为 $H'(x) = F'(x) + 1$ ，解决了梯度为零的问题

PyTorch学习-卷积神经网络（高级篇）

GoogLeNet

Plain nets: stacking 3x3 conv layers

Residual net

相关推荐