[论文阅读]CCPD|Towards End-to-End License Plate Detection and Recognition: A Large Dataset and Baseline

题目：Towards End-to-End License Plate Detectionand Recognition: A Large Dataset and Baseline
作者：Zhenbo Xu ；Wei Yang；etc…

Motivation:
作者认为目前现有的数据集都比较小，于是作者整个大的CCPD，然后基于这个数据集又设计了一个检测模型。

Methods:
作者通过各种办法做了这个数据集，很丰富，很感人。
然后主要看作者设计的检测模型：The Roadside Parking Net (RPnet)
RPnet：
[论文阅读]CCPD|Towards End-to-End License Plate Detection and Recognition: A Large Dataset and Baseline
the detection module & the recognition module
the detection module来预测 $Cx,Cy,w,h$ ，损失函数为：

the recognition module经过ROI pooling层，预测车牌内容，损失函数为：
[论文阅读]CCPD|Towards End-to-End License Plate Detection and Recognition: A Large Dataset and Baseline
作者先在the detection module模型上单独做预训练，以保证后端ROI pooling 的有效性，这个给出的框才准确。

个人觉得这个不是端到端的训练，如果不对the detection module预训练，roi pooling 将在一定程度上失效。而且最后采用了7个全连接层分类器来预测车牌，计算量也是够了，是我的话会改用全卷积替换，更准确，更快。

experiment:
依然不多说，直接上图
[论文阅读]CCPD|Towards End-to-End License Plate Detection and Recognition: A Large Dataset and Baseline
作者开源的代码中图像直接将720x1160resize 到480x480然后在CCPD数据集上训练，所以效果也只有在CCPD数据集上显得比较好。感觉作者的数据制作非常感人，而模型的新颖程度一般。

【完结】

[论文阅读]CCPD|Towards End-to-End License Plate Detection and Recognition: A Large Dataset and Baseline

相关推荐