【迁移学习】Self Paced Adversarial Training for Multimodal Few-shot Learning论文解读

paper：Self Paced Adversarial Training for Multimodal Few-shot Learning

20181127 WACV-19 Self Paced Adversarial Training for Multimodal Few-shot Learning.

方法
源域训练GAN网络

利用源域数据训练GAN网络。具体地，GAN网络设成stackGAN网络，所谓stackGAN网络是指，用l个生成器G，来逐步生成分辨率高的图像。形式化的stackGAN如下：

G1 –> image1 -> G2 -> image2 -> … ->image l-1 -> Gl -> image l

目标域训练GAN网络

将stackGAN中判别器D的最后一层网络神经元个数从原来的类别个数改为新的类别个数，从而产生新的判别器D’。

引入文本来辅助分类

每张图像用10个左右的文字描述，用这些文字生成图像，通过判别器D’进行每个类别下对应生成图像的排序，取排序最前的图像作为可信图像，放入真实图像中，一起来训练G和D’。

【迁移学习】Self Paced Adversarial Training for Multimodal Few-shot Learning论文解读

实验
数据来源

CUB with bird data and Oxford-102 containing flower data.

CUB数据，包含11,788 images of 200 different bird species，每个类别月30张图像。

参数设置

During representation learning, we train a StackGAN for 900 epochs. Similar to [38], we use Adam [10] for optimization. We set the learning rate _ to 2 _ 10????4 and the batch size to 24 for both G and D. In the initialization phase for self-paced learning, we construct D0 by replacing the last layer of D by a linear softmax layer. The resulting network is then optimized using the cross-entropy loss function and a SGD optimizer with learning rate _ = 10????3 and momentum 0:5. Batch size is set to 32 and training proceeds for 20 epochs. Self-paced learning of D0 continues to use the same settings (i.e. SGD with _ = 10????3 and momentum 0:5, minimizing a cross-entropy loss). Additionally, Adam’s learning rate for G is reduced to 2_10????5. In every iteration we choose exactly one generated image.

实验结果

【迁移学习】Self Paced Adversarial Training for Multimodal Few-shot Learning论文解读

Finetuning为原始的GAN网络和简单D网络的迁移，属于meta-learning，可以学习到源域与目标域共有的特征。

SPL-D’G为作者提出的模型

SPL表示self paced learning，翻译为自步数更新，意思是说用生成的可信的图像扩充到原有图像中，来训练GAN网络。

【迁移学习】Self Paced Adversarial Training for Multimodal Few-shot Learning论文解读

相关推荐