17.Interpretable Adversarial Perturbation in Input Embedding Space for Text

该文章对本人博客论文5.Adversarial Training Methods for Semi-Supervised Text Classification做了进一步的解释实验。
17.Interpretable Adversarial Perturbation in Input Embedding Space for Text

论文5针对NLP中半监督问题提出了解决的方案。
1.对抗训练是一种数剧增强的方法，即在原数据基础上求其一个最大的扰动，得到其对抗样本。
17.Interpretable Adversarial Perturbation in Input Embedding Space for Text

2.虚拟对抗是针对半监督学习，对于没有标签的那一部分，根据KL散度，生成其对抗样本，并将二者的KL散度作为目标函数。

本文将r的搜索空间限制在一个线性空间上，得到其真实的对抗样本，即该对抗样本可以对应具体的自然语言。
17.Interpretable Adversarial Perturbation in Input Embedding Space for Text

通过文本分类任务和文本标注任务可以看到，该方法和原来方法效果差不多，即可用得到真实的对抗样本解释原论文中的对抗样本。

17.Interpretable Adversarial Perturbation in Input Embedding Space for Text

相关推荐