Deep learning III - II Machine Learning Strategy 2 - Mismatched training and dev/test set

Deep learning III - II Machine Learning Strategy 2 - Mismatched training and dev/test set 如何处理不相称的训练和测试集


深度学习需要大量的数据来供其训练学习

  • 当你得到的金标准数据数量较少时(如10,000份),但你可以得到类似的大量数据数据(如200,000份),这时不应该将两部分数据shuffle然后随机分配;应该拿出5,000份金标准数据与那200,000份数据合并为train set,剩余的5,000份分成dev set 和 test set。

    Deep learning III - II Machine Learning Strategy 2 - Mismatched training and dev/test set
    Deep learning III - II Machine Learning Strategy 2 - Mismatched training and dev/test set