Training Set: this data set is used to adjust the weights on the neural network.
Validation Set: this data set is used to minimize over-fitting. You're not adjusting the weights of the network with this data set, you're just verifying that any increase in accuracy over the training data set actually yields an increase in accuracy over a data set that has not been shown to the network before, or at least the network hasn't trained on it (i.e. validation data set). If the accuracy over the training data set increases, but the accuracy over then validation data set stays the same or decreases, then you're over-fitting your neural network and you should stop training.
Testing Set: this data set is used only for testing the final solution in order to confirm the actual predictive power of the network.
本文深入解析了神经网络训练、验证与测试集的用途与区别,通过实例说明如何有效利用这些数据集避免过拟合,确保模型在未知数据上的泛化能力。
18

被折叠的 条评论
为什么被折叠?



