过学习 overfitting

最新推荐文章于 2022-10-17 10:30:50 发布

原创

最新推荐文章于 2022-10-17 10:30:50 发布 · 1.3k 阅读

0 ·

CC 4.0 BY-SA版权

过学习，也称为过度拟合，发生在模型过于复杂导致在独立验证数据上的预测误差显著高于校准数据时。理想的模型应在剩余干扰误差和估计误差之间取得平衡。模型的最优复杂度高度依赖于校准数据集的大小和质量。对于噪声大且数据量有限的集合，简单模型能防止过学习，而对噪声小的大数据集，更复杂的模型可能更优。寻找每个数据集的最优模型复杂度是一项挑战，涉及防止过拟合和欠拟合的问题，这不仅影响神经网络，也影响多种多元校准算法。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

过学习 Overfitting

【原文】http://www.frank-dieterle.de/phd/2_8_1.html

The best measure for the generalizing ability is the error of prediction of as many independent separate validation data as possible. According to figure 2 the error of prediction is composed of two main contributions, the remaining interference error and the estimation error [39]. The interference error is the systematic error (bias) due to unmodeled interference in the data, as the calibration model is not complex enough to capture all the interferences of the relationship between sensor responses and analytes. The estimation error is caused by modeling measured random noise of various kinds. The optimal prediction is obtained, when the remaining interference error and the estimation error balance ea