When Does Label Smoothing Help?

最新推荐文章于 2025-05-26 01:34:11 发布

Adam婷

最新推荐文章于 2025-05-26 01:34:11 发布

阅读量3.4k

点赞数 1

CC 4.0 BY-SA版权

分类专栏： AI程序员算法机器学习深度学习神经网络论文研读观点

本文链接：https://blog.youkuaiyun.com/weixin_41697507/article/details/95095299

AI程序员同时被 3 个专栏收录

166 篇文章 ¥19.90 ¥99.00

订阅专栏

超级会员免费看

机器学习

161 篇文章 ¥19.90 ¥99.00

订阅专栏

超级会员免费看

算法

161 篇文章

订阅专栏

本文探讨了标签平滑如何提高多类神经网络的泛化和学习速度，以及它对模型校准的积极影响。研究发现，标签平滑通过减少过自信的预测并形成类别的紧密聚类，提高了模型的准确性。然而，它也降低了知识蒸馏的有效性，因为丢失了不同类别的实例间相似性的信息。此外，文章通过线性投影的可视化方法揭示了标签平滑对倒数第二层表示的影响。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

When Does Label Smoothing Help?

Rafael Müller, Simon Kornblith, Geoffrey Hinton

Google Brain
Toronto

rafaelmuller@google.com

Abstract

The generalization and learning speed of a multi-class neural network can often be significantly improved by using soft targets that are a weighted average of the hard targets and the uniform distribution over labels. Smoothing the labels in this way prevents the network from becoming over-confident and label smoothing has been used in many state-of-the-art models, including image classification, language translation and speech recognition. Despite its widespread use, label smoothi

了解本专栏