【KD】Correlation Congruence for Knowledge Distillation

moonuke

于 2019-09-30 18:11:58 发布

阅读量1.7k

点赞数

分类专栏： Knowledge Distillation

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.youkuaiyun.com/qq_36269513/article/details/101699194

版权

Paper： Correlation Congruence for Knowledge Distillation

1, Motivation：

通常情况下KD的teacher模型的特征空间没考虑类内类间的分布,student模型也将缺少我们期望的类内类间的分布特性。

Usually, the embedding space of teacher possesses the characteristic that intra-class instances cohere together while inter-class instances separate from each other. But its counterpart of student model trained by instance congruence would lack such desired characteristic.

2，Contribution:

提出相关一致性知识蒸馏（CCKD），它不仅关注实例一致性，而且关注相关一致性。（instance congruence通过mini-batch的PK或聚类实现。correlation congruence通过样本I,J直接的相关性损失函数的约束实现实现。）
将mini-batch中的相关性计算直接转成mini-batch的的大矩阵进行，减少计算量。
采用不同的mini-batch sampler strategies.
在CIFAR-100, ImageNet-1K, person reidentification and face recognition进行实验。

3，论文框架：

最低0.47元/天解锁文章

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。