Caffe 损失层中loss_weight 如何存储？

最新推荐文章于 2018-07-02 15:06:01 发布

maocaisheng

最新推荐文章于 2018-07-02 15:06:01 发布

阅读量3.1k

点赞数 2

CC 4.0 BY-SA版权

分类专栏： caffe学习

本文链接：https://blog.youkuaiyun.com/u012938704/article/details/71708885

caffe学习专栏收录该内容

9 篇文章

订阅专栏

本文详细介绍了在Caffe框架中如何为损失层设置loss weight，并解释了loss weight的具体存储方式。通过源码解析，展示了在训练过程中如何初始化和设置loss weight。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

一个网络中如果存在多个损失层的话，需要给每个损失层加上loss_weight参数，不加的话默认为1.0。
但是loss_weight如何存储的呢？

这里我是从ContrastiveLossLayer::Backward_cpu中发现的：

const Dtype sign = (i == 0) ? 1 : -1;
const Dtype alpha = sign * top[0]->cpu_diff()[0] /
      static_cast<Dtype>(bottom[i]->num());

其中top[0]->cpu_diff()[0]保存的即为该层的loss_weight。

训练时函数调用如下：

这里写图片描述

在所有层的父类layer.hpp中会执行下列操作：

void SetUp(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top) {
    InitMutex();
    CheckBlobCounts(bottom, top);
    LayerSetUp(bottom, top);
    Reshape(bottom, top);
    SetLossWeights(top);
  }

先执行完LayerSetUp和Reshape的初始化操作，调用了SetLossWeights，其中caffe_set(count, loss_weight, loss_multiplier);将loss_weight赋值给top[0]->cpu_diff()。

/**
  * Called by SetUp to initialize the weights associated with any top blobs in
  * the loss function. Store non-zero loss weights in the diff blob.
  */
 inline void SetLossWeights(const vector<Blob<Dtype>*>& top) {
   const int num_loss_weights = layer_param_.loss_weight_size();
   if (num_loss_weights) {
     CHECK_EQ(top.size(), num_loss_weights) << "loss_weight must be "
         "unspecified or specified once per top blob.";
     for (int top_id = 0; top_id < top.size(); ++top_id) {
       const Dtype loss_weight = layer_param_.loss_weight(top_id);
       if (loss_weight == Dtype(0)) { continue; }
       this->set_loss(top_id, loss_weight);
       const int count = top[top_id]->count();
       Dtype* loss_multiplier = top[top_id]->mutable_cpu_diff();
       caffe_set(count, loss_weight, loss_multiplier);
     }
   }
 }