读懂caffe模型

最新推荐文章于 2025-05-30 02:58:43 发布

原创最新推荐文章于 2025-05-30 02:58:43 发布 · 231 阅读

0 ·

CC 4.0 BY-SA版权

其他专栏收录该内容

7 篇文章

订阅专栏

本文详细介绍了Caffe模型的构建与配置方法，重点解释了卷积层中lr_mult和decay_mult参数的作用及设置方式，并给出了具体的配置示例。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

caffe 模型可视化网页网址：http://ethereon.github.io/netscope/#/editor
caffe模型是自底向上流的，所以bottom是前一层，top就是本层，输出层

卷积层

　　lr_mult: 學習率的係數，最終的學習率是這個數乘以solver.prototxt配置文檔中的base_lr。如果有兩個lr_mult, 則第一個表示權值的學習率，第二個表示偏置項的學習率。一般偏置項的學習率是權值學習率的兩倍
　　r_mult indicates what to multiply the learning rate by for a particular layer. This is useful if you want to update some layers with a smaller learning rate (e.g. when finetuning some layers while training others from scratch) or if you do not want to update the weights for one layer, decay_mult is the same, just for weight decay.

layer {
  name: "conv_att"
  type: "Convolution"
  bottom: "input_feature"
  top: "conv_att"
  param {
    lr_mult: 1 #kernel lr——mult
    decay_mult: 1
  }
  param {
    lr_mult: 2 # bias lr——mult
    decay_mult: 0 
  }
  convolution_param {
    num_output: 8  #输出feature map个数
    kernel_size: 1 # 卷积核大小
    weight_filler {
      type: "msra"
    }
    bias_filler {
      type: "constant"
    }
  }
}