Siamese Network final convolution operation 代码实现（tf, keras)

最新推荐文章于 2024-04-21 09:41:46 发布

wchenchen

最新推荐文章于 2024-04-21 09:41:46 发布

阅读量607

点赞数

CC 4.0 BY-SA版权

分类专栏： Object racking 文章标签： Siamese Keras 两层的输出卷积 Object tracking Network

本文链接：https://blog.youkuaiyun.com/u012129119/article/details/80267882

Object racking 专栏收录该内容

1 篇文章

订阅专栏

本文介绍了一个特定的卷积操作实现，该操作应用于两个不同尺寸的输入张量，并详细解释了如何使用TensorFlow来完成这一过程。通过tf.nn.convolution函数进行卷积计算，并通过tf.map_fn函数将操作应用于所有输入样本。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

代码实现的功能为下图中最后的两层输出的卷积操作

这里写图片描述

def cross_correlation(inputs):   #([None,22,22,128],[None,6,6,128])
    x = inputs[0]     #[22,22,128]
    #print(x.shape.as_list())
    x = tf.reshape(x, [1] + x.shape.as_list())  #[1,22,22,128]
    z = inputs[1]     #[6,6,128]
    z = tf.reshape(z, z.shape.as_list() + [1])   #[6,6,128,1]
    #print(x.shape.as_list())
    #print(z.shape.as_list())
    #return tf.nn.convolution(x, z, padding='VALID', strides=(1,1))
    a= tf.nn.convolution(x, z, padding='VALID', strides=(1,1))   #[1,17,17,1]
    return a

def x_corr_map(inputs):
    # Note that dtype MUST be specified, 
    # otherwise TF will assert that the input and output structures are the same,
    # which they most certainly are NOT.
    return K.reshape(tf.map_fn(cross_correlation, inputs, dtype=tf.float32, infer_shape=False), shape=(-1,17,17))   
    # [None,17,17]

def x_corr_layer():
    return Lambda(x_corr_map, output_shape=(17,17))

x=x_corr_layer()([s_output,t_output])  
print(x.shape.as_list())    #[None,17,17]

代码解释

s_output 为搜索区域 branch 输出 size: (None, W1, H1, C1) e.g. (None, 22, 22, 128)
t_output为目标区域branch输出 size: (None, W2, H2, C2) e.g. (None, 6, 6, 128)
层x_corr_layer()接受输入 inputs [s_output, t_output]
tf.map_fn 实现将 inputs 在张量第一维上展开口应用到 cross_correlation 函数上。
每一步的输出都体现在代码后面。