PSMNet:Pyramid Stereo Matching Network学习测试笔记03-如何训练网络

最新推荐文章于 2023-05-31 21:26:33 发布

原创最新推荐文章于 2023-05-31 21:26:33 发布 · 1.4k 阅读

3 ·

CC 4.0 BY-SA版权

学习笔记同时被 2 个专栏收录

15 篇文章

订阅专栏

PSMNet

7 篇文章

订阅专栏

本文深入探讨PSMNet立体匹配网络的工作原理及源码实现细节，讲解了网络在训练与测试阶段的不同预处理流程，包括图像裁剪与空白扩充等数据增强手段。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

写在前面的话：
2019年09月28日18:02:55补充说明：优快云博客发布版权更新，如果您看了博客并且用到PSMNet相关东西，请注明引用原作者的文章：

@inproceedings{chang2018pyramid,
title={Pyramid Stereo Matching Network},
author={Chang, Jia-Ren and Chen, Yong-Sheng},
booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
pages={5410–5418},
year={2018}
}

这里开始看PSMNet源码
数据集如何组织输入不看，只看这个网络做了哪些工作。

if self.training:  
           w, h = left_img.size
           th, tw = 256, 512
           x1 = random.randint(0, w - tw)
           y1 = random.randint(0, h - th)
           left_img = left_img.crop((x1, y1, x1 + tw, y1 + th))
           right_img = right_img.crop((x1, y1, x1 + tw, y1 + th))
           dataL = dataL[y1:y1 + th, x1:x1 + tw]
           processed = preprocess.get_transform(augment=False)  
           left_img   = processed(left_img)
           right_img  = processed(right_img)
           return left_img, right_img, dataL
        else:
           w, h = left_img.size
           print('WARRING:\tw = %d\th = %d' % (w, h))
           left_img = left_img.crop((w-960, h-544, w, h))
           right_img = right_img.crop((w-960, h-544, w, h))
           processed = preprocess.get_transform(augment=False)  
           left_img       = processed(left_img)
           right_img      = processed(right_img)
           return left_img, right_img, dataL