论文阅读笔记之——《Deep Stacked Hierarchical Multi-patch Network for Image Deblurring》

最新推荐文章于 2025-06-07 00:30:00 发布

原创最新推荐文章于 2025-06-07 00:30:00 发布 · 3k 阅读

6 ·

CC 4.0 BY-SA版权

图像超分辨率重建同时被 3 个专栏收录

75 篇文章

订阅专栏

卷积神经网络

65 篇文章

订阅专栏

深度学习

63 篇文章

订阅专栏

本文聚焦非均匀图像去模糊，指出传统方法存在运行时间长、增加模型深度无法提升去模糊质量等问题。提出了类似空间金字塔匹配的端到端CNN分层模型DMPHN，打破了限制网络深度的因素，虽结构简单但有效，不过作者未发布源码。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

论文：https://arxiv.org/pdf/1904.03468v1.pdf

本文是做非均匀deblur的。The goal of non-uniform blind image deblurring is to remove the undesired blur caused by the camera motion and the scene dynamics

开篇作者抛出来两个问题：

1) Deconvolution/upsampling operations in the coarse-to-fine scheme result in expensive runtime;（时间长）

2) Simply increasing the model depth with finer-scale levels cannot improve the quality of deblurring.（简单的增加模型的深度是不可以提升performance的）

下面这张图是实时性与performance的图。其实在low-level vision问题上，当performance提到一定程度，大多数都会选择往参数量低或者运算量低的方向走

we present a deep hierarchical （分层的） multi-patch network inspired by Spatial Pyramid Matching （空间金字塔匹配） to deal with blurry images via a fine-to-coarse hierarchical representation.

Prior to the success of deep learning, conventional deblurring methods used to employ a variety of constraints or regularizations to approximate the motion blur filters, involving an expensive non-convex nonlinear optimization. Moreover, the commonly used assumption of spatially-uniform blur kernel is overly restrictive, resulting in a poor deblurring of complex blur patterns.

the “coarse-to-fine” scheme（“粗到细”方案）a sharp image is gradually restored at different resolutions in a pyramid.

we propose a simple yet effective multi-level CNN model called Deep Multi-Patch Hierarchical Network (DMPHN) which uses multi-patch hierarchy as input.

Feature aggregation （聚合） over multiple patches has been used in image classification

Contributions：

1、We propose an end-to-end CNN hierarchical model akin （相似） to Spatial Pyramid Matching (SPM) that performs deblurring in the fine-to-coarse grids thus exploiting multi-patch localized-to-coarse operations.

2、We identify the limitation to stacking depth of current deep deblurring models and introduce novel stacking approaches which overcome this limitation.（打破了限制网络深度的因素）

Conventional image deblurring methods fail to remove non-uniform motion blur due to the use of spatially-invariant （空间不变） deblurring kernel.

Recurrent Neural Network (RNN) is a popular tool employed in deblurring due to its advantage in sequential information processing.

整体的结构如下图2所示

we use the (1-2-4-8) model for illustration purposes. Notation (1-2-4-8) indicates the numbers of image patches from the coarsest to the finniest level

Note that the numbers of input and output patches at each level are different as the main idea of our work is to make the lower level focus on local information (finer grid) to produce residual information for the coarser gird (obtained by concatenating convolutional features).

论文的网络结构跟其他结构对比