Paper Notes of CVPR-0085

本文介绍了一种基于深度生成模型的方法,通过稀疏输入(如不到6%的像素)实现高质量图像重建和编辑。该方法利用轮廓与纹理之间的统计关联,分为两部分:一是重建整体图像结构和颜色;二是恢复纹理细节。实验结果表明,此方法在保真度和效果上具有优越性。

Sparse, Smart Contours to Represent and Edit Images

Abstract

By GAN, reconstruct images with high quality and fidelity from sparse input, e.g., comprising less than 6% of image pixels.

Introduction

This paper proposed a new method based on deep generative models to resolve the conflict between high fidelity and high sparsity. They trained the model to hallucinate appropriately according to the correlation between contours and textures. For instance, knowing that a contour map is of a person’s face, the model can fill in the details of hairs and facial expression based on the statistical correlation trained on a set of facial images. They divided the work into two parts. One is to reconstruct the overall image structures and colors. The other is to recover texture and finetuning.

Method

The model consists of two networks: LFN (“Low Frequency Network”) and HFN (“High Frequency Network”).

The LFN is trained with an L1L_1L1 pixel loss between the reconstructed output image and the ground-truth image. The HFN is conditioned on the sparse contours and the output of the LFN, and trained with a combination of piexl loss and an adversarial loss. Then they use a conditional discriminator to distinguish between the real image and the fake output of HFN.

5c74f2d5a22b3.jpg

In fact, the LFN and HFN is a convolutional encoder and decoder without connections between layers of the encoder and decoder. The discriminator is a combination of a “patch discriminator” and a branch of dilated convolution filters that better capture higher frequencies.

Experiments

In experiments, they show “reconstruction” and “editing” respectively. For evaluation, they use human reters firstly to distinguish the real and fake image. Then, they use FaceNet to test the extent to which their reconstructed faces capture the identity of a person. Finally, they use texture-loss to evaluate the quality of their synthesized texture compared to the source image. Meanwhile, they show comparisons with two baselines.

Conclusion

The authors proposed a method to reconstruct image with high fildety and fine texture from sparse source image (about 6% pixels). Their model include two parts: LFN and HFN based GAN. The former reconstruct paradoxical contours while the latter finetuning the texture in details. Quantity experiments show that their method has superiority in both fildety and effectiveness.

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值