Zeng等人2019 CVPR论文：金字塔上下文编码器网络提升高质量图像修复

PDF文件

下载需积分: 45 | 1.05MB | 更新于2024-08-26 | 75 浏览量 | 举报收藏

立即下载

《Zeng_Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting (CVPR 2019)》是一篇在计算机视觉领域的重要论文，由Yanhong Zeng等人在2019年的CVPR会议上发表。该研究专注于解决高质量图像修复（image inpainting）问题，即如何在损坏的图像区域填充合理且连贯的内容，使得修复后的图像看起来自然且不失真。论文的核心贡献是提出了一种名为Pyramid-Context Encoder Network (PEN-Net)的方法。该网络设计巧妙地结合了多尺度金字塔结构与上下文编码器（context encoder），旨在捕捉图像的全局和局部信息，从而生成更为逼真的修复结果。PEN-Net的主要特点是它能够同时考虑视觉一致性和区域上下文，而不仅仅是简单的像素复制或仅依赖于区域上下文生成新内容。论文中展示了PEN-Net在各种复杂场景下的出色表现，包括建筑立面（facades）、自然景观、人脸以及纹理等。通过比较修复前后，可以明显看出PEN-Net生成的结果不仅填充了缺失的部分，而且保持了整体图像的连贯性和细节一致性。例如，修复后的图像中，建筑物的边缘平滑过渡，人脸表情自然，纹理匹配得当，这些都是高质量图像修复的重要指标。在技术细节上，PEN-Net采用了递归的金字塔架构，这允许模型在不同尺度上处理图像，从大到小逐渐细化修复。同时，上下文编码器部分负责理解周围区域的语义信息，生成与周围环境协调的补丁。整个过程可能涉及到深度学习的卷积神经网络（CNN）模块，如U-Net或变分自编码器（VAE），以及一些强化学习或生成对抗网络（GAN）的技巧来优化生成结果。总结来说，这篇论文对图像修复领域的技术进步做出了重要贡献，展示了利用深度学习技术在保持图像完整性的同时，提升修复质量的可能性。通过阅读和理解这篇论文，研究人员和从业者能了解到如何更好地融合多尺度信息和上下文理解，以达到更高的图像修复精度，这对于数字图像处理、图像修复软件开发以及视觉内容生成等领域具有实际应用价值。

Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting

Yanhong Zeng

1,2∗

, Jianlong Fu

, Hongyang Chao

1,2

, Baining Guo

School of Data and Computer Science, Sun Yat-sen University, Guangzhou, P.R. China

The Key Laboratory of Machine Intelligence and Advanced Computing (Sun Yat-sen University),

Ministry of Education, Guangzhou, P.R. China

Microsoft Research, Beijing, P.R. China

[email protected], {jianf,bainguo}@microsoft.com, [email protected]

Figure 1: High-quality image inpainting results generated by the proposed Pyramid-context ENcoder Network (PEN-Net).

In each pair, the left is a damaged image masked in white, and the right is the result of image inpainting. PEN-Net shows

excellent performance on a variety of images, including facades, natural scene, face and texture. [Best viewed in color]

Abstract

High-quality image inpainting requires ﬁlling missing

regions in a damaged image with plausible content. Exist-

ing works either ﬁll the regions by copying image patches or

generating semantically-coherent patches from region con-

text, while neglect the fact that both visual and semantic

plausibility are highly-demanded. In this paper, we pro-

pose a Pyramid-context ENcoder Network (PEN-Net) for

image inpainting by deep generative models. The PEN-Net

is built upon a U-Net structure, which can restore an image

by encoding contextual semantics from full resolution input,

and decoding the learned semantic features back into im-

ages. Speciﬁcally, we propose a pyramid-context encoder,

which progressively learns region afﬁnity by attention from

∗

This work was performed when the ﬁrst author was visiting Microsoft

Research as a research intern.

a high-level semantic feature map and transfers the learned

attention to the previous low-level feature map. As the miss-

ing content can be ﬁlled by attention transfer from deep to

shallow in a pyramid fashion, both visual and semantic co-

herence for image inpainting can be ensured. We further

propose a multi-scale decoder with deeply-supervised pyra-

mid losses and an adversarial loss. Such a design not only

results in fast convergence in training, but more realistic re-

sults in testing. Extensive experiments on various datasets

show the superior performance of the proposed network.

1. Introduction

Image inpainting aims at ﬁlling missing pixels in a dam-

aged image given a corresponding mask [

2]. This task has

drawn great attention and become a valuable and active re-

search topic for decades [

5, 12, 17], because high-quality

1486

下载后可阅读完整内容，剩余8页未读，继续阅读

开通会员，免费下载（低至0.43元/天)

成为会员后, 你将解锁

下载资源随意下

优质VIP博文免费学

优质文库回答免费看

付费资源9折优惠

C153123shj_2

粉丝: 0

Zeng等人2019 CVPR论文：金字塔上下文编码器网络提升高质量图像修复

Pyramid-Attention-Networks:具有多个图像恢复任务的新SOTA结果的“用于图像恢复的金字塔注意力网络”的PyTorch代码

计算机视觉 顶会 CVPR_2019_全部论文开源代码链接汇总.csv

arm-zeng_6.5_x86_64-linux-gnueabi_by_zenghc.tar.xz

nave-zeng-windows-me-source-code-joke__3-1788-windows source code

tu-xiang-zeng-qiang---.rar_tu

tu_xiang_zeng_qiang.rar_alphabetxgs_eager5zj_matlab_tu_理想低通

mcm2004_b_li_gu_zeng.pdf

镁镀层_英文论文_2014-2018年_谷歌学术

jpeg_torbu_zeng.tar.bz2

vsftpd-2.2.2-27.1.x86_64.rpm

最新资源

计算机视觉顶会 CVPR_2019_全部论文开源代码链接汇总.csv