Fugu-MT 論文翻訳(概要): CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying

論文の概要: CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying

arxiv url: http://arxiv.org/abs/2303.08524v1
Date: Wed, 15 Mar 2023 11:13:51 GMT
ステータス: 翻訳完了
システム内更新日: 2023-03-16 14:10:54.552862
Title: CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying
Title（参考訳）: coordfill:パラメータ付き座標クエリによる高分解能画像インパインティング
Authors: Weihuang Liu, Xiaodong Cun, Chi-Man Pun, Menghan Xia, Yong Zhang, Jue Wang
Abstract要約: 本稿では,近年の連続的暗黙表現の発達により,その限界を初めて破ろうとする。実験の結果,提案手法はGTX 2080 Ti GPUを用いて2048$times$2048の画像をリアルタイムに処理できることがわかった。
参考スコア（独自算出の注目度）: 52.91778151771145
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Image inpainting aims to fill the missing hole of the input. It is hard to solve this task efficiently when facing high-resolution images due to two reasons: (1) Large reception field needs to be handled for high-resolution image inpainting. (2) The general encoder and decoder network synthesizes many background pixels synchronously due to the form of the image matrix. In this paper, we try to break the above limitations for the first time thanks to the recent development of continuous implicit representation. In detail, we down-sample and encode the degraded image to produce the spatial-adaptive parameters for each spatial patch via an attentional Fast Fourier Convolution(FFC)-based parameter generation network. Then, we take these parameters as the weights and biases of a series of multi-layer perceptron(MLP), where the input is the encoded continuous coordinates and the output is the synthesized color value. Thanks to the proposed structure, we only encode the high-resolution image in a relatively low resolution for larger reception field capturing. Then, the continuous position encoding will be helpful to synthesize the photo-realistic high-frequency textures by re-sampling the coordinate in a higher resolution. Also, our framework enables us to query the coordinates of missing pixels only in parallel, yielding a more efficient solution than the previous methods. Experiments show that the proposed method achieves real-time performance on the 2048$\times$2048 images using a single GTX 2080 Ti GPU and can handle 4096$\times$4096 images, with much better performance than existing state-of-the-art methods visually and numerically. The code is available at: https://github.com/NiFangBaAGe/CoordFill.
Abstract（参考訳）: Image Inpaintingは、入力の欠落した穴を埋めることを目的としている。高解像度画像に対して,(1)大きな受信フィールドを扱う必要がある,という2つの理由から,この課題を効率的に解決することは困難である。 2)一般エンコーダ及びデコーダネットワークは、画像マトリックスの形態により、多数の背景画素を同期的に合成する。本稿では,近年の連続的暗黙表現の発展により,上記の制約を初めて破ろうとしている。本稿では,FFC(Fast Fourier Convolution)に基づくパラメータ生成ネットワークを用いて,分解画像のダウンサンプリングと符号化を行い,各空間パッチの空間適応パラメータを生成する。そして、これらのパラメータを一連の多層パーセプトロン(MLP)の重みとバイアスとみなし、入力は符号化された連続座標であり、出力は合成された色値である。提案手法により,高解像度画像のみを比較的低解像度で符号化し,受信場を拡大する。次に、連続位置符号化は、高分解能で座標を再サンプリングすることにより、フォトリアリスティックな高周波テクスチャを合成するのに役立つ。また,本フレームワークでは,欠落画素の座標を並列にのみ問合せすることが可能であり,従来の手法よりも効率的な解が得られる。実験の結果,提案手法はGTX 2080 Ti GPUを用いて2048$\times$2048の画像をリアルタイムに処理し,4096$\times$4096の画像を処理可能であることがわかった。コードはhttps://github.com/nifangbaage/coordfill。

論文の概要: CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying

関連論文リスト