Fugu-MT 論文翻訳(概要): DeepRFTv2: Kernel-level Learning for Image Deblurring

論文の概要: DeepRFTv2: Kernel-level Learning for Image Deblurring

arxiv url: http://arxiv.org/abs/2511.21132v1
Date: Wed, 26 Nov 2025 07:30:41 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-27 18:37:59.010661
Title: DeepRFTv2: Kernel-level Learning for Image Deblurring
Title（参考訳）: DeepRFTv2: 画像劣化のためのカーネルレベルの学習
Authors: Xintian Mao, Haofei Song, Yin-Nian Liu, Qingli Li, Yan Wang,
Abstract要約: ネットワークがデブロアを学ぼうとするなら、その曖昧なプロセスを理解するべきであることはよく知られている。本稿では,フリエカーネル推定器(FKE)を提案する。提案手法は,最先端の動作不良を解消し,他のカーネル関連問題に対処する可能性を示す。
参考スコア（独自算出の注目度）: 21.144472443267
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: It is well-known that if a network aims to learn how to deblur, it should understand the blur process. Blurring is naturally caused by the convolution of the sharp image with the blur kernel. Thus, allowing the network to learn the blur process in the kernel-level can significantly improve the image deblurring performance. But, current deep networks are still at the pixel-level learning stage, either performing end-to-end pixel-level restoration or stage-wise pseudo kernel-level restoration, failing to enable the deblur model to understand the essence of the blur. To this end, we propose Fourier Kernel Estimator (FKE), which considers the activation operation in Fourier space and converts the convolution problem in the spatial domain to a multiplication problem in Fourier space. Our FKE, jointly optimized with the deblur model, enables the network to learn the kernel-level blur process with low complexity and without any additional supervision. Furthermore, we change the convolution object of the kernel from ``image" to network extracted ``feature", whose rich semantic and structural information is more suitable to blur process learning. With the convolution of the feature and the estimated kernel, our model can learn the essence of blur in kernel-level. To further improve the efficiency of feature extraction, we design a decoupled multi-scale architecture with multiple hierarchical sub-unets with a reversible strategy, which allows better multi-scale encoding and decoding in low training memory. Extensive experiments indicate that our method achieves state-of-the-art motion deblurring results and show potential for handling other kernel-related problems. Analysis also shows our kernel estimator is able to learn physically meaningful kernels. The code will be available at https://github.com/DeepMed-Lab-ECNU/Single-Image-Deblur.
Abstract（参考訳）: ネットワークがデブロアを学ぼうとするなら、その曖昧なプロセスを理解するべきであることはよく知られている。ブラッシングは、シャープ画像とぼやけたカーネルとの畳み込みによって自然に発生する。これにより、カーネルレベルでのぼやけた処理をネットワークで学べるようにすることで、画像劣化性能を大幅に向上させることができる。しかし、現在のディープネットワークはまだピクセルレベルの学習段階にあり、エンド・ツー・エンドのピクセルレベルの復元や、ステージ単位の擬似カーネルレベルの復元を行うかのいずれかを行う。この目的のために、フーリエ空間における活性化操作を考慮し、空間領域における畳み込み問題をフーリエ空間における乗算問題に変換するフーリエカーネル推定器(FKE)を提案する。我々のFKEはデブロアモデルと共同で最適化されており、ネットワークは複雑さが低く、追加の監督なしにカーネルレベルのぼやけたプロセスを学ぶことができる。さらに、カーネルの畳み込みオブジェクトを `image' から "feature" へ変更する。この特徴と推定されたカーネルの畳み込みにより、我々のモデルはカーネルレベルの曖昧さの本質を学ぶことができる。特徴抽出の効率をさらに高めるため,低トレーニングメモリにおけるマルチスケール符号化と復号化を向上する,可逆的な戦略で複数の階層的なサブユニットを持つデカップリング型マルチスケールアーキテクチャを設計する。実験の結果,提案手法は最先端の動作不良を解消し,他のカーネル関連問題に対処できる可能性が示唆された。分析により、我々のカーネル推定器は物理的に意味のあるカーネルを学習できることを示している。コードはhttps://github.com/DeepMed-Lab-ECNU/Single-Image-Deblur.comから入手できる。

論文の概要: DeepRFTv2: Kernel-level Learning for Image Deblurring

関連論文リスト