Fugu-MT 論文翻訳(概要): Combined Dictionary Unfolding Network with Gradient-Adaptive Fidelity for Transferable Multi-Source Fusion

論文の概要: Combined Dictionary Unfolding Network with Gradient-Adaptive Fidelity for Transferable Multi-Source Fusion

arxiv url: http://arxiv.org/abs/2605.00461v1
Date: Fri, 01 May 2026 06:52:29 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-04 17:43:28.88566
Title: Combined Dictionary Unfolding Network with Gradient-Adaptive Fidelity for Transferable Multi-Source Fusion
Title（参考訳）: 転送可能な多ソース核融合のためのグラディエント適応忠実度付き複合辞書展開ネットワーク
Authors: Ge Luo, Jun-Jie Huang, Qi Yu, Tianrui Liu, Ke Liang, Yuming Xiang, Wentao Zhao, Xinwang Liu, Meng Wang,
Abstract要約: CDNetはマルチソース画像融合のための軽量複合辞書展開ネットワークである。マルチ露光画像融合、赤外・可視画像融合、医用画像融合、セマンティックセグメンテーションのための赤外・可視画像融合の4つの課題についてCDNetを評価する。
参考スコア（独自算出の注目度）: 51.66903887088365
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Deep Unfolding Network-based methods have emerged as effective solutions for multi-source image fusion by combining model-driven iterative optimization with data-driven deep learning. However, most existing deep unfolding image fusion methods are derived from alternating minimization, which updates the features of different modalities separately. This design introduces considerable computational and memory overhead, limiting deployment on resource-constrained edge devices. To address this issue, we propose CDNet, a lightweight Combined Dictionary Unfolding Network for multi-source image fusion. Rather than introducing a new sparse coding prior or empirically compressing an existing fusion network, CDNet translates the unique-common decomposition prior of coupled dictionary learning into a structurally constrained joint unfolding architecture. The resulting CDBlock follows a block-sparse interaction topology and performs a model-derived joint update of common and modality-specific representations, thereby streamlining feature learning and improving efficiency.In addition, we design a compact High- and Low-frequency Image Fidelity loss for unsupervised training without ground-truth images. We evaluate CDNet on four tasks, including multi-exposure image fusion, infrared and visible image fusion, medical image fusion, and infrared and visible image fusion for semantic segmentation. Experimental results show that CDNet achieves competitive or superior fusion performance with high efficiency. For infrared and visible image fusion, CDNet outperforms competing methods on four of six metrics on the TNO dataset and five of six metrics on the RoadScene dataset. In particular, it surpasses the second-best method by 1.23 dB and 1.59 dB in PSNR on TNO and RoadScene, respectively.
Abstract（参考訳）: Deep Unfolding Networkベースの手法は、モデル駆動の反復最適化とデータ駆動のディープラーニングを組み合わせることで、マルチソース画像融合の効果的なソリューションとして登場した。しかし、既存のディープ展開画像融合法のほとんどは、異なるモードの特徴を別々に更新する交互化最小化法に由来する。この設計では、計算とメモリのオーバーヘッドが大幅に増加し、リソース制約のあるエッジデバイスへのデプロイメントが制限される。この問題に対処するため,マルチソース画像融合のための軽量複合辞書展開ネットワークCDNetを提案する。既存の核融合ネットワークを事前にあるいは経験的に圧縮する新しいスパースコーディングを導入する代わりに、CDNetは結合辞書学習に先立って、一意の分解を構造的に制約された結合展開アーキテクチャに変換する。得られたCDBlockは,ブロックスパース相互作用トポロジに従って,共通表現とモダリティ表現のモデルによる共同更新を行い,特徴学習の合理化と効率の向上を実現した。マルチ露光画像融合、赤外・可視画像融合、医用画像融合、セマンティックセグメンテーションのための赤外・可視画像融合の4つの課題についてCDNetを評価する。実験の結果, CDNetは高い効率で高い核融合性能を達成できることがわかった。赤外線と可視画像の融合では、CDNetはTNOデータセットで6つのメトリクスのうち4つ、RoadSceneデータセットで6つのメトリクスのうち5つで競合する手法を上回っている。特に、TNOおよびRoadScene上のPSNRでは、それぞれ1.23dBと1.59dBの2番目に高い方法を上回っている。

論文の概要: Combined Dictionary Unfolding Network with Gradient-Adaptive Fidelity for Transferable Multi-Source Fusion

関連論文リスト