Fugu-MT 論文翻訳(概要): UniFusion: A Unified Image Fusion Framework with Robust Representation and Source-Aware Preservation

論文の概要: UniFusion: A Unified Image Fusion Framework with Robust Representation and Source-Aware Preservation

arxiv url: http://arxiv.org/abs/2603.14214v1
Date: Sun, 15 Mar 2026 04:07:06 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.6743
Title: UniFusion: A Unified Image Fusion Framework with Robust Representation and Source-Aware Preservation
Title（参考訳）: UniFusion:ロバスト表現とソース認識保存を備えた統合イメージ融合フレームワーク
Authors: Xingyuan Li, Songcheng Du, Yang Zou, HaoYuan Xu, Zhiying Jiang, Jinyuan Liu,
Abstract要約: We propose UniFusion, a unified image fusion framework to achieve cross-task generalization。融合出力と入力の整合性を維持するために再構成調整損失を導入する。複数の融合タスクにわたる実験は、UniFusionの優れた視覚的品質、一般化能力、現実のシナリオへの適応性を示す。
参考スコア（独自算出の注目度）: 18.352691348247294
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Image fusion aims to integrate complementary information from multiple source images to produce a more informative and visually consistent representation, benefiting both human perception and downstream vision tasks. Despite recent progress, most existing fusion methods are designed for specific tasks (i.e., multi-modal, multi-exposure, or multi-focus fusion) and struggle to effectively preserve source information during the fusion process. This limitation primarily arises from task-specific architectures and the degradation of source information caused by deep-layer propagation. To overcome these issues, we propose UniFusion, a unified image fusion framework designed to achieve cross-task generalization. First, leveraging DINOv3 for modality-consistent feature extraction, UniFusion establishes a shared semantic space for diverse inputs. Second, to preserve the understanding of each source image, we introduce a reconstruction-alignment loss to maintain consistency between fused outputs and inputs. Finally, we employ a bilevel optimization strategy to decouple and jointly optimize reconstruction and fusion objectives, effectively balancing their coupling relationship and ensuring smooth convergence. Extensive experiments across multiple fusion tasks demonstrate UniFusion's superior visual quality, generalization ability, and adaptability to real-world scenarios. Code is available at https://github.com/dusongcheng/UniFusion.
Abstract（参考訳）: Image fusionは、複数のソースイメージからの補完的な情報を統合して、より情報的かつ視覚的に一貫性のある表現を生成し、人間の知覚と下流の視覚タスクの両方に役立てることを目的としている。近年の進歩にもかかわらず、ほとんどの既存の融合法は特定のタスク(マルチモーダル、マルチエクスポージャー、マルチフォーカスフュージョン)のために設計されており、融合プロセス中にソース情報を効果的に保存するのに苦労している。この制限は、主にタスク固有のアーキテクチャと、ディープレイヤの伝搬によって引き起こされるソース情報の劣化から生じる。これらの課題を克服するために,マルチタスクの一般化を実現するための統合画像融合フレームワークUniFusionを提案する。まず、DINOv3をモダリティ一貫性のある特徴抽出に利用し、UniFusionは多様な入力のための共有意味空間を確立する。第2に、各ソースイメージの理解を維持するために、融合出力と入力の整合性を維持するために再構成調整損失を導入する。最後に, 2段階の最適化手法を用いて, 再構成と融合の目的を分離し, 協調的に最適化し, 結合関係を効果的にバランスさせ, 円滑な収束を確保する。複数の融合タスクにわたる大規模な実験は、UniFusionの優れた視覚的品質、一般化能力、現実のシナリオへの適応性を示している。コードはhttps://github.com/dusongcheng/UniFusion.comで入手できる。

論文の概要: UniFusion: A Unified Image Fusion Framework with Robust Representation and Source-Aware Preservation

関連論文リスト