Fugu-MT 論文翻訳(概要): FMRFusion: Frequency-Aware Multi-View Representation Learning for Heterogeneous Image Fusion

論文の概要: FMRFusion: Frequency-Aware Multi-View Representation Learning for Heterogeneous Image Fusion

arxiv url: http://arxiv.org/abs/2606.07985v1
Date: Sat, 06 Jun 2026 05:23:06 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:05.600302
Title: FMRFusion: Frequency-Aware Multi-View Representation Learning for Heterogeneous Image Fusion
Title（参考訳）: FMRフュージョン:不均一画像融合のための周波数対応多視点表現学習
Authors: Tao Zhoua, Yunlong Liu, Qinghui Chen, Zekai Zhang, Minlong Sun, Changlin Biana, Dagang Li, Wenmin Wang, Jinglin Zhang,
Abstract要約: FMRFusionは異種画像融合のための周波数認識型表現学習ネットワークである。識別構造を捉えるために, マルチスケール構造トラヒック認識モジュールが導入された。クロスビュー補間相互作用を具体化し、反射光情報と放射強度応答の相補的特性を明示的にモデル化し、融合させる。
参考スコア（独自算出の注目度）: 22.180711075004538
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Infrared and visible image fusion aims to generate a composite image that retains significant target information and preserves detailed textures, integrating two heterogeneous modalities. Previous image fusion methods typically adopt a single-module stacking approach to extract features from the two modalities. However, these approaches may result in incomplete learning of their distinct characteristics, thereby limiting the fusion effectiveness and constrain ing robustness in real-world heterogeneous data scenarios. To address these challenges, we propose FMRFusion, a frequency-aware multi-view representation learning network for Heterogeneous Image Fusion. A Multi-Scale Struc tural Perception Module is introduced to effectively capture discriminative structures, extracting fine-grained local structures and essential contextual information. A bilinear frequency decomposition mechanism is employed to sepa rate features into high-frequency and low-frequency components, enabling joint modeling of local details and global representations across different frequency domains. Moreover, a Cross-View Complementary Interaction is incorpo rated to explicitly model and fuse the complementary characteristics between reflected light information and radiative intensity responses, facilitating effective cross-view interaction. We further improve the Performance of the fused results by flow matching, which progressively refines the fused features by learning the transformation from coarse data to high-quality representations. Extensive experiments conducted on multiple benchmark datasets demonstrate that FMRFusion achieves superior and consistent performance across a range of fusion tasks, especially in nighttime scenarios
Abstract（参考訳）: 赤外線と可視光の融合は、重要なターゲット情報を保持し、詳細なテクスチャを保存し、2つの不均一なモダリティを統合する合成画像を生成することを目的としている。従来の画像融合法は、通常、2つのモードから特徴を抽出するために単一モジュール積み重ね方式を採用する。しかし、これらのアプローチは、それらの特徴を不完全に学習し、現実の異種データシナリオにおける融合の有効性と制約的堅牢性を制限する。これらの課題に対処するために、異種画像融合のための周波数対応多視点表現学習ネットワークであるFMRFusionを提案する。識別構造を効果的に捉え, きめ細かな局所構造と重要な文脈情報を抽出するために, マルチスケール・ストラクチャー・パーセプション・モジュールが導入された。双線形周波数分解機構を用いて、特徴を高周波および低周波成分に分解し、異なる周波数領域にわたる局所的詳細と大域的表現の連成モデリングを可能にする。さらに、クロスビュー補完相互作用を具体化し、反射光情報と放射強度応答の相補的特性を明示的にモデル化し、効果的にクロスビュー相互作用を促進する。粗いデータから高品質な表現への変換を学習することで、融合した特徴を段階的に洗練するフローマッチングにより、融合した結果の性能をさらに向上する。複数のベンチマークデータセットで実施された大規模な実験により、FMRFusionは、特に夜間シナリオにおいて、様々な融合タスクにおいて、優れた、一貫したパフォーマンスを達成することが示された。

論文の概要: FMRFusion: Frequency-Aware Multi-View Representation Learning for Heterogeneous Image Fusion

関連論文リスト