Fugu-MT 論文翻訳(概要): All-weather Multi-Modality Image Fusion: Unified Framework and 100k Benchmark

論文の概要: All-weather Multi-Modality Image Fusion: Unified Framework and 100k Benchmark

arxiv url: http://arxiv.org/abs/2402.02090v2
Date: Mon, 11 Nov 2024 12:11:40 GMT
ステータス: 翻訳完了
システム内更新日: 2024-11-28 17:07:30.559077
Title: All-weather Multi-Modality Image Fusion: Unified Framework and 100k Benchmark
Title（参考訳）: All-weather Multi-Modality Image Fusion: Unified Frameworkと100kベンチマーク
Authors: Xilai Li, Wuyang Liu, Xiaosong Li, Fuqiang Zhou, Huafeng Li, Feiping Nie,
Abstract要約: MMIF(Multi-modality Image fusion)は、異なる画像モダリティからの相補的な情報を組み合わせて、より包括的で客観的なシーン解釈を提供する。既存のMMIF法では、現実の場面で異なる気象干渉に抵抗する能力が欠如しており、自律運転のような実践的な応用では利用できない。この文脈で効率的なマルチタスクを実現するために,全天候MMIFモデルを提案する。実世界の場面と合成シーンの両方における実験結果から,提案アルゴリズムは細部回復と多モード特徴抽出に優れることが示された。
参考スコア（独自算出の注目度）: 42.49073228252726
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multi-modality image fusion (MMIF) combines complementary information from different image modalities to provide a more comprehensive and objective interpretation of scenes. However, existing MMIF methods lack the ability to resist different weather interferences in real-world scenes, preventing them from being useful in practical applications such as autonomous driving. To bridge this research gap, we proposed an all-weather MMIF model. Achieving effective multi-tasking in this context is particularly challenging due to the complex and diverse nature of weather conditions. A key obstacle lies in the 'black box' nature of current deep learning architectures, which restricts their multi-tasking capabilities. To overcome this, we decompose the network into two modules: a fusion module and a restoration module. For the fusion module, we introduce a learnable low-rank representation model to decompose images into low-rank and sparse components. This interpretable feature separation allows us to better observe and understand images. For the restoration module, we propose a physically-aware clear feature prediction module based on an atmospheric scattering model that can deduce variations in light transmittance from both scene illumination and reflectance. We also construct a large-scale multi-modality dataset with 100,000 image pairs across rain, haze, and snow conditions, covering various degradation levels and diverse scenes to thoroughly evaluate image fusion methods in adverse weather. Experimental results in both real-world and synthetic scenes show that the proposed algorithm excels in detail recovery and multi-modality feature extraction. The code is available at https://github.com/ixilai/AWFusion.
Abstract（参考訳）: MMIF(Multi-modality Image fusion)は、異なる画像モダリティからの相補的な情報を組み合わせて、より包括的で客観的なシーン解釈を提供する。しかし、既存のMMIF法では、現実の場面で異なる気象干渉に抵抗する能力が欠如しており、自律運転のような実用的な応用では利用できない。この研究ギャップを埋めるため、我々は全天候MMIFモデルを提案した。この文脈で効果的なマルチタスクを実現することは、気象条件の複雑で多様な性質のために特に困難である。重要な障害は、現在のディープラーニングアーキテクチャの‘ブラックボックス’の性質にある。これを解決するために,ネットワークを融合モジュールと復元モジュールの2つのモジュールに分割する。融合モジュールに対しては,画像を低ランクかつスパースなコンポーネントに分解する,学習可能な低ランク表現モデルを導入する。この解釈可能な特徴分離により、イメージをよりよく観察し理解することができます。再生モジュールでは,光透過率の変動をシーン照明と反射の両方から推定できる大気散乱モデルに基づいて,物理的に認識可能な特徴予測モジュールを提案する。また, 降雨, ヘイズ, 積雪条件にまたがる画像ペア10万枚からなる大規模多モードデータセットを構築し, 各種劣化レベルと多様な場面を網羅し, 悪天候下での画像融合手法を徹底的に評価した。実世界の場面と合成シーンの両方における実験結果から,提案アルゴリズムは細部回復と多モード特徴抽出に優れることが示された。コードはhttps://github.com/ixilai/AWFusion.comで公開されている。

論文の概要: All-weather Multi-Modality Image Fusion: Unified Framework and 100k Benchmark

関連論文リスト