Fugu-MT 論文翻訳(概要): FlexiD-Fuse: Flexible number of inputs multi-modal medical image fusion based on diffusion model

論文の概要: FlexiD-Fuse: Flexible number of inputs multi-modal medical image fusion based on diffusion model

arxiv url: http://arxiv.org/abs/2509.09456v1
Date: Thu, 11 Sep 2025 13:41:14 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-12 16:52:24.400733
Title: FlexiD-Fuse: Flexible number of inputs multi-modal medical image fusion based on diffusion model
Title（参考訳）: FlexiD-Fuse:拡散モデルに基づくマルチモーダル医用画像融合のフレキシブル数
Authors: Yushen Xu, Xiaosong Li, Yuchun Wang, Xiaoqi Cheng, Huafeng Li, Haishu Tan,
Abstract要約: FlexiD-Fuseは、フレキシブルな量の入力モダリティに対応するために設計された拡散ベースの画像融合ネットワークである。 2モーダルと3モーダルの医療画像融合を同じ重量でエンドツーエンドに処理することができる。 expectation-Maximizationアルゴリズムを拡散サンプリングプロセスに組み込むことで、FlexiD-Fuseは高品質な融合画像を生成することができる。
参考スコア（独自算出の注目度）: 17.729495428690107
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Different modalities of medical images provide unique physiological and anatomical information for diseases. Multi-modal medical image fusion integrates useful information from different complementary medical images with different modalities, producing a fused image that comprehensively and objectively reflects lesion characteristics to assist doctors in clinical diagnosis. However, existing fusion methods can only handle a fixed number of modality inputs, such as accepting only two-modal or tri-modal inputs, and cannot directly process varying input quantities, which hinders their application in clinical settings. To tackle this issue, we introduce FlexiD-Fuse, a diffusion-based image fusion network designed to accommodate flexible quantities of input modalities. It can end-to-end process two-modal and tri-modal medical image fusion under the same weight. FlexiD-Fuse transforms the diffusion fusion problem, which supports only fixed-condition inputs, into a maximum likelihood estimation problem based on the diffusion process and hierarchical Bayesian modeling. By incorporating the Expectation-Maximization algorithm into the diffusion sampling iteration process, FlexiD-Fuse can generate high-quality fused images with cross-modal information from source images, independently of the number of input images. We compared the latest two and tri-modal medical image fusion methods, tested them on Harvard datasets, and evaluated them using nine popular metrics. The experimental results show that our method achieves the best performance in medical image fusion with varying inputs. Meanwhile, we conducted extensive extension experiments on infrared-visible, multi-exposure, and multi-focus image fusion tasks with arbitrary numbers, and compared them with the perspective SOTA methods. The results of the extension experiments consistently demonstrate the effectiveness and superiority of our method.
Abstract（参考訳）: 医学画像の異なるモダリティは、疾患に対するユニークな生理的および解剖学的情報を提供する。マルチモーダル医療画像融合は、異なる相補的な医療画像から有用な情報を異なるモダリティと統合し、疾患の特徴を包括的かつ客観的に反映した融合画像を生成し、臨床診断における医師の助けとなる。しかし、既存の融合法では、2モーダルまたは3モーダルの入力のみを受け入れるなど、一定の数のモーダル入力しか処理できないため、様々な入力量を直接処理できないため、臨床現場での応用が妨げられる。この問題に対処するために、フレキシD-Fuseは、フレキシブルな量の入力モダリティに対応するために設計された拡散ベースの画像融合ネットワークである。 2モーダルと3モーダルの医療画像融合を同じ重量でエンドツーエンドに処理することができる。 FlexiD-Fuseは、固定条件入力のみをサポートする拡散融合問題を、拡散過程と階層ベイズモデルに基づく最大推定問題に変換する。拡散サンプリング反復プロセスに期待最大化アルゴリズムを組み込むことで、入力画像の数によらず、ソース画像からクロスモーダル情報を持つ高品質の融合画像を生成することができる。我々は、最新の2つの医用画像融合法を比較し、ハーバードのデータセットでそれらをテストし、9つの人気のある指標を用いて評価した。実験結果から,本手法は入力の異なる医用画像融合において最高の性能を発揮することが示された。一方、赤外可視・マルチ露光・多焦点画像融合タスクの広範囲な拡張実験を行い、SOTA法と比較した。拡張実験の結果は, 本手法の有効性と優位性を一貫して示している。

論文の概要: FlexiD-Fuse: Flexible number of inputs multi-modal medical image fusion based on diffusion model

関連論文リスト