Fugu-MT 論文翻訳(概要): A Language-Signal-Vision Multimodal Framework for Multitask Cardiac Analysis

論文の概要: A Language-Signal-Vision Multimodal Framework for Multitask Cardiac Analysis

arxiv url: http://arxiv.org/abs/2508.13072v1
Date: Mon, 18 Aug 2025 16:43:31 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-19 14:49:11.489623
Title: A Language-Signal-Vision Multimodal Framework for Multitask Cardiac Analysis
Title（参考訳）: マルチタスク心臓分析のための言語信号ビジョンマルチモーダルフレームワーク
Authors: Yuting Zhang, Tiantian Geng, Luoying Hao, Xinxing Cheng, Alexander Thorley, Xiaoxia Wang, Wenqi Lu, Sandeep S Hothi, Lei Wei, Zhaowen Qiu, Dipak Kotecha, Jinming Duan,
Abstract要約: TGMM ( Textual Guidance Multimodal fusion for Multiple Heartc Task) を開発した。本研究は,複数のモダリティにまたがる重要な特徴を体系的に検討し,臨床意思決定における相乗的貢献を解明した。
参考スコア（独自算出の注目度）: 37.18952260878238
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Contemporary cardiovascular management involves complex consideration and integration of multimodal cardiac datasets, where each modality provides distinct but complementary physiological characteristics. While the effective integration of multiple modalities could yield a holistic clinical profile that accurately models the true clinical situation with respect to data modalities and their relatives weightings, current methodologies remain limited by: 1) the scarcity of patient- and time-aligned multimodal data; 2) reliance on isolated single-modality or rigid multimodal input combinations; 3) alignment strategies that prioritize cross-modal similarity over complementarity; and 4) a narrow single-task focus. In response to these limitations, a comprehensive multimodal dataset was curated for immediate application, integrating laboratory test results, electrocardiograms, and echocardiograms with clinical outcomes. Subsequently, a unified framework, Textual Guidance Multimodal fusion for Multiple cardiac tasks (TGMM), was proposed. TGMM incorporated three key components: 1) a MedFlexFusion module designed to capture the unique and complementary characteristics of medical modalities and dynamically integrate data from diverse cardiac sources and their combinations; 2) a textual guidance module to derive task-relevant representations tailored to diverse clinical objectives, including heart disease diagnosis, risk stratification and information retrieval; and 3) a response module to produce final decisions for all these tasks. Furthermore, this study systematically explored key features across multiple modalities and elucidated their synergistic contributions in clinical decision-making. Extensive experiments showed that TGMM outperformed state-of-the-art methods across multiple clinical tasks, with additional validation confirming its robustness on another public dataset.
Abstract（参考訳）: 現代の心臓血管管理には複雑な考慮と多モード心的データセットの統合が含まれており、それぞれのモダリティは異なるが相補的な生理的特徴を提供する。複数のモダリティの効果的な統合は、データモダリティとその関連性重み付けに関する真の臨床状況を正確にモデル化する総合的な臨床プロファイルをもたらす可能性があるが、現在の方法論は以下の通りである。 1) 患者及び時間順のマルチモーダルデータの不足 2) 孤立した単一モダリティ又は剛性多モード入力の組み合わせに依存している。 3 相互類似性を相補性より優先する調整戦略及び 4) シングルタスクの焦点が狭いこと。これらの制約に応えて、検査結果、心電図、心エコー図を臨床結果と統合した総合的マルチモーダルデータセットを即時適用した。その後,TGMM(Textual Guidance Multimodal fusion for Multiple Heartc Task)という統合フレームワークが提案された。 TGMMには3つの重要な要素が組み込まれている。 1)MedFlexFusionモジュールは、医療モダリティのユニークかつ相補的な特性を捉え、多様な心臓ソースからのデータとその組み合わせを動的に統合するように設計されている。 2 心臓病の診断、リスク階層化及び情報検索を含む多様な臨床目的に適したタスク関連表現を導出するためのテキストガイダンスモジュール 3) これらのタスクの最終的な決定を生成する応答モジュール。さらに, 本研究は, 複数のモダリティにまたがる重要な特徴を体系的に検討し, 臨床意思決定における相乗的貢献を解明した。大規模な実験により、TGMMは複数の臨床タスクで最先端の手法よりも優れており、別の公開データセットでその堅牢性を確認した。

論文の概要: A Language-Signal-Vision Multimodal Framework for Multitask Cardiac Analysis

関連論文リスト