Fugu-MT 論文翻訳(概要): J-RAS: Enhancing Medical Image Segmentation via Retrieval-Augmented Joint Training

論文の概要: J-RAS: Enhancing Medical Image Segmentation via Retrieval-Augmented Joint Training

arxiv url: http://arxiv.org/abs/2510.09953v2
Date: Tue, 14 Oct 2025 15:57:51 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-15 14:23:56.901417
Title: J-RAS: Enhancing Medical Image Segmentation via Retrieval-Augmented Joint Training
Title（参考訳）: J-RAS:検索統合トレーニングによる医用画像セグメンテーションの強化
Authors: Salma J. Ahmed, Emad A. Mohammed, Azam Asilian Bidgoli,
Abstract要約: 本稿では,探索モデルとセグメンテーションモデルを統合したガイド画像セグメンテーションのジョイントトレーニング手法を提案する。どちらのモデルも最適化されており、セグメンテーションモデルは取得した画像とマスクのペアを利用して解剖学的理解を深めることができる。 2つのベンチマークデータセット上で、U-Net、TransUNet、SAM、SegFormerを含む複数のセグメンテーションバックボーンにわたるJ-RASを検証する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Image segmentation, the process of dividing images into meaningful regions, is critical in medical applications for accurate diagnosis, treatment planning, and disease monitoring. Although manual segmentation by healthcare professionals produces precise outcomes, it is time-consuming, costly, and prone to variability due to differences in human expertise. Artificial intelligence (AI)-based methods have been developed to address these limitations by automating segmentation tasks; however, they often require large, annotated datasets that are rarely available in practice and frequently struggle to generalize across diverse imaging conditions due to inter-patient variability and rare pathological cases. In this paper, we propose Joint Retrieval Augmented Segmentation (J-RAS), a joint training method for guided image segmentation that integrates a segmentation model with a retrieval model. Both models are jointly optimized, enabling the segmentation model to leverage retrieved image-mask pairs to enrich its anatomical understanding, while the retrieval model learns segmentation-relevant features beyond simple visual similarity. This joint optimization ensures that retrieval actively contributes meaningful contextual cues to guide boundary delineation, thereby enhancing the overall segmentation performance. We validate J-RAS across multiple segmentation backbones, including U-Net, TransUNet, SAM, and SegFormer, on two benchmark datasets: ACDC and M&Ms, demonstrating consistent improvements. For example, on the ACDC dataset, SegFormer without J-RAS achieves a mean Dice score of 0.8708$\pm$0.042 and a mean Hausdorff Distance (HD) of 1.8130$\pm$2.49, whereas with J-RAS, the performance improves substantially to a mean Dice score of 0.9115$\pm$0.031 and a mean HD of 1.1489$\pm$0.30. These results highlight the method's effectiveness and its generalizability across architectures and datasets.
Abstract（参考訳）: イメージセグメンテーションは、画像を意味のある領域に分割するプロセスであり、正確な診断、治療計画、疾患モニタリングのための医学的応用において重要である。医療専門家による手動セグメンテーションは正確な結果をもたらすが、人的専門性の違いにより、時間を要する、コストがかかる、変動しがちである。人工知能(AI)ベースの手法は、セグメンテーションタスクを自動化してこれらの制限に対処するために開発されたが、実際にはほとんど利用できない大規模な注釈付きデータセットを必要とすることが多く、患者間変異や稀な病理症例による多様な画像条件の一般化に苦慮することが多い。本稿では,検索モデルとセグメンテーションモデルを統合したガイド画像セグメンテーションのジョイントトレーニング手法であるJ-RASを提案する。どちらのモデルも共同最適化されており、抽出した画像とマスクのペアを利用して解剖学的理解を深めることが可能であり、一方、検索モデルは単純な視覚的類似性を超えたセグメンテーション関連の特徴を学習する。この共同最適化により、検索が意味のある文脈的手がかりに積極的に寄与して境界線を導出し、全体のセグメンテーション性能を向上させる。我々は、U-Net、TransUNet、SAM、SegFormerを含む複数のセグメンテーションバックボーンにわたるJ-RASを、ACDCとM&Mの2つのベンチマークデータセットで検証し、一貫した改善を実証した。例えば、ACDCデータセットでは、J-RASのないSegFormerは平均Diceスコアが0.8708$\pm$0.042、平均Hausdorff Distance(HD)が1.8130$\pm$2.49、一方J-RASでは平均Diceスコアが0.9115$\pm$0.031、平均HDが1.1489$\pm$0.30となる。これらの結果は,本手法の有効性と,アーキテクチャとデータセット間の一般化性を強調した。

論文の概要: J-RAS: Enhancing Medical Image Segmentation via Retrieval-Augmented Joint Training

関連論文リスト