Fugu-MT 論文翻訳(概要): Multi-Sequence Parotid Gland Lesion Segmentation via Expert Text-Guided Segment Anything Model

論文の概要: Multi-Sequence Parotid Gland Lesion Segmentation via Expert Text-Guided Segment Anything Model

arxiv url: http://arxiv.org/abs/2508.09645v1
Date: Wed, 13 Aug 2025 09:25:29 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-14 20:42:00.832817
Title: Multi-Sequence Parotid Gland Lesion Segmentation via Expert Text-Guided Segment Anything Model
Title（参考訳）: エキスパートテキスト誘導セグメントモデルによる多列耳下腺病変分割
Authors: Zhongyuan Wu, Chuan-Xian Ren, Yu Wang, Xiaohua Ban, Jianning Xiao, Xiaohui Duan,
Abstract要約: 耳下腺セグメントモデル(PG-SAM)は、耳下腺病変セグメント化の専門知識を取り入れている。 PG-SAMは、3つの独立した臨床センターにまたがる耳下腺病変の分節における最先端のパフォーマンスを達成する。
参考スコア（独自算出の注目度）: 10.892625199593644
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Parotid gland lesion segmentation is essential for the treatment of parotid gland diseases. However, due to the variable size and complex lesion boundaries, accurate parotid gland lesion segmentation remains challenging. Recently, the Segment Anything Model (SAM) fine-tuning has shown remarkable performance in the field of medical image segmentation. Nevertheless, SAM's interaction segmentation model relies heavily on precise lesion prompts (points, boxes, masks, etc.), which are very difficult to obtain in real-world applications. Besides, current medical image segmentation methods are automatically generated, ignoring the domain knowledge of medical experts when performing segmentation. To address these limitations, we propose the parotid gland segment anything model (PG-SAM), an expert diagnosis text-guided SAM incorporating expert domain knowledge for cross-sequence parotid gland lesion segmentation. Specifically, we first propose an expert diagnosis report guided prompt generation module that can automatically generate prompt information containing the prior domain knowledge to guide the subsequent lesion segmentation process. Then, we introduce a cross-sequence attention module, which integrates the complementary information of different modalities to enhance the segmentation effect. Finally, the multi-sequence image features and generated prompts are feed into the decoder to get segmentation result. Experimental results demonstrate that PG-SAM achieves state-of-the-art performance in parotid gland lesion segmentation across three independent clinical centers, validating its clinical applicability and the effectiveness of diagnostic text for enhancing image segmentation in real-world clinical settings.
Abstract（参考訳）: 耳下腺病変の分節は耳下腺疾患の治療に不可欠である。しかし, 変化の大きさと複雑な病変の境界から, 正確な耳下腺病変の分節はいまだに困難である。近年,Segment Anything Model (SAM)ファインチューニングは,医用画像のセグメンテーションにおいて顕著な性能を示した。それでもSAMの相互作用セグメンテーションモデルは、現実世界では非常に入手が難しい正確な病変プロンプト(ポイント、ボックス、マスクなど)に大きく依存している。また、現在の医用画像セグメンテーション法は、セグメンテーションを行う際に、医療専門家のドメイン知識を無視して自動的に生成される。これらの制約に対処するため, 耳下腺病変分類のための専門領域知識を取り入れた, テキストガイド付きSAMの専門診断モデルである耳下腺セグメントモデル(PG-SAM)を提案する。具体的には、まず、先行ドメイン知識を含むプロンプト情報を自動生成し、その後の病変分割プロセスを導出する専門家診断レポートガイドプロンプト生成モジュールを提案する。次に、異なるモードの補完情報を統合してセグメント化効果を高めるクロスシーケンスアテンションモジュールを提案する。最後に、複数シーケンスの画像特徴と生成されたプロンプトをデコーダに入力してセグメント化結果を得る。実験の結果,PG-SAMは3つの独立した臨床センターにまたがる耳下腺病変セグメンテーションの最先端性を実現し,その臨床応用性および臨床現場における画像セグメンテーション向上のための診断テキストの有効性を検証した。

論文の概要: Multi-Sequence Parotid Gland Lesion Segmentation via Expert Text-Guided Segment Anything Model

関連論文リスト