Fugu-MT 論文翻訳(概要): An Automated Retrieval-Augmented Generation LLaMA-4 109B-based System for Evaluating Radiotherapy Treatment Plans

論文の概要: An Automated Retrieval-Augmented Generation LLaMA-4 109B-based System for Evaluating Radiotherapy Treatment Plans

arxiv url: http://arxiv.org/abs/2509.20707v2
Date: Mon, 29 Sep 2025 02:04:09 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 14:13:47.622569
Title: An Automated Retrieval-Augmented Generation LLaMA-4 109B-based System for Evaluating Radiotherapy Treatment Plans
Title（参考訳）: 放射線治療計画評価のためのLLaMA-4 109B自動検索システム
Authors: Junjie Cui, Peilong Wang, Jason Holmes, Leshan Sun, Michael L. Hinni, Barbara A. Pockaj, Sujay A. Vora, Terence T. Sio, William W. Wong, Nathan Y. Yu, Steven E. Schild, Joshua R. Niska, Sameer R. Keole, Jean-Claude M. Rwigema, Samir H. Patel, Lisa A. McGee, Carlos A. Vargas, Wei Liu,
Abstract要約: 我々は,LLaMA-4 109Bを用いた放射線治療計画の自動化,プロトコル認識,解釈可能な評価のためのRAGシステムを開発した。 RAGシステムは5つのSentenceTransformerバックボーンに最適化された検索エンジン、コホート類似性に基づくパーセンタイル予測コンポーネント、臨床制約チェッカーの3つのコアモジュールを統合している。
参考スコア（独自算出の注目度）: 2.2532577733932038
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Purpose: To develop a retrieval-augmented generation (RAG) system powered by LLaMA-4 109B for automated, protocol-aware, and interpretable evaluation of radiotherapy treatment plans. Methods and Materials: We curated a multi-protocol dataset of 614 radiotherapy plans across four disease sites and constructed a knowledge base containing normalized dose metrics and protocol-defined constraints. The RAG system integrates three core modules: a retrieval engine optimized across five SentenceTransformer backbones, a percentile prediction component based on cohort similarity, and a clinical constraint checker. These tools are directed by a large language model (LLM) using a multi-step prompt-driven reasoning pipeline to produce concise, grounded evaluations. Results: Retrieval hyperparameters were optimized using Gaussian Process on a scalarized loss function combining root mean squared error (RMSE), mean absolute error (MAE), and clinically motivated accuracy thresholds. The best configuration, based on all-MiniLM-L6-v2, achieved perfect nearest-neighbor accuracy within a 5-percentile-point margin and a sub-2pt MAE. When tested end-to-end, the RAG system achieved 100% agreement with the computed values by standalone retrieval and constraint-checking modules on both percentile estimates and constraint identification, confirming reliable execution of all retrieval, prediction and checking steps. Conclusion: Our findings highlight the feasibility of combining structured population-based scoring with modular tool-augmented reasoning for transparent, scalable plan evaluation in radiation therapy. The system offers traceable outputs, minimizes hallucination, and demonstrates robustness across protocols. Future directions include clinician-led validation, and improved domain-adapted retrieval models to enhance real-world integration.
Abstract（参考訳）: 目的:LLaMA-4109Bを用いた放射線治療計画の自動化,プロトコル認識,解釈可能な評価のためのRAGシステムの開発。方法と材料:4つの疾患部位にわたる614件の放射線治療計画のマルチプロトコールデータセットを作成し,正常化線量測定値とプロトコル定義制約を含む知識ベースを構築した。 RAGシステムは5つのSentenceTransformerバックボーンに最適化された検索エンジン、コホート類似性に基づくパーセンタイル予測コンポーネント、臨床制約チェッカーの3つのコアモジュールを統合している。これらのツールは、多段階のプロンプト駆動推論パイプラインを使用して、簡潔で基礎的な評価を生成する、大規模言語モデル(LLM)によって誘導される。結果: 検索ハイパーパラメータは, root mean squared error (RMSE), mean absolute error (MAE), and clinicallymotivated accuracy thresholdsを併用したスカラー化損失関数上で, Gaussian Process を用いて最適化された。 All-MiniLM-L6-v2をベースとした最良の構成は、5パーセントのポイントマージンとサブ2ptのMAEで完全な近接精度を達成した。エンドツーエンドでテストすると、RAGシステムは、パーセンタイル推定と制約識別の両方において、スタンドアロンの検索と制約チェックモジュールによる計算値との100%の一致を達成し、全ての検索、予測、チェックステップの信頼性を確認した。結論: この知見は, 放射線治療における透過的かつスケーラブルな計画評価のための, 構造的人口ベーススコアとモジュラーツール強化推論の併用の可能性を明らかにするものである。このシステムはトレーサブルな出力を提供し、幻覚を最小限に抑え、プロトコル間の堅牢性を示す。将来的な方向性としては、クリニック主導のバリデーションや、実際の統合を強化するためのドメイン適応検索モデルの改善がある。

論文の概要: An Automated Retrieval-Augmented Generation LLaMA-4 109B-based System for Evaluating Radiotherapy Treatment Plans

関連論文リスト