Fugu-MT 論文翻訳(概要): CLIN-LLM: A Safety-Constrained Hybrid Framework for Clinical Diagnosis and Treatment Generation

論文の概要: CLIN-LLM: A Safety-Constrained Hybrid Framework for Clinical Diagnosis and Treatment Generation

arxiv url: http://arxiv.org/abs/2510.22609v1
Date: Sun, 26 Oct 2025 10:11:53 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 15:28:15.293891
Title: CLIN-LLM: A Safety-Constrained Hybrid Framework for Clinical Diagnosis and Treatment Generation
Title（参考訳）: CLIN-LLM : 臨床診断と治療のための安全制約付きハイブリッドフレームワーク
Authors: Md. Mehedi Hasan, Rafid Mostafiz, Md. Abir Hossain, Bikash Kumar Paul,
Abstract要約: LLM(Large Language Model)ベースのシステムは、しばしば医学的な根拠がなく、不確実性を定量化できない。 CLIN-LLMは,マルチモーダル患者エンコーディング,不確実性校正疾患分類,検索・拡張治療生成を統合した安全拘束型ハイブリッドパイプラインである。
参考スコア（独自算出の注目度）: 0.31984926651189866
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Accurate symptom-to-disease classification and clinically grounded treatment recommendations remain challenging, particularly in heterogeneous patient settings with high diagnostic risk. Existing large language model (LLM)-based systems often lack medical grounding and fail to quantify uncertainty, resulting in unsafe outputs. We propose CLIN-LLM, a safety-constrained hybrid pipeline that integrates multimodal patient encoding, uncertainty-calibrated disease classification, and retrieval-augmented treatment generation. The framework fine-tunes BioBERT on 1,200 clinical cases from the Symptom2Disease dataset and incorporates Focal Loss with Monte Carlo Dropout to enable confidence-aware predictions from free-text symptoms and structured vitals. Low-certainty cases (18%) are automatically flagged for expert review, ensuring human oversight. For treatment generation, CLIN-LLM employs Biomedical Sentence-BERT to retrieve top-k relevant dialogues from the 260,000-sample MedDialog corpus. The retrieved evidence and patient context are fed into a fine-tuned FLAN-T5 model for personalized treatment generation, followed by post-processing with RxNorm for antibiotic stewardship and drug-drug interaction (DDI) screening. CLIN-LLM achieves 98% accuracy and F1 score, outperforming ClinicalBERT by 7.1% (p < 0.001), with 78% top-5 retrieval precision and a clinician-rated validity of 4.2 out of 5. Unsafe antibiotic suggestions are reduced by 67% compared to GPT-5. These results demonstrate CLIN-LLM's robustness, interpretability, and clinical safety alignment. The proposed system provides a deployable, human-in-the-loop decision support framework for resource-limited healthcare environments. Future work includes integrating imaging and lab data, multilingual extensions, and clinical trial validation.
Abstract（参考訳）: 特に診断リスクの高い異種患者では, 正確な症状と症状の分類, 臨床的根拠による治療勧告は依然として困難である。既存の大規模言語モデル(LLM)ベースのシステムは医療基盤を欠いていることが多く、不確実性を定量化できないため、出力が安全でない。 CLIN-LLMは,マルチモーダル患者エンコーディング,不確実性校正疾患分類,検索・拡張治療生成を統合した安全拘束型ハイブリッドパイプラインである。 The framework fine-tunes BioBERT on 1,200 Clinical Case from the Symptom2Disease dataset and integrates Focal Loss with Monte Carlo Dropout to enable confidence-aware predictions from free-text symptoms and structured vitals。低確実性のケース(18%)は、専門家のレビューのために自動的にフラグ付けされ、人間の監視が保証される。治療生成のためにCLIN-LLMはBiomedical Sentence-BERTを使用して、26万サンプルのMedDialog corpusからトップk関連対話を検索する。検索されたエビデンスと患者コンテキストは、パーソナライズされた治療生成のための微調整されたFLAN-T5モデルに入力され、その後RxNormで抗生物質のスチュワードシップとドラッグ・ドラッグ・インタラクション(DDI)スクリーニングを行う。 CLIN-LLMの精度は98%,F1スコアは7.1%(p<0。安全性の低い抗生物質は、GPT-5と比較して67%減少する。これらの結果から,CLIN-LLMの堅牢性,解釈可能性,臨床安全アライメントが示唆された。提案システムは、リソース制限された医療環境に対して、デプロイ可能な、ループ内意思決定支援フレームワークを提供する。今後の開発には、画像と実験室のデータの統合、多言語拡張、臨床試験の検証が含まれる。

論文の概要: CLIN-LLM: A Safety-Constrained Hybrid Framework for Clinical Diagnosis and Treatment Generation

関連論文リスト