Fugu-MT 論文翻訳(概要): Training a Large Language Model for Medical Coding Using Privacy-Preserving Synthetic Clinical Data

論文の概要: Training a Large Language Model for Medical Coding Using Privacy-Preserving Synthetic Clinical Data

arxiv url: http://arxiv.org/abs/2603.23515v1
Date: Fri, 06 Mar 2026 02:02:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-06 02:36:12.999916
Title: Training a Large Language Model for Medical Coding Using Privacy-Preserving Synthetic Clinical Data
Title（参考訳）: プライバシ保存臨床データを用いた医用符号化のための大規模言語モデルの訓練
Authors: John Cook, Michael Wyatt, Peng Wei, Iris Chin, Santosh Gupta, Van Zyl Van Vuuren, Richie Siburian, Amanda Spicer, Kristen Viviano, Alda Cami, Raunaq Malhotra, Zhewei Yao, Jeff Rasley, Gaurav Kaushik,
Abstract要約: 本稿では,現代のオープンウェイト基盤モデルが,専門家レベルの医療用コーディングタスクに適応できるかどうかを検討する。 Llama 3-70B を EHR-grounded templates と code policy から生成された臨床ノートとゴールドコードに微調整する。合成コーパスの微調整の後、精度の高いF1は0.70を超え、両方のコードシステムで絶対的な利得を示している。
参考スコア（独自算出の注目度）: 10.24013423253024
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Improving the accuracy and reliability of medical coding reduces clinician burnout and supports revenue cycle processes, freeing providers to focus more on patient care. However, automating the assignment of ICD-10-CM and CPT codes from clinical documentation remains a challenge due to heterogeneous records, nuanced coding guidelines, and long-tail distributions. Large language models have been proposed to help or automate specific medical coding tasks. However, foundation models are not explicitly trained for medical coding and zero-shot coding has yielded poor results. We investigate whether a modern open-weight foundation model can be adapted for an expert-level medical coding task using privacy-preserving synthetic training data derived from electronic health records. We fine-tune Llama 3-70B on pairs of clinical notes and gold codes generated from EHR-grounded templates and coding policies, then evaluate exact-code prediction for ICD-10-CM and CPT. A zero-shot baseline with the unadapted model achieved an F1 score of 0.18 for exact code match. After fine-tuning on the synthetic corpus, exact-match F1 exceeded 0.70, representing a large absolute gain across both code systems. Notably, performance remained high on complex categories that often require multi-step clinical reasoning and code composition, including Advanced Illness and Frailty classes, and the model retained its performance on medical comprehension tasks. These results indicate that synthetic, policy-aware data can efficiently teach a general-purpose large language model to support precise medical coding without exposing protected health information. The approach offers a practical path for training coding agents safely and iteratively on specific tasks that represent real-world populations.
Abstract（参考訳）: 医療コーディングの正確性と信頼性の向上により、クリニックのバーンアウトが減少し、収益サイクルのプロセスがサポートされ、提供者は患者のケアに集中できるようになる。しかし,ICD-10-CMとCPT符号の自動割り当ては,異質な記録,ニュアンスな符号化ガイドライン,長いテール分布などにより,依然として困難である。特定の医学的コーディングタスクを支援するか、自動化するために、大規模な言語モデルが提案されている。しかし、基礎モデルは医学的コーディングのために明示的に訓練されていないため、ゼロショットコーディングは結果の低さをもたらしている。電子健康記録から得られたプライバシー保護型総合訓練データを用いて,現代のオープンウェイト基盤モデルを専門家レベルの医療コーディングタスクに適用できるかどうかを検討する。 ICD-10-CM と CPT の正確なコード予測を行うため,EHR のテンプレートと符号化ポリシから生成された臨床メモとゴールドコードに Llama 3-70B を微調整した。未適応モデルのゼロショットベースラインは、正確なコードマッチングでF1スコアが0.18に達した。合成コーパスの微調整の後、精度の高いF1は0.70を超え、両方のコードシステムで絶対的な利得を示している。特に、多段階の臨床推論とコード構成を必要とする複雑なカテゴリーでは、高度な病気や虚偽のクラスを含むパフォーマンスが高く、医療的理解タスクではそのパフォーマンスが保たれていた。これらの結果から,保護された健康情報を公開することなく,総合的な大規模言語モデルを効率的に学習し,正確な医学的コーディングを支援することが示唆された。このアプローチは、現実世界の人口を表す特定のタスクに対して、安全かつ反復的にコーディングエージェントを訓練するための実践的なパスを提供する。

論文の概要: Training a Large Language Model for Medical Coding Using Privacy-Preserving Synthetic Clinical Data

関連論文リスト