Fugu-MT 論文翻訳(概要): Enhancing LLM Medical Coding with Structured External Knowledge

論文の概要: Enhancing LLM Medical Coding with Structured External Knowledge

arxiv url: http://arxiv.org/abs/2605.27377v2
Date: Thu, 28 May 2026 04:48:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 07:09:36.503843
Title: Enhancing LLM Medical Coding with Structured External Knowledge
Title（参考訳）: 構造化外部知識によるLLM医療符号化の強化
Authors: Yidong Gan, David D. Nguyen, Yang Lin, Peter Zhong, Thanh Vu, Long Duong, Yuan-Fang Li,
Abstract要約: 既存の LLM ベースの自動化手法は LLM の内部知識に大きく依存している。 RAG-Codingは,外部知識を構造化してLLMを増強するエージェントフリーな手法である。 MDACE-2025では、RAG-Codingはすべてのベースラインを上回り、更新されたガイドラインに対する効果的な一般化を実証している。
参考スコア（独自算出の注目度）: 22.558079959261164
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Accurate medical coding requires consulting authoritative resources such as the ICD tabular list and coding guidelines. Existing LLM-based automated methods largely rely on LLMs' internal knowledge, which is prone to hallucination and cannot keep pace with guideline updates. We introduce RAG-Coding, an agentic, training-free method that augments LLMs with structured external knowledge: the tabular list is encoded as a knowledge graph capturing hierarchical and instructional code relationships, and the guidelines are distilled into concise, code-specific summaries rather than retrieved as raw text. To enable our study, we also introduce MDACE-2025, expert re-annotations of the MDACE dataset under the 2025 ICD-10-CM/PCS guidelines, adding code sequencing and justification comments. On MDACE, RAG-Coding outperforms the best LLM-based baseline by 3--13\% in micro-F1 across five LLM backbones, and achieves comparable micro- and macro-F1 to the supervised state-of-the-art, with higher recall ($+$11\%) at the cost of precision ($-$6\%). On MDACE-2025, RAG-Coding outperforms all baselines, demonstrating effective generalisation to updated guidelines. Ablations confirm stepwise gains, highlighting the importance of integrating structured external knowledge for LLM-based medical coding.
Abstract（参考訳）: 正確な医療用コーディングには、ICD表表やコーディングガイドラインなどの信頼できるリソースをコンサルティングする必要がある。既存の LLM ベースの自動化手法は LLM の内部知識に大きく依存している。表表リストは階層的および命令的コード関係を抽出した知識グラフとしてエンコードされ、ガイドラインは原文として検索されるのではなく、簡潔でコード固有の要約に蒸留される。 2025 ICD-10-CM/PCSガイドラインに基づくMDACEデータセットのエキスパートアノテーションであるMDACE-2025も導入し,コードシークエンシングと正当化コメントを追加した。 MDACEでは、RAG-Codingは5つのLDMバックボーンのマイクロF1において3-13\%の最高のLCMベースラインを上回り、監督された最先端技術に匹敵するマイクロF1とマクロF1を達成し、精度の犠牲(-6\%)で高いリコール(+11\%)を達成している。 MDACE-2025では、RAG-Codingはすべてのベースラインを上回り、更新されたガイドラインに対する効果的な一般化を実証している。アブレーションは段階的に向上し、LLMベースの医療コーディングのための構造化外的知識の統合の重要性を強調している。

論文の概要: Enhancing LLM Medical Coding with Structured External Knowledge

関連論文リスト