Fugu-MT 論文翻訳(概要): Planner-Auditor Twin: Agentic Discharge Planning with FHIR-Based LLM Planning, Guideline Recall, Optional Caching and Self-Improvement

論文の概要: Planner-Auditor Twin: Agentic Discharge Planning with FHIR-Based LLM Planning, Guideline Recall, Optional Caching and Self-Improvement

arxiv url: http://arxiv.org/abs/2601.21113v1
Date: Wed, 28 Jan 2026 23:04:11 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-30 16:22:49.47444
Title: Planner-Auditor Twin: Agentic Discharge Planning with FHIR-Based LLM Planning, Guideline Recall, Optional Caching and Self-Improvement
Title（参考訳）: プランナー・オーディタツイン:FHIRによるLCM計画、ガイドラインリコール、オプションキャッシング、自己改善によるエージェント放電計画
Authors: Kaiyuan Wu, Aditya Nagori, Rishikesan Kamaleswaran,
Abstract要約: 大きな言語モデル(LLM)は、臨床退院計画の約束を示すが、その使用は幻覚、排便、誤診された自信によって制限される。安全性と信頼性を向上させる自己改善型キャッシュオプションのPlanner-Auditorフレームワークを導入する。
参考スコア（独自算出の注目度）: 2.0755366440393748
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Objective: Large language models (LLMs) show promise for clinical discharge planning, but their use is constrained by hallucination, omissions, and miscalibrated confidence. We introduce a self-improving, cache-optional Planner-Auditor framework that improves safety and reliability by decoupling generation from deterministic validation and targeted replay. Materials and Methods: We implemented an agentic, retrospective, FHIR-native evaluation pipeline using MIMIC-IV-on-FHIR. For each patient, the Planner (LLM) generates a structured discharge action plan with an explicit confidence estimate. The Auditor is a deterministic module that evaluates multi-task coverage, tracks calibration (Brier score, ECE proxies), and monitors action-distribution drift. The framework supports two-tier self-improvement: (i) within-episode regeneration when enabled, and (ii) cross-episode discrepancy buffering with replay for high-confidence, low-coverage cases. Results: While context caching improved performance over baseline, the self-improvement loop was the primary driver of gains, increasing task coverage from 32% to 86%. Calibration improved substantially, with reduced Brier/ECE and fewer high-confidence misses. Discrepancy buffering further corrected persistent high-confidence omissions during replay. Discussion: Feedback-driven regeneration and targeted replay act as effective control mechanisms to reduce omissions and improve confidence reliability in structured clinical planning. Separating an LLM Planner from a rule-based, observational Auditor enables systematic reliability measurement and safer iteration without model retraining. Conclusion: The Planner-Auditor framework offers a practical pathway toward safer automated discharge planning using interoperable FHIR data access and deterministic auditing, supported by reproducible ablations and reliability-focused evaluation.
Abstract（参考訳）: 目的: 大規模言語モデル(LLM)は, 臨床退院計画の公約を示すが, その使用は幻覚, 排便, 誤診された信頼によって制限される。本稿では,自己改善型キャッシュ選択型Planner-Auditorフレームワークを提案する。材料と方法:MIMIC-IV-on-FHIRを用いたエージェント,リフレクション,FHIRネイティブ評価パイプラインを実装した。各患者に対して、プランナー(LLM)は、明確な信頼度推定を伴う構造化された退院行動計画を生成する。 Auditorは、マルチタスクカバレッジを評価し、キャリブレーション(Brier score, ECE proxies)を追跡し、アクション分散ドリフトを監視する決定論的モジュールである。このフレームワークは2層自己改善をサポートしている。 (i)有効時のエピソード内再生、及び (II)高信頼低被覆症例に対するリプレイによる異方性バッファリング結果: コンテキストキャッシュはベースラインよりもパフォーマンスを向上したが、自己改善ループは利得の主要な要因であり、タスクカバレッジは32%から86%に増加した。校正は大幅に改善され、ブライア/ECEは減少し、高信頼のミスも少なくなった。離散バッファリングは、リプレイ中に永続的な高信頼欠落を補正する。考察: フィードバック駆動リジェネレーションとターゲットリプレイは, 欠失を低減し, 構造化された臨床計画における信頼性を向上させるための効果的な制御機構として機能する。 LLMプランナーを規則に基づく観察オーディタから分離することで、モデルの再トレーニングなしに、系統的な信頼性測定と安全なイテレーションが可能になる。結論:Planner-Auditorフレームワークは、相互運用可能なFHIRデータアクセスと決定論的監査を用いた安全な自動放電計画への実践的な経路を提供する。

論文の概要: Planner-Auditor Twin: Agentic Discharge Planning with FHIR-Based LLM Planning, Guideline Recall, Optional Caching and Self-Improvement

関連論文リスト