Fugu-MT 論文翻訳(概要): RePrompT: Recurrent Prompt Tuning for Integrating Structured EHR Encoders with Large Language Models

論文の概要: RePrompT: Recurrent Prompt Tuning for Integrating Structured EHR Encoders with Large Language Models

arxiv url: http://arxiv.org/abs/2604.17725v1
Date: Mon, 20 Apr 2026 02:20:13 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.662263
Title: RePrompT: Recurrent Prompt Tuning for Integrating Structured EHR Encoders with Large Language Models
Title（参考訳）: RePrompT: 構造化HRエンコーダと大規模言語モデルの統合のための繰り返しプロンプトチューニング
Authors: Arya Hadizadeh Moghaddam, Drew Ross, Mohsen Nayebi Kerdabadi, Dongjie Wang, Zijun Yao,
Abstract要約: 本稿では,構造化EHRエンコーダを即時チューニングにより統合する時間認識フレームワークRePrompTを紹介する。 MIMIC-IIIとMIMIC-IVの実験では、RePrompTはEHRベースのベースラインとLLMベースのベースラインの両方を一貫して上回っている。
参考スコア（独自算出の注目度）: 12.004161606345084
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Language Models (LLMs) have shown strong promise for mining Electronic Health Records (EHRs) by reasoning over longitudinal clinical information to capture context-rich patient trajectories. However, leveraging LLMs for structured EHRs (e.g., standardized diagnosis and medication codes) presents two key challenges. First, translating time-stamped EHR sequences into plain text can obscure both temporal structure and code identities, weakening the ability to capture code co-occurrence and longitudinal regularities. Second, unlike cohort-trained predictive models that learn a shared, task-aligned representation space across patients, LLMs are often applied in a case-isolated inference setting where each patient is processed independently without leveraging population-level patterns. To address these challenges, we introduce RePrompT, a time-aware LLM framework that integrates structured EHR encoders through prompt tuning, without modifying underlying architectures. Specifically, RePrompT recurrently incorporates latent states from prior visits to preserve longitudinal information, and injects population-level information through trainable prompt tokens derived from a cohort-trained, task-aligned EHR encoder. Experiments on MIMIC-III and MIMIC-IV demonstrate that RePrompT consistently outperforms both EHR-based and LLM-based baselines across multiple clinical prediction tasks.
Abstract（参考訳）: 大規模言語モデル (LLMs) は, 文脈に富む患者軌跡を捉えるために, 経時的臨床情報を解析することにより電子健康記録 (EHRs) のマイニングを強く約束している。しかしながら、構造化EMH(例えば、標準化された診断と治療基準)にLLMを活用することは、2つの重要な課題を提示する。まず、タイムスタンプされたEHRシーケンスをプレーンテキストに変換することで、時間的構造とコードの同一性の両方を曖昧にし、コード共起と縦長の規則性をキャプチャする能力を弱めることができる。第2に、患者間で共有されたタスク整合表現空間を学習するコホート学習予測モデルとは異なり、各患者が集団レベルのパターンを活用せずに独立に処理されるケース分離推論環境では、LLMがしばしば適用される。これらの課題に対処するため、我々は、基盤となるアーキテクチャを変更することなく、即時チューニングにより構造化されたEHRエンコーダを統合する、タイムアウェアなLLMフレームワークであるRePrompTを紹介した。具体的には、RePrompTは、前回の訪問から潜伏状態に繰り返し組み込んで長手情報を保持し、コホート訓練されたタスク対応のEHRエンコーダから導出される訓練可能なプロンプトトークンを通じて、人口レベルの情報を注入する。 MIMIC-IIIとMIMIC-IVの実験では、RePrompTは複数の臨床予測タスクにおいて、EHRベースのベースラインとLLMベースのベースラインを一貫して上回っている。

論文の概要: RePrompT: Recurrent Prompt Tuning for Integrating Structured EHR Encoders with Large Language Models

関連論文リスト