Fugu-MT 論文翻訳(概要): Clin-JEPA: A Multi-Phase Co-Training Framework for Joint-Embedding Predictive Pretraining on EHR Patient Trajectories

論文の概要: Clin-JEPA: A Multi-Phase Co-Training Framework for Joint-Embedding Predictive Pretraining on EHR Patient Trajectories

arxiv url: http://arxiv.org/abs/2605.10840v2
Date: Tue, 12 May 2026 07:38:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-13 18:21:07.141789
Title: Clin-JEPA: A Multi-Phase Co-Training Framework for Joint-Embedding Predictive Pretraining on EHR Patient Trajectories
Title（参考訳）: Clin-JEPA: EHR患者軌道上での複合組込み予測予測のための多相協調学習フレームワーク
Authors: Yixuan Yang, Mehak Arora, Ryan Zhang, Baraa Abed, Junseob Kim, Tilendra Choudhary, Md Hassanuzzaman, Kevin Zhu, Ayman Ali, Chengkun Yang, Alasdair Edward Gent, Victor Moas, Rishikesan Kamaleswaran,
Abstract要約: Clin-JEPAは、HR患者軌道上での予測(JEPA)事前トレーニングのための多相コトレーニングフレームワークである。 MIMIC-IV ICUデータでは、3つの独立した評価がフレームワークをサポートしている。
参考スコア（独自算出の注目度）: 7.992399976910451
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We present Clin-JEPA, a multi-phase co-training framework for joint-embedding predictive (JEPA) pretraining on EHR patient trajectories. JEPA architectures have enabled latent-space planning in robotics and high-quality representation learning in vision, but extending the paradigm to EHR data -- to obtain a single backbone that simultaneously forecasts patient trajectories and serves diverse downstream risk-prediction tasks without per-task fine-tuning -- remains an open challenge. Existing JEPA frameworks either discard the predictor after pretraining (I-JEPA, V-JEPA) or train it on a frozen pretrained encoder (V-JEPA 2-AC), leaving the encoder unaware of the rollout signal that the retained predictor must use at inference; co-training the encoder and predictor under a shared JEPA prediction objective would supply this grounding, but naïve co-training is unstable, with representation collapse and online/target drift causing autoregressive rollout to diverge. Clin-JEPA's five-phase pretraining curriculum -- predictor warmup, joint refinement, EMA target alignment, hard sync, and predictor finalization -- addresses each failure mode by phase, stably co-training a Qwen3-8B-based encoder and a 92M-parameter latent trajectory predictor. On MIMIC-IV ICU data, three independent evaluations support the framework: (1) latent $\ell_1$ rollout drift uniquely converges ($-$15.7%) over 48-hour horizons while baselines and ablations diverge (+3% to +4951%); (2) the encoder learns a clinically discriminative latent geometry (deteriorating-patient cohorts displace 4.83$\times$ further than stable patients in latent space, vs $\leq$2.62$\times$ for baseline encoders); (3) a single backbone outperforms strong tabular and sequence baselines on multi-task downstream evaluation. Clin-JEPA achieves mean AUROC 0.851 on ICareFM EEP and 0.883 on 8 binary risk tasks (+0.038 and +0.041 vs baseline average).
Abstract（参考訳）: Clin-JEPAは関節内包予測(JEPA)のための多相協調訓練フレームワークである。 JEPAアーキテクチャは、ロボット工学における潜時空間計画と視覚における高品質な表現学習を可能にしたが、パラダイムをEHRデータに拡張することで、患者の軌道を同時に予測し、タスク毎の微調整なしでさまざまな下流リスク予測タスクを提供する単一のバックボーンを取得することは、依然としてオープンな課題である。既存のJEPAフレームワークは、事前トレーニング後の予測器(I-JEPA、V-JEPA)を捨てるか、あるいは凍結した事前訓練されたエンコーダ(V-JEPA 2-AC)でトレーニングし、保持された予測器が推論時に使用しなければならないロールアウト信号に気づかないままにしておく。 Clin-JEPAの5段階事前訓練カリキュラム -- 予測器のウォームアップ、共同改良、EMAターゲットアライメント、ハードシンク、予測器ファイナライゼーション -- は、Qwen3-8Bベースのエンコーダと92Mの潜航軌道予測器を安定的に併用して、各障害モードにフェーズ毎に対処する。 MIMIC-IV ICUデータでは、3つの独立した評価がフレームワークをサポートしている: (1) latent $\ell_1$ rollout driftは48時間水平線上に一意に収束するが、ベースラインとアブレーションは分岐し(+3%から+4951%)、 (2) encoderは臨床的に識別可能な潜伏幾何学を学習する(deteriorating-patient cohorts displace 4.83$\times$は潜伏空間の安定な患者より多く、対して$\leq$2.62$\times$はベースラインエンコーダに対して)。 Clin-JEPAは、ICareFM EEP上の平均AUROC 0.851と、8つのバイナリリスクタスク(ベースライン平均に対する+0.038と+0.041)における0.883を達成する。

論文の概要: Clin-JEPA: A Multi-Phase Co-Training Framework for Joint-Embedding Predictive Pretraining on EHR Patient Trajectories

関連論文リスト