Fugu-MT 論文翻訳(概要): DiffSight-Former: Modeling Structural Differences and Temporal Dynamics for Glaucoma Progression Prediction

論文の概要: DiffSight-Former: Modeling Structural Differences and Temporal Dynamics for Glaucoma Progression Prediction

arxiv url: http://arxiv.org/abs/2606.09140v1
Date: Mon, 08 Jun 2026 07:39:10 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:06.805421
Title: DiffSight-Former: Modeling Structural Differences and Temporal Dynamics for Glaucoma Progression Prediction
Title（参考訳）: DiffSight-Former:緑内障進展予測のための構造差と時間ダイナミクスのモデル化
Authors: Yi Huang, Lei Bi, Jinman Kim,
Abstract要約: 緑内障は世界中で不可逆的な盲目の原因となっている。深層学習は、基礎画像解析において有望なパフォーマンスを達成した。眼底画像からの緑内障進展予測のためのフレームワークであるDiffSight-Formerを提案する。
参考スコア（独自算出の注目度）: 15.917908024992805
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Glaucoma is a leading cause of irreversible blindness worldwide, and early detection from fundus images is critical for effective disease management. While deep learning has achieved promising performance in fundus image analysis, most existing methods rely on single time-point images and fail to capture longitudinal structural and vascular changes associated with disease progression. Sequential fundus images acquired during clinical follow-up provide valuable temporal information; however, current sequential models often struggle to detect subtle early progression signals and commonly depend on fixed-length inputs or diagnostic cues from already glaucomatous images, limiting their clinical utility for early prediction. To address these limitations, we propose DiffSight-Former, a framework for glaucoma progression prediction from sequential fundus images. It incorporates a time-variant feature extraction module based on a fundus-specific foundation model to obtain robust anatomical representations. A multi-structure difference modeling module is introduced to quantify progression-related changes in the optic disc/cup region and retinal vasculature. These representations are integrated with temporal interval embeddings and processed by a time-aware Transformer to model disease progression and estimate the probability of future glaucoma onset. Experiments were conducted on two longitudinal datasets, SIGF (405 sequences) and GRAPE (263 sequences). On SIGF, DiffSight-Former achieved an AUC of 91.54% and a sensitivity of 92.16% for progression prediction. On GRAPE, it achieved an average accuracy of 87.48% across three clinical visual-field progression criteria. Compared with existing approaches, DiffSight-Former demonstrates strong performance and robustness across different temporal settings, highlighting its potential for longitudinal glaucoma monitoring and early risk prediction.
Abstract（参考訳）: 緑内障は世界中で不可逆性失明の主要な原因であり、眼底画像からの早期発見は効果的な疾患管理に重要である。深層学習は根底画像解析において有望な性能を達成したが、既存のほとんどの手法は単一のタイムポイント画像に依存しており、疾患進行に伴う縦断的な構造変化や血管変化を捉えていない。臨床経過観察中に得られた連続眼底画像は、貴重な時間的情報を提供するが、現在のシーケンシャルモデルでは、微妙な初期進行信号の検出に苦慮することが多く、通常、既存の緑内障画像からの定長入力や診断方法に依存しており、早期予測の臨床的有用性を制限している。これらの制約に対処するため,連続した眼底画像から緑内障進行予測を行うフレームワークであるDiffSight-Formerを提案する。根本的基盤モデルに基づく時間変化特徴抽出モジュールを組み込んで、堅牢な解剖学的表現を得る。光ディスク/カップ領域と網膜血管の進行に伴う変化を定量化するために、多構造差モデリングモジュールを導入する。これらの表現は、時間間隔埋め込みと統合され、時間認識変換器によって処理され、疾患の進行をモデル化し、将来の緑内障発症確率を推定する。 SIGF(405塩基配列)とGRAPE(263塩基配列)の2つの縦断データセットを用いて実験を行った。 SIGFでは、DiffSight-FormerのAUCは91.54%、感度は92.16%に達した。 GRAPEでは、3つの臨床視野進行基準で平均87.48%の精度を達成した。既存のアプローチと比較して、DiffSight-Formerは、異なる時間的設定にわたって、強いパフォーマンスと堅牢性を示し、縦型緑内障のモニタリングと早期リスク予測の可能性を強調している。

論文の概要: DiffSight-Former: Modeling Structural Differences and Temporal Dynamics for Glaucoma Progression Prediction

関連論文リスト