Fugu-MT 論文翻訳(概要): Paper Title: LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments

論文の概要: Paper Title: LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments

arxiv url: http://arxiv.org/abs/2603.12071v1
Date: Thu, 12 Mar 2026 15:40:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-13 14:46:26.189774
Title: Paper Title: LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments
Title（参考訳）: 論文>LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments
Authors: Zhaoyang Jiang, Zhizhong Fu, David McAllister, Yunsoo Kim, Honghan Wu,
Abstract要約: 本稿では、縦型T1強調脳MRIを読み取る3次元視覚言語モデルの訓練用パイプラインであるLoV3Dを提案する。パイプラインは、ラベル一貫性、縦コヒーレンス、生物学的妥当性を強制することで最終的な診断を下す。被験者レベルのADNIテストセットでは、LoV3Dは3クラスの診断精度が93.7%に達する。
参考スコア（独自算出の注目度）: 14.481985722970238
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Longitudinal brain MRI is essential for characterizing the progression of neurological diseases such as Alzheimer's disease assessment. However, current deep-learning tools fragment this process: classifiers reduce a scan to a label, volumetric pipelines produce uninterpreted measurements, and vision-language models (VLMs) may generate fluent but potentially hallucinated conclusions. We present LoV3D, a pipeline for training 3D vision-language models, which reads longitudinal T1-weighted brain MRI, produces a region-level anatomical assessment, conducts longitudinal comparison with the prior scan, and finally outputs a three-class diagnosis (Cognitively Normal, Mild Cognitive Impairment, or Dementia) along with a synthesized diagnostic summary. The stepped pipeline grounds the final diagnosis by enforcing label consistency, longitudinal coherence, and biological plausibility, thereby reducing the risks of hallucinations. The training process introduces a clinically-weighted Verifier that scores candidate outputs automatically against normative references derived from standardized volume metrics, driving Direct Preference Optimization without a single human annotation. On a subject-level held-out ADNI test set (479 scans, 258 subjects), LoV3D achieves 93.7% three-class diagnostic accuracy (+34.8% over the no-grounding baseline), 97.2% on two-class diagnosis accuracy (+4% over the SOTA) and 82.6% region-level anatomical classification accuracy (+33.1% over VLM baselines). Zero-shot transfer yields 95.4% on MIRIAD (100% Dementia recall) and 82.9% three-class accuracy on AIBL, confirming high generalizability across sites, scanners, and populations. Code is available at https://github.com/Anonymous-TEVC/LoV-3D.
Abstract（参考訳）: 縦断的脳MRIは、アルツハイマー病などの神経疾患の進行を特徴づけるのに不可欠である。しかし、現在のディープラーニングツールは、このプロセスを断片化している: 分類器はスキャンをラベルに還元し、ボリュームパイプラインは解釈されていない測定結果を生成し、視覚言語モデル(VLM)は、流動的でハロゲン化の可能性がある結論を生成する。縦断的T1強調脳MRIを読み、領域レベルの解剖学的評価を行い、前回のスキャンと縦断的な比較を行い、最終的に3段階の診断(認知正常、軽度認知障害、認知症)を合成診断概要とともに出力する3次元視覚言語モデルの訓練用パイプラインであるLoV3Dを提案する。ステップパイプラインは、ラベル一貫性、長手コヒーレンス、生物学的妥当性を強制することにより最終診断を下し、幻覚のリスクを低減する。トレーニングプロセスでは、標準化されたボリュームメトリクスから派生した規範的基準に対して、候補出力を自動的にスコアする、臨床重み付き検証を導入し、単一の人間のアノテーションなしで直接優先度最適化を駆動する。被験者レベルのADNIテストセット(479スキャン、258被験者)では、LoV3Dは3クラス診断精度93.7%(非接地ベースラインより+34.8%)、2クラス診断精度97.2%(SOTAより+4%)、領域レベルの解剖学的分類精度82.6%(VLMベースラインより+33.1%)を達成している。ゼロショット転送はMIRIAD (100% Dementia recall) で95.4%、AIBLで82.9%の精度で行われ、サイト、スキャナー、人口間で高い一般化性が確認されている。コードはhttps://github.com/Anonymous-TEVC/LoV-3Dで入手できる。

論文の概要: Paper Title: LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments

関連論文リスト