Fugu-MT 論文翻訳(概要): EvoClinician: A Self-Evolving Agent for Multi-Turn Medical Diagnosis via Test-Time Evolutionary Learning

論文の概要: EvoClinician: A Self-Evolving Agent for Multi-Turn Medical Diagnosis via Test-Time Evolutionary Learning

arxiv url: http://arxiv.org/abs/2601.22964v1
Date: Fri, 30 Jan 2026 13:26:18 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-02 18:28:15.468715
Title: EvoClinician: A Self-Evolving Agent for Multi-Turn Medical Diagnosis via Test-Time Evolutionary Learning
Title（参考訳）: EvoClinician:テスト時間進化学習による多施設医療診断のためのセルフ進化エージェント
Authors: Yufei He, Juncheng Liu, Zhiyuan Hu, Yulin Chen, Yue Liu, Yuan Sui, Yibo Li, Nuo Chen, Jun Hu, Bryan Hooi, Xinxing Xu, Jiang Bian,
Abstract要約: エージェントのマルチターン診断能力を評価するためのベンチマークであるMed-Inquireを提案する。次に、テスト時に効率的な診断戦略を学ぶ自己進化エージェントであるEvoClinicianを紹介する。実験の結果,EvoClinicianは連続学習ベースラインや,メモリエージェントなどの自己進化エージェントよりも優れていた。
参考スコア（独自算出の注目度）: 72.70291772077738
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Prevailing medical AI operates on an unrealistic ''one-shot'' model, diagnosing from a complete patient file. However, real-world diagnosis is an iterative inquiry where Clinicians sequentially ask questions and order tests to strategically gather information while managing cost and time. To address this, we first propose Med-Inquire, a new benchmark designed to evaluate an agent's ability to perform multi-turn diagnosis. Built upon a dataset of real-world clinical cases, Med-Inquire simulates the diagnostic process by hiding a complete patient file behind specialized Patient and Examination agents. They force the agent to proactively ask questions and order tests to gather information piece by piece. To tackle the challenges posed by Med-Inquire, we then introduce EvoClinician, a self-evolving agent that learns efficient diagnostic strategies at test time. Its core is a ''Diagnose-Grade-Evolve'' loop: an Actor agent attempts a diagnosis; a Process Grader agent performs credit assignment by evaluating each action for both clinical yield and resource efficiency; finally, an Evolver agent uses this feedback to update the Actor's strategy by evolving its prompt and memory. Our experiments show EvoClinician outperforms continual learning baselines and other self-evolving agents like memory agents. The code is available at https://github.com/yf-he/EvoClinician
Abstract（参考訳）: 一般的な医療AIは非現実的な「ワンショット」モデルで動作し、完全な患者ファイルから診断する。しかし、現実の診断は、臨床医がコストと時間を管理しながら戦略的に情報を収集するための質問や検査を順次依頼する反復的な調査である。そこで本研究では,エージェントのマルチターン診断能力を評価するためのベンチマークであるMed-Inquireを提案する。 Med-Inquireは、実際の臨床症例のデータセットに基づいて、患者ファイルを完全に隠蔽することで、診断プロセスをシミュレートする。彼らはエージェントに積極的に質問をさせ、情報を一つずつ集めるようにテストを命じる。 Med-Inquireがもたらす課題に対処するために、テスト時に効率的な診断戦略を学ぶ自己進化エージェントであるEvoClinicianを紹介します。アクターエージェントは診断を試み、プロセスグレーダーエージェントは、臨床利益とリソース効率の両方についてそれぞれのアクションを評価し、クレジット割り当てを行う。最後に、Evolverエージェントは、このフィードバックを使用してアクターの戦略を更新し、そのプロンプトと記憶を進化させる。実験の結果,EvoClinicianは連続学習ベースラインや,メモリエージェントなどの自己進化エージェントよりも優れていた。コードはhttps://github.com/yf-he/EvoClinicianで公開されている。

論文の概要: EvoClinician: A Self-Evolving Agent for Multi-Turn Medical Diagnosis via Test-Time Evolutionary Learning

関連論文リスト