Fugu-MT 論文翻訳(概要): LLM-Guided Evolution for Medical Decision Pipelines

論文の概要: LLM-Guided Evolution for Medical Decision Pipelines

arxiv url: http://arxiv.org/abs/2606.07342v1
Date: Fri, 05 Jun 2026 14:53:16 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-08 14:33:29.797321
Title: LLM-Guided Evolution for Medical Decision Pipelines
Title（参考訳）: 医療用パイプラインのLCMガイドによる進化
Authors: Ivan Sviridov, Artem Oskin, Ivan Panin, Iaroslav Bespalov, Dmitry Dylov, Ivan Oseledets, Aleksandr Nesterov,
Abstract要約: LLM誘導MAP-Elitesの進化を医学的意思決定戦略の発見のための推論時代替として検討する。我々は,タスク固有の適合関数によって最適化された実行可能アーティファクトの進化的探索として,トリアージ,インタラクティブコンサルテーション,医用画像分類を定式化する。
参考スコア（独自算出の注目度）: 36.34631320654623
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Adapting large language models (LLMs) to clinical workflows often requires costly fine-tuning or manual prompt and pipeline engineering. We study LLM-guided MAP-Elites evolution as an inference-time alternative for discovering medical decision strategies and provide an implementation repository at https://github.com/univanxx/llm_guided_evo_medical. We formulate urgency triage, interactive consultation, and medical image classification as evolutionary searches over executable artifacts optimized by task-specific fitness functions. Across all three settings, evolution improves over manually designed baselines under practical constraints. In triage, evolved programs increase Semigran accuracy from $77.3\%$ to $87.1\%$ and emergency recall from $0.60$ to $0.97$, while improving safety-weighted held-out MIMIC-ESI performance. In interactive consultation, evolved policies improve the accuracy--cost frontier across Llama-3, Qwen-3.5, and Gemma-4 and transfer to held-out iCRAFTMD. In PneumoniaMNIST, prompt-only evolution improves frozen MedGemma VLMs while preserving strict JSON outputs. Qualitative analysis shows that the gains come from interpretable program-level mechanisms, calibrated triage boundaries, targeted evidence acquisition, selective commitment, and finding-oriented visual decision rules, rather than superficial prompt rewording alone.
Abstract（参考訳）: 大規模言語モデル(LLM)を臨床ワークフローに適合させるには、手動のプロンプトとパイプラインエンジニアリングのコストがかかることが多い。我々は LLM 誘導 MAP-Elites の進化を,医学的意思決定戦略を発見するための推論時代替手段として検討し, https://github.com/univanxx/llm_guided_evo_medical で実装リポジトリを提供する。タスク固有のフィットネス機能によって最適化された実行可能アーティファクトの進化的探索として、緊急トリアージ、インタラクティブコンサルテーション、医用画像分類を定式化する。これら3つの設定全体で、実践的な制約の下で手作業で設計されたベースラインよりも進化が改善される。トリアージでは、進化したプログラムは、セミグラの精度を7.3.%から87.1.%に引き上げ、緊急時のリコールを0.60ドルから0.97ドルに引き上げ、安全性に富んだMIMIC-ESIのパフォーマンスを改善した。インタラクティブなコンサルティングにおいて、進化したポリシーは、Llama-3、Qwen-3.5、Gemma-4をまたいだ精度・コストのフロンティアを改善し、保留中のiCRAFTMDに転送する。 PneumoniaMNISTでは、プロンプトのみの進化は、厳格なJSON出力を保持しながら、凍結したMedGemma VLMを改善する。質的な分析は、プログラムレベルのメカニズム、校正されたトリアージ境界、標的となる証拠の取得、選択的なコミットメント、探索指向の視覚的決定規則から得られる利益が、表面的なプロンプトの言い換えだけでは得られないことを示している。

論文の概要: LLM-Guided Evolution for Medical Decision Pipelines

関連論文リスト