Fugu-MT 論文翻訳(概要): When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges

論文の概要: When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges

arxiv url: http://arxiv.org/abs/2605.26046v1
Date: Mon, 25 May 2026 17:08:55 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:20.538232
Title: When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges
Title（参考訳）: 衝突時の勾配: LLM判事のための多目的プロンプト最適化の故障モード
Authors: Parth Darshan, Abhishek Divekar,
Abstract要約: テキスト勾配法は多目的テキスト勾配設定には適用されないことを示す。勾配特異性は、勾配が複数の基準を共同で処理するときに59%(9.0から3.7まで)低下する。最適化時勾配解法と推論時命令干渉の2つの分離可能な障害モードを同定する。
参考スコア（独自算出の注目度）: 0.3580891736370874
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Customizing an LLM judge to a specific task or domain often involves optimizing its prompt across multiple evaluation criteria simultaneously. Textual gradient methods automate this for a single judge criterion, however they produce natural-language critiques, not numerical vectors. Thus, the conflict-resolution toolkit of multi-task learning (PCGrad, MGDA) doesn't apply to the multi-objective textual gradient setting. We test five decomposition modes of textual gradient optimizers by varying how much cross-task information the loss, gradient and optimizer LLMs share. In 6 of 10 configurations, we observe that optimization never improves over the initial prompt. Gradient specificity drops by 59% (from 9.0 to 3.7) when the gradient LLM processes multiple criteria jointly. Separately, we observe that naively combining per-task instructions into a single prompt degrades Spearman's rho by -5.3%. These results identify two separable failure modes: optimization-time gradient dilution and inference-time instruction interference, which together constrain the design space for multi-objective judge customization using textual feedback.
Abstract（参考訳）: LLMの判断を特定のタスクやドメインにカスタマイズするには、複数の評価基準をまたいでプロンプトを最適化する必要があることが多い。テキスト勾配法は、これを1つの判断基準に対して自動化するが、数値ベクトルではなく自然言語批判を生成する。したがって、マルチタスク学習(PCGrad, MGDA)のコンフリクト分解ツールキットは、多目的テキスト勾配設定には適用されない。我々は,テキスト勾配最適化器の5つの分解モードを,損失,勾配,最適化器がどの程度の確率で共有されているかによって検証する。 10のコンフィグレーションのうち6つでは、初期プロンプトよりも最適化が改善されないことが観察される。勾配 LLM が複数の基準を共同で処理すると、勾配特異度は59%低下する(9.0から3.7)。個別に、タスク毎の命令を1つのプロンプトに鼻で組み合わせることで、スピアマンのローを5.3%低下させる。これらの結果から,最適化時勾配解法と推論時命令干渉法という2つの分離可能な障害モードが同定された。

論文の概要: When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges

関連論文リスト