Fugu-MT 論文翻訳(概要): Beyond Compromise: Pareto-Lenient Consensus for Efficient Multi-Preference LLM Alignment

論文の概要: Beyond Compromise: Pareto-Lenient Consensus for Efficient Multi-Preference LLM Alignment

arxiv url: http://arxiv.org/abs/2604.05965v1
Date: Tue, 07 Apr 2026 14:58:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-08 17:42:09.89832
Title: Beyond Compromise: Pareto-Lenient Consensus for Efficient Multi-Preference LLM Alignment
Title（参考訳）: 妥協を超えて: 効率の良いマルチパラメータLLMアライメントのためのパレートLenient Consensus
Authors: Renxuan Tan, Rongpeng Li, Zhifeng Zhao, Honggang Zhang,
Abstract要約: 動的ネゴシエーションプロセスとしてアライメントを再定義するゲーム理論フレームワークを提案する。厳密なアプローチとは異なり、PLCは局所的な劣化を動的に許容するコンセンサス駆動の lenient rectification を導入する。この研究は、MPAの有望な道として交渉主導型アライメントの可能性を強調している。
参考スコア（独自算出の注目度）: 12.219758006484689
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Transcending the single-preference paradigm, aligning LLMs with diverse human values is pivotal for robust deployment. Contemporary Multi-Objective Preference Alignment (MPA) approaches predominantly rely on static linear scalarization or rigid gradient projection to navigate these trade-offs. However, by enforcing strict conflict avoidance or simultaneous descent, these paradigms often prematurely converge to local stationary points. While mathematically stable, these points represent a conservative compromise where the model sacrifices potential global Pareto improvements to avoid transient local trade-offs. To break this deadlock, we propose Pareto-Lenient Consensus (PLC), a game-theoretic framework that reimagines alignment as a dynamic negotiation process. Unlike rigid approaches, PLC introduces consensus-driven lenient gradient rectification, which dynamically tolerates local degradation provided there is a sufficient dominant coalition surplus, thereby empowering the optimization trajectory to escape local suboptimal equilibrium and explore the distal Pareto-optimal frontier. Theoretical analysis validates PLC can facilitate stalemate escape and asymptotically converge to a Pareto consensus equilibrium. Moreover, extensive experiments show that PLC surpasses baselines in both fixed-preference alignment and global Pareto frontier quality. This work highlights the potential of negotiation-driven alignment as a promising avenue for MPA. Our codes are available at https://anonymous.4open.science/r/aaa-6BB8.
Abstract（参考訳）: 単一参照パラダイムを超越して、LLMをさまざまな人的価値と整合させることが、ロバストなデプロイメントにおいて重要なのです。現代のMulti-Objective Preference Alignment (MPA) アプローチは、これらのトレードオフをナビゲートするために、静的な線形スカラー化や厳密な勾配投影に依存している。しかし、厳密な衝突回避や同時降下を強制することにより、これらのパラダイムは早期に局所的な定常点に収束することが多い。数学的には安定しているが、これらの点は、過渡的な局所的なトレードオフを避けるために、モデルが世界的なパレートの改善を犠牲にする保守的な妥協を表している。このデッドロックを破るために,動的ネゴシエーションプロセスとしてアライメントを再定義するゲーム理論フレームワークであるPareto-Lenient Consensus (PLC)を提案する。厳密なアプローチとは異なり、PLCは、十分な支配的な連立余剰が存在する場合、局所的な劣化を動的に許容し、最適化軌道が局所的最適均衡から逃れ、遠位パレート・最適フロンティアを探索する。理論的解析により、PLCはスタレマトエスケープを促進し、漸近的にパレートのコンセンサス均衡に収束する。さらに、広範な実験により、PLCは固定参照アライメントとグローバルパレートフロンティア品質の両方において、ベースラインを超越していることが示されている。この研究は、MPAの有望な道として交渉主導型アライメントの可能性を強調している。私たちのコードはhttps://anonymous.4open.science/r/aa-6BB8.comで公開されています。

論文の概要: Beyond Compromise: Pareto-Lenient Consensus for Efficient Multi-Preference LLM Alignment

関連論文リスト