Fugu-MT 論文翻訳(概要): MGDA-Decoupled: Geometry-Aware Multi-Objective Optimisation for DPO-based LLM Alignment

論文の概要: MGDA-Decoupled: Geometry-Aware Multi-Objective Optimisation for DPO-based LLM Alignment

arxiv url: http://arxiv.org/abs/2604.20685v1
Date: Wed, 22 Apr 2026 15:33:45 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-23 15:36:11.197819
Title: MGDA-Decoupled: Geometry-Aware Multi-Objective Optimisation for DPO-based LLM Alignment
Title（参考訳）: MGDA-Decoupled:DPOに基づくLLMアライメントのための幾何学的多目的最適化
Authors: Andor Vári-Kakas, Ji Won Park, Natasa Tagasovska,
Abstract要約: 幾何学に基づく多目的最適化アルゴリズムMGDA-Decoupledを導入する。それぞれの目的の収束ダイナミクスを明示的に説明しながら、共通の降下方向を見つける。 UltraFeedbackデータセットの実験では、MGDA-Decoupledがゴールデンレスポンスに対して最高勝利率を達成した。
参考スコア（独自算出の注目度）: 6.301256425456381
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Aligning large language models (LLMs) to desirable human values requires balancing multiple, potentially conflicting objectives such as helpfulness, truthfulness, and harmlessness, which presents a multi-objective optimisation challenge. Most alignment pipelines rely on a fixed scalarisation of these objectives, which can introduce procedural unfairness by systematically under-weighting harder-to-optimise or minority objectives. To promote more equitable trade-offs, we introduce MGDA-Decoupled, a geometry-based multi-objective optimisation algorithm that finds a shared descent direction while explicitly accounting for each objective's convergence dynamics. In contrast to prior methods that depend on reinforcement learning (e.g., GAPO) or explicit reward models (e.g., MODPO), our approach operates entirely within the lightweight Direct Preference Optimisation (DPO) paradigm. Experiments on the UltraFeedback dataset show that geometry-aware methods -- and MGDA-Decoupled in particular -- achieve the highest win rates against golden responses, both overall and per objective.
Abstract（参考訳）: 大きな言語モデル(LLM)を望ましい人的価値に適応させるには、多目的最適化の課題を示す、有益性、真実性、無害性といった、競合する可能性のある複数の目標のバランスが必要である。ほとんどのアライメントパイプラインは、これらの目的の固定されたスカラー化に依存しており、システマティックに過度に過度に過度に最適化する、あるいは少数な目的を導入することによって、手続き上の不公平をもたらす可能性がある。より公平なトレードオフを促進するため、MGDA-Decoupledという幾何に基づく多目的最適化アルゴリズムを導入し、各対象の収束ダイナミクスを明示的に考慮しながら、共有降下方向を求める。強化学習(GAPOなど)や明示的な報酬モデル(MODPOなど)に依存する従来の手法とは対照的に,本手法は軽量な直接選好最適化(DPO)パラダイムで完全に動作する。 UltraFeedbackデータセットの実験では、ジオメトリ対応のメソッド -- 特にMGDA-Decoupled -- が、全体と目的の両方において、ゴールデンレスポンスに対して最高の勝利率を達成したことが示されている。

論文の概要: MGDA-Decoupled: Geometry-Aware Multi-Objective Optimisation for DPO-based LLM Alignment

関連論文リスト