Fugu-MT 論文翻訳(概要): Internalizing Self-Consistency in Language Models: Multi-Agent Consensus Alignment

論文の概要: Internalizing Self-Consistency in Language Models: Multi-Agent Consensus Alignment

arxiv url: http://arxiv.org/abs/2509.15172v2
Date: Tue, 30 Sep 2025 19:57:55 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-02 17:16:29.738926
Title: Internalizing Self-Consistency in Language Models: Multi-Agent Consensus Alignment
Title（参考訳）: 言語モデルにおける自己整合性の内部化:マルチエージェント・コンセンサスアライメント
Authors: Ankur Samanta, Akshayaa Magesh, Youliang Yu, Runzhe Wu, Ayush Jain, Daniel Jiang, Boris Vidolov, Paul Sajda, Yonathan Efroni, Kaveh Hassani,
Abstract要約: 言語モデル(LM)は矛盾する推論子であり、しばしば同じプロンプトに対する矛盾した応答を生成する。適切に整合した推論モデルの本質的な性質として自己整合性を定式化し、MACA(Multi-Agent Consensus Alignment)を導入する。 MACAは、エージェントが自分自身をより決定的かつ簡潔に教えることを可能にし、外部の監督なしにマルチエージェント設定におけるピアインサイトをより活用する。
参考スコア（独自算出の注目度）: 22.305033366660187
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Language Models (LMs) are inconsistent reasoners, often generating contradictory responses to identical prompts. While inference-time methods can mitigate these inconsistencies, they fail to address the core problem: LMs struggle to reliably select reasoning pathways leading to consistent outcomes under exploratory sampling. To address this, we formalize self-consistency as an intrinsic property of well-aligned reasoning models and introduce Multi-Agent Consensus Alignment (MACA), a reinforcement learning framework that post-trains models to favor reasoning trajectories aligned with their internal consensus using majority/minority outcomes from multi-agent debate. These trajectories emerge from deliberative exchanges where agents ground reasoning in peer arguments, not just aggregation of independent attempts, creating richer consensus signals than single-round majority voting. MACA enables agents to teach themselves to be more decisive and concise, and better leverage peer insights in multi-agent settings without external supervision, driving substantial improvements across self-consistency (+27.6% on GSM8K), single-agent reasoning (+23.7% on MATH), sampling-based inference (+22.4% Pass@20 on MATH), and multi-agent ensemble decision-making (+42.7% on MathQA). These findings, coupled with strong generalization to unseen benchmarks (+16.3% on GPQA, +11.6% on CommonsenseQA), demonstrate robust self-alignment that more reliably unlocks latent reasoning potential of language models.
Abstract（参考訳）: 言語モデル(LM)は矛盾する推論子であり、しばしば同じプロンプトに対する矛盾した応答を生成する。推論時間法はこれらの矛盾を緩和するが、中核的な問題に対処することができない: LMは探索サンプリングの下で一貫した結果をもたらす推論経路を確実に選択するのに苦労する。そこで本研究では, 自己整合性を, 適切に整合した推論モデルの本質的な性質として定式化し, マルチエージェント・コンセンサス・アライメント(MACA)を導入した。これらの軌道は、独立した試みの集合だけでなく、エージェントが単独の過半数投票よりもリッチなコンセンサス信号を生成するような議論的な交換から生まれる。 MACAにより、エージェントはより決定的かつ簡潔で、外部の監督なしにマルチエージェント設定におけるピアインサイトをより活用し、自己整合性(GSM8Kでは+27.6%)、シングルエージェント推論(MATHでは+23.7%)、サンプリングベース推論(MATHでは+22.4%)、マルチエージェントアンサンブル意思決定(MathQAでは+42.7%)で大幅に改善される。これらの発見とGPQAで+16.3%、CommonsenseQAで+11.6%)の強い一般化は、言語モデルの潜在推論可能性をより確実に解き放つ堅牢な自己アライメントを示す。

論文の概要: Internalizing Self-Consistency in Language Models: Multi-Agent Consensus Alignment

関連論文リスト