Fugu-MT 論文翻訳(概要): Beyond Majority Voting: LLM Aggregation by Leveraging Higher-Order Information

論文の概要: Beyond Majority Voting: LLM Aggregation by Leveraging Higher-Order Information

arxiv url: http://arxiv.org/abs/2510.01499v1
Date: Wed, 01 Oct 2025 22:21:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:20.892744
Title: Beyond Majority Voting: LLM Aggregation by Leveraging Higher-Order Information
Title（参考訳）: 多数投票を超えて - 高次情報の活用によるLLM集約
Authors: Rui Ai, Yuqi Pan, David Simchi-Levi, Milind Tambe, Haifeng Xu,
Abstract要約: 最適重み(OW)と逆サプライシング人気度(ISP)という2つの新しいアグリゲーションアルゴリズムを開発した。我々の理論的分析は、これらの手法が軽微な仮定の下での多数決の本質的な制限を確実に緩和することを示している。我々は,我々のアルゴリズムを人工データセット,UltraFeedbackやMMLUなどのLLMファインチューニングベンチマーク,実世界の医療環境ARMMAN上で実証的に検証した。
参考スコア（独自算出の注目度）: 57.397381631496906
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: With the rapid progress of multi-agent large language model (LLM) reasoning, how to effectively aggregate answers from multiple LLMs has emerged as a fundamental challenge. Standard majority voting treats all answers equally, failing to consider latent heterogeneity and correlation across models. In this work, we design two new aggregation algorithms called Optimal Weight (OW) and Inverse Surprising Popularity (ISP), leveraging both first-order and second-order information. Our theoretical analysis shows these methods provably mitigate inherent limitations of majority voting under mild assumptions, leading to more reliable collective decisions. We empirically validate our algorithms on synthetic datasets, popular LLM fine-tuning benchmarks such as UltraFeedback and MMLU, and a real-world healthcare setting ARMMAN. Across all cases, our methods consistently outperform majority voting, offering both practical performance gains and conceptual insights for the design of robust multi-agent LLM pipelines.
Abstract（参考訳）: マルチエージェント大規模言語モデル(LLM)推論の急速な進歩により、複数のLLMからの回答を効果的に集約する方法が根本的な課題として浮上した。標準多数決は全ての答えを等しく扱い、不均一性やモデル間の相関を考慮していない。本研究では, 最適重み(OW)と逆サプライシング人気度(ISP)という2つの新しいアグリゲーションアルゴリズムを設計し, 1次情報と2次情報の両方を活用する。我々の理論的分析は、これらの手法が穏やかな仮定の下で多数決の固有の制限を確実に緩和し、より信頼性の高い集団決定につながることを示している。我々は,我々のアルゴリズムを人工データセット,UltraFeedbackやMMLUなどのLLMファインチューニングベンチマーク,実世界の医療環境ARMMAN上で実証的に検証した。いずれの場合も,我々の手法は多数決を一貫して上回り,ロバストなマルチエージェントLLMパイプラインの設計において,実用的な性能向上と概念的洞察を提供する。

論文の概要: Beyond Majority Voting: LLM Aggregation by Leveraging Higher-Order Information

関連論文リスト