Fugu-MT 論文翻訳(概要): Don't Throw Away Your Beams: Improving Consistency-based Uncertainties in LLMs via Beam Search

論文の概要: Don't Throw Away Your Beams: Improving Consistency-based Uncertainties in LLMs via Beam Search

arxiv url: http://arxiv.org/abs/2512.09538v1
Date: Wed, 10 Dec 2025 11:24:29 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-11 15:14:53.494839
Title: Don't Throw Away Your Beams: Improving Consistency-based Uncertainties in LLMs via Beam Search
Title（参考訳）: ビームをスローしない:ビームサーチによるLCMの一貫性に基づく不確実性の改善
Authors: Ekaterina Fadeeva, Maiya Goloburda, Aleksandr Rubashevskii, Roman Vashurin, Artem Shelmanov, Preslav Nakov, Mrinmaya Sachan, Maxim Panov,
Abstract要約: 整合性に基づく不確実性推定の候補を生成するためにビームサーチを用いる新しい手法のファミリーを導入する。我々は、6つのQAデータセットに対する我々のアプローチを実証的に評価し、その多項サンプリングに対する一貫した改善が最先端のUQパフォーマンスをもたらすことを発見した。
参考スコア（独自算出の注目度）: 111.6996614063716
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Consistency-based methods have emerged as an effective approach to uncertainty quantification (UQ) in large language models. These methods typically rely on several generations obtained via multinomial sampling, measuring their agreement level. However, in short-form QA, multinomial sampling is prone to producing duplicates due to peaked distributions, and its stochasticity introduces considerable variance in uncertainty estimates across runs. We introduce a new family of methods that employ beam search to generate candidates for consistency-based UQ, yielding improved performance and reduced variance compared to multinomial sampling. We also provide a theoretical lower bound on the beam set probability mass under which beam search achieves a smaller error than multinomial sampling. We empirically evaluate our approach on six QA datasets and find that its consistent improvements over multinomial sampling lead to state-of-the-art UQ performance.
Abstract（参考訳）: 一貫性に基づく手法は、大規模言語モデルにおける不確実性定量化(UQ)に対する効果的なアプローチとして現れている。これらの手法は典型的には、マルチノミカルサンプリングによって得られた数世代に依拠し、それらの合意レベルを測定する。しかし, 短時間のQAでは, ピーク分布による重複が生じる傾向があり, その確率性は, 走行中の不確実性推定にかなりのばらつきをもたらす。本稿では, ビームサーチを用いて, 整合性に基づくUQの候補を生成する手法を提案する。また、ビーム探索がマルチパラメータサンプリングよりも誤差の少ないビームセット確率質量の理論的下界も提供する。我々は、6つのQAデータセットに対する我々のアプローチを実証的に評価し、その多項サンプリングに対する一貫した改善が最先端のUQパフォーマンスをもたらすことを発見した。

論文の概要: Don't Throw Away Your Beams: Improving Consistency-based Uncertainties in LLMs via Beam Search

関連論文リスト