Fugu-MT 論文翻訳(概要): Second Guess: Detecting Uncertainty Through Abstention and Answer Stability in Small Language Models

論文の概要: Second Guess: Detecting Uncertainty Through Abstention and Answer Stability in Small Language Models

arxiv url: http://arxiv.org/abs/2605.25394v1
Date: Mon, 25 May 2026 03:38:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:19.274447
Title: Second Guess: Detecting Uncertainty Through Abstention and Answer Stability in Small Language Models
Title（参考訳）: 第二のガイダンス:小言語モデルにおける無視と解答安定性による不確かさの検出
Authors: Ashwath Vaithinathan Aravindan, Mayank Kejriwal,
Abstract要約: 大規模な言語モデルは、不確実な場合には控えるよりも、自信があるが誤った答えを生成することが多い。マルチチョイス質問応答(MCQA)における抑止のための軽量かつパラメータフリーなプロンプト手法である_Second Guess_を提案する。第2ギースは10.81%の複合リスク改善を達成している。
参考スコア（独自算出の注目度）: 2.5782420501870296
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models often generate confident but incorrect answers rather than abstaining when uncertain. This problem is particularly acute for small language models (SLMs), where computational constraints and autonomous operation amplify the need for reliable uncertainty detection. We propose _Second Guess_, a lightweight, parameter-free prompting technique for abstention in multiple-choice question answering (MCQA) that is well-suited for SLMs. Our key empirical insight is that models which truly know an answer will select it consistently, while uncertain models exhibit unstable behavior when an ``I don't know'' option is added. Evaluated on four open models (2B-8B parameters) and four benchmarks, Second Guess achieves the highest composite risk improvement of 10.81\%. Notably, it maintains an 8\% composite risk improvement on fine-tuned models where entropy-based methods degrade, and improves most for lower-performing models. All code and results required to reproduce this work is available in https://github.com/Mystic-Slice/second-guess
Abstract（参考訳）: 大規模な言語モデルは、不確実な場合には控えるよりも、自信があるが誤った答えを生成することが多い。この問題は、計算の制約と自律的な操作が確実な不確実性検出の必要性を増幅する小言語モデル(SLM)にとって特に急激な問題である。本稿では,SLM に適したマルチチョイス質問応答 (MCQA) において,抑止のための軽量かつパラメータフリーなプロンプト手法である _Second Guess_ を提案する。私たちの重要な経験的洞察は、真に解答を知っているモデルは一貫して選択するが、不確実なモデルは ``I don't know'' オプションを追加すると不安定な振る舞いを示す。 4つのオープンモデル(2B-8Bパラメータ)と4つのベンチマークで評価され、Second Guessは10.81\%の最高の複合リスク改善を達成する。特に、エントロピーに基づく手法が劣化する微調整モデルでは86%の複合リスク改善が維持され、低性能モデルでは最も改善されている。この作業を再現するために必要なコードと結果は、https://github.com/Mystic-Slice/second-guessで確認できる。

論文の概要: Second Guess: Detecting Uncertainty Through Abstention and Answer Stability in Small Language Models

関連論文リスト