Fugu-MT 論文翻訳(概要): From `May' to `Is': Certainty Distortion in Language Model Rewriting

論文の概要: From `May' to `Is': Certainty Distortion in Language Model Rewriting

arxiv url: http://arxiv.org/abs/2606.07951v1
Date: Sat, 06 Jun 2026 02:53:31 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:05.57964
Title: From `May' to `Is': Certainty Distortion in Language Model Rewriting
Title（参考訳）: May から `Is' へ:言語モデル書き換えにおける確実な歪み
Authors: Catarina G Belem, Shang Wu, Hongyu Yao, Mark Steyvers, Sameer Singh, Padhraic Smyth,
Abstract要約: 言語モデル(LM)における確実性歪みについて検討する。本稿では,集団レベルでの確実性判定と一致するLMに基づく評価基準を提案する。これらの結果から,確実性歪みが最大75%のLM出力に影響を及ぼし,書き直し作業において系統的に非対称であることが示唆された。
参考スコア（独自算出の注目度）: 22.185142741738783
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Humans increasingly turn to Language Models (LMs) in ways that shape beliefs and drive decisions, including discussing, rewriting, and summarizing information from scientific articles, news, and medical reports. However, in these domains, where how confidently a claim is expressed matters, little is known about whether LMs faithfully preserve it. In this work, we investigate certainty distortion in LMs, defined as meaningful changes in expressed certainty when semantic content is preserved. We propose an LM-based evaluation metric that is consistent with population-level judgments of certainty. Using this metric, we characterize certainty distortion across different sizes and families of models in the context of scientific and medical communication tasks. Our results show that certainty distortion affects up to 75\% of LM outputs and is systematically asymmetric in rewriting tasks with most LMs being 1.5-2$\times$ more likely to increase the expressed certainty than to decrease it. These effects can compound over repeated paraphrasing: in the medical domain, claude-haiku-4-5 increases certainty of 20\% examples after a single iteration, increasing to 40\% after five iterations. Prompt-based interventions reduce overall certainty distortion but do not eliminate it. Together, these findings reveal a general bias toward inflating expressed certainty, with direct implications for users who rely on LMs in high-stakes domains.
Abstract（参考訳）: 人間は、信念を形作り、科学的記事、ニュース、医療報告から情報を議論、書き直し、要約するなど決定を導く方法で言語モデル(LM)に目を向けるようになっている。しかし、これらの領域では、主張がいかに自信を持って表現されるかが問題であり、LMがそれを忠実に保存するかどうかはほとんど分かっていない。本研究では,意味的内容が保存されている場合の表現的確実性において意味のある変化として定義されるLMの確実性歪みについて検討する。本稿では,集団レベルでの確実性判定と一致したLMに基づく評価指標を提案する。この測定値を用いて、科学的・医学的なコミュニケーションタスクの文脈において、異なるサイズのモデルやモデルのファミリーにまたがる確実性歪みを特徴づける。以上の結果から,自信の歪みは最大75\%のLM出力に影響し,ほとんどのLMが1.5-2$\times$で書き直し作業において系統的に非対称であることが明らかとなった。医療領域では、クロードハイク-4-5は1回の反復で20 %のサンプルを確実に増加させ、5回の反復で40 %まで増加する。プロンプトに基づく介入は全体的な確実性の歪みを減少させるが、それを排除しない。これらの結果から,高吸収領域のLMに依存するユーザに対して,インフレーションに対する一般的な偏見が示唆された。

論文の概要: From `May' to `Is': Certainty Distortion in Language Model Rewriting

関連論文リスト