Fugu-MT 論文翻訳(概要): Do Retrieval Augmented Language Models Know When They Don't Know?

論文の概要: Do Retrieval Augmented Language Models Know When They Don't Know?

arxiv url: http://arxiv.org/abs/2509.01476v1
Date: Mon, 01 Sep 2025 13:44:15 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-04 15:17:03.722944
Title: Do Retrieval Augmented Language Models Know When They Don't Know?
Title（参考訳）: Retrieval Augmented Language Modelsはいつ知らないのか?
Authors: Youchao Zhou, Heyan Huang, Yicheng Liu, Rui Dai, Xinglin Wang, Xingchen Zhang, Shumin Shi, Yang Deng,
Abstract要約: ALMはいつ知らないのか知っていますか? 期待とは対照的に, LLM は有意なテキストバッファー-拒否行動を示す。提案手法は, 学習後モデルに対する簡易かつ効果的な拒絶手法を開発し, 解答品質を向上する。
参考スコア（独自算出の注目度）: 55.72375712577378
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Existing Large Language Models (LLMs) occasionally generate plausible yet factually incorrect responses, known as hallucinations. Researchers are primarily using two approaches to mitigate hallucinations, namely Retrieval Augmented Language Models (RALMs) and refusal post-training. However, current research predominantly emphasizes their individual effectiveness while overlooking the evaluation of the refusal capability of RALMs. In this study, we ask the fundamental question: Do RALMs know when they don't know? Specifically, we ask three questions. First, are RALMs well-calibrated regarding different internal and external knowledge states? We examine the influence of various factors. Contrary to expectations, we find that LLMs exhibit significant \textbf{over-refusal} behavior. Then, how does refusal post-training affect the over-refusal issue? We investigate the Refusal-aware Instruction Tuning and In-Context Fine-tuning methods. Our results show that the over-refusal problem is mitigated by In-context fine-tuning. but magnified by R-tuning. However, we also find that the refusal ability may conflict with the quality of the answer. Finally, we develop a simple yet effective refusal method for refusal post-trained models to improve their overall answer quality in terms of refusal and correct answers. Our study provides a more comprehensive understanding of the influence of important factors on RALM systems.
Abstract（参考訳）: 既存のLarge Language Models (LLMs) は時折、幻覚(幻覚)として知られる、真に正しくない応答を生成する。研究者は主に幻覚を緩和する2つのアプローチ、すなわちRetrieval Augmented Language Models (RALMs) とRetrieval Post-training(英語版)の2つを使っている。しかし、現在の研究は、ALMの拒絶能力の評価を見越しながら、個々の効果を主に強調している。 ALMはいつ、自分が知らないのかを知っていますか? 具体的には3つの質問をします。まず、ALMは内部と外部の異なる知識状態についてよく校正されているか? 各種要因の影響について検討する。期待とは対照的に, LLM は有意な \textbf{over-refusal} な振る舞いを示す。では、ポストトレーニングの拒絶は、オーバートレーニングの問題にどのように影響しますか? 本稿では,Refusal-aware Instruction Tuning法とIn-Context Fine-tuning法について検討する。以上の結果から,インコンテクストの微調整によりオーバーリファインの問題が緩和されることが示唆された。しかし、Rチューニングによって拡大します。しかし、拒絶能力は答えの質と矛盾する可能性があることも判明した。最後に,リファインダーモデルに対する簡易かつ効果的なリファインダー手法を開発し,リファインダーと正解の両面から全体の回答品質を改善する。本研究は、ALMシステムに対する重要な要因の影響について、より包括的な理解を提供する。

論文の概要: Do Retrieval Augmented Language Models Know When They Don't Know?

関連論文リスト