Fugu-MT 論文翻訳(概要): SafeLLM: Extraction as a Hallucination-Resistant Alternative to Rewriting in Safety-Critical Settings

論文の概要: SafeLLM: Extraction as a Hallucination-Resistant Alternative to Rewriting in Safety-Critical Settings

arxiv url: http://arxiv.org/abs/2606.12897v1
Date: Thu, 11 Jun 2026 04:55:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-12 15:55:27.584613
Title: SafeLLM: Extraction as a Hallucination-Resistant Alternative to Rewriting in Safety-Critical Settings
Title（参考訳）: SafeLLM: 安全臨界設定における書き換えに対する幻覚耐性の代替手段としての抽出
Authors: Julia Ive, Felix Jozsa, Evridiki Georgaki, Nabeel Sheikh, Emma Cattell, Nick Jackson, Paulina Bondaronek, Ciaran Scott Hill, Richard Dobson,
Abstract要約: 自由形式の書き換えに依存する検索拡張世代システム(RAG)は、完全性と簡潔性の間の幻覚と不安定なトレードオフを導入することができる。精度、リコール、安全性をドキュメントタイプとモデルスケールでバランスさせる戦略を比較します。局所的なNHS急性診療・腫瘍学ガイドラインや英国全土のNICEガイドラインなど,様々な長さと構造を持つ文書を用いて実験を行った。
参考スコア（独自算出の注目度）: 2.7975477743127346
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) are increasingly used to access organisational documentation, including standard operating procedures (SOPs), HR policies and institutional guidelines. However, retrieval-augmented generation (RAG) systems that rely on free-form rewriting can introduce hallucinations and unstable trade-offs between completeness and conciseness, particularly in safety- and compliance-critical settings. Objectives: To evaluate extraction as a hallucination-resistant alternative to rewriting-based RAG and compare strategies that balance precision, recall and safety across document types and model scales. Methods: We compare multiple prompting strategies, including line-number-based source selection, extraction of relevant guideline sentences with explicit safety annotations, and a multi-stage pipeline that refines draft answers using supporting evidence from source guidelines. Experiments are conducted on documents of varying length and structure, including local NHS acute care and oncology guidelines and UK-wide NICE guidelines, using both frontier-scale and locally deployable models. Performance is assessed using automatic metrics and human expert evaluation of relevance and completeness. Results: Line-number selection achieves the strongest results, outperforming direct copying and safety-focused strategies across both large and small models while maintaining high term recall (up to 95%) and close alignment with source text. Safety-oriented approaches improve precision but introduce systematic omissions, while multi-stage filtering further amplifies this trade-off. Performance varies with document structure: line-based extraction excels in protocol-like content, whereas alternative strategies perform better on more verbose documents (up to 97% term recall).
Abstract（参考訳）: 大規模言語モデル(LLM)は、標準的な運用手順(SOP)、人事方針、制度ガイドラインなど、組織のドキュメントへのアクセスにますます使われています。しかし、自由形式の書き換えに依存する検索強化世代(RAG)システムは、特に安全性とコンプライアンスクリティカルな設定において、完全性と簡潔性の間の幻覚や不安定なトレードオフをもたらす可能性がある。目的:リライトベースのRAGに代わる幻覚耐性の代替として抽出を評価し,文書タイプやモデルスケール間の精度,リコール,安全性のバランスをとる戦略を比較する。方法: 行数に基づくソース選択, 明確な安全アノテーションによる関連ガイドライン文の抽出, および, 情報源ガイドラインからのエビデンスを援用して, 答案を洗練する多段階パイプラインなど, 複数のプロンプト戦略を比較した。局所的なNHS急性期診療ガイドラインや英国全体のNICEガイドラインなど,フロンティアスケールおよび局所展開可能なモデルを用いて,様々な長さと構造を持つ文書上で実験を行った。パフォーマンスは、自動測定と人間専門家による妥当性と完全性の評価によって評価される。結果: 行数選択は, 高速リコール(95%まで)とソーステキストとの密接な整合性を維持しながら, 大型モデルと小型モデルの両方において, 直接複製および安全性を重視した戦略を上回り, 最強の結果を得る。安全指向のアプローチは精度を向上するが、体系的な省略を導入し、マルチステージフィルタリングはこのトレードオフをさらに増幅する。行ベースの抽出はプロトコルライクなコンテンツに優れ、代替戦略はより冗長なドキュメント(最大97%の項リコール)でパフォーマンスが向上する。

論文の概要: SafeLLM: Extraction as a Hallucination-Resistant Alternative to Rewriting in Safety-Critical Settings

関連論文リスト