Fugu-MT 論文翻訳(概要): Hallucination Mitigating for Medical Report Generation

論文の概要: Hallucination Mitigating for Medical Report Generation

arxiv url: http://arxiv.org/abs/2601.15745v1
Date: Thu, 22 Jan 2026 08:13:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-23 21:37:20.540359
Title: Hallucination Mitigating for Medical Report Generation
Title（参考訳）: 医療報告作成のための幻覚の緩和
Authors: Ruoqing Zhao, Runze Xia, Piji Li,
Abstract要約: ファイングラインドされたtextbfReinforced Rewards textbfMedical Report Generation (KERM) を用いた textbfKnowledge-textbf フレームワークを提案する。本手法は,MedCLIPを知識検索に活用することによりLVLMへの入力を改良し,キュレートされた知識コーパスから関連する病変事実文を取り入れた。次に,検索した知識が患者の臨床的文脈とコンテキスト的に関連していることを保証するために,新たな浄化モジュールを導入する。
参考スコア（独自算出の注目度）: 19.773914014955746
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: In the realm of medical report generation (MRG), the integration of natural language processing has emerged as a vital tool to alleviate the workload of radiologists. Despite the impressive capabilities demonstrated by large vision language models (LVLMs) in understanding natural language, their susceptibility to generating plausible yet inaccurate claims, known as ``hallucinations'', raises concerns-especially in the nuanced and critical field of medical. In this work, we introduce a framework, \textbf{K}nowledge-\textbf{E}nhanced with Fine-Grained \textbf{R}einforced Rewards \textbf{M}edical Report Generation (KERM), to tackle the issue. Our approach refines the input to the LVLM by first utilizing MedCLIP for knowledge retrieval, incorporating relevant lesion fact sentences from a curated knowledge corpus. We then introduce a novel purification module to ensure the retrieved knowledge is contextually relevant to the patient's clinical context. Subsequently, we employ fine-grained rewards to guide these models in generating highly supportive and clinically relevant descriptions, ensuring the alignment of model's outputs with desired behaviors. Experimental results on IU-Xray and MIMIC-CXR datasets validate the effectiveness of our approach in mitigating hallucinations and enhancing report quality.
Abstract（参考訳）: 医療報告生成(MRG)の領域では、自然言語処理の統合が、放射線技師の作業負荷を軽減する重要なツールとして現れている。自然言語の理解において大きな視覚言語モデル(LVLM)によって示される印象的な能力にもかかわらず、'hallucinations'として知られる、もっとも正確で不正確なクレームを生成することへの感受性は、特にニュアンスで批判的な医療分野における懸念を高めている。本稿では,この問題に対処するために,ファイングレードされた \textbf{R}einforced Rewards \textbf{M}edical Report Generation (KERM) を用いたフレームワークである \textbf{K}nowledge-\textbf{E}nhancedを導入する。本手法は,MedCLIPを知識検索に活用することによりLVLMへの入力を改良し,キュレートされた知識コーパスから関連する病変事実文を取り入れた。次に,検索した知識が患者の臨床的文脈とコンテキスト的に関連していることを保証するために,新たな浄化モジュールを導入する。続いて、これらのモデルが高度に支援的かつ臨床的に関係のある記述を生成し、モデル出力と望ましい振る舞いとの整合性を確保するために、きめ細かい報酬を用いる。 IU-XrayおよびMIMIC-CXRデータセットの実験結果から,幻覚の緩和と報告品質の向上に対するアプローチの有効性が検証された。

論文の概要: Hallucination Mitigating for Medical Report Generation

関連論文リスト