Fugu-MT 論文翻訳(概要): Mitigating Hallucinations in Multimodal LLMs via Object-aware Preference Optimization

論文の概要: Mitigating Hallucinations in Multimodal LLMs via Object-aware Preference Optimization

arxiv url: http://arxiv.org/abs/2508.20181v1
Date: Wed, 27 Aug 2025 18:02:04 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-29 18:12:01.647242
Title: Mitigating Hallucinations in Multimodal LLMs via Object-aware Preference Optimization
Title（参考訳）: オブジェクト指向選好最適化によるマルチモーダルLLMにおける幻覚の緩和
Authors: Alberto Compagnoni, Davide Caffagni, Nicholas Moratelli, Lorenzo Baraldi, Marcella Cornia, Rita Cucchiara,
Abstract要約: MLLM(Multimodal Large Language Models)は、様々なタスクに対処するための統一インターフェースとして登場した。多くのベンチマークで最先端の結果が誇示されているにもかかわらず、長年の問題はMLLMが幻覚を起こす傾向にある。本稿では,幻覚の問題をアライメント問題として取り上げ,幻覚を伴わないコンテンツを生成するためのMLLMのステアリングを試みる。
参考スコア（独自算出の注目度）: 55.543583937522804
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multimodal Large Language Models (MLLMs) emerge as a unified interface to address a multitude of tasks, ranging from NLP to computer vision. Despite showcasing state-of-the-art results in many benchmarks, a long-standing issue is the tendency of MLLMs to hallucinate, that is to generate answers to the user's query that are not reflected in the visual input. In this paper, we address the problem of hallucinations as an alignment problem, seeking to steer the MLLM so that it prefers generating content without hallucinations. In contrast to recent approaches that require complicated pipelines to build synthetic preference data for alignment training, often relying on proprietary models, we capitalize on the well-known CHAIR metric, originally proposed to gauge the degree of hallucinations in image captioning. Given a pair of generated answers, we leverage CHAIR to distinguish winner and loser options (i.e., non-hallucinated and hallucinated samples) and fine-tune off-the-shelf MLLMs via Direct Preference Optimization (DPO). The resulting method, which we refer to as CHAIR-DPO, effectively diminishes the amount of hallucinated answers on several hallucination benchmarks, demonstrating the effectiveness of fine-tuning the MLLM with a CHAIR-based reward. Source code and trained models are publicly available at https://github.com/aimagelab/CHAIR-DPO.
Abstract（参考訳）: MLLM(Multimodal Large Language Models)は、NLPからコンピュータビジョンまで、さまざまなタスクに対処するための統一インターフェースとして登場した。多くのベンチマークで最先端の結果を誇示しているにもかかわらず、長年の問題はMLLMが幻覚する傾向にあり、視覚的な入力に反映されないユーザのクエリに対する回答を生成することである。本稿では,幻覚の問題をアライメント問題として取り上げ,幻覚を伴わないコンテンツを生成するためのMLLMのステアリングを試みる。画像キャプションにおける幻覚の度合いを測るために提案されたCHAIR測度を,プロプライエタリなモデルに頼って,アライメントトレーニングのための合成選好データを構築するために複雑なパイプラインを必要とする最近のアプローチとは対照的に,我々はよく知られたCHAIR測度を利用する。一対の回答が得られた場合、CHAIRを利用して勝者と敗者の選択肢(非幻覚的、幻覚的サンプル)を区別し、直接選好最適化(DPO)を介して既成のMLLMを微調整する。結果,CHAIR-DPOは,複数の幻覚ベンチマークにおける幻覚応答量を効果的に低減し,CHAIR-DPOを用いたMLLMの微調整の有効性を示す。ソースコードとトレーニングされたモデルはhttps://github.com/aimagelab/CHAIR-DPOで公開されている。

論文の概要: Mitigating Hallucinations in Multimodal LLMs via Object-aware Preference Optimization

関連論文リスト