Fugu-MT 論文翻訳(概要): Improving Alignment in LVLMs with Debiased Self-Judgment

論文の概要: Improving Alignment in LVLMs with Debiased Self-Judgment

arxiv url: http://arxiv.org/abs/2508.20655v2
Date: Thu, 11 Sep 2025 12:03:20 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-12 13:52:32.828677
Title: Improving Alignment in LVLMs with Debiased Self-Judgment
Title（参考訳）: 脱バイアス自己判断によるLVLMのアライメント改善
Authors: Sihan Yang, Chenhang Cui, Zihao Zhao, Yiyang Zhou, Weilong Yan, Ying Wei, Huaxiu Yao,
Abstract要約: 本稿では,モデルの内部で作成した自己評価指標であるデバイアスド自己判断スコア(debiased self-judgment score)を生成する新しい手法を提案する。提案手法は,復号法と選好調整法の両方を強化し,幻覚の低減,安全性の向上,全体的な能力の向上を実現した。
参考スコア（独自算出の注目度）: 44.8380749479927
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The rapid advancements in Large Language Models (LLMs) and Large Visual-Language Models (LVLMs) have opened up new opportunities for integrating visual and linguistic modalities. However, effectively aligning these modalities remains challenging, often leading to hallucinations--where generated outputs are not grounded in the visual input--and raising safety concerns across various domains. Existing alignment methods, such as instruction tuning and preference tuning, often rely on external datasets, human annotations, or complex post-processing, which limit scalability and increase costs. To address these challenges, we propose a novel approach that generates the debiased self-judgment score, a self-evaluation metric created internally by the model without relying on external resources. This enables the model to autonomously improve alignment. Our method enhances both decoding strategies and preference tuning processes, resulting in reduced hallucinations, enhanced safety, and improved overall capability. Empirical results show that our approach significantly outperforms traditional methods, offering a more effective solution for aligning LVLMs.
Abstract（参考訳）: LLM(Large Language Models)とLVLM(Large Visual-Language Models)の急速な進歩により、視覚的および言語的モダリティを統合する新たな機会が開かれた。しかし、これらのモダリティを効果的に整合させることは依然として困難であり、しばしば幻覚につながる。命令チューニングや優先順位調整のような既存のアライメント手法は、しばしば外部データセット、ヒューマンアノテーション、複雑な後処理に依存し、スケーラビリティを制限しコストを増大させる。これらの課題に対処するために,モデルの内部で外部リソースに依存しない自己評価指標であるデバイアスド自己判断スコア(debiased self-judgment score)を生成する新しい手法を提案する。これにより、モデルは自律的にアライメントを改善することができる。提案手法は,復号法と選好調整法の両方を強化し,幻覚の低減,安全性の向上,全体的な能力の向上を実現した。実験の結果,本手法は従来の手法よりも優れており,LVLMの整合性向上に有効であることがわかった。

論文の概要: Improving Alignment in LVLMs with Debiased Self-Judgment

関連論文リスト