Fugu-MT 論文翻訳(概要): When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm

論文の概要: When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm

arxiv url: http://arxiv.org/abs/2603.24079v1
Date: Wed, 25 Mar 2026 08:35:25 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:11.212191
Title: When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm
Title（参考訳）: リスクを理解するとき--画像生成パラダイムの信頼性と安全性のリスク-
Authors: Ye Leng, Junjie Chu, Mingjie Li, Chenhao Lin, Chao Shen, Michael Backes, Yun Shen, Yang Zhang,
Abstract要約: マルチモーダル大言語モデル(MLLM)は、言語と画像生成の統一パラダイムとして登場した。我々は、安全でないコンテンツ生成と偽画像合成という2つの側面に沿って、新興MLLMの安全性リスクを分析し、比較する。
参考スコア（独自算出の注目度）: 46.5461323436883
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recently, multimodal large language models (MLLMs) have emerged as a unified paradigm for language and image generation. Compared with diffusion models, MLLMs possess a much stronger capability for semantic understanding, enabling them to process more complex textual inputs and comprehend richer contextual meanings. However, this enhanced semantic ability may also introduce new and potentially greater safety risks. Taking diffusion models as a reference point, we systematically analyze and compare the safety risks of emerging MLLMs along two dimensions: unsafe content generation and fake image synthesis. Across multiple unsafe generation benchmark datasets, we observe that MLLMs tend to generate more unsafe images than diffusion models. This difference partly arises because diffusion models often fail to interpret abstract prompts, producing corrupted outputs, whereas MLLMs can comprehend these prompts and generate unsafe content. For current advanced fake image detectors, MLLM-generated images are also notably harder to identify. Even when detectors are retrained with MLLMs-specific data, they can still be bypassed by simply providing MLLMs with longer and more descriptive inputs. Our measurements indicate that the emerging safety risks of the cutting-edge generative paradigm, MLLMs, have not been sufficiently recognized, posing new challenges to real-world safety.
Abstract（参考訳）: 近年,言語と画像生成の統一パラダイムとしてマルチモーダル大規模言語モデル (MLLM) が登場している。拡散モデルと比較すると、MLLMは意味理解の能力が非常に強く、より複雑なテキスト入力を処理し、よりリッチな文脈意味を理解することができる。しかし、この強化されたセマンティック能力は、新たな、潜在的により大きな安全性リスクをもたらす可能性がある。拡散モデルを基準点として、安全でないコンテンツ生成と偽画像合成という2つの側面に沿って、新興MLLMの安全性リスクを体系的に分析・比較する。複数のアンセーフ生成ベンチマークデータセットを通して、MLLMは拡散モデルよりもより安全でない画像を生成する傾向があることを観察する。この違いは、拡散モデルが抽象的なプロンプトの解釈に失敗し、腐敗した出力を生成するのに対して、MLLMはこれらのプロンプトを理解し、安全でないコンテンツを生成できるためである。現在の先進的な偽画像検出器では、MLLM生成画像の識別も特に困難である。検出器がMLLM固有のデータで再訓練されたとしても、MLLMにより長くより記述的な入力を提供することでバイパスすることができる。以上の結果から,最先端生成パラダイムであるMLLMの安全性リスクは十分に認識されておらず,現実の安全性に新たな課題がもたらされることが示唆された。

論文の概要: When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm

関連論文リスト