Fugu-MT 論文翻訳(概要): ACPO: Anchor-Constrained Perceptual Optimization for Diffusion Models with No-Reference Quality Guidance

論文の概要: ACPO: Anchor-Constrained Perceptual Optimization for Diffusion Models with No-Reference Quality Guidance

arxiv url: http://arxiv.org/abs/2604.26348v1
Date: Wed, 29 Apr 2026 07:00:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-30 15:59:36.281915
Title: ACPO: Anchor-Constrained Perceptual Optimization for Diffusion Models with No-Reference Quality Guidance
Title（参考訳）: ACPO:非参照品質誘導拡散モデルに対するアンカー制約付き知覚最適化
Authors: Yang Yang, Feifan Meng, Han Fang, Weiming Zhang,
Abstract要約: 本研究では,非参照知覚品質を拡散訓練に取り入れることの問題点について検討する。重要な課題は、非参照画像品質評価(NR-IQA)モデルで提供されるような知覚信号を直接最適化することが、元の拡散目標とのミスマッチを導入することである。安定した知覚適応を可能にするアンカー制約最適化フレームワークを提案する。
参考スコア（独自算出の注目度）: 36.4379549994663
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Diffusion models have achieved remarkable success in image generation, yet their training is predominantly driven by full-reference objectives that enforce pixel-wise similarity to ground-truth images.Such supervision, while effective for fidelity, may insufficient in terms of subjective visual perception quality and text-image semantic consistency. In this work, we investigate the problem of incorporating no-reference perceptual quality into diffusion training. A key challenge is that directly optimizing perceptual signals, such as those provided by no-reference image quality assessment (NR-IQA) models, introduces a mismatch with the original diffusion objective, leading to training instability and distributional drift during fine-tuning. To address this issue, we propose an anchor-constrained optimization framework that enables stable perceptual adaptation. Specifically, we leverage a learned NR-IQA model as a perceptual guidance signal, while introducing an anchor-based regularization that enforces consistency with the base diffusion model in terms of noise prediction. This design effectively balances perceptual quality improvement and generative fidelity, allowing controlled adaptation toward perceptually favorable outputs without compromising the original generative behavior. Extensive experiments demonstrate that our method consistently enhances perceptual quality while preserving generation diversity and training stability, highlighting the effectiveness of anchor-constrained perceptual optimization for diffusion models.
Abstract（参考訳）: 拡散モデルは画像生成において顕著な成功を収めてきたが、そのトレーニングは主に、地平線画像とピクセルワイドな類似性を強制するフルリフレクションの目的によって進められており、その監督は、忠実性に効果があるが、主観的視覚的品質とテキストイメージのセマンティック一貫性の観点からは不十分である。本研究では,非参照知覚品質を拡散訓練に取り入れることの問題点について検討する。重要な課題は、非参照画像品質評価(NR-IQA)モデルで提供されるような知覚信号を直接最適化することが、元の拡散目標とのミスマッチを導入し、微調整中のトレーニング不安定性と分布ドリフトをもたらすことである。この問題に対処するために,安定した知覚適応を可能にするアンカー制約最適化フレームワークを提案する。具体的には、学習したNR-IQAモデルを知覚誘導信号として利用し、ノイズ予測の観点からベース拡散モデルとの整合性を強制するアンカーベース正規化を導入する。この設計は、知覚品質の向上と生成の忠実さを効果的にバランスさせ、本来の生成挙動を損なうことなく、知覚的に好適な出力への制御適応を可能にする。広汎な実験により,本手法は世代多様性と訓練安定性を保ちながら知覚品質を継続的に向上し,拡散モデルに対するアンカー拘束型知覚最適化の有効性を強調した。

論文の概要: ACPO: Anchor-Constrained Perceptual Optimization for Diffusion Models with No-Reference Quality Guidance

関連論文リスト