Fugu-MT 論文翻訳(概要): Stylistic Attribute Control in Latent Diffusion Models

論文の概要: Stylistic Attribute Control in Latent Diffusion Models

arxiv url: http://arxiv.org/abs/2605.02583v1
Date: Mon, 04 May 2026 13:34:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:50.304415
Title: Stylistic Attribute Control in Latent Diffusion Models
Title（参考訳）: 潜在拡散モデルにおけるスティリスティック属性制御
Authors: Max Reimann, Benito Buchheim, Jürgen Döllner,
Abstract要約: 潜時拡散モデルにおけるスタイリスティック特性の微粒化パラメトリック制御手法を提案する。我々は、スタイリスティックな微調整と基礎モデルの間のドメインギャップを埋めるためにガイダンス合成を使用する。我々は、スタイリスティックにフィルタリングされた合成データセットから、さまざまなスタイリスティックな属性を学習することで、我々のアプローチを検証する。
参考スコア（独自算出の注目度）: 2.8893654860442872
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Text-to-image diffusion models have revolutionized image synthesis and editing, but precise control over stylistic attributes remains a challenge, often causing unintended content modifications. We propose an approach for fine-grained parametric control of stylistic attributes in latent diffusion models by learning disentangled editing directions from synthetic datasets. We use guidance composition to close the domain gap between stylistically finetuned and foundation models, preserving the original image semantics while applying stylistic adjustments. To ensure consistent edits, we introduce a training regularization loss and enhance DDIM inversion with optimized null-conditional embeddings for real image editing. We validate our approach by learning from stylistically filtered synthetic datasets varying a range of stylistic attributes, including outlines, local contrast, watercolorization effects, and geometric patterns. Our evaluations demonstrate that compared to current text-based editing techniques, our method offers well-integrated, more precise and continuously adjustable stylistic modifications.
Abstract（参考訳）: テキストから画像への拡散モデルは画像合成と編集に革命をもたらしたが、スタイリスティックな属性の正確な制御は依然として困難であり、意図しないコンテンツ修正を引き起こすことが多い。本稿では, 合成データセットから不整合編集方向を学習することにより, 潜時拡散モデルにおけるスタイリスティック特性の詳細なパラメトリック制御を行う手法を提案する。本研究では,スタイリスティックな微調整モデルと基礎モデルとのドメインギャップを埋めるためにガイダンス構成を用い,スタイリスティックな調整を施した上で,元のイメージセマンティクスを保存する。一貫性のある編集を実現するため,実画像編集に最適化されたnull条件埋め込みを用いて,トレーニング正規化損失を導入し,DDIMのインバージョンを向上する。我々は,スタイリスティックにフィルタリングされた合成データセットから,アウトライン,局所コントラスト,水彩色効果,幾何学的パターンなど,さまざまなスタイリスティックな属性を学習することで,我々のアプローチを検証する。提案手法は,従来のテキストベースの編集技術と比較して,よりよく統合され,より正確で,連続的に調整可能なスタイル修正を提供する。

論文の概要: Stylistic Attribute Control in Latent Diffusion Models

関連論文リスト