Fugu-MT 論文翻訳(概要): Scheduled Style Injection: Expanding the Style-Content Pareto Frontier in Training-Free Diffusion-based Style Transfer

論文の概要: Scheduled Style Injection: Expanding the Style-Content Pareto Frontier in Training-Free Diffusion-based Style Transfer

arxiv url: http://arxiv.org/abs/2605.26538v1
Date: Tue, 26 May 2026 04:39:40 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-27 17:51:41.668108
Title: Scheduled Style Injection: Expanding the Style-Content Pareto Frontier in Training-Free Diffusion-based Style Transfer
Title（参考訳）: スケジューリングスタイルインジェクション:学習自由拡散型スタイルトランスファーにおけるスタイルコンテントパレートフロンティアの拡張
Authors: Amey Sunil Kulkarni,
Abstract要約: 事前学習拡散モデルによるスタイル伝達は急速に進んでいる。モデルの中で、スタイルインジェクションはどこで最強になるべきか? 指導的なトレーニング不要なメソッドであるStyleIDは、すべてのレイヤとタイムステップに一様に単一のグローバルパラメータ(ガンマ)を使用する。このトレードオフは必然的に厳格であることを示す。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Style transfer with pre-trained diffusion models has advanced rapidly, but a core question remains underexplored: where in the model should style injection be strongest? StyleID, the leading training-free method, uses a single global parameter (gamma) uniformly across all layers and timesteps, which forces a fixed tradeoff between style quality and content preservation. We show this tradeoff is unnecessarily rigid. We systematically explore four dimensions of control: varying style injection strength across decoder layers, across denoising timesteps, and scheduling ControlNet geometric conditioning along both axes. The pattern is consistent everywhere: decreasing schedules, with stronger structural signal injection in shallower layers and earlier timesteps, reliably outperform the reverse. Beyond direction, schedule shape matters: cosine and square-root timestep schedules outperform linear. Most importantly, we find that gamma scheduling and ControlNet conditioning are nearly independent. The resulting combined configurations expand the Pareto frontier, offering superior tradeoffs between style fidelity and content preservation compared to any single baseline setting. Our best balanced configuration achieves ArtFID of 27.036 versus StyleID's 28.801 - a 6.1% relative improvement, with consistent gains across the full style-content tradeoff frontier. Results are validated across 35 configurations totaling over 28,000 stylized images using four complementary metrics. These findings generalize across SD backbones with identical rank ordering. All modifications are training-free, parameter-free, and require only a few lines of scheduling code; code is available at https://github.com/ameyskulkarni/scheduled_style_injection.
Abstract（参考訳）: 事前学習した拡散モデルによるスタイル伝達は急速に進んでいるが、中心的な疑問は未解決のままである。指導的なトレーニングフリーな方法であるStyleIDは、すべてのレイヤとタイムステップに一様に1つのグローバルパラメータ(ガンマ)を使用し、スタイル品質とコンテンツ保存のトレードオフを固定する。このトレードオフは必然的に厳格であることを示す。制御の4次元を体系的に検討し、デコーダ層間でのスタイルインジェクション強度の変化、デノイングタイムステップ間のスタイルインジェクション強度の変化、および両軸に沿って幾何条件をスケジューリングする。このパターンは至る所で一貫性があり、スケジュールを減らし、より浅い層でのより強い構造信号注入と、より早い時間ステップにより、裏面を確実に上回る。方向を超えて、スケジュールの形状は重要:コサインと平方根のタイムステップのスケジュールは線形よりも優れています。最も重要なことは、ガンマスケジューリングとControlNetコンディショニングがほぼ独立していることである。結果として、パレート・フロンティアが拡張され、スタイルの忠実さとコンテンツ保存のトレードオフが、どのベースライン設定よりも優れている。最もバランスの取れた設定では、ArtFIDの27.036とStyleIDの28.801 – 6.1%の相対的な改善を実現しています。結果は、合計28,000以上のスタイリングされたイメージを4つの補完的なメトリクスを使って35の構成で検証する。これらの所見はSD背骨に共通する順に一般化した。コードはhttps://github.com/ameyskulkarni/scheduled_style_injectionで入手できる。

論文の概要: Scheduled Style Injection: Expanding the Style-Content Pareto Frontier in Training-Free Diffusion-based Style Transfer

関連論文リスト