Fugu-MT 論文翻訳(概要): MAST: Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer

論文の概要: MAST: Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer

arxiv url: http://arxiv.org/abs/2604.12281v1
Date: Tue, 14 Apr 2026 04:47:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-15 19:11:32.244033
Title: MAST: Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer
Title（参考訳）: MAST:無トレーニングマルチスタイルトランスファーのためのマスクガイド付きマッサージマスアロケーション
Authors: Dongkyung Kang, Jaeyeon Hwang, Junseo Park, Minji Kang, Yeryeong Lee, Beomseok Ko, Hanyoung Roh, Jeongmin Shin, Hyeryung Jang,
Abstract要約: MAST(Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer)を提案する。アーティファクトフリーで構造保存のスタイリングを実現するため、MASTは4つの連結モジュールを統合している。
参考スコア（独自算出の注目度）: 6.817047561934744
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Style transfer aims to render a content image with the visual characteristics of a reference style while preserving its underlying semantic layout and structural geometry. While recent diffusion-based models demonstrate strong stylization capabilities by leveraging powerful generative priors and controllable internal representations, they typically assume a single global style. Extending them to multi-style scenarios often leads to boundary artifacts, unstable stylization, and structural inconsistency due to interference between multiple style representations. To overcome these limitations, we propose MAST (Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer), a novel training-free framework that explicitly controls content-style interactions within the diffusion attention mechanism. To achieve artifact-free and structure-preserving stylization, MAST integrates four connected modules. First, Layout-preserving Query Anchoring prevents global layout collapse by firmly anchoring the semantic structure using content queries. Second, Logit-level Attention Mass Allocation deterministically distributes attention probability mass across spatial regions, seamlessly fusing multiple styles without boundary artifacts. Third, Sharpness-aware Temperature Scaling restores the attention sharpness degraded by multi-style expansion. Finally, Discrepancy-aware Detail Injection adaptively compensates for localized high-frequency detail losses by measuring structural discrepancies. Extensive experiments demonstrate that MAST effectively mitigates boundary artifacts and maintains structural consistency, preserving texture fidelity and spatial coherence even as the number of applied styles increases.
Abstract（参考訳）: スタイル転送は、その基盤となるセマンティックなレイアウトと構造的幾何学を保ちながら、参照スタイルの視覚的特徴を持つコンテンツイメージをレンダリングすることを目的としている。最近の拡散モデルでは、強力な生成先行と制御可能な内部表現を活用することで、強いスタイル化能力を示すが、通常は単一のグローバルなスタイルを仮定する。それらをマルチスタイルのシナリオに拡張することは、境界アーティファクト、不安定なスタイル化、複数のスタイル表現間の干渉による構造的不整合につながることが多い。これらの制約を克服するために,MAST(Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer)を提案する。アーティファクトフリーで構造保存のスタイリングを実現するため、MASTは4つの連結モジュールを統合している。まず、Layout保存クエリアンカリングは、コンテンツクエリを使用してセマンティック構造をしっかりと固定することで、グローバルなレイアウトの崩壊を防ぐ。第二に、ロジトレベルの注意質量配分は、空間領域に注意確率質量を決定的に分散し、境界アーチファクトのない複数のスタイルをシームレスに融合させる。第3に、シャープネスを意識した温度スケーリングは、マルチスタイル拡張によって劣化した注意シャープネスを復元する。最後に、離散性を考慮した詳細インジェクションは、構造的不一致を測定することにより、局所的な高周波詳細損失を適応的に補償する。大規模な実験により, MASTは境界アーチファクトを効果的に緩和し, 構造的整合性を保ち, テクスチャの忠実さと空間コヒーレンスを保ちながら, 適用スタイルの数が増えても維持することを示した。

論文の概要: MAST: Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer

関連論文リスト