Fugu-MT 論文翻訳(概要): DealMaTe: Multi-Dimensional Material Transfer via Diffusion Transformer

論文の概要: DealMaTe: Multi-Dimensional Material Transfer via Diffusion Transformer

arxiv url: http://arxiv.org/abs/2605.15681v1
Date: Fri, 15 May 2026 07:06:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-18 21:22:26.203784
Title: DealMaTe: Multi-Dimensional Material Transfer via Diffusion Transformer
Title（参考訳）: DealMaTe:拡散変圧器による多次元物質移動
Authors: Nisha Huang, Yizhou Lin, Jie Guo, Xiu Li, Tong-Yee Lee, Zitong Yu,
Abstract要約: DealMaTeは、テキストガイダンスと参照ネットワークを排除する拡散フレームワークである。 DealMaTeは任意の入力材料の下で顕著な高忠実度物質移動を実現する。
参考スコア（独自算出の注目度）: 45.232470509013815
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recently, diffusion-based material transfer methods rely on image fine-tuning or complex architectures with auxiliary networks but face challenges such as text dependency, additional computational costs, and feature misalignment. To address these limitations, we propose \textbf{DealMaTe}, using \underline{\textbf{de}}pth, norm\underline{\textbf{a}}l, and \underline{\textbf{l}}ighting images for \underline{\textbf{ma}}terial \underline{\textbf{t}}ransf\underline{\textbf{e}}r. DealMaTe is a simplified diffusion framework that eliminates text guidance and reference networks. We design a lightweight 3D information injection method, Multi-Dim 3D Shader LoRA, which, without modifying the base model weights, enables compatible control conditions and achieves harmonious and stable results. Additionally, we optimize the attention mechanism with Shader Causal Mutual Attention and key-value (KV) caching to reduce inference latency caused by multiple conditions, improve computational efficiency, and achieve high-quality material transfer results with low architectural complexity. Extensive experiments covering a wide variety of objects and lighting conditions consistently demonstrate that DealMaTe achieves remarkable high-fidelity material transfer under arbitrary input materials. The code is available at https://github.com/haha-lisa/DealMaTe.
Abstract（参考訳）: 近年,拡散型物質移動法は画像の微調整や補助的ネットワークによる複雑なアーキテクチャに依存しているが,テキスト依存や計算コストの増大,特徴の誤調整といった課題に直面している。これらの制限に対処するため、 \underline{\textbf{de}}pth, norm\underline{\textbf{a}}l, \underline{\textbf{l}}ighting image for \underline{\textbf{ma}}terial \underline{\textbf{t}}ransf\underline{\textbf{e}}rを用いて、 \textbf{DealMaTe}を提案する。 DealMaTeは、テキストのガイダンスと参照ネットワークを排除する単純化された拡散フレームワークである。基本モデルの重みを変更しない軽量な3次元情報注入方式であるMulti-Dim 3D Shader LoRAを設計し、互換性のある制御条件を実現し、調和と安定した結果を得る。さらに,Shader Causal Mutual Attention and Key-value(KV)キャッシングによるアテンション機構を最適化し,複数の条件による推論遅延を低減し,計算効率を向上し,アーキテクチャの複雑さを低減した高品質な物質移動結果を実現する。様々な物体や照明条件を包含する広範囲な実験は、任意の入力材料の下で、DealMaTeが顕著な高忠実性物質移動を達成することを一貫して証明している。コードはhttps://github.com/haha-lisa/DealMaTeで入手できる。

論文の概要: DealMaTe: Multi-Dimensional Material Transfer via Diffusion Transformer

関連論文リスト