Fugu-MT 論文翻訳(概要): EasyVFX: Frequency-Driven Decoupling for Resource-Efficient VFX Generation

論文の概要: EasyVFX: Frequency-Driven Decoupling for Resource-Efficient VFX Generation

arxiv url: http://arxiv.org/abs/2605.22051v1
Date: Thu, 21 May 2026 06:38:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 20:14:18.524804
Title: EasyVFX: Frequency-Driven Decoupling for Resource-Efficient VFX Generation
Title（参考訳）: EasyVFX: リソース効率の良いVFX生成のための周波数駆動型デカップリング
Authors: Yue Ma, Xu Ye, Qinghe Wang, Yucheng Wang, Hongyu Liu, Yinhan Zhang, Xinyu Wang, Yuanpeng Che, Shanhui Mo, Paul Liang, Fangneng Zhan, Qifeng Chen,
Abstract要約: EasyVFXは、厳密な制約の下で現実的な視覚効果合成を実現するフレームワークである。我々は、EasyVFXが構造的に一貫性があり、視覚的に素晴らしい効果をもたらすことを示し、周波数認識学習がプロ級のVFXを民主化するための重要な触媒であることを証明した。
参考スコア（独自算出の注目度）: 50.24005970263923
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Generating high-fidelity visual effects (VFX) typically demands massive datasets and prohibitive computational power due to the intricate coupling of spatial textures and temporal dynamics. In this paper, we introduce EasyVFX, a resource-efficient framework that achieves realistic VFX synthesis under stringent constraints. Our core philosophy lies in frequency-domain decomposition: we observe that the complexity of VFX can be significantly mitigated by decoupling high-frequency components, which represent intricate spatial appearances, from low-frequency components that encapsulate global motion dynamics. This spectral disentanglement transforms a high-dimensional learning problem into manageable sub-tasks, thereby lowering the optimization barrier and reducing data dependency. Building upon this insight, we propose a two-stage training paradigm. First, we design a Frequency-aware Mixture-of-Experts (Freq-MoE) architecture. By utilizing a soft routing mechanism, our model assigns specialized experts to distinct spectral bands, enabling them to cultivate robust priors for appearance and motion dynamics. This specialization allows the model to acquire foundational VFX knowledge with fewer GPU resources. Second, we introduce a Test-Time Training strategy powered by a novel Frequency-constraint Loss. This allows the pre-trained model to swiftly adapt to specific, unseen effects through localized optimizations, requiring only about 100 steps on a single GPU. Experimental results demonstrate that EasyVFX produces structurally consistent and visually stunning effects, proving that frequency-aware learning is a key catalyst for democratizing professional-grade VFX.
Abstract（参考訳）: 高忠実度視覚効果(VFX)を生成するには、空間テクスチャと時間ダイナミクスの複雑な結合により、大量のデータセットと計算能力が要求される。本稿では,制約下での現実的なVFX合成を実現する,資源効率のよいフレームワークであるEasyVFXを紹介する。我々の中心となる哲学は周波数領域の分解であり、大域的な運動力学を包含する低周波成分から複雑な空間的な外観を表す高周波成分を分離することにより、VFXの複雑さを著しく軽減することができる。このスペクトル歪みは、高次元学習問題を管理可能なサブタスクに変換することにより、最適化障壁を低くし、データ依存性を低減する。この知見に基づいて、2段階のトレーニングパラダイムを提案する。まず、周波数対応Mixture-of-Experts (Freq-MoE)アーキテクチャを設計する。ソフトルーティング機構を利用することで、特殊な専門家を異なるスペクトル帯域に割り当て、外見や動きのダイナミクスの頑健さを育むことができる。この特殊化により、GPUリソースを少なくして基礎的なVFX知識を取得することができる。第2に、新しい周波数制約損失を利用したテストタイムトレーニング戦略を導入する。これにより、トレーニング済みのモデルでは、ローカライズされた最適化を通じて、特定の、見えないエフェクトに迅速に適応することができる。実験により、EasyVFXは、プロ級のVFXを民主化する上で、周波数認識学習が重要な触媒であることを証明し、構造的に一貫性があり、視覚的に素晴らしい効果をもたらすことが示された。

論文の概要: EasyVFX: Frequency-Driven Decoupling for Resource-Efficient VFX Generation

関連論文リスト