Fugu-MT 論文翻訳(概要): Few-step Flow for 3D Generation via Marginal-Data Transport Distillation

論文の概要: Few-step Flow for 3D Generation via Marginal-Data Transport Distillation

arxiv url: http://arxiv.org/abs/2509.04406v1
Date: Thu, 04 Sep 2025 17:24:31 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-05 20:21:10.23334
Title: Few-step Flow for 3D Generation via Marginal-Data Transport Distillation
Title（参考訳）: マルジナル・データ・トランスポート蒸留による3次元生成のための数ステップフロー
Authors: Zanwei Zhou, Taoran Yi, Jiemin Fang, Chen Yang, Lingxi Xie, Xinggang Wang, Wei Shen, Qi Tian,
Abstract要約: 数段階の3次元流体蒸留のための新しいフレームワーク MDT-dist を提案する。我々のアプローチは、事前訓練されたモデルを蒸留して、Marginal-Data Transportを学習する、という第一の目的に基づいて構築されている。提案手法は,A800におけるA800の9.0倍,6.5倍の速度で0.68s (1 Step x2) と0.94s (2 Step x2) のレイテンシを実現するため,各流れトランスのサンプリングステップを25から2に短縮する。
参考スコア（独自算出の注目度）: 104.76254102015794
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Flow-based 3D generation models typically require dozens of sampling steps during inference. Though few-step distillation methods, particularly Consistency Models (CMs), have achieved substantial advancements in accelerating 2D diffusion models, they remain under-explored for more complex 3D generation tasks. In this study, we propose a novel framework, MDT-dist, for few-step 3D flow distillation. Our approach is built upon a primary objective: distilling the pretrained model to learn the Marginal-Data Transport. Directly learning this objective needs to integrate the velocity fields, while this integral is intractable to be implemented. Therefore, we propose two optimizable objectives, Velocity Matching (VM) and Velocity Distillation (VD), to equivalently convert the optimization target from the transport level to the velocity and the distribution level respectively. Velocity Matching (VM) learns to stably match the velocity fields between the student and the teacher, but inevitably provides biased gradient estimates. Velocity Distillation (VD) further enhances the optimization process by leveraging the learned velocity fields to perform probability density distillation. When evaluated on the pioneer 3D generation framework TRELLIS, our method reduces sampling steps of each flow transformer from 25 to 1 or 2, achieving 0.68s (1 step x 2) and 0.94s (2 steps x 2) latency with 9.0x and 6.5x speedup on A800, while preserving high visual and geometric fidelity. Extensive experiments demonstrate that our method significantly outperforms existing CM distillation methods, and enables TRELLIS to achieve superior performance in few-step 3D generation.
Abstract（参考訳）: フローベースの3D生成モデルは通常、推論中に数十のサンプリングステップを必要とする。特にCM(Consistency Models, Consistency Models)は2次元拡散モデルの加速において大きな進歩を遂げているが、より複雑な3次元生成タスクでは未探索のままである。本研究では, MDT-dist という新しいフレームワークを, 数段階の3次元流体蒸留のためのフレームワークとして提案する。我々のアプローチは、事前訓練されたモデルを蒸留して、Marginal-Data Transportを学習する、という第一の目的に基づいて構築されている。この目的を直接学習するには速度場を統合する必要があるが、この積分の実装は困難である。そこで我々は,速度マッチング (VM) と速度蒸留 (VD) という2つの最適化可能な目標を提案する。速度マッチング(VM)は、生徒と教師の間の速度場を安定的に一致させることを学ぶが、必然的にバイアスのある勾配推定を提供する。速度蒸留(VD)は、学習した速度場を利用して確率密度蒸留を行うことにより、最適化プロセスをさらに強化する。先駆的な3D 生成フレームワーク TRELLIS で評価すると,A800 上での 9.0x と 6.5x のレイテンシで,各フロートランスフォーマーのサンプリングステップを 25 から 2 に削減し,0.68s (1 ステップ x2) と 0.94s (2 ステップ x2) を実現した。大規模実験により,本手法は既存のCM蒸留法を著しく上回り,TRELLISが数段3D生成において優れた性能を発揮することが示された。

論文の概要: Few-step Flow for 3D Generation via Marginal-Data Transport Distillation

関連論文リスト