Fugu-MT 論文翻訳(概要): Velocity-Space 3D Asset Editing

論文の概要: Velocity-Space 3D Asset Editing

arxiv url: http://arxiv.org/abs/2605.07385v1
Date: Fri, 08 May 2026 07:42:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:38.894093
Title: Velocity-Space 3D Asset Editing
Title（参考訳）: 速度空間3次元アセット編集
Authors: Hao Liu, Yuxuan Lin, Jingfeng Guo, Ruihang Chu, Junjie Wang, Ruotong Li, Yujiu Yang,
Abstract要約: ローカルに3Dアセットを編集し、残りの部分を保存しながらターゲット領域を変更することは、ネイティブな3D編集の基本的な要件である。 Inversion-free, training-free, and mask-free framework that address each problem with a target intervention in the sampler。
参考スコア（独自算出の注目度）: 45.94997633532406
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Editing a 3D asset locally, modifying a target region while preserving the rest, is a fundamental requirement of native 3D editing. Existing methods enforce locality through mechanisms external to the generator, such as manual 3D masks, post-hoc voxel merging, or 2D multi-view lifting. None of them intervene where the corruption actually originates: inside the ODE sampler. For a rectified-flow generator to achieve faithful local editing, its velocity field should be strong over the target editing region while vanishing on preserved content. Yet a single velocity field can hardly satisfy both requirements simultaneously, leading to three problems: (i) identity leakage that keeps the edit signal non-zero on preserved regions; (ii) no dedicated edit-amplification channel, so strengthening the edit inevitably perturbs identity; and (iii) an identity drag at the geometry and material stages, where a global condition pulls every token toward the target. We propose VS3D (Velocity-Space 3D Asset editing}), an inversion-free, training-free, and mask-free framework that addresses each problem with a targeted intervention inside the sampler. VS3D integrates three complementary modules, each corresponding to a specific stage of the editing pipeline. Reconstruction-Anchored Source Injection (RASI) absorbs identity leakage by turning the unconditional embedding into a per-step, asset-specific anchor calibrated through source reconstruction. Partial-Mean Guidance (PMG) amplifies the edit signal by contrasting high- and low-quality subsample estimates of the velocity difference, active only where a consistent edit exists. Twin-Agreement Residual injection (TAR) lets the sampler decide token by token what to preserve at the geometry and material stages.
Abstract（参考訳）: ローカルに3Dアセットを編集し、残りの部分を保存しながらターゲット領域を変更することは、ネイティブな3D編集の基本的な要件である。既存の方法は、手動の3Dマスク、ポストホックのボクセルのマージ、または2Dのマルチビューリフトのようなジェネレータ外部のメカニズムを通して局所性を強制する。それらの誰も、汚職が実際に発生した場所(ODEサンプルラの内部)に介入しません。整流流発生器が忠実な局所編集を実現するためには、その速度場は保存された内容に頼らず、目標編集領域に強くなるべきである。しかし、1つの速度場は両方の要求を同時に満たすことができず、3つの問題に繋がる。一保存地域において、編集信号がゼロに保たないID漏洩 (二専用編集増幅チャンネルがないため、編集が必然的に混乱すること。) 第三に、世界的条件が全てのトークンを目標に向かって引き寄せる幾何学的・物質的段階におけるアイデンティティのドラッグ。そこで我々は,VS3D (Velocity-Space 3D Asset editing}) を提案する。 VS3Dは3つの補完モジュールを統合し、それぞれが編集パイプラインの特定のステージに対応する。レコンストラクション・アンコールド・ソース・インジェクション(RASI)は、ソース・インジェクションによって調整された非条件埋め込みをステップごとのアセット固有のアンカーに変えることで、アイデンティティ・リークを吸収する。部分平均誘導(PMG)は、一貫した編集が存在する場合にのみ、速度差の高および低品質のサブサンプル推定を対比することにより、編集信号を増幅する。 TAR (Twin-Agreement Residual Injection) では、サンプルがトークンをトークンで決定できる。

論文の概要: Velocity-Space 3D Asset Editing

関連論文リスト