Fugu-MT 論文翻訳(概要): Edit3DGS: Unified Framework for Dynamic Head Editing via 2D Instruction-Guided Diffusion and 3D Gaussian Splatting

論文の概要: Edit3DGS: Unified Framework for Dynamic Head Editing via 2D Instruction-Guided Diffusion and 3D Gaussian Splatting

arxiv url: http://arxiv.org/abs/2606.17432v1
Date: Tue, 16 Jun 2026 02:26:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-17 17:15:32.221319
Title: Edit3DGS: Unified Framework for Dynamic Head Editing via 2D Instruction-Guided Diffusion and 3D Gaussian Splatting
Title（参考訳）: Edit3DGS: 2Dインストラクション誘導拡散と3Dガウススプラッティングによる動的頭部編集のための統一フレームワーク
Authors: Duy-Dat Tran, Trung-Nghia Le,
Abstract要約: 動的3Dヘッド編集のための統合フレームワークEdit3DGSを提案する。本手法は,画像領域における意味制御性と,時間的に整合した3次元表現とを結合する。
参考スコア（独自算出の注目度）: 3.8101824866013523
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: We present Edit3DGS, a unified framework for dynamic 3D head editing that integrates 2D instruction-guided diffusion with 3D Gaussian splatting. Unlike prior approaches that separately address frame-based edits or static 3D reconstruction, our method couples semantic controllability in the image domain with photorealistic, temporally consistent 3D representations. Given an input video, editable facial regions are masked and modified using a text-conditioned diffusion model to support fine-grained operations such as expression transformation, attribute modification, and appearance refinement. The edited frames are then aggregated through 3D Gaussian splatting to produce a coherent, high-fidelity avatar that preserves both identity and motion dynamics. To enforce consistency, Edit3DGS incorporates multi-view batch editing and lightweight inpainting strategies that recover lost expressions across timesteps. Experimental results demonstrate that our framework enables controllable, artifact-free head editing with smooth temporal transitions, offering practical applications in virtual avatars, immersive communication, film production, and interactive media.
Abstract（参考訳）: 本稿では,動的3次元頭部編集のための統合フレームワークであるEdit3DGSについて述べる。フレームベースの編集や静的な3D再構成を別々に扱う従来の手法とは異なり,本手法は画像領域のセマンティック・コントロール可能性と,時間的に整合したフォトリアリスティックな3D表現を結合する。入力ビデオが与えられると、編集可能な顔領域をテキスト条件の拡散モデルでマスクして修正し、表現変換、属性修正、外観改善などのきめ細かい操作をサポートする。編集されたフレームは3Dガウススプラッティングによって集約され、同一性と運動力学の両方を保存するコヒーレントで高忠実なアバターを生成する。一貫性を強化するため、Edit3DGSはマルチビューバッチ編集と、タイムステップ全体で失われた表現を復元する軽量なインペイント戦略を取り入れている。実験結果から,本フレームワークは,仮想アバター,没入型通信,フィルム制作,インタラクティブメディアなどにおいて,スムーズな時間的遷移を伴う制御可能な,アーティファクトフリーなヘッド編集を可能にすることが示された。

論文の概要: Edit3DGS: Unified Framework for Dynamic Head Editing via 2D Instruction-Guided Diffusion and 3D Gaussian Splatting

関連論文リスト