Fugu-MT 論文翻訳(概要): Feed-forward Gaussian Registration for Head Avatar Creation and Editing

論文の概要: Feed-forward Gaussian Registration for Head Avatar Creation and Editing

arxiv url: http://arxiv.org/abs/2603.15811v1
Date: Mon, 16 Mar 2026 18:42:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-18 17:42:06.947839
Title: Feed-forward Gaussian Registration for Head Avatar Creation and Editing
Title（参考訳）: 頭部アバター作成と編集のためのフィードフォワードガウス登録
Authors: Malte Prinzler, Paulo Gotardo, Siyu Tang, Timo Bolkart,
Abstract要約: MATCHは、高品質なアバターの作成と編集のための多視点ガウス登録法である。フレーム当たり0.5秒で多視点画像と対応してガウスのスプレートテクスチャを直接予測する。アバター生成は、最も近い競合するベースラインの10倍高速である。
参考スコア（独自算出の注目度）: 18.828792839070264
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We present MATCH (Multi-view Avatars from Topologically Corresponding Heads), a multi-view Gaussian registration method for high-quality head avatar creation and editing. State-of-the-art multi-view head avatar methods require time-consuming head tracking followed by expensive avatar optimization, often resulting in a total creation time of more than one day. MATCH, in contrast, directly predicts Gaussian splat textures in correspondence from calibrated multi-view images in just 0.5 seconds per frame, without requiring data preprocessing. The learned intra-subject correspondence across frames enables fast creation of personalized head avatars, while correspondence across subjects supports applications such as expression transfer, optimization-free tracking, semantic editing, and identity interpolation. We establish these correspondences end-to-end using a transformer-based model that predicts Gaussian splat textures in the fixed UV layout of a template mesh. To achieve this, we introduce a novel registration-guided attention block, where each UV-map token attends exclusively to image tokens depicting its corresponding mesh region. This design improves efficiency and performance compared to dense cross-view attention. MATCH outperforms existing methods in novel-view synthesis, geometry registration, and head avatar generation, while making avatar creation 10 times faster than the closest competing baseline. The code and model weights are available on the project website.
Abstract（参考訳）: MATCH(Multi-view Avatars from Topologically Cor correspondinging Heads)は,高品質な頭部アバター作成・編集のための多視点ガウス登録法である。最先端のマルチビューヘッドアバター手法では、時間を要するヘッドトラッキングと高価なアバター最適化が必要であり、しばしば1日以上作成される。対照的に、MATCHは、データ前処理を必要とせず、1フレームあたり0.5秒でキャリブレーションされたマルチビュー画像から、ガウススプレートテクスチャを直接予測する。フレーム間の学習されたオブジェクト内対応により、パーソナライズされた頭部アバターの高速作成が可能となり、一方、被写体間の対応は、表現伝達、最適化不要な追跡、意味編集、アイデンティティ補間などのアプリケーションをサポートする。テンプレートメッシュの固定UVレイアウトにおけるガウススプレートテクスチャを予測するトランスフォーマーモデルを用いて,これらの対応関係をエンドツーエンドに構築する。これを実現するために,各UVマップトークンが対応するメッシュ領域を表す画像トークンにのみ参加する,新規な登録誘導型アテンションブロックを導入する。この設計は、高密度のクロスビューアテンションに比べて効率と性能を向上させる。 MATCHは、新しいビュー合成、ジオメトリ登録、ヘッドアバター生成において既存の手法よりも優れており、アバター生成は最も近い競合するベースラインの10倍高速である。コードとモデルの重み付けはプロジェクトのWebサイトにある。

論文の概要: Feed-forward Gaussian Registration for Head Avatar Creation and Editing

関連論文リスト