Fugu-MT 論文翻訳(概要): Efficient Few-shot Identity Preserving Attribute Editing for 3D-aware Deep Generative Models

論文の概要: Efficient Few-shot Identity Preserving Attribute Editing for 3D-aware Deep Generative Models

arxiv url: http://arxiv.org/abs/2510.18287v1
Date: Tue, 21 Oct 2025 04:27:46 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:12.919998
Title: Efficient Few-shot Identity Preserving Attribute Editing for 3D-aware Deep Generative Models
Title（参考訳）: 3次元深部生成モデルの属性編集を効率よく行うFew-shot Identity
Authors: Vishal Vinod,
Abstract要約: 顔の同一性を保存することは、照明の修正、眼鏡の追加・削除、顔の老化、髪型の編集、表現の修正などを可能にする生成タスクである。本稿では,最近の3次元認識深部生成モデルと2次元画像編集技術の進歩に基づく手法を提案する。
参考スコア（独自算出の注目度）: 0.6091702876917279
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Identity preserving editing of faces is a generative task that enables modifying the illumination, adding/removing eyeglasses, face aging, editing hairstyles, modifying expression etc., while preserving the identity of the face. Recent progress in 2D generative models have enabled photorealistic editing of faces using simple techniques leveraging the compositionality in GANs. However, identity preserving editing for 3D faces with a given set of attributes is a challenging task as the generative model must reason about view consistency from multiple poses and render a realistic 3D face. Further, 3D portrait editing requires large-scale attribute labelled datasets and presents a trade-off between editability in low-resolution and inflexibility to editing in high resolution. In this work, we aim to alleviate some of the constraints in editing 3D faces by identifying latent space directions that correspond to photorealistic edits. To address this, we present a method that builds on recent advancements in 3D-aware deep generative models and 2D portrait editing techniques to perform efficient few-shot identity preserving attribute editing for 3D-aware generative models. We aim to show from experimental results that using just ten or fewer labelled images of an attribute is sufficient to estimate edit directions in the latent space that correspond to 3D-aware attribute editing. In this work, we leverage an existing face dataset with masks to obtain the synthetic images for few attribute examples required for estimating the edit directions. Further, to demonstrate the linearity of edits, we investigate one-shot stylization by performing sequential editing and use the (2D) Attribute Style Manipulation (ASM) technique to investigate a continuous style manifold for 3D consistent identity preserving face aging. Code and results are available at: https://vishal-vinod.github.io/gmpi-edit/
Abstract（参考訳）: 顔の同一性保存は、顔の同一性を維持しつつ、照明の変更、眼鏡の追加・削除、顔の老化、毛髪の編集、表情の修正等を可能にする生成タスクである。 2次元生成モデルの最近の進歩により、GANの合成性を利用した簡単な手法で顔の写実的編集が可能になった。しかし、複数のポーズからの視点の一貫性を考慮し、リアルな3D顔を表示する必要があるため、属性セットを持つ3D顔のアイデンティティ保存編集は難しい作業である。さらに、3D画像編集には大規模な属性ラベル付きデータセットが必要であり、低解像度で編集可能であることと高解像度で編集できないこととの間にトレードオフがある。本研究では,フォトリアリスティックな編集に対応する潜在空間方向を特定することで,3次元顔の編集における制約を緩和することを目的とする。そこで本研究では,近年の3D認識深部生成モデルと2D画像編集技術の進歩を基盤として,3D認識深部生成モデルに対する属性編集を効率的に行う手法を提案する。提案手法は,属性のラベル付き画像が10個以下であれば,属性の編集に対応する潜在空間における編集方向を推定するのに十分であることを示す。本研究では,マスクを用いた既存の顔データセットを用いて,編集方向を推定するために必要な属性例の少ない合成画像を得る。さらに,編集の線形性を示すために, (2D) Attribute Style Manipulation (ASM) 技法を逐次的に編集し, 顔の老化を保ちながら, 連続的なスタイル多様体を探索する。コードと結果は、https://vishal-vinod.github.io/gmpi-edit/.com/で公開されている。

論文の概要: Efficient Few-shot Identity Preserving Attribute Editing for 3D-aware Deep Generative Models

関連論文リスト