Fugu-MT 論文翻訳(概要): FiCA: Feed-forward instant Gaussian Codec Avatars from a Single Portrait Image

論文の概要: FiCA: Feed-forward instant Gaussian Codec Avatars from a Single Portrait Image

arxiv url: http://arxiv.org/abs/2606.24232v1
Date: Tue, 23 Jun 2026 07:22:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-24 22:16:48.819941
Title: FiCA: Feed-forward instant Gaussian Codec Avatars from a Single Portrait Image
Title（参考訳）: FiCA: 単一ポートレート画像からのフィードフォワードインスタントガウスコーデックアバター
Authors: Kim Youwang, Zhengyu Yang, Liuhao Ge, Yu Rong, Timur Bagautdinov, Su Zhaoen, Nir Sopher, Jovan Popović, Teng Deng, Tae-Hyun Oh, Chen Cao,
Abstract要約: フィードフォワード,即時ガウスコーデックアバター生成パイプラインであるFiCAを導入し,単一のイメージからライフライクなアバターを生成する。実験により, フィードフォワードアプローチによって生成されたアバターは, 多様なアイデンティティを忠実に表現し, 競合する最近の手法によるアバターの視覚的品質を上回っていることが示された。
参考スコア（独自算出の注目度）: 26.57429602358677
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We introduce FiCA, a Feed-forward, instant Gaussian Codec Avatar generation pipeline that creates lifelike avatars from a single portrait image. Generating a photorealistic and drivable avatar from just a single image is significantly challenging due to the limited visual information available to accurately infer the 3D appearance and geometry of human heads. To address this, we develop a novel system that combines human-centric vision foundation models with a diffusion model. This system is designed to fully exploit partial visual observations to generate lifelike human avatars. Our proposed diffusion model learns a generative mapping from these partial observations to complete and authentic 3D mesh reconstruction. Additionally, we introduce a feed-forward mesh refinement network that enhances the fidelity and identity preservation of the generated avatars, eliminating the need for person-specific test-time optimization. By leveraging a universal prior model that decodes a generated mesh into a set of 3D Gaussians, we generate a photorealistic 3D Gaussian avatar, capable of being driven with novel expressions in real-time. Our experiments demonstrate that the avatars generated by our feed-forward approach faithfully represent diverse identities and surpass the visual quality of avatars produced by recent competing methods.
Abstract（参考訳）: 本研究では、フィードフォワード、即時ガウスコーデックアバター生成パイプラインであるFiCAを紹介し、単一のポートレートイメージからライフライクなアバターを生成する。人間の頭部の3D形状や形状を正確に推測するために利用可能な視覚情報が限られているため、単一の画像からフォトリアリスティックで乾燥可能なアバターを生成することは極めて困難である。そこで我々は,人間中心の視覚基盤モデルと拡散モデルを組み合わせた新しいシステムを開発した。このシステムは、部分的な視覚的観察をフル活用して、生命に似た人間のアバターを生成するように設計されている。提案する拡散モデルでは,これらの部分的観測から生成的マッピングを学習し,完全な3Dメッシュ再構成を実現する。さらに,生成されたアバターの忠実度とアイデンティティを向上し,個人固有のテストタイム最適化の必要性を解消するフィードフォワードメッシュリファインメントネットワークを導入する。生成したメッシュを3次元ガウスアンの集合にデコードする普遍的事前モデルを活用することで,新しい表現をリアルタイムに駆動できるフォトリアリスティックな3次元ガウスアバターを生成する。実験により, フィードフォワードアプローチによって生成されたアバターは, 多様なアイデンティティを忠実に表現し, 競合する最近の手法によるアバターの視覚的品質を上回っていることが示された。

論文の概要: FiCA: Feed-forward instant Gaussian Codec Avatars from a Single Portrait Image

関連論文リスト