Fugu-MT 論文翻訳(概要): GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models

論文の概要: GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models

arxiv url: http://arxiv.org/abs/2603.23246v1
Date: Tue, 24 Mar 2026 14:14:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-25 19:53:37.526066
Title: GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models
Title（参考訳）: GO-Renderer:3次元制御可能なビデオ拡散モデルによる生成オブジェクトレンダリング
Authors: Zekai Gu, Shuoxuan Feng, Yansong Wang, Hanzhuo Huang, Zhongshuo Du, Chengfeng Zhao, Chengwei Ren, Peng Wang, Yuan Liu,
Abstract要約: GO-Rendererは、任意の照明条件下で高品質なオブジェクトレンダリングを実現するために、ビデオ生成モデルをガイドするフレームワークである。新しい視点でのイメージの合成、新しい照明環境におけるオブジェクトのレンダリング、既存のビデオへのオブジェクト挿入など、オブジェクトレンダリングタスクにおける最先端のパフォーマンスを実現する。
参考スコア（独自算出の注目度）: 10.712561585083222
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Reconstructing a renderable 3D model from images is a useful but challenging task. Recent feedforward 3D reconstruction methods have demonstrated remarkable success in efficiently recovering geometry, but still cannot accurately model the complex appearances of these 3D reconstructed models. Recent diffusion-based generative models can synthesize realistic images or videos of an object using reference images without explicitly modeling its appearance, which provides a promising direction for object rendering, but lacks accurate control over the viewpoints. In this paper, we propose GO-Renderer, a unified framework integrating the reconstructed 3D proxies to guide the video generative models to achieve high-quality object rendering on arbitrary viewpoints under arbitrary lighting conditions. Our method not only enjoys the accurate viewpoint control using the reconstructed 3D proxy but also enables high-quality rendering in different lighting environments using diffusion generative models without explicitly modeling complex materials and lighting. Extensive experiments demonstrate that GO-Renderer achieves state-of-the-art performance across the object rendering tasks, including synthesizing images on new viewpoints, rendering the objects in a novel lighting environment, and inserting an object into an existing video.
Abstract（参考訳）: 画像からレンダリング可能な3Dモデルを再構築するのは便利だが、難しい作業だ。近年のフィードフォワード3次元再構成法は, 幾何を効率的に復元することに成功したが, これらの3次元再構成モデルの複雑な外観を正確にモデル化することはできない。近年の拡散型生成モデルでは、オブジェクトの外観を明示的にモデル化することなく、オブジェクトのリアルなイメージやビデオを合成することが可能であり、オブジェクトのレンダリングに有望な方向を提供するが、視点の正確な制御はできない。本稿ではGO-Rendererを提案する。GO-Rendererは、再構成された3Dプロキシを統合し、任意の照明条件下で高品質なオブジェクトレンダリングを実現するためのビデオ生成モデルを誘導する統合フレームワークである。提案手法は, 再構成された3Dプロキシを用いた正確な視点制御だけでなく, 複雑な材料や照明を明示的にモデル化することなく, 拡散生成モデルを用いて, 異なる照明環境における高品質なレンダリングを可能にする。 GO-Rendererは、新しい視点でのイメージの合成、新しい照明環境におけるオブジェクトのレンダリング、既存のビデオへのオブジェクト挿入など、オブジェクトレンダリングタスクにおける最先端のパフォーマンスを実現する。

論文の概要: GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models

関連論文リスト