Fugu-MT 論文翻訳(概要): 3DTV: A Feedforward Interpolation Network for Real-Time View Synthesis

論文の概要: 3DTV: A Feedforward Interpolation Network for Real-Time View Synthesis

arxiv url: http://arxiv.org/abs/2604.11211v1
Date: Mon, 13 Apr 2026 09:06:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:16.449132
Title: 3DTV: A Feedforward Interpolation Network for Real-Time View Synthesis
Title（参考訳）: 3DTV:リアルタイムビュー合成のためのフィードフォワード補間ネットワーク
Authors: Stefan Schulz, Fernando Edelstein, Hannah Dröge, Matthias B. Hullin, Markus Plack,
Abstract要約: 3DTVはリアルタイムのスパースビューレンダリングのためのフィードネットワークである。ポーズ対応深さモジュールは、粗い深さピラミッドを推定する。フィードフォワードで再トレーニングすることなく動作し、AR/VR、テレプレゼンス、インタラクティブなアプリケーションに実用的だ。
参考スコア（独自算出の注目度）: 40.790950801945584
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Real-time free-viewpoint rendering requires balancing multi-camera redundancy with the latency constraints of interactive applications. We address this challenge by combining lightweight geometry with learning and propose 3DTV, a feedforward network for real-time sparse-view interpolation. A Delaunay-based triplet selection ensures angular coverage for each target view. Building on this, we introduce a pose-aware depth module that estimates a coarse-to-fine depth pyramid, enabling efficient feature reprojection and occlusion-aware blending. Unlike methods that require scene-specific optimization, 3DTV runs feedforward without retraining, making it practical for AR/VR, telepresence, and interactive applications. Our experiments on challenging multi-view video datasets demonstrate that 3DTV consistently achieves a strong balance of quality and efficiency, outperforming recent real-time novel-view baselines. Crucially, 3DTV avoids explicit proxies, enabling robust rendering across diverse scenes. This makes it a practical solution for low-latency multi-view streaming and interactive rendering. Project Page: https://stefanmschulz.github.io/3DTV_webpage/
Abstract（参考訳）: リアルタイム自由視点レンダリングでは、対話型アプリケーションの遅延制約とマルチカメラ冗長性のバランスが要求される。本稿では,軽量な幾何学と学習を組み合わせることでこの問題に対処し,リアルタイムなスパースビュー補間のためのフィードフォワードネットワークである3DTVを提案する。 Delaunayベースのトリプルト選択は、ターゲットビュー毎の角度カバレッジを保証する。これに基づいて、粗大な深さピラミッドを推定し、効率的な特徴の再投影と閉塞認識ブレンディングを可能にする、ポーズ対応深度モジュールを導入する。シーン固有の最適化を必要とする方法とは異なり、3DTVは再トレーニングせずにフィードフォワードで動作し、AR/VR、テレプレゼンス、インタラクティブなアプリケーションに実用的である。マルチビュー・ビデオ・データセットに挑戦する実験により、3DTVは品質と効率のバランスを保ち、最近のリアルタイム・ノベル・ビュー・ベースラインを上回ります。重要な点として、3DTVは露骨なプロキシを避け、多様なシーンで堅牢なレンダリングを可能にする。これにより、低レイテンシのマルチビューストリーミングとインタラクティブレンダリングの実用的なソリューションとなる。プロジェクトページ:https://stefanmschulz.github.io/3DTV_webpage/

論文の概要: 3DTV: A Feedforward Interpolation Network for Real-Time View Synthesis

関連論文リスト