Fugu-MT 論文翻訳(概要): ReconPhys: Reconstruct Appearance and Physical Attributes from Single Video

論文の概要: ReconPhys: Reconstruct Appearance and Physical Attributes from Single Video

arxiv url: http://arxiv.org/abs/2604.07882v1
Date: Thu, 09 Apr 2026 06:51:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-10 18:34:05.745572
Title: ReconPhys: Reconstruct Appearance and Physical Attributes from Single Video
Title（参考訳）: ReconPhys: 単一ビデオからのリコンストラクトの出現と物理的属性
Authors: Boyuan Wang, Xiaofeng Wang, Yongkang Li, Zheng Zhu, Yifan Chang, Angen Ye, Guosheng Zhao, Chaojun Ni, Guan Huang, Yijie Ren, Yueqi Duan, Xingang Wang,
Abstract要約: ReconPhysは、単一の単眼ビデオから物理的属性推定と3次元ガウススプラッティング再構成を共同で学習するフィードフォワードフレームワークである。提案手法では, 自己教師型戦略を用いて訓練したデュアルブランチアーキテクチャを用いて, 地上の物理ラベルの必要性を解消する。大規模合成データセットの実験は優れた性能を示した。
参考スコア（独自算出の注目度）: 51.2815592637053
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Reconstructing non-rigid objects with physical plausibility remains a significant challenge. Existing approaches leverage differentiable rendering for per-scene optimization, recovering geometry and dynamics but requiring expensive tuning or manual annotation, which limits practicality and generalizability. To address this, we propose ReconPhys, the first feedforward framework that jointly learns physical attribute estimation and 3D Gaussian Splatting reconstruction from a single monocular video. Our method employs a dual-branch architecture trained via a self-supervised strategy, eliminating the need for ground-truth physics labels. Given a video sequence, ReconPhys simultaneously infers geometry, appearance, and physical attributes. Experiments on a large-scale synthetic dataset demonstrate superior performance: our method achieves 21.64 PSNR in future prediction compared to 13.27 by state-of-the-art optimization baselines, while reducing Chamfer Distance from 0.349 to 0.004. Crucially, ReconPhys enables fast inference (<1 second) versus hours required by existing methods, facilitating rapid generation of simulation-ready assets for robotics and graphics.
Abstract（参考訳）: 物理的確率で厳密でない物体を再構築することは、依然として大きな課題である。既存のアプローチでは、シーンごとの最適化、幾何とダイナミクスの回復に微分可能なレンダリングを利用するが、高価なチューニングや手動のアノテーションが必要であり、実用性と一般化性に制限がある。そこで本研究では, 物理属性推定と3次元ガウススプラッティング再構成を同時学習する最初のフィードフォワードフレームワークであるReconPhysを提案する。提案手法では, 自己教師型戦略を用いて訓練したデュアルブランチアーキテクチャを用いて, 地上の物理ラベルの必要性を解消する。ビデオシーケンスが与えられた後、ReconPhysは幾何学、外観、物理的属性を同時に推測する。提案手法は,最先端の最適化ベースラインによる13.27に対して,将来の予測では21.64 PSNRを達成し,チャンファー距離を0.349から0.004に削減する。重要なこととして、ReconPhysは既存の手法で要求される時間に対して高速な推論((<1秒)を可能にし、ロボット工学とグラフィックスのためのシミュレーション可能なアセットを迅速に生成する。

論文の概要: ReconPhys: Reconstruct Appearance and Physical Attributes from Single Video

関連論文リスト