Fugu-MT 論文翻訳(概要): DeformGS: Scene Flow in Highly Deformable Scenes for Deformable Object Manipulation

論文の概要: DeformGS: Scene Flow in Highly Deformable Scenes for Deformable Object Manipulation

arxiv url: http://arxiv.org/abs/2312.00583v2
Date: Fri, 30 Aug 2024 15:16:43 GMT
ステータス: 翻訳完了
システム内更新日: 2024-09-02 20:31:28.517499
Title: DeformGS: Scene Flow in Highly Deformable Scenes for Deformable Object Manipulation
Title（参考訳）: DeformGS: 変形可能なオブジェクト操作のための高変形性シーンにおけるシーンフロー
Authors: Bardienus P. Duisterhof, Zhao Mandi, Yunchao Yao, Jia-Wei Liu, Jenny Seidenschwarz, Mike Zheng Shou, Deva Ramanan, Shuran Song, Stan Birchfield, Bowen Wen, Jeffrey Ichnowski,
Abstract要約: DeformGSは、複数のカメラからダイナミックなシーンを同時撮影することで、高度に変形可能なシーンのシーンフローを復元するアプローチである。 DeformGSは最先端と比較して平均55.8%の3Dトラッキングを改善している。十分なテクスチャで、DeformGSは1.5 x 1.5 mの布の上で3.3mmの中央値追跡誤差を達成している。
参考スコア（独自算出の注目度）: 66.7719069053058
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Teaching robots to fold, drape, or reposition deformable objects such as cloth will unlock a variety of automation applications. While remarkable progress has been made for rigid object manipulation, manipulating deformable objects poses unique challenges, including frequent occlusions, infinite-dimensional state spaces and complex dynamics. Just as object pose estimation and tracking have aided robots for rigid manipulation, dense 3D tracking (scene flow) of highly deformable objects will enable new applications in robotics while aiding existing approaches, such as imitation learning or creating digital twins with real2sim transfer. We propose DeformGS, an approach to recover scene flow in highly deformable scenes, using simultaneous video captures of a dynamic scene from multiple cameras. DeformGS builds on recent advances in Gaussian splatting, a method that learns the properties of a large number of Gaussians for state-of-the-art and fast novel-view synthesis. DeformGS learns a deformation function to project a set of Gaussians with canonical properties into world space. The deformation function uses a neural-voxel encoding and a multilayer perceptron (MLP) to infer Gaussian position, rotation, and a shadow scalar. We enforce physics-inspired regularization terms based on conservation of momentum and isometry, which leads to trajectories with smaller trajectory errors. We also leverage existing foundation models SAM and XMEM to produce noisy masks, and learn a per-Gaussian mask for better physics-inspired regularization. DeformGS achieves high-quality 3D tracking on highly deformable scenes with shadows and occlusions. In experiments, DeformGS improves 3D tracking by an average of 55.8% compared to the state-of-the-art. With sufficient texture, DeformGS achieves a median tracking error of 3.3 mm on a cloth of 1.5 x 1.5 m in area. Website: https://deformgs.github.io
Abstract（参考訳）: ロボットに布などの変形可能な物体の折り畳み、ドレープ、あるいは再配置を教えることで、さまざまな自動化アプリケーションをアンロックする。剛体物体の操作には顕著な進歩があるが、変形可能な物体を操作することは、しばしば閉塞、無限次元状態空間、複雑な力学など、ユニークな課題を生んでいる。オブジェクトのポーズ推定と追跡が、厳密な操作のためのロボットを支援するのと同じように、高度に変形可能なオブジェクトの密集した3Dトラッキング(シーンフロー)は、模倣学習やリアル2sim転送によるデジタルツインの作成といった既存のアプローチを支援しながら、ロボット工学の新たな応用を可能にする。複数のカメラからダイナミックなシーンを同時撮影することで、高度に変形可能なシーンのシーンフローを復元するDeformGSを提案する。 DeformGSは、最先端で高速なノベルビュー合成のために多数のガウス人の特性を学習する手法であるガウススプラッティングの最近の進歩を基盤としている。 DeformGSは変形関数を学び、標準的性質を持つガウスの集合を世界空間に射影する。変形関数は、ガウスの位置、回転、シャドウスカラーを推測するために、ニューラルボクセル符号化と多層パーセプトロン(MLP)を用いる。運動量と等距離の保存に基づく物理に着想を得た正規化項を施行し、より小さな軌道誤差を伴う軌道を導いた。また、既存の基礎モデルSAMとXMEMを利用してノイズマスクを作成し、ガウス毎のマスクを学習し、物理学に着想を得た正規化を改良する。 DeformGSは、シャドーとオクルージョンを備えた高度に変形可能なシーンで高品質な3Dトラッキングを実現する。実験では、DeformGSは最先端と比較して平均55.8%の3Dトラッキングを改善している。十分なテクスチャで、DeformGSは1.5 x 1.5 mの布の上で3.3mmの中央値追跡誤差を達成している。ウェブサイト:https://deformgs.github.io

論文の概要: DeformGS: Scene Flow in Highly Deformable Scenes for Deformable Object Manipulation

関連論文リスト