Fugu-MT 論文翻訳(概要): HGGT: Robust and Flexible 3D Hand Mesh Reconstruction from Uncalibrated Images

論文の概要: HGGT: Robust and Flexible 3D Hand Mesh Reconstruction from Uncalibrated Images

arxiv url: http://arxiv.org/abs/2603.23997v1
Date: Wed, 25 Mar 2026 06:54:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:11.170249
Title: HGGT: Robust and Flexible 3D Hand Mesh Reconstruction from Uncalibrated Images
Title（参考訳）: HGGT:未校正画像からのロバストでフレキシブルな3Dハンドメッシュ再構成
Authors: Yumeng Liu, Xiao-Xiao Long, Marc Habermann, Xuanze Yang, Cheng Lin, Yuan Liu, Yuexin Ma, Wenping Wang, Ligang Liu,
Abstract要約: 高忠実度3Dハンドジオメトリはコンピュータビジョンにおいて重要な課題である。スケーラブルなアプリケーションは、正確性とデプロイメントの柔軟性の両方を必要とします。本研究では、3Dハンドメッシュとカメラのポーズを非校正視点から推定するフィードフォワードアーキテクチャを提案する。
参考スコア（独自算出の注目度）: 81.42866295265443
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Recovering high-fidelity 3D hand geometry from images is a critical task in computer vision, holding significant value for domains such as robotics, animation and VR/AR. Crucially, scalable applications demand both accuracy and deployment flexibility, requiring the ability to leverage massive amounts of unstructured image data from the internet or enable deployment on consumer-grade RGB cameras without complex calibration. However, current methods face a dilemma. While single-view approaches are easy to deploy, they suffer from depth ambiguity and occlusion. Conversely, multi-view systems resolve these uncertainties but typically demand fixed, calibrated setups, limiting their real-world utility. To bridge this gap, we draw inspiration from 3D foundation models that learn explicit geometry directly from visual data. By reformulating hand reconstruction from arbitrary views as a visual-geometry grounded task, we propose a feed-forward architecture that, for the first time in literature, jointly infers 3D hand meshes and camera poses from uncalibrated views. Extensive evaluations show that our approach outperforms state-of-the-art benchmarks and demonstrates strong generalization to uncalibrated, in-the-wild scenarios. Here is the link of our project page: https://lym29.github.io/HGGT/.
Abstract（参考訳）: 画像から高忠実度3Dハンドジオメトリを復元することは、コンピュータビジョンにおいて重要な課題であり、ロボット工学、アニメーション、VR/ARといった領域において重要な価値を持っている。重要なことは、スケーラブルなアプリケーションは精度とデプロイメントの柔軟性の両方を必要としており、インターネットから大量の非構造化イメージデータを活用したり、複雑なキャリブレーションなしでコンシューマグレードのRGBカメラへのデプロイを可能にする能力を必要としている。しかし、現在の手法はジレンマに直面している。シングルビューアプローチはデプロイが容易だが、深さの曖昧さと閉塞に悩まされている。逆に、マルチビューシステムはこれらの不確実性を解消するが、通常は固定されたキャリブレーションされたセットアップを必要とし、現実のユーティリティを制限している。このギャップを埋めるために、視覚データから直接明示的な幾何学を学習する3D基礎モデルからインスピレーションを得る。任意の視点からの手振りを視覚的接地課題として再構成することにより,文献の中で初めて,非校正された視点から3次元のメッシュとカメラのポーズを共同で推論するフィードフォワードアーキテクチャを提案する。広範に評価した結果,提案手法は最先端のベンチマークより優れており,非校正型インザワイルドシナリオへの強力な一般化が示されている。 https://lym29.github.io/HGGT/。

論文の概要: HGGT: Robust and Flexible 3D Hand Mesh Reconstruction from Uncalibrated Images

関連論文リスト