Fugu-MT 論文翻訳(概要): OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera

論文の概要: OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera

arxiv url: http://arxiv.org/abs/2511.03571v1
Date: Wed, 05 Nov 2025 15:51:42 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-06 18:19:32.474146
Title: OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera
Title（参考訳）: OneOcc:単一パノラマカメラを用いた足ロボットのセマンティック動作予測
Authors: Hao Shi, Ze Wang, Shangwei Guo, Mengfei Duan, Song Wang, Teng Chen, Kailun Yang, Lin Wang, Kaiwei Wang,
Abstract要約: OneOccは視力のみのパノラマSSCフレームワークで、歩行誘導体ジッタと360度連続性のために設計された。 1Occ は、 (i) 二重射影融合 (DP-ER) を用いて、環状パノラマとその等方的展開を利用し、360デグ連続性と格子アライメントを保ち、 (ii) 双格子ボクセル化 (BGV) をカルテシアン空間と円筒極性空間で推論し、離散化バイアスを低減し、自由/占有境界を鋭くし、 (iii) 階層型 AMoE-3D による動的マルチスケール核融合および動的多次元核融合のための軽量デコーダ
参考スコア（独自算出の注目度）: 41.20666727293422
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Robust 3D semantic occupancy is crucial for legged/humanoid robots, yet most semantic scene completion (SSC) systems target wheeled platforms with forward-facing sensors. We present OneOcc, a vision-only panoramic SSC framework designed for gait-introduced body jitter and 360{\deg} continuity. OneOcc combines: (i) Dual-Projection fusion (DP-ER) to exploit the annular panorama and its equirectangular unfolding, preserving 360{\deg} continuity and grid alignment; (ii) Bi-Grid Voxelization (BGV) to reason in Cartesian and cylindrical-polar spaces, reducing discretization bias and sharpening free/occupied boundaries; (iii) a lightweight decoder with Hierarchical AMoE-3D for dynamic multi-scale fusion and better long-range/occlusion reasoning; and (iv) plug-and-play Gait Displacement Compensation (GDC) learning feature-level motion correction without extra sensors. We also release two panoramic occupancy benchmarks: QuadOcc (real quadruped, first-person 360{\deg}) and Human360Occ (H3O) (CARLA human-ego 360{\deg} with RGB, Depth, semantic occupancy; standardized within-/cross-city splits). OneOcc sets new state-of-the-art (SOTA): on QuadOcc it beats strong vision baselines and popular LiDAR ones; on H3O it gains +3.83 mIoU (within-city) and +8.08 (cross-city). Modules are lightweight, enabling deployable full-surround perception for legged/humanoid robots. Datasets and code will be publicly available at https://github.com/MasterHow/OneOcc.
Abstract（参考訳）: 足とヒューマノイドロボットにはロバストな3Dセマンティック占領が不可欠であるが、ほとんどのセマンティック・シーン・コンプリート(SSC)システムは前方センサーを備えた車輪付きプラットフォームをターゲットにしている。歩行導入体ジッタと360{\deg}連続性のために設計された視覚のみのパノラマSSCフレームワークであるOneOccを提案する。 OneOccは、一二重投射核融合(DP-ER)により、環状パノラマとその等角展開を利用して、360{\deg}連続性と格子アライメントを保つこと。 (II)二格子ボクセル化(BGV)をカルテおよび円筒偏極空間で推論し、離散化バイアスを低減し、自由/占有境界を鋭くする。三動的マルチスケール核融合のための階層型AMoE-3Dを用いた軽量デコーダ (4)追加センサを使わずに特徴レベルの動作補正を行うGDC学習。また、我々は2つのパノラマ占有ベンチマークを公表した: QuadOcc (実際の4倍体、一人称360{\deg}) と Human360Occ (H3O) (CARLA human-ego 360{\deg}) 。 OneOccは新たな最先端技術(SOTA)を定め、QuadOccでは強力なビジョンベースラインと人気のあるLiDARを破り、H3Oでは+3.83 mIoU(都市内)と+8.08(都市横断)を獲得した。モジュールは軽量で、脚とヒューマノイドロボットのフルサラウンド認識をデプロイできる。データセットとコードはhttps://github.com/MasterHow/OneOcc.comで公開される。

論文の概要: OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera

関連論文リスト