Fugu-MT 論文翻訳(概要): GS-Playground: A High-Throughput Photorealistic Simulator for Vision-Informed Robot Learning

論文の概要: GS-Playground: A High-Throughput Photorealistic Simulator for Vision-Informed Robot Learning

arxiv url: http://arxiv.org/abs/2604.25459v1
Date: Tue, 28 Apr 2026 10:05:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-29 16:49:17.813946
Title: GS-Playground: A High-Throughput Photorealistic Simulator for Vision-Informed Robot Learning
Title（参考訳）: GS-Playground:視覚インフォーム型ロボット学習のための高出力フォトリアリスティックシミュレータ
Authors: Yufei Jia, Heng Zhang, Ziheng Zhang, Junzhe Wu, Mingrui Yu, Zifan Wang, Dixuan Jiang, Zheng Li, Chenyu Cao, Zhuoyuan Yu, Xun Yang, Haizhou Ge, Yuchi Zhang, Jiayuan Zhang, Zhenbiao Huang, Tianle Liu, Shenyu Chen, Jiacheng Wang, Bin Xie, Xuran Yao, Xiwa Deng, Guangyu Wang, Jinzhi Zhang, Lei Hao, Zhixing Chen, Yuxiang Chen, Anqi Wang, Hongyun Tian, Yiyi Yan, Zhanxiang Cao, Yizhou Jiang, Hanyang Shao, Yue Li, Lu Shi, Bokui Chen, Wei Sui, Hanqing Cui, Yusen Qin, Ruqi Huang, Lei Han, Tiancai Wang, Guyue Zhou,
Abstract要約: GS-Playgroundは、エンド・ツー・エンドの知覚学習を促進するために設計されたマルチモーダル・シミュレーション・フレームワークである。このシステムは640x480の解像度で104 FPSのスループットを達成し、大規模な視覚的RLの障壁を著しく低減する。
参考スコア（独自算出の注目度）: 55.272458304179025
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Embodied AI research is undergoing a shift toward vision-centric perceptual paradigms. While massively parallel simulators have catalyzed breakthroughs in proprioception-based locomotion, their potential remains largely untapped for vision-informed tasks due to the prohibitive computational overhead of large-scale photorealistic rendering. Furthermore, the creation of simulation-ready 3D assets heavily relies on labor-intensive manual modeling, while the significant sim-to-real physical gap hinders the transfer of contact-rich manipulation policies. To address these bottlenecks, we propose GS-Playground, a multi-modal simulation framework designed to accelerate end-to-end perceptual learning. We develop a novel high-performance parallel physics engine, specifically designed to integrate with a batch 3D Gaussian Splatting (3DGS) rendering pipeline to ensure high-fidelity synchronization. Our system achieves a breakthrough throughput of 10^4 FPS at 640x480 resolution, significantly lowering the barrier for large-scale visual RL. Additionally, we introduce an automated Real2Sim workflow that reconstructs photorealistic, physically consistent, and memory-efficient environments, streamlining the generation of complex simulation-ready scenes. Extensive experiments on locomotion, navigation, and manipulation demonstrate that GS-Playground effectively bridges the perceptual and physical gaps across diverse embodied tasks. Project homepage: https://gsplayground.github.io.
Abstract（参考訳）: 身体的AI研究は、視覚中心の知覚パラダイムへとシフトしつつある。非常に並列なシミュレーターは、プロバイオセプションに基づくローコモーションのブレークスルーを触媒しているが、大規模なフォトリアリスティックレンダリングの計算オーバーヘッドが禁じられているため、視覚的インフォームドなタスクには、そのポテンシャルはほとんど使われていない。さらに、シミュレーション可能な3Dアセットの作成は、労働集約的な手動モデリングに大きく依存する一方で、大きなシミュレートと現実の物理的なギャップは、コンタクトリッチな操作ポリシーの移行を妨げる。これらのボトルネックに対処するために,エンドツーエンドの知覚学習を高速化するマルチモーダル・シミュレーション・フレームワークであるGS-Playgroundを提案する。我々は,高忠実度同期を保証するために,バッチ3Dガウススプラッティング(3DGS)レンダリングパイプラインと統合して設計された,新しい高性能並列物理エンジンを開発した。我々のシステムは640x480の解像度で10^4 FPSのスループットを達成し、大規模視覚RLの障壁を著しく低減する。さらに,フォトリアリスティックな,物理的に一貫性のある,メモリ効率の高い環境を再構築し,複雑なシミュレーション可能なシーンの生成を合理化する,自動Real2Simワークフローを導入する。移動、ナビゲーション、操作に関する大規模な実験は、GS-Playgroundが様々な具体的タスクにまたがる知覚的および身体的ギャップを効果的に橋渡しすることを示した。プロジェクトホームページ: https://gsplayground.github.io

論文の概要: GS-Playground: A High-Throughput Photorealistic Simulator for Vision-Informed Robot Learning

関連論文リスト