Arena 4.0: A Comprehensive ROS2 Development and Benchmarking Platform for Human-centric Navigation Using Generative-Model-based Environment Generation
- URL: http://arxiv.org/abs/2409.12471v1
- Date: Thu, 19 Sep 2024 05:20:13 GMT
- Title: Arena 4.0: A Comprehensive ROS2 Development and Benchmarking Platform for Human-centric Navigation Using Generative-Model-based Environment Generation
- Authors: Volodymyr Shcherbyna1, Linh Kästner, Diego Diaz, Huu Giang Nguyen, Maximilian Ho-Kyoung Schreff, Tim Lenz, Jonas Kreutz, Ahmed Martban, Huajian Zeng, Harold Soh,
- Abstract summary: This paper introduces Arena 4.0, a significant advancement over Arena 3.0, Arena-Bench, Arena 1.0, and Arena 2.0.
Arena 4.0 offers three key novel contributions: (1) a generative-model-based world and scenario generation approach; (2) a comprehensive 3D model database; and (3) a complete migration to ROS 2.
- Score: 7.937317889320489
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Building on the foundations of our previous work, this paper introduces Arena 4.0, a significant advancement over Arena 3.0, Arena-Bench, Arena 1.0, and Arena 2.0. Arena 4.0 offers three key novel contributions: (1) a generative-model-based world and scenario generation approach that utilizes large language models (LLMs) and diffusion models to dynamically generate complex, human-centric environments from text prompts or 2D floorplans, useful for the development and benchmarking of social navigation strategies; (2) a comprehensive 3D model database, extendable with additional 3D assets that are semantically linked and annotated for dynamic spawning and arrangement within 3D worlds; and (3) a complete migration to ROS 2, enabling compatibility with modern hardware and enhanced functionalities for improved navigation, usability, and easier deployment on real robots. We evaluated the platform's performance through a comprehensive user study, demonstrating significant improvements in usability and efficiency compared to previous versions. Arena 4.0 is openly available at https://github.com/Arena-Rosnav.
Related papers
- Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling [80.30976039119236]
Lumina-mGPT 2.0 is a stand-alone, decoder-only autoregressive model.<n>It is trained entirely from scratch, enabling unrestricted architectural design and licensing freedom.<n>It achieves generation quality on par with state-of-the-art diffusion models.
arXiv Detail & Related papers (2025-07-23T17:42:13Z) - NavBench: A Unified Robotics Benchmark for Reinforcement Learning-Based Autonomous Navigation [16.554282855005766]
We present NavBench, a benchmark for training and evaluating Reinforcement Learning-based navigation policies.<n>Our framework standardizes task definitions, enabling different robots to tackle various navigation challenges.<n>By ensuring consistency between simulation and real-world deployment, NavBench simplifies the development of RL-based navigation strategies.
arXiv Detail & Related papers (2025-05-20T15:48:23Z) - 3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models [94.48803082248872]
3D generation is experiencing rapid advancements, while the development of 3D evaluation has not kept pace.
We develop a large-scale human preference dataset 3DGen-Bench.
We then train a CLIP-based scoring model, 3DGen-Score, and a MLLM-based automatic evaluator, 3DGen-Eval.
arXiv Detail & Related papers (2025-03-27T17:53:00Z) - Generative AI Framework for 3D Object Generation in Augmented Reality [0.0]
This thesis integrates state-of-the-art generative AI models for real-time creation of 3D objects in augmented reality (AR) environments.
The framework demonstrates applications across industries such as gaming, education, retail, and interior design.
A significant contribution is democratizing 3D model creation, making advanced AI tools accessible to a broader audience.
arXiv Detail & Related papers (2025-02-21T17:01:48Z) - SE Arena: An Interactive Platform for Evaluating Foundation Models in Software Engineering [0.7511028207083381]
We introduce SE Arena, an interactive platform designed to evaluate foundation models (FMs) in software engineering activities.
SE Arena provides a transparent, open-source leaderboard, supports multi-round conversational score, and enables end-to-end model comparisons.
This paper outlines the design and capabilities of SE Arena, emphasizing its potential to advance the evaluation and practical application of FMs in software engineering.
arXiv Detail & Related papers (2025-02-03T22:19:28Z) - GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency [50.11520458252128]
Existing 3D affordance learning methods struggle with generalization and robustness due to limited annotated data.
We propose GEAL, a novel framework designed to enhance the generalization and robustness of 3D affordance learning by leveraging large-scale pre-trained 2D models.
GEAL consistently outperforms existing methods across seen and novel object categories, as well as corrupted data.
arXiv Detail & Related papers (2024-12-12T17:59:03Z) - Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning [55.339257446600634]
We introduce Robin3D, a powerful 3DLLM trained on large-scale instruction-following data.
We construct 1 million instruction-following data, consisting of 344K Adversarial samples, 508K Diverse samples, and 165K benchmark training set samples.
Robin3D consistently outperforms previous methods across five widely-used 3D multimodal learning benchmarks.
arXiv Detail & Related papers (2024-09-30T21:55:38Z) - Adapting a Foundation Model for Space-based Tasks [16.81793096235458]
In the future of space robotics, we see three core challenges which motivate the use of a foundation model adapted to space-based applications.
In this work, we demonstrate that 1) existing vision-language models are deficient visual reasoners in space-based applications, and 2) fine-tuning a vision-language model on extraterrestrial data significantly improves the quality of responses.
arXiv Detail & Related papers (2024-08-12T05:07:24Z) - Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation [58.574464340559466]
We advocate for hyperbolic learning to enable crowd navigation and we introduce Hyp2Nav.
Hyp2Nav leverages the intrinsic properties of hyperbolic geometry to better encode the hierarchical nature of decision-making processes in navigation tasks.
We propose a hyperbolic policy model and a hyperbolic curiosity module that results in effective social navigation, best success rates, and returns across multiple simulation settings.
arXiv Detail & Related papers (2024-07-18T14:40:33Z) - NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration [57.15811390835294]
This paper describes how we can train a single unified diffusion policy to handle both goal-directed navigation and goal-agnostic exploration.
We show that this unified policy results in better overall performance when navigating to visually indicated goals in novel environments.
Our experiments, conducted on a real-world mobile robot platform, show effective navigation in unseen environments in comparison with five alternative methods.
arXiv Detail & Related papers (2023-10-11T21:07:14Z) - Arena-Rosnav 2.0: A Development and Benchmarking Platform for Robot
Navigation in Highly Dynamic Environments [2.194096102715421]
Arena-Rosnav 2.0 is an extension to our previous works Arena-Bench and Arena-Rosnav.
It adds a variety of additional modules for developing and benchmarking robotic navigation approaches.
arXiv Detail & Related papers (2023-02-20T15:10:16Z) - Arena-Web -- A Web-based Development and Benchmarking Platform for
Autonomous Navigation Approaches [2.4937400423177767]
We present Arena-Web, a web-based development and evaluation suite for developing, training, and testing DRL-based navigation planners.
The interface is designed to be intuitive and engaging to appeal to non-experts and make the technology accessible to a wider audience.
arXiv Detail & Related papers (2023-02-06T16:06:07Z) - Simple and Effective Synthesis of Indoor 3D Scenes [78.95697556834536]
We study the problem of immersive 3D indoor scenes from one or more images.
Our aim is to generate high-resolution images and videos from novel viewpoints.
We propose an image-to-image GAN that maps directly from reprojections of incomplete point clouds to full high-resolution RGB-D images.
arXiv Detail & Related papers (2022-04-06T17:54:46Z) - Habitat 2.0: Training Home Assistants to Rearrange their Habitat [122.54624752876276]
We introduce Habitat 2.0 (H2.0), a simulation platform for training virtual robots in interactive 3D environments.
We make contributions to all levels of the embodied AI stack - data, simulation, and benchmark tasks.
arXiv Detail & Related papers (2021-06-28T05:42:15Z) - Occupancy Anticipation for Efficient Exploration and Navigation [97.17517060585875]
We propose occupancy anticipation, where the agent uses its egocentric RGB-D observations to infer the occupancy state beyond the visible regions.
By exploiting context in both the egocentric views and top-down maps our model successfully anticipates a broader map of the environment.
Our approach is the winning entry in the 2020 Habitat PointNav Challenge.
arXiv Detail & Related papers (2020-08-21T03:16:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.