From Obstacles to Etiquette: Robot Social Navigation with VLM-Informed Path Selection
- URL: http://arxiv.org/abs/2602.09002v1
- Date: Mon, 09 Feb 2026 18:46:12 GMT
- Title: From Obstacles to Etiquette: Robot Social Navigation with VLM-Informed Path Selection
- Authors: Zilin Fang, Anxing Xiao, David Hsu, Gim Hee Lee,
- Abstract summary: This paper presents a social robot navigation framework that integrates geometric planning with contextual social reasoning.<n>The system first extracts obstacles and human dynamics to generate geometrically feasible candidate paths, then leverages a fine-tuned vision-language model (VLM) to evaluate these paths.<n>Experiments in four social navigation contexts demonstrate that our method achieves the best overall performance with the lowest personal space violation duration, the minimal pedestrian-facing time, and no social zone intrusions.
- Score: 57.74400052368147
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Navigating socially in human environments requires more than satisfying geometric constraints, as collision-free paths may still interfere with ongoing activities or conflict with social norms. Addressing this challenge calls for analyzing interactions between agents and incorporating common-sense reasoning into planning. This paper presents a social robot navigation framework that integrates geometric planning with contextual social reasoning. The system first extracts obstacles and human dynamics to generate geometrically feasible candidate paths, then leverages a fine-tuned vision-language model (VLM) to evaluate these paths, informed by contextually grounded social expectations, selecting a socially optimized path for the controller. This task-specific VLM distills social reasoning from large foundation models into a smaller and efficient model, allowing the framework to perform real-time adaptation in diverse human-robot interaction contexts. Experiments in four social navigation contexts demonstrate that our method achieves the best overall performance with the lowest personal space violation duration, the minimal pedestrian-facing time, and no social zone intrusions. Project page: https://path-etiquette.github.io
Related papers
- LISN: Language-Instructed Social Navigation with VLM-based Controller Modulating [47.62872797480247]
We present LISN-Bench, the first simulation-based benchmark for language-instructed social navigation.<n>We propose Social-Nav-Modulator, a fast-slow hierarchical system where a VLM agent modulates costmaps and controller parameters.<n>Our method achieves an average success rate of 91.3%, which is greater than 63% than the most competitive baseline.
arXiv Detail & Related papers (2025-12-10T18:54:30Z) - Learning Social Heuristics for Human-Aware Path Planning [3.2957337131930493]
Social robotic navigation has been at the center of numerous studies in recent years.<n>We propose Heuristic Planning with Learned Social Value (HPLSV), a method to learn a value function encapsulating the cost of social navigation.
arXiv Detail & Related papers (2025-09-02T09:36:11Z) - GSON: A Group-based Social Navigation Framework with Large Multimodal Model [9.94576166903495]
This paper introduces GSON, a novel group-based social navigation framework.<n>GSON uses visual prompting to enable zero-shot extraction of social relationships among pedestrians.<n>We validate GSON through extensive real-world mobile robot navigation experiments.
arXiv Detail & Related papers (2024-09-26T17:27:15Z) - Learning Social Cost Functions for Human-Aware Path Planning [2.6995631218854235]
We propose a novel method to recognize common social scenarios and modify a traditional planner's cost function to adapt to them.
Our approach allows the robot to learn different social norms with a single learned model, rather than having different modules for each task.
arXiv Detail & Related papers (2024-07-15T08:57:02Z) - Socially Integrated Navigation: A Social Acting Robot with Deep Reinforcement Learning [0.7864304771129751]
Mobile robots are being used on a large scale in various crowded situations and become part of our society.
Socially acceptable navigation behavior of a mobile robot with individual human consideration is an essential requirement for scalable applications and human acceptance.
We propose a novel socially integrated navigation approach where the robot's social behavior is adaptive and emerges from the interaction with humans.
arXiv Detail & Related papers (2024-03-14T18:25:40Z) - Multi-Agent Dynamic Relational Reasoning for Social Robot Navigation [50.01551945190676]
Social robot navigation can be helpful in various contexts of daily life but requires safe human-robot interactions and efficient trajectory planning.
We propose a systematic relational reasoning approach with explicit inference of the underlying dynamically evolving relational structures.
We demonstrate its effectiveness for multi-agent trajectory prediction and social robot navigation.
arXiv Detail & Related papers (2024-01-22T18:58:22Z) - Principles and Guidelines for Evaluating Social Robot Navigation
Algorithms [44.51586279645062]
Social robot navigation is difficult to evaluate because it involves dynamic human agents and their perceptions of the appropriateness of robot behavior.
Our contributions include (a) a definition of a socially navigating robot as one that respects the principles of safety, comfort, legibility, politeness, social competency, agent understanding, proactivity, and responsiveness to context, (b) guidelines for the use of metrics, development of scenarios, benchmarks, datasets, and simulators to evaluate social navigation, and (c) a social navigation metrics framework to make it easier to compare results from different simulators, robots and datasets.
arXiv Detail & Related papers (2023-06-29T07:31:43Z) - Exploiting Socially-Aware Tasks for Embodied Social Navigation [17.48110264302196]
We propose an end-to-end architecture that exploits Socially-Aware Tasks to inject into a reinforcement learning navigation policy.
To this end, our tasks exploit the notion of immediate and future dangers of collision.
We validate our approach on Gibson4+ and Habitat-Matterport3D datasets.
arXiv Detail & Related papers (2022-12-01T18:52:46Z) - Socially Compliant Navigation Dataset (SCAND): A Large-Scale Dataset of
Demonstrations for Social Navigation [92.66286342108934]
Social navigation is the capability of an autonomous agent, such as a robot, to navigate in a'socially compliant' manner in the presence of other intelligent agents such as humans.
Our dataset contains 8.7 hours, 138 trajectories, 25 miles of socially compliant, human teleoperated driving demonstrations.
arXiv Detail & Related papers (2022-03-28T19:09:11Z) - SSAGCN: Social Soft Attention Graph Convolution Network for Pedestrian
Trajectory Prediction [59.064925464991056]
We propose one new prediction model named Social Soft Attention Graph Convolution Network (SSAGCN)
SSAGCN aims to simultaneously handle social interactions among pedestrians and scene interactions between pedestrians and environments.
Experiments on public available datasets prove the effectiveness of SSAGCN and have achieved state-of-the-art results.
arXiv Detail & Related papers (2021-12-05T01:49:18Z) - PHASE: PHysically-grounded Abstract Social Events for Machine Social
Perception [50.551003004553806]
We create a dataset of physically-grounded abstract social events, PHASE, that resemble a wide range of real-life social interactions.
Phase is validated with human experiments demonstrating that humans perceive rich interactions in the social events.
As a baseline model, we introduce a Bayesian inverse planning approach, SIMPLE, which outperforms state-of-the-art feed-forward neural networks.
arXiv Detail & Related papers (2021-03-02T18:44:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.