LST-SLAM: A Stereo Thermal SLAM System for Kilometer-Scale Dynamic Environments
- URL: http://arxiv.org/abs/2602.20925v1
- Date: Tue, 24 Feb 2026 14:04:54 GMT
- Title: LST-SLAM: A Stereo Thermal SLAM System for Kilometer-Scale Dynamic Environments
- Authors: Zeyu Jiang, Kuan Xu, Changhao Chen,
- Abstract summary: LST-SLAM is a novel large-scale stereo thermal SLAM system that achieves robust performance in complex, dynamic scenes.<n>Our approach combines self-supervised thermal feature learning, stereo dual-level motion tracking, and geometric pose optimization.<n>Experiments on kilometer-scale dynamic thermal datasets show that LST-SLAM significantly outperforms recent representative SLAM systems.
- Score: 9.986292956956953
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Thermal cameras offer strong potential for robot perception under challenging illumination and weather conditions. However, thermal Simultaneous Localization and Mapping (SLAM) remains difficult due to unreliable feature extraction, unstable motion tracking, and inconsistent global pose and map construction, particularly in dynamic large-scale outdoor environments. To address these challenges, we propose LST-SLAM, a novel large-scale stereo thermal SLAM system that achieves robust performance in complex, dynamic scenes. Our approach combines self-supervised thermal feature learning, stereo dual-level motion tracking, and geometric pose optimization. We also introduce a semantic-geometric hybrid constraint that suppresses potentially dynamic features lacking strong inter-frame geometric consistency. Furthermore, we develop an online incremental bag-of-words model for loop closure detection, coupled with global pose optimization to mitigate accumulated drift. Extensive experiments on kilometer-scale dynamic thermal datasets show that LST-SLAM significantly outperforms recent representative SLAM systems, including AirSLAM and DROID-SLAM, in both robustness and accuracy.
Related papers
- VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM [75.55522219717137]
We present VIGS-SLAM, a visual-inertial 3D Gaussian Splatting SLAM system.<n>It achieves robust real-time tracking and high-fidelity reconstruction.<n>Our method tightly couples visual and inertial cues within a unified optimization framework.
arXiv Detail & Related papers (2025-12-02T00:19:13Z) - HAD: Hierarchical Asymmetric Distillation to Bridge Spatio-Temporal Gaps in Event-Based Object Tracking [80.07224739976911]
Event cameras offer exceptional temporal resolution and a range (modal)<n> RGB cameras excel at capturing rich texture with high resolution, whereas event cameras offer exceptional temporal resolution and a range (modal)
arXiv Detail & Related papers (2025-10-22T13:15:13Z) - WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments [48.51530726697405]
We present WildGS-SLAM, a robust and efficient monocular RGB SLAM system designed to handle dynamic environments.<n>We introduce an uncertainty map, predicted by a shallow multi-layer perceptron and DINOv2 features, to guide dynamic object removal during both tracking and mapping.<n>Results showcase WildGS-SLAM's superior performance in dynamic environments compared to state-of-the-art methods.
arXiv Detail & Related papers (2025-04-04T19:19:40Z) - SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images [14.322021490470414]
We present DarkSLAM, a noval deep learning-based monocular thermal SLAM system for complex lighting conditions.<n>Our approach incorporates the Efficient Channel Attention (ECA) mechanism in visual odometry and the Selective Kernel Attention (SKA) mechanism in depth estimation.<n>It delivers precise localization and 3D dense mapping even in challenging nighttime environments.
arXiv Detail & Related papers (2025-02-26T08:34:23Z) - Monte Carlo Tree Search with Velocity Obstacles for safe and efficient motion planning in dynamic environments [49.30744329170107]
We propose a novel approach for optimal online motion planning with minimal information about dynamic obstacles.<n>The proposed methodology combines Monte Carlo Tree Search (MCTS), for online optimal planning via model simulations, with Velocity Obstacles (VO), for obstacle avoidance.<n>We show the superiority of our methodology with respect to state-of-the-art planners, including Non-linear Model Predictive Control (NMPC), in terms of improved collision rate, computational and task performance.
arXiv Detail & Related papers (2025-01-16T16:45:08Z) - ROVER: A Multi-Season Dataset for Visual SLAM [7.296917102476635]
ROVER is a benchmark dataset for evaluating visual SLAM algorithms in diverse environmental conditions.<n>It covers 39 recordings across five outdoor locations, collected through all seasons and various lighting scenarios.<n>Results show that while stereo-inertial and RGBD configurations perform better under favorable lighting, most SLAM systems perform poorly in low-light and high-vegetation scenarios.
arXiv Detail & Related papers (2024-12-03T15:34:00Z) - NICER-SLAM: Neural Implicit Scene Encoding for RGB SLAM [111.83168930989503]
NICER-SLAM is a dense RGB SLAM system that simultaneously optimize for camera poses and a hierarchical neural implicit map representation.
We show strong performance in dense mapping, tracking, and novel view synthesis, even competitive with recent RGB-D SLAM systems.
arXiv Detail & Related papers (2023-02-07T17:06:34Z) - Dense RGB-D-Inertial SLAM with Map Deformations [25.03159756734727]
We propose the first tightly-coupled dense RGB-D-inertial SLAM system.
We show that our system is more robust to fast motions and periods of low texture and low geometric variation than a related RGB-D-only SLAM system.
arXiv Detail & Related papers (2022-07-22T08:33:38Z) - PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in
Dynamic Scenes [0.0]
This paper proposes a real-time stereo indirect visual SLAM system, PLD-SLAM, which combines point and line features.
We also present a novel global gray similarity (GGS) algorithm to achieve reasonable selection and efficient loop closure detection.
arXiv Detail & Related papers (2022-07-22T07:40:00Z) - Pushing the Envelope of Rotation Averaging for Visual SLAM [69.7375052440794]
We propose a novel optimization backbone for visual SLAM systems.
We leverage averaging to improve the accuracy, efficiency and robustness of conventional monocular SLAM systems.
Our approach can exhibit up to 10x faster with comparable accuracy against the state-art on public benchmarks.
arXiv Detail & Related papers (2020-11-02T18:02:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.