ThermalLoc: A Vision Transformer-Based Approach for Robust Thermal Camera Relocalization in Large-Scale Environments
- URL: http://arxiv.org/abs/2506.18268v1
- Date: Mon, 23 Jun 2025 03:52:35 GMT
- Title: ThermalLoc: A Vision Transformer-Based Approach for Robust Thermal Camera Relocalization in Large-Scale Environments
- Authors: Yu Liu, Yangtao Meng, Xianfei Pan, Jie Jiang, Changhao Chen,
- Abstract summary: Thermal cameras capture environmental data through heat emission.<n>Traditional visual relocalization methods not directly applicable to thermal images.<n>We introduce ThermalLoc, a novel end-to-end deep learning method for thermal image relocalization.
- Score: 11.322440891972901
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Thermal cameras capture environmental data through heat emission, a fundamentally different mechanism compared to visible light cameras, which rely on pinhole imaging. As a result, traditional visual relocalization methods designed for visible light images are not directly applicable to thermal images. Despite significant advancements in deep learning for camera relocalization, approaches specifically tailored for thermal camera-based relocalization remain underexplored. To address this gap, we introduce ThermalLoc, a novel end-to-end deep learning method for thermal image relocalization. ThermalLoc effectively extracts both local and global features from thermal images by integrating EfficientNet with Transformers, and performs absolute pose regression using two MLP networks. We evaluated ThermalLoc on both the publicly available thermal-odometry dataset and our own dataset. The results demonstrate that ThermalLoc outperforms existing representative methods employed for thermal camera relocalization, including AtLoc, MapNet, PoseNet, and RobustLoc, achieving superior accuracy and robustness.
Related papers
- ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation [6.524847658755803]
We propose a solution to augment multi-modal datasets with synthetic thermal data to enable widespread and rapid adaptation of thermal cameras.<n>We explore the use of conditional diffusion models to convert existing RGB images to thermal images using self-attention to learn the thermal properties of real-world objects.
arXiv Detail & Related papers (2025-06-26T03:18:22Z) - RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model [59.37279559684668]
We introduce RS-vHeat, an efficient multi-modal remote sensing foundation model.<n>Specifically, RS-vHeat applies the Heat Conduction Operator (HCO) with a complexity of $O(N1.5)$ and a global receptive field.<n>Compared to attention-based remote sensing foundation models, we reduce memory usage by 84%, FLOPs by 24% and improves throughput by 2.7 times.
arXiv Detail & Related papers (2024-11-27T01:43:38Z) - T-FAKE: Synthesizing Thermal Images for Facial Landmarking [8.20594611891252]
We introduce the T-FAKE dataset, a large-scale synthetic thermal dataset with sparse and dense landmarks.<n>We propose a novel RGB2Thermal loss function, which enables the domain-adaptive transfer of RGB faces to thermal style.<n>Our models show excellent performance with both sparse 70-point landmarks and dense 478-point landmark annotations.
arXiv Detail & Related papers (2024-08-27T15:07:58Z) - Building Vision Models upon Heat Conduction [66.1594989193046]
This study introduces the Heat Conduction Operator (HCO) built upon the physical heat conduction principle.<n>HCO conceptualizes image patches as heat sources and models their correlations through adaptive thermal energy diffusion.<n> vHeat achieves up to a 3x throughput, 80% less GPU memory allocation, and 35% fewer computational FLOPs compared to the Swin-Transformer.
arXiv Detail & Related papers (2024-05-26T12:58:04Z) - Segmentation Framework for Heat Loss Identification in Thermal Images:
Empowering Scottish Retrofitting and Thermographic Survey Companies [3.663784777941382]
This study proposes a deep learning (DL)-based segmentation framework using the Mask Region Proposal Convolutional Neural Network (Mask RCNN)
The objective of the framework is to identify, and crop heat loss sources caused by weak insulation, while also eliminating obstructive objects present in those images.
arXiv Detail & Related papers (2023-08-07T14:36:49Z) - Simultaneous temperature estimation and nonuniformity correction from
multiple frames [0.0]
Low-cost microbolometer-based IR cameras are prone to spatially nonuniformity and to drift in temperature measurements.
We propose a novel approach for simultaneous temperature estimation and nonuniformity correction (NUC) from multiple frames captured by low-cost microbolometer cameras.
arXiv Detail & Related papers (2023-07-23T11:28:25Z) - Precise Facial Landmark Detection by Reference Heatmap Transformer [52.417964103227696]
We propose a novel Reference Heatmap Transformer (RHT) for more precise facial landmark detection.
The experimental results from challenging benchmark datasets demonstrate that our proposed method outperforms the state-of-the-art methods in the literature.
arXiv Detail & Related papers (2023-03-14T12:26:48Z) - Does Thermal Really Always Matter for RGB-T Salient Object Detection? [153.17156598262656]
This paper proposes a network named TNet to solve the RGB-T salient object detection (SOD) task.
In this paper, we introduce a global illumination estimation module to predict the global illuminance score of the image.
On the other hand, we introduce a two-stage localization and complementation module in the decoding phase to transfer object localization cue and internal integrity cue in thermal features to the RGB modality.
arXiv Detail & Related papers (2022-10-09T13:50:12Z) - Maximizing Self-supervision from Thermal Image for Effective
Self-supervised Learning of Depth and Ego-motion [78.19156040783061]
Self-supervised learning of depth and ego-motion from thermal images shows strong robustness and reliability under challenging scenarios.
The inherent thermal image properties such as weak contrast, blurry edges, and noise hinder to generate effective self-supervision from thermal images.
We propose an effective thermal image mapping method that significantly increases image information, such as overall structure, contrast, and details, while preserving temporal consistency.
arXiv Detail & Related papers (2022-01-12T09:49:24Z) - Meta-UDA: Unsupervised Domain Adaptive Thermal Object Detection using
Meta-Learning [64.92447072894055]
Infrared (IR) cameras are robust under adverse illumination and lighting conditions.
We propose an algorithm meta-learning framework to improve existing UDA methods.
We produce a state-of-the-art thermal detector for the KAIST and DSIAC datasets.
arXiv Detail & Related papers (2021-10-07T02:28:18Z) - Robust pedestrian detection in thermal imagery using synthesized images [39.33977680993236]
We propose a method for improving pedestrian detection in the thermal domain using two stages.
First, a generative data augmentation approach is used, then a domain adaptation method using generated data adapts an RGB pedestrian detector.
Our detector achieves the best single-modality detection results on KAIST with respect to the state-of-the-art.
arXiv Detail & Related papers (2021-02-03T11:08:31Z) - Exploring Thermal Images for Object Detection in Underexposure Regions
for Autonomous Driving [67.69430435482127]
Underexposure regions are vital to construct a complete perception of the surroundings for safe autonomous driving.
The availability of thermal cameras has provided an essential alternate to explore regions where other optical sensors lack in capturing interpretable signals.
This work proposes a domain adaptation framework which employs a style transfer technique for transfer learning from visible spectrum images to thermal images.
arXiv Detail & Related papers (2020-06-01T09:59:09Z) - Unsupervised Image-generation Enhanced Adaptation for Object Detection
in Thermal images [4.810743887667828]
This paper proposes an unsupervised image-generation enhanced adaptation method for object detection in thermal images.
To reduce the gap between visible domain and thermal domain, the proposed method manages to generate simulated fake thermal images.
Experiments demonstrate the effectiveness and superiority of the proposed method.
arXiv Detail & Related papers (2020-02-17T04:53:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.