Real-Time Fusion of Visual and Chart Data for Enhanced Maritime Vision
- URL: http://arxiv.org/abs/2507.13880v1
- Date: Fri, 18 Jul 2025 12:58:11 GMT
- Title: Real-Time Fusion of Visual and Chart Data for Enhanced Maritime Vision
- Authors: Marten Kreis, Benjamin Kiefer,
- Abstract summary: We present a novel approach to enhancing marine vision by fusing real-time visual data with chart information.<n>Our system overlays nautical chart data onto live video feeds by accurately matching detected navigational aids, such as buoys, with their corresponding representations in chart data.
- Score: 2.14769181770878
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents a novel approach to enhancing marine vision by fusing real-time visual data with chart information. Our system overlays nautical chart data onto live video feeds by accurately matching detected navigational aids, such as buoys, with their corresponding representations in chart data. To achieve robust association, we introduce a transformer-based end-to-end neural network that predicts bounding boxes and confidence scores for buoy queries, enabling the direct matching of image-domain detections with world-space chart markers. The proposed method is compared against baseline approaches, including a ray-casting model that estimates buoy positions via camera projection and a YOLOv7-based network extended with a distance estimation module. Experimental results on a dataset of real-world maritime scenes demonstrate that our approach significantly improves object localization and association accuracy in dynamic and challenging environments.
Related papers
- RelMap: Reliable Spatiotemporal Sensor Data Visualization via Imputative Spatial Interpolation [18.947107160943595]
This paper introduces a novel shorttemporal data pipeline that achieves reliable results and produces a novel heatmap representation with uncertainty information.<n>We leverage imputation from Neural Networks (GNNs) to enhance visualization reliability and temporal resolution.
arXiv Detail & Related papers (2025-08-02T07:25:23Z) - Scalable Dynamic Origin-Destination Demand Estimation Enhanced by High-Resolution Satellite Imagery Data [1.735650011442274]
This study presents a novel framework for dynamic origin-destination demand estimation (DODE) in multi-class mesoscopic network models.<n>We leverage high-resolution satellite imagery together with conventional traffic data from local sensors.<n>Results show that supplementing traditional data with satellite-derived density significantly improves estimation performance.
arXiv Detail & Related papers (2025-06-25T06:47:06Z) - Deep Domain Adaptation: A Sim2Real Neural Approach for Improving Eye-Tracking Systems [80.62854148838359]
Eye image segmentation is a critical step in eye tracking that has great influence over the final gaze estimate.
We use dimensionality-reduction techniques to measure the overlap between the target eye images and synthetic training data.
Our methods result in robust, improved performance when tackling the discrepancy between simulation and real-world data samples.
arXiv Detail & Related papers (2024-03-23T22:32:06Z) - Image and AIS Data Fusion Technique for Maritime Computer Vision
Applications [1.482087972733629]
We develop a technique that fuses Automatic Identification System (AIS) data with vessels detected in images to create datasets.
Our approach associates detected ships to their corresponding AIS messages by estimating distance and azimuth.
This technique is useful for creating datasets for waterway traffic management, encounter detection, and surveillance.
arXiv Detail & Related papers (2023-12-07T20:54:49Z) - Velocity-Based Channel Charting with Spatial Distribution Map Matching [4.913210912019975]
Fingerprint-based localization improves positioning performance in challenging, non-line-of-sight (NLoS) dominated indoor environments.
Channel-charting avoids this labeling effort as it implicitly associates relative coordinates to the recorded radio signals.
We propose a novel framework that does not require reference positions to keep the models up to date.
arXiv Detail & Related papers (2023-11-14T09:21:09Z) - Unified Visual Relationship Detection with Vision and Language Models [89.77838890788638]
This work focuses on training a single visual relationship detector predicting over the union of label spaces from multiple datasets.
We propose UniVRD, a novel bottom-up method for Unified Visual Relationship Detection by leveraging vision and language models.
Empirical results on both human-object interaction detection and scene-graph generation demonstrate the competitive performance of our model.
arXiv Detail & Related papers (2023-03-16T00:06:28Z) - iSDF: Real-Time Neural Signed Distance Fields for Robot Perception [64.80458128766254]
iSDF is a continuous learning system for real-time signed distance field reconstruction.
It produces more accurate reconstructions and better approximations of collision costs and gradients.
arXiv Detail & Related papers (2022-04-05T15:48:39Z) - Towards Scale Consistent Monocular Visual Odometry by Learning from the
Virtual World [83.36195426897768]
We propose VRVO, a novel framework for retrieving the absolute scale from virtual data.
We first train a scale-aware disparity network using both monocular real images and stereo virtual data.
The resulting scale-consistent disparities are then integrated with a direct VO system.
arXiv Detail & Related papers (2022-03-11T01:51:54Z) - An Online Semantic Mapping System for Extending and Enhancing Visual
SLAM [2.538209532048867]
We present a real-time semantic mapping approach for mobile vision systems with a 2D to 3D object detection pipeline and rapid data association for generated landmarks.
Our system reaches real-time capabilities with an average iteration duration of 65ms and is able to improve the pose estimation of a state-of-the-art SLAM by up to 68% on a public dataset.
arXiv Detail & Related papers (2022-03-08T09:14:37Z) - TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view
Stereo [55.30992853477754]
We present TANDEM, a real-time monocular tracking and dense framework.
For pose estimation, TANDEM performs photometric bundle adjustment based on a sliding window of alignments.
TANDEM shows state-of-the-art real-time 3D reconstruction performance.
arXiv Detail & Related papers (2021-11-14T19:01:02Z) - Radar-based Dynamic Occupancy Grid Mapping and Object Detection [55.74894405714851]
In recent years, the classical occupancy grid map approach has been extended to dynamic occupancy grid maps.
This paper presents the further development of a previous approach.
The data of multiple radar sensors are fused, and a grid-based object tracking and mapping method is applied.
arXiv Detail & Related papers (2020-08-09T09:26:30Z) - A Neural Architecture for Detecting Confusion in Eye-tracking Data [1.8655840060559168]
We introduce an architecture that uses RNN and CNN sub-models in parallel to take advantage of the temporal and visuospatial aspects of our data.
Our model outperforms an existing model based on Random Forests resulting in a 22% improvement in combined sensitivity & specificity.
arXiv Detail & Related papers (2020-03-13T18:20:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.