Comparing View-Based and Map-Based Semantic Labelling in Real-Time SLAM
- URL: http://arxiv.org/abs/2002.10342v1
- Date: Mon, 24 Feb 2020 16:12:51 GMT
- Title: Comparing View-Based and Map-Based Semantic Labelling in Real-Time SLAM
- Authors: Zoe Landgraf, Fabian Falck, Michael Bloesch, Stefan Leutenegger,
Andrew Davison
- Abstract summary: Spatial AI systems must build persistent scene representations where geometric models are combined with meaningful semantic labels.
Here, we present an experimental framework and comparison which uses real-time height map fusion as an accessible platform for a fair comparison.
- Score: 21.502428526207233
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Generally capable Spatial AI systems must build persistent scene
representations where geometric models are combined with meaningful semantic
labels. The many approaches to labelling scenes can be divided into two clear
groups: view-based which estimate labels from the input view-wise data and then
incrementally fuse them into the scene model as it is built; and map-based
which label the generated scene model. However, there has so far been no
attempt to quantitatively compare view-based and map-based labelling. Here, we
present an experimental framework and comparison which uses real-time height
map fusion as an accessible platform for a fair comparison, opening up the
route to further systematic research in this area.
Related papers
- Using a negative spatial auto-correlation index to evaluate and improve intrinsic TagMap's multi-scale visualization capabilities [1.829179623372777]
Existing methodologies for tag maps primarily focus on tag layout at specific scales.
We incorporate the negative spatial auto-correlation index into tag maps to assess the uniformity of tag size distribution.
This enhancement involves iteratively filtering out candidate tags and selecting optimal tags that meet the defined index criteria.
arXiv Detail & Related papers (2024-08-08T08:52:27Z) - Label Learning Method Based on Tensor Projection [82.51786483693206]
We propose a label learning method based on tensor projection (LLMTP)
We extend the matrix projection transformation to tensor projection, so that the spatial structure information between views can be fully utilized.
In addition, we introduce the tensor Schatten $p$-norm regularization to make the clustering label matrices of different views as consistent as possible.
arXiv Detail & Related papers (2024-02-26T13:03:26Z) - A Data-efficient Framework for Robotics Large-scale LiDAR Scene Parsing [10.497309421830671]
Existing state-of-the-art 3D point clouds understanding methods only perform well in a fully supervised manner.
This work presents a general and simple framework to tackle point clouds understanding when labels are limited.
arXiv Detail & Related papers (2023-12-03T02:38:51Z) - Semantic-Aware Representation Blending for Multi-Label Image Recognition
with Partial Labels [86.17081952197788]
We propose to blend category-specific representation across different images to transfer information of known labels to complement unknown labels.
Experiments on the MS-COCO, Visual Genome, Pascal VOC 2007 datasets show that the proposed SARB framework obtains superior performance over current leading competitors.
arXiv Detail & Related papers (2022-03-04T07:56:16Z) - AutoGeoLabel: Automated Label Generation for Geospatial Machine Learning [69.47585818994959]
We evaluate a big data processing pipeline to auto-generate labels for remote sensing data.
We utilize the big geo-data platform IBM PAIRS to dynamically generate such labels in dense urban areas.
arXiv Detail & Related papers (2022-01-31T20:02:22Z) - MAOMaps: A Photo-Realistic Benchmark For vSLAM and Map Merging Quality
Assessment [0.0]
We introduce a novel benchmark that is aimed at quantitatively evaluating the quality of vision-based simultaneous localization and mapping (vSLAM) and map merging algorithms.
The dataset is photo-realistic and provides both the localization and the map ground truth data.
To compare the vSLAM-built maps and the ground-truth ones we introduce a novel way to find correspondences between them that takes the SLAM context into account.
arXiv Detail & Related papers (2021-05-31T14:30:36Z) - Debiased Contrastive Learning [64.98602526764599]
We develop a debiased contrastive objective that corrects for the sampling of same-label datapoints.
Empirically, the proposed objective consistently outperforms the state-of-the-art for representation learning in vision, language, and reinforcement learning benchmarks.
arXiv Detail & Related papers (2020-07-01T04:25:24Z) - Rethinking Localization Map: Towards Accurate Object Perception with
Self-Enhancement Maps [78.2581910688094]
This work introduces a novel self-enhancement method to harvest accurate object localization maps and object boundaries with only category labels as supervision.
In particular, the proposed Self-Enhancement Maps achieve the state-of-the-art localization accuracy of 54.88% on ILSVRC.
arXiv Detail & Related papers (2020-06-09T12:35:55Z) - Weakly-Supervised Salient Object Detection via Scribble Annotations [54.40518383782725]
We propose a weakly-supervised salient object detection model to learn saliency from scribble labels.
We present a new metric, termed saliency structure measure, to measure the structure alignment of the predicted saliency maps.
Our method not only outperforms existing weakly-supervised/unsupervised methods, but also is on par with several fully-supervised state-of-the-art models.
arXiv Detail & Related papers (2020-03-17T12:59:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.