Convolutional Bayesian Kernel Inference for 3D Semantic Mapping
- URL: http://arxiv.org/abs/2209.10663v2
- Date: Wed, 31 May 2023 19:41:32 GMT
- Title: Convolutional Bayesian Kernel Inference for 3D Semantic Mapping
- Authors: Joey Wilson, Yuewei Fu, Arthur Zhang, Jingyu Song, Andrew Capodieci,
Paramsothy Jayakumar, Kira Barton, and Maani Ghaffari
- Abstract summary: We introduce a Convolutional Bayesian Kernel Inference layer which learns to perform explicit Bayesian inference.
We learn semantic-geometric probability distributions for LiDAR sensor information and incorporate semantic predictions into a global map.
We evaluate our network against state-of-the-art semantic mapping algorithms on the KITTI data set, demonstrating improved latency with comparable semantic label inference results.
- Score: 1.7615233156139762
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Robotic perception is currently at a cross-roads between modern methods,
which operate in an efficient latent space, and classical methods, which are
mathematically founded and provide interpretable, trustworthy results. In this
paper, we introduce a Convolutional Bayesian Kernel Inference (ConvBKI) layer
which learns to perform explicit Bayesian inference within a depthwise
separable convolution layer to maximize efficency while maintaining reliability
simultaneously. We apply our layer to the task of real-time 3D semantic
mapping, where we learn semantic-geometric probability distributions for LiDAR
sensor information and incorporate semantic predictions into a global map. We
evaluate our network against state-of-the-art semantic mapping algorithms on
the KITTI data set, demonstrating improved latency with comparable semantic
label inference results.
Related papers
- Latent BKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty [6.986230616834552]
This paper introduces a novel probabilistic mapping algorithm, Latent BKI, which enables open-vocabulary mapping with quantifiable uncertainty.
Latent BKI is evaluated against similar explicit semantic mapping and VL mapping frameworks on the popular MatterPort-3D and Semantic KITTI data sets.
Real-world experiments demonstrate applicability to challenging indoor environments.
arXiv Detail & Related papers (2024-10-15T17:02:32Z) - IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images [50.4538089115248]
Generalizable 3D object reconstruction from single-view RGB-D images remains a challenging task.
We propose a novel approach, IPoD, which harmonizes implicit field learning with point diffusion.
Experiments conducted on the CO3D-v2 dataset affirm the superiority of IPoD, achieving 7.8% improvement in F-score and 28.6% in Chamfer distance over existing methods.
arXiv Detail & Related papers (2024-03-30T07:17:37Z) - Evidential Semantic Mapping in Off-road Environments with Uncertainty-aware Bayesian Kernel Inference [5.120567378386614]
We propose an evidential semantic mapping framework, which can enhance reliability in perceptually challenging off-road environments.
By adaptively handling semantic uncertainties, the proposed framework constructs robust representations of the surroundings even in previously unseen environments.
arXiv Detail & Related papers (2024-03-21T05:13:34Z) - ConvBKI: Real-Time Probabilistic Semantic Mapping Network with Quantifiable Uncertainty [7.537718151195062]
We develop a modular neural network for real-time colorblack(> 10 Hz) semantic mapping in uncertain environments.
Our approach combines the reliability of classical probabilistic algorithms with the performance and efficiency of modern neural networks.
arXiv Detail & Related papers (2023-10-24T17:30:26Z) - Volumetric Semantically Consistent 3D Panoptic Mapping [77.13446499924977]
We introduce an online 2D-to-3D semantic instance mapping algorithm aimed at generating semantic 3D maps suitable for autonomous agents in unstructured environments.
It introduces novel ways of integrating semantic prediction confidence during mapping, producing semantic and instance-consistent 3D regions.
The proposed method achieves accuracy superior to the state of the art on public large-scale datasets, improving on a number of widely used metrics.
arXiv Detail & Related papers (2023-09-26T08:03:10Z) - Intensity Profile Projection: A Framework for Continuous-Time
Representation Learning for Dynamic Networks [50.2033914945157]
We present a representation learning framework, Intensity Profile Projection, for continuous-time dynamic network data.
The framework consists of three stages: estimating pairwise intensity functions, learning a projection which minimises a notion of intensity reconstruction error.
Moreoever, we develop estimation theory providing tight control on the error of any estimated trajectory, indicating that the representations could even be used in quite noise-sensitive follow-on analyses.
arXiv Detail & Related papers (2023-06-09T15:38:25Z) - X-Distill: Improving Self-Supervised Monocular Depth via Cross-Task
Distillation [69.9604394044652]
We propose a novel method to improve the self-supervised training of monocular depth via cross-task knowledge distillation.
During training, we utilize a pretrained semantic segmentation teacher network and transfer its semantic knowledge to the depth network.
We extensively evaluate the efficacy of our proposed approach on the KITTI benchmark and compare it with the latest state of the art.
arXiv Detail & Related papers (2021-10-24T19:47:14Z) - PDC-Net+: Enhanced Probabilistic Dense Correspondence Network [161.76275845530964]
Enhanced Probabilistic Dense Correspondence Network, PDC-Net+, capable of estimating accurate dense correspondences.
We develop an architecture and an enhanced training strategy tailored for robust and generalizable uncertainty prediction.
Our approach obtains state-of-the-art results on multiple challenging geometric matching and optical flow datasets.
arXiv Detail & Related papers (2021-09-28T17:56:41Z) - Dynamic Semantic Occupancy Mapping using 3D Scene Flow and Closed-Form
Bayesian Inference [3.0389083199673337]
We leverage state-of-the-art semantic segmentation and 3D flow estimation using deep learning to provide measurements for map inference.
We develop a continuous (i.e., can be queried at arbitrary resolution) Bayesian model that propagates the scene with flow and infers a 3D semantic occupancy map with better performance than its static counterpart.
arXiv Detail & Related papers (2021-08-06T15:51:40Z) - Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning [86.45526827323954]
Weakly-supervised semantic segmentation is a challenging task as no pixel-wise label information is provided for training.
We propose an iterative algorithm to learn such pairwise relations.
We show that the proposed algorithm performs favorably against the state-of-the-art methods.
arXiv Detail & Related papers (2020-02-19T10:32:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.