LTS-NET: End-to-end Unsupervised Learning of Long-Term 3D Stable objects
- URL: http://arxiv.org/abs/2301.03426v3
- Date: Mon, 12 Jun 2023 07:18:39 GMT
- Title: LTS-NET: End-to-end Unsupervised Learning of Long-Term 3D Stable objects
- Authors: Ibrahim Hroob, Sergi Molina, Riccardo Polvara, Grzegorz Cielniak and
Marc Hanheide
- Abstract summary: We present an end-to-end data-driven pipeline for determining the long-term stability of objects within a given environment, specifically distinguishing between static and dynamic objects.
Our pipeline includes a labelling method that utilizes historical data from the environment to generate training data for a neural network.
Our approach is evaluated on point cloud data from two parking lots in the NCLT dataset, and the results show that our proposed solution, outperforms direct training of a classification model for static stability vs dynamic object classification.
- Score: 7.491472577165315
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this research, we present an end-to-end data-driven pipeline for
determining the long-term stability status of objects within a given
environment, specifically distinguishing between static and dynamic objects.
Understanding object stability is key for mobile robots since long-term stable
objects can be exploited as landmarks for long-term localisation. Our pipeline
includes a labelling method that utilizes historical data from the environment
to generate training data for a neural network. Rather than utilizing discrete
labels, we propose the use of point-wise continuous label values, indicating
the spatio-temporal stability of individual points, to train a point cloud
regression network named LTS-NET. Our approach is evaluated on point cloud data
from two parking lots in the NCLT dataset, and the results show that our
proposed solution, outperforms direct training of a classification model for
static vs dynamic object classification.
Related papers
- Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning [51.170479006249195]
We introduce a new dataset, benchmark, and a dynamic coarse-to-fine learning scheme in this study.
Our proposed dataset, AI-TOD-R, features the smallest object sizes among all oriented object detection datasets.
We present a benchmark spanning a broad range of detection paradigms, including both fully-supervised and label-efficient approaches.
arXiv Detail & Related papers (2024-12-16T09:14:32Z) - Learning a Low-Rank Feature Representation: Achieving Better Trade-Off
between Stability and Plasticity in Continual Learning [20.15493383736196]
In continual learning, networks confront a trade-off between stability and plasticity when trained on a sequence of tasks.
We propose a novel training algorithm called LRFR to bolster plasticity without sacrificing stability.
Using CIFAR-100 and TinyImageNet as benchmark datasets for continual learning, the proposed approach consistently outperforms state-of-the-art methods.
arXiv Detail & Related papers (2023-12-14T08:34:11Z) - PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection [66.94819989912823]
We propose a point-trajectory transformer with long short-term memory for efficient temporal 3D object detection.
We use point clouds of current-frame objects and their historical trajectories as input to minimize the memory bank storage requirement.
We conduct extensive experiments on the large-scale dataset to demonstrate that our approach performs well against state-of-the-art methods.
arXiv Detail & Related papers (2023-12-13T18:59:13Z) - Object Goal Navigation using Data Regularized Q-Learning [9.65323691689801]
Object Goal Navigation requires a robot to find and navigate to an instance of a target object class in a previously unseen environment.
Our framework incrementally builds a semantic map of the environment over time, and then repeatedly selects a long-term goal.
Long-term goal selection is formulated as a vision-based deep reinforcement learning problem.
arXiv Detail & Related papers (2022-08-27T13:26:30Z) - Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding [62.17020485045456]
It is commonly assumed in semi-supervised learning (SSL) that the unlabeled data are drawn from the same distribution as that of the labeled ones.
We propose to selectively utilize unlabeled data through sample weighting, so that only conducive unlabeled data would be prioritized.
arXiv Detail & Related papers (2022-05-02T16:09:17Z) - Learning-based Point Cloud Registration for 6D Object Pose Estimation in
the Real World [55.7340077183072]
We tackle the task of estimating the 6D pose of an object from point cloud data.
Recent learning-based approaches to addressing this task have shown great success on synthetic datasets.
We analyze the causes of these failures, which we trace back to the difference between the feature distributions of the source and target point clouds.
arXiv Detail & Related papers (2022-03-29T07:55:04Z) - Multi-Object Tracking and Segmentation with a Space-Time Memory Network [12.043574473965318]
We propose a method for multi-object tracking and segmentation based on a novel memory-based mechanism to associate tracklets.
The proposed tracker, MeNToS, addresses particularly the long-term data association problem.
arXiv Detail & Related papers (2021-10-21T17:13:17Z) - 3D-FCT: Simultaneous 3D Object Detection and Tracking Using Feature
Correlation [0.0]
3D-FCT is a Siamese network architecture that utilizes temporal information to simultaneously perform the related tasks of 3D object detection and tracking.
Our proposed method is evaluated on the KITTI tracking dataset where it is shown to provide an improvement of 5.57% mAP over a state-of-the-art approach.
arXiv Detail & Related papers (2021-10-06T06:36:29Z) - Learning to Track with Object Permanence [61.36492084090744]
We introduce an end-to-end trainable approach for joint object detection and tracking.
Our model, trained jointly on synthetic and real data, outperforms the state of the art on KITTI, and MOT17 datasets.
arXiv Detail & Related papers (2021-03-26T04:43:04Z) - ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework
for LiDAR Point Cloud Segmentation [111.56730703473411]
Training deep neural networks (DNNs) on LiDAR data requires large-scale point-wise annotations.
Simulation-to-real domain adaptation (SRDA) trains a DNN using unlimited synthetic data with automatically generated labels.
ePointDA consists of three modules: self-supervised dropout noise rendering, statistics-invariant and spatially-adaptive feature alignment, and transferable segmentation learning.
arXiv Detail & Related papers (2020-09-07T23:46:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.