LTS-NET: End-to-end Unsupervised Learning of Long-Term 3D Stable objects
- URL: http://arxiv.org/abs/2301.03426v3
- Date: Mon, 12 Jun 2023 07:18:39 GMT
- Title: LTS-NET: End-to-end Unsupervised Learning of Long-Term 3D Stable objects
- Authors: Ibrahim Hroob, Sergi Molina, Riccardo Polvara, Grzegorz Cielniak and
Marc Hanheide
- Abstract summary: We present an end-to-end data-driven pipeline for determining the long-term stability of objects within a given environment, specifically distinguishing between static and dynamic objects.
Our pipeline includes a labelling method that utilizes historical data from the environment to generate training data for a neural network.
Our approach is evaluated on point cloud data from two parking lots in the NCLT dataset, and the results show that our proposed solution, outperforms direct training of a classification model for static stability vs dynamic object classification.
- Score: 7.491472577165315
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this research, we present an end-to-end data-driven pipeline for
determining the long-term stability status of objects within a given
environment, specifically distinguishing between static and dynamic objects.
Understanding object stability is key for mobile robots since long-term stable
objects can be exploited as landmarks for long-term localisation. Our pipeline
includes a labelling method that utilizes historical data from the environment
to generate training data for a neural network. Rather than utilizing discrete
labels, we propose the use of point-wise continuous label values, indicating
the spatio-temporal stability of individual points, to train a point cloud
regression network named LTS-NET. Our approach is evaluated on point cloud data
from two parking lots in the NCLT dataset, and the results show that our
proposed solution, outperforms direct training of a classification model for
static vs dynamic object classification.
Related papers
- STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking [13.269416985959404]
Multiple object tracking (MOT) in Unmanned Aerial Vehicle (UAV) videos is important for diverse applications in computer vision.
We propose a novel Spatio-Temporal Cohesion Multiple Object Tracking framework (STCMOT)
We use historical embedding features to model the representation of ReID and detection features in a sequential order.
Our framework sets a new state-of-the-art performance in MOTA and IDF1 metrics.
arXiv Detail & Related papers (2024-09-17T14:34:18Z) - SeMoLi: What Moves Together Belongs Together [51.72754014130369]
We tackle semi-supervised object detection based on motion cues.
Recent results suggest that motion-based clustering methods can be used to pseudo-label instances of moving objects.
We re-think this approach and suggest that both, object detection, as well as motion-inspired pseudo-labeling, can be tackled in a data-driven manner.
arXiv Detail & Related papers (2024-02-29T18:54:53Z) - Learning a Low-Rank Feature Representation: Achieving Better Trade-Off
between Stability and Plasticity in Continual Learning [20.15493383736196]
In continual learning, networks confront a trade-off between stability and plasticity when trained on a sequence of tasks.
We propose a novel training algorithm called LRFR to bolster plasticity without sacrificing stability.
Using CIFAR-100 and TinyImageNet as benchmark datasets for continual learning, the proposed approach consistently outperforms state-of-the-art methods.
arXiv Detail & Related papers (2023-12-14T08:34:11Z) - PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection [66.94819989912823]
We propose a point-trajectory transformer with long short-term memory for efficient temporal 3D object detection.
We use point clouds of current-frame objects and their historical trajectories as input to minimize the memory bank storage requirement.
We conduct extensive experiments on the large-scale dataset to demonstrate that our approach performs well against state-of-the-art methods.
arXiv Detail & Related papers (2023-12-13T18:59:13Z) - Object Goal Navigation using Data Regularized Q-Learning [9.65323691689801]
Object Goal Navigation requires a robot to find and navigate to an instance of a target object class in a previously unseen environment.
Our framework incrementally builds a semantic map of the environment over time, and then repeatedly selects a long-term goal.
Long-term goal selection is formulated as a vision-based deep reinforcement learning problem.
arXiv Detail & Related papers (2022-08-27T13:26:30Z) - Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding [62.17020485045456]
It is commonly assumed in semi-supervised learning (SSL) that the unlabeled data are drawn from the same distribution as that of the labeled ones.
We propose to selectively utilize unlabeled data through sample weighting, so that only conducive unlabeled data would be prioritized.
arXiv Detail & Related papers (2022-05-02T16:09:17Z) - Learning-based Point Cloud Registration for 6D Object Pose Estimation in
the Real World [55.7340077183072]
We tackle the task of estimating the 6D pose of an object from point cloud data.
Recent learning-based approaches to addressing this task have shown great success on synthetic datasets.
We analyze the causes of these failures, which we trace back to the difference between the feature distributions of the source and target point clouds.
arXiv Detail & Related papers (2022-03-29T07:55:04Z) - Multi-Object Tracking and Segmentation with a Space-Time Memory Network [12.043574473965318]
We propose a method for multi-object tracking and segmentation based on a novel memory-based mechanism to associate tracklets.
The proposed tracker, MeNToS, addresses particularly the long-term data association problem.
arXiv Detail & Related papers (2021-10-21T17:13:17Z) - 3D-FCT: Simultaneous 3D Object Detection and Tracking Using Feature
Correlation [0.0]
3D-FCT is a Siamese network architecture that utilizes temporal information to simultaneously perform the related tasks of 3D object detection and tracking.
Our proposed method is evaluated on the KITTI tracking dataset where it is shown to provide an improvement of 5.57% mAP over a state-of-the-art approach.
arXiv Detail & Related papers (2021-10-06T06:36:29Z) - Learning to Track with Object Permanence [61.36492084090744]
We introduce an end-to-end trainable approach for joint object detection and tracking.
Our model, trained jointly on synthetic and real data, outperforms the state of the art on KITTI, and MOT17 datasets.
arXiv Detail & Related papers (2021-03-26T04:43:04Z) - ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework
for LiDAR Point Cloud Segmentation [111.56730703473411]
Training deep neural networks (DNNs) on LiDAR data requires large-scale point-wise annotations.
Simulation-to-real domain adaptation (SRDA) trains a DNN using unlimited synthetic data with automatically generated labels.
ePointDA consists of three modules: self-supervised dropout noise rendering, statistics-invariant and spatially-adaptive feature alignment, and transferable segmentation learning.
arXiv Detail & Related papers (2020-09-07T23:46:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.