GMA3D: Local-Global Attention Learning to Estimate Occluded Motions of
Scene Flow
- URL: http://arxiv.org/abs/2210.03296v2
- Date: Sun, 23 Jul 2023 04:28:18 GMT
- Title: GMA3D: Local-Global Attention Learning to Estimate Occluded Motions of
Scene Flow
- Authors: Zhiyang Lu, Ming Cheng
- Abstract summary: We propose a GMA3D module based on the transformer framework to infer the motion information of occluded points from the motion information of local and global non-occluded points respectively.
Experiments show that our GMA3D can solve the occlusion problem in the scene flow, especially in the real scene.
- Score: 3.2738068278607426
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Scene flow represents the motion information of each point in the 3D point
clouds. It is a vital downstream method applied to many tasks, such as motion
segmentation and object tracking. However, there are always occlusion points
between two consecutive point clouds, whether from the sparsity data sampling
or real-world occlusion. In this paper, we focus on addressing occlusion issues
in scene flow by the semantic self-similarity and motion consistency of the
moving objects. We propose a GMA3D module based on the transformer framework,
which utilizes local and global semantic similarity to infer the motion
information of occluded points from the motion information of local and global
non-occluded points respectively, and then uses an offset aggregator to
aggregate them. Our module is the first to apply the transformer-based
architecture to gauge the scene flow occlusion problem on point clouds.
Experiments show that our GMA3D can solve the occlusion problem in the scene
flow, especially in the real scene. We evaluated the proposed method on the
occluded version of point cloud datasets and get state-of-the-art results on
the real scene KITTI dataset. To testify that GMA3D is still beneficial to
non-occluded scene flow, we also conducted experiments on non-occluded version
datasets and achieved promising performance on FlyThings3D and KITTI. The code
is available at https://anonymous.4open.science/r/GMA3D-E100.
Related papers
- Aligning Bird-Eye View Representation of Point Cloud Sequences using
Scene Flow [0.0]
Low-resolution point clouds are challenging for object detection methods due to their sparsity.
We develop a plug-in module that enables single-frame detectors to compute scene flow to rectify their Bird-Eye View representation.
arXiv Detail & Related papers (2023-05-04T15:16:21Z) - Unsupervised Learning of 3D Scene Flow with 3D Odometry Assistance [20.735976558587588]
Scene flow estimation is used in various applications such as autonomous driving fields, activity recognition, and virtual reality fields.
It is challenging to annotate scene flow with ground truth for real-world data.
We propose to use odometry information to assist the unsupervised learning of scene flow and use real-world LiDAR data to train our network.
arXiv Detail & Related papers (2022-09-11T21:53:43Z) - AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [86.10213302724085]
We propose a novel 3D detection framework that associates intact features for objects via domain adaptation.
We achieve new state-of-the-art performance on the KITTI 3D detection benchmark in both accuracy and speed.
arXiv Detail & Related papers (2022-08-24T16:54:38Z) - What Matters for 3D Scene Flow Network [44.02710380584977]
3D scene flow estimation from point clouds is a low-level 3D motion perception task in computer vision.
We propose a novel all-to-all flow embedding layer with backward reliability validation during the initial scene flow estimation.
Our proposed model surpasses all existing methods by at least 38.2% on FlyingThings3D dataset and 24.7% on KITTI Scene Flow dataset for EPE3D metric.
arXiv Detail & Related papers (2022-07-19T09:27:05Z) - IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding
Alignment [58.8330387551499]
We formulate the problem as estimation of point-wise trajectories (i.e., smooth curves)
We propose IDEA-Net, an end-to-end deep learning framework, which disentangles the problem under the assistance of the explicitly learned temporal consistency.
We demonstrate the effectiveness of our method on various point cloud sequences and observe large improvement over state-of-the-art methods both quantitatively and visually.
arXiv Detail & Related papers (2022-03-22T10:14:08Z) - SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation [71.2856098776959]
Estimating 3D motions for point clouds is challenging, since a point cloud is unordered and its density is significantly non-uniform.
We propose a novel architecture named Sparse Convolution-Transformer Network (SCTN) that equips the sparse convolution with the transformer.
We show that the learned relation-based contextual information is rich and helpful for matching corresponding points, benefiting scene flow estimation.
arXiv Detail & Related papers (2021-05-10T15:16:14Z) - Weakly Supervised Learning of Rigid 3D Scene Flow [81.37165332656612]
We propose a data-driven scene flow estimation algorithm exploiting the observation that many 3D scenes can be explained by a collection of agents moving as rigid bodies.
We showcase the effectiveness and generalization capacity of our method on four different autonomous driving datasets.
arXiv Detail & Related papers (2021-02-17T18:58:02Z) - Occlusion Guided Scene Flow Estimation on 3D Point Clouds [4.518012967046983]
3D scene flow estimation is a vital tool in perceiving our environment given depth or range sensors.
Here we propose a new scene flow architecture called OGSF-Net which tightly couples the learning for both flow and occlusions between frames.
Their coupled symbiosis results in a more accurate prediction of flow in space.
arXiv Detail & Related papers (2020-11-30T15:22:03Z) - Scene Flow from Point Clouds with or without Learning [47.03163552693887]
Scene flow is the three-dimensional (3D) motion field of a scene.
Current learning-based approaches seek to estimate the scene flow directly from point clouds.
We present a simple and interpretable objective function to recover the scene flow from point clouds.
arXiv Detail & Related papers (2020-10-31T17:24:48Z) - Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud
Object Detection [64.2159881697615]
Object detection from 3D point clouds remains a challenging task, though recent studies pushed the envelope with the deep learning techniques.
We propose a domain adaptation like approach to enhance the robustness of the feature representation.
Our simple yet effective approach fundamentally boosts the performance of 3D point cloud object detection and achieves the state-of-the-art results.
arXiv Detail & Related papers (2020-06-08T05:15:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.