3D Siamese Transformer Network for Single Object Tracking on Point
Clouds
- URL: http://arxiv.org/abs/2207.11995v2
- Date: Tue, 26 Jul 2022 08:43:25 GMT
- Title: 3D Siamese Transformer Network for Single Object Tracking on Point
Clouds
- Authors: Le Hui, Lingpeng Wang, Linghua Tang, Kaihao Lan, Jin Xie, Jian Yang
- Abstract summary: Siamese network based trackers formulate 3D single object tracking as cross-correlation learning between point features of a template and a search area.
We explicitly use Transformer to form a 3D Siamese Transformer network for learning robust cross correlation between the template and the search area.
Our method achieves state-of-the-art performance on the 3D single object tracking task.
- Score: 22.48888264770609
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Siamese network based trackers formulate 3D single object tracking as
cross-correlation learning between point features of a template and a search
area. Due to the large appearance variation between the template and search
area during tracking, how to learn the robust cross correlation between them
for identifying the potential target in the search area is still a challenging
problem. In this paper, we explicitly use Transformer to form a 3D Siamese
Transformer network for learning robust cross correlation between the template
and the search area of point clouds. Specifically, we develop a Siamese point
Transformer network to learn shape context information of the target. Its
encoder uses self-attention to capture non-local information of point clouds to
characterize the shape information of the object, and the decoder utilizes
cross-attention to upsample discriminative point features. After that, we
develop an iterative coarse-to-fine correlation network to learn the robust
cross correlation between the template and the search area. It formulates the
cross-feature augmentation to associate the template with the potential target
in the search area via cross attention. To further enhance the potential
target, it employs the ego-feature augmentation that applies self-attention to
the local k-NN graph of the feature space to aggregate target features.
Experiments on the KITTI, nuScenes, and Waymo datasets show that our method
achieves state-of-the-art performance on the 3D single object tracking task.
Related papers
- Multi-Correlation Siamese Transformer Network with Dense Connection for
3D Single Object Tracking [14.47355191520578]
Point cloud-based 3D object tracking is an important task in autonomous driving.
It remains challenging to learn the correlation between the template and search branches effectively with the sparse LIDAR point cloud data.
We present a multi-correlation Siamese Transformer network that has multiple stages and carries out feature correlation at the end of each stage.
arXiv Detail & Related papers (2023-12-18T09:33:49Z) - Correlation Pyramid Network for 3D Single Object Tracking [16.694809791177263]
We propose a novel Correlation Pyramid Network (CorpNet) with a unified encoder and a motion-factorized decoder.
CorpNet achieves state-of-the-art results while running in real-time.
arXiv Detail & Related papers (2023-05-16T06:07:20Z) - Unleash the Potential of Image Branch for Cross-modal 3D Object
Detection [67.94357336206136]
We present a new cross-modal 3D object detector, namely UPIDet, which aims to unleash the potential of the image branch from two aspects.
First, UPIDet introduces a new 2D auxiliary task called normalized local coordinate map estimation.
Second, we discover that the representational capability of the point cloud backbone can be enhanced through the gradients backpropagated from the training objectives of the image branch.
arXiv Detail & Related papers (2023-01-22T08:26:58Z) - CXTrack: Improving 3D Point Cloud Tracking with Contextual Information [59.55870742072618]
3D single object tracking plays an essential role in many applications, such as autonomous driving.
We propose CXTrack, a novel transformer-based network for 3D object tracking.
We show that CXTrack achieves state-of-the-art tracking performance while running at 29 FPS.
arXiv Detail & Related papers (2022-11-12T11:29:01Z) - Point Cloud Registration-Driven Robust Feature Matching for 3D Siamese
Object Tracking [24.97192595209272]
Learning robust feature matching between the template and search area is crucial for 3D Siamese tracking.
We propose a novel point cloud registration-driven Siamese tracking framework, with the intuition that spatially aligned corresponding points tend to achieve consistent feature representations.
Our method consists of two modules, including a tracking-specific nonlocal registration module and a registration-aided Sinkhorn template-feature aggregation module.
arXiv Detail & Related papers (2022-09-14T03:25:04Z) - AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [86.10213302724085]
We propose a novel 3D detection framework that associates intact features for objects via domain adaptation.
We achieve new state-of-the-art performance on the KITTI 3D detection benchmark in both accuracy and speed.
arXiv Detail & Related papers (2022-08-24T16:54:38Z) - ST3D: Self-training for Unsupervised Domain Adaptation on 3D
ObjectDetection [78.71826145162092]
We present a new domain adaptive self-training pipeline, named ST3D, for unsupervised domain adaptation on 3D object detection from point clouds.
Our ST3D achieves state-of-the-art performance on all evaluated datasets and even surpasses fully supervised results on KITTI 3D object detection benchmark.
arXiv Detail & Related papers (2021-03-09T10:51:24Z) - Point Transformer [122.2917213154675]
We investigate the application of self-attention networks to 3D point cloud processing.
We design self-attention layers for point clouds and use these to construct self-attention networks for tasks such as semantic scene segmentation.
Our Point Transformer design improves upon prior work across domains and tasks.
arXiv Detail & Related papers (2020-12-16T18:58:56Z) - Graph Attention Tracking [76.19829750144564]
We propose a simple target-aware Siamese graph attention network for general object tracking.
Experiments on challenging benchmarks including GOT-10k, UAV123, OTB-100 and LaSOT demonstrate that the proposed SiamGAT outperforms many state-of-the-art trackers.
arXiv Detail & Related papers (2020-11-23T04:26:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.