STTracker: Spatio-Temporal Tracker for 3D Single Object Tracking
- URL: http://arxiv.org/abs/2306.17440v1
- Date: Fri, 30 Jun 2023 07:25:11 GMT
- Title: STTracker: Spatio-Temporal Tracker for 3D Single Object Tracking
- Authors: Yubo Cui, Zhiheng Li, Zheng Fang
- Abstract summary: 3D single object tracking with point clouds is a critical task in 3D computer vision.
Previous methods usually input the last two frames and use the template point cloud in previous frame and the search area point cloud in the current frame respectively.
- Score: 11.901758708579642
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: 3D single object tracking with point clouds is a critical task in 3D computer
vision. Previous methods usually input the last two frames and use the
predicted box to get the template point cloud in previous frame and the search
area point cloud in the current frame respectively, then use similarity-based
or motion-based methods to predict the current box. Although these methods
achieved good tracking performance, they ignore the historical information of
the target, which is important for tracking. In this paper, compared to
inputting two frames of point clouds, we input multi-frame of point clouds to
encode the spatio-temporal information of the target and learn the motion
information of the target implicitly, which could build the correlations among
different frames to track the target in the current frame efficiently.
Meanwhile, rather than directly using the point feature for feature fusion, we
first crop the point cloud features into many patches and then use sparse
attention mechanism to encode the patch-level similarity and finally fuse the
multi-frame features. Extensive experiments show that our method achieves
competitive results on challenging large-scale benchmarks (62.6% in KITTI and
49.66% in NuScenes).
Related papers
- SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud
Tracking [26.405519771454102]
We introduce Sequence-to-Sequence tracking paradigm and a tracker named SeqTrack3D to capture target motion across continuous frames.
This novel method ensures robust tracking by leveraging location priors from historical boxes, even in scenes with sparse points.
Experiments conducted on large-scale datasets show that SeqTrack3D achieves new state-of-the-art performances.
arXiv Detail & Related papers (2024-02-26T02:14:54Z) - PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection [66.94819989912823]
We propose a point-trajectory transformer with long short-term memory for efficient temporal 3D object detection.
We use point clouds of current-frame objects and their historical trajectories as input to minimize the memory bank storage requirement.
We conduct extensive experiments on the large-scale dataset to demonstrate that our approach performs well against state-of-the-art methods.
arXiv Detail & Related papers (2023-12-13T18:59:13Z) - Motion-to-Matching: A Mixed Paradigm for 3D Single Object Tracking [27.805298263103495]
We propose MTM-Tracker, which combines motion modeling with feature matching into a single network.
In the first stage, we exploit the continuous historical boxes as motion prior and propose an encoder-decoder structure to locate target coarsely.
In the second stage, we introduce a feature interaction module to extract motion-aware features from consecutive point clouds and match them to refine target movement.
arXiv Detail & Related papers (2023-08-23T02:40:51Z) - Modeling Continuous Motion for 3D Point Cloud Object Tracking [54.48716096286417]
This paper presents a novel approach that views each tracklet as a continuous stream.
At each timestamp, only the current frame is fed into the network to interact with multi-frame historical features stored in a memory bank.
To enhance the utilization of multi-frame features for robust tracking, a contrastive sequence enhancement strategy is proposed.
arXiv Detail & Related papers (2023-03-14T02:58:27Z) - MBPTrack: Improving 3D Point Cloud Tracking with Memory Networks and Box
Priors [59.55870742072618]
3D single object tracking has been a crucial problem for decades with numerous applications such as autonomous driving.
We present MBPTrack, which adopts a Memory mechanism to utilize past information and formulates localization in a coarse-to-fine scheme.
arXiv Detail & Related papers (2023-03-09T07:07:39Z) - CXTrack: Improving 3D Point Cloud Tracking with Contextual Information [59.55870742072618]
3D single object tracking plays an essential role in many applications, such as autonomous driving.
We propose CXTrack, a novel transformer-based network for 3D object tracking.
We show that CXTrack achieves state-of-the-art tracking performance while running at 29 FPS.
arXiv Detail & Related papers (2022-11-12T11:29:01Z) - TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object
Detection [47.941714033657675]
3D object detection using point clouds has attracted increasing attention due to its wide applications in autonomous driving and robotics.
We design TransPillars, a novel transformer-based feature aggregation technique that exploits temporal features of consecutive point cloud frames.
Our proposed TransPillars achieves state-of-art performance as compared to existing multi-frame detection approaches.
arXiv Detail & Related papers (2022-08-04T15:41:43Z) - Learning Spatio-Appearance Memory Network for High-Performance Visual
Tracking [79.80401607146987]
Existing object tracking usually learns a bounding-box based template to match visual targets across frames, which cannot accurately learn a pixel-wise representation.
This paper presents a novel segmentation-based tracking architecture, which is equipped with a local-temporal memory network to learn accurate-temporal correspondence.
arXiv Detail & Related papers (2020-09-21T08:12:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.