TricubeNet: 2D Kernel-Based Object Representation for Weakly-Occluded
Oriented Object Detection
- URL: http://arxiv.org/abs/2104.11435v1
- Date: Fri, 23 Apr 2021 06:50:28 GMT
- Title: TricubeNet: 2D Kernel-Based Object Representation for Weakly-Occluded
Oriented Object Detection
- Authors: Beomyoung Kim, Janghyeon Lee, Sihaeng Lee, Doyeon Kim, and Junmo Kim
- Abstract summary: We present a new approach for oriented object detection, an anchor-free one-stage detector.
This approach, named TricubeNet, represents each object as a 2D Tricube kernel and extracts bounding boxes using appearance-based post-processing.
- Score: 24.44389034373491
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a new approach for oriented object detection, an anchor-free
one-stage detector. This approach, named TricubeNet, represents each object as
a 2D Tricube kernel and extracts bounding boxes using appearance-based
post-processing. Unlike existing anchor-based oriented object detectors, we can
save the computational complexity and the number of hyperparameters by
eliminating the anchor box in the network design. In addition, by adopting a
heatmap-based detection process instead of the box offset regression, we simply
and effectively solve the angle discontinuity problem, which is one of the
important problems for oriented object detection. To further boost the
performance, we propose some effective techniques for the loss balancing,
extracting the rotation-invariant feature, and heatmap refinement. To
demonstrate the effectiveness of our TricueNet, we experiment on various tasks
for the weakly-occluded oriented object detection. The extensive experimental
results show that our TricueNet is highly effective and competitive for
oriented object detection. The code is available at
https://github.com/qjadud1994/TricubeNet.
Related papers
- The Importance of Anti-Aliasing in Tiny Object Detection [0.0]
This paper applies an existing approach WaveCNet for anti-aliasing to tiny object detection.
We modify the original WaveCNet to apply Wavelet Pooling layers, effectively suppressing aliasing.
We also propose a bottom-heavy version of the backbone, which further improves the performance of tiny object detection.
arXiv Detail & Related papers (2023-10-22T08:02:01Z) - Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection [54.041049052843604]
We present STEMD, a novel end-to-end framework that enhances the DETR-like paradigm for multi-frame 3D object detection.
First, to model the inter-object spatial interaction and complex temporal dependencies, we introduce the spatial-temporal graph attention network.
Finally, it poses a challenge for the network to distinguish between the positive query and other highly similar queries that are not the best match.
arXiv Detail & Related papers (2023-07-01T13:53:14Z) - 3D Small Object Detection with Dynamic Spatial Pruning [62.72638845817799]
We propose an efficient feature pruning strategy for 3D small object detection.
We present a multi-level 3D detector named DSPDet3D which benefits from high spatial resolution.
It takes less than 2s to directly process a whole building consisting of more than 4500k points while detecting out almost all objects.
arXiv Detail & Related papers (2023-05-05T17:57:04Z) - 3DMODT: Attention-Guided Affinities for Joint Detection & Tracking in 3D
Point Clouds [95.54285993019843]
We propose a method for joint detection and tracking of multiple objects in 3D point clouds.
Our model exploits temporal information employing multiple frames to detect objects and track them in a single network.
arXiv Detail & Related papers (2022-11-01T20:59:38Z) - PSA-Det3D: Pillar Set Abstraction for 3D object Detection [14.788139868324155]
We propose a pillar set abstraction (PSA) and foreground point compensation (FPC) to improve the detection performance for small object.
The experiments on the KITTI 3D detection benchmark show that our proposed PSA-Det3D outperforms other algorithms with high accuracy for small object detection.
arXiv Detail & Related papers (2022-10-20T03:05:34Z) - Double-Dot Network for Antipodal Grasp Detection [20.21384585441404]
This paper proposes a new deep learning approach to antipodal grasp detection, named Double-Dot Network (DD-Net)
It follows the recent anchor-free object detection framework, which does not depend on empirically pre-set anchors.
An effective CNN architecture is introduced to localize such fingertips, and with the help of auxiliary centers for refinement, it accurately and robustly infers grasp candidates.
arXiv Detail & Related papers (2021-08-03T14:21:17Z) - Delving into Localization Errors for Monocular 3D Object Detection [85.77319416168362]
Estimating 3D bounding boxes from monocular images is an essential component in autonomous driving.
In this work, we quantify the impact introduced by each sub-task and find the localization error' is the vital factor in restricting monocular 3D detection.
arXiv Detail & Related papers (2021-03-30T10:38:01Z) - Reinforced Axial Refinement Network for Monocular 3D Object Detection [160.34246529816085]
Monocular 3D object detection aims to extract the 3D position and properties of objects from a 2D input image.
Conventional approaches sample 3D bounding boxes from the space and infer the relationship between the target object and each of them, however, the probability of effective samples is relatively small in the 3D space.
We propose to start with an initial prediction and refine it gradually towards the ground truth, with only one 3d parameter changed in each step.
This requires designing a policy which gets a reward after several steps, and thus we adopt reinforcement learning to optimize it.
arXiv Detail & Related papers (2020-08-31T17:10:48Z) - Object as Hotspots: An Anchor-Free 3D Object Detection Approach via
Firing of Hotspots [37.16690737208046]
We argue for an approach opposite to existing methods using object-level anchors.
Inspired by compositional models, we propose an object as composition of its interior non-empty voxels, termed hotspots.
Based on OHS, we propose an anchor-free detection head with a novel ground truth assignment strategy.
arXiv Detail & Related papers (2019-12-30T03:02:22Z) - SESS: Self-Ensembling Semi-Supervised 3D Object Detection [138.80825169240302]
We propose SESS, a self-ensembling semi-supervised 3D object detection framework. Specifically, we design a thorough perturbation scheme to enhance generalization of the network on unlabeled and new unseen data.
Our SESS achieves competitive performance compared to the state-of-the-art fully-supervised method by using only 50% labeled data.
arXiv Detail & Related papers (2019-12-26T08:48:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.