EPP-Net: Extreme-Point-Prediction-Based Object Detection
- URL: http://arxiv.org/abs/2104.14066v1
- Date: Thu, 29 Apr 2021 01:01:50 GMT
- Title: EPP-Net: Extreme-Point-Prediction-Based Object Detection
- Authors: Yang Yang, Min Li, Bo Meng, Zihao Huang, Junxing Ren, Degang Sun
- Abstract summary: We present a new anchor-free dense object detector, which regresses the relative displacement vector between each pixel and the four extreme points.
We also propose a new metric to measure the similarity between two groups of extreme points, namely, Extreme Intersection over Union (EIoU)
On the MS-COCO dataset, our method achieves an average precision (AP) of 39.3% with ResNet-50 and an AP of 48.3% with ResNeXt-101-DCN.
- Score: 9.270523894683278
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object detection can be regarded as a pixel clustering task, and its boundary
is determined by four extreme points (leftmost, top, rightmost, and bottom).
However, most studies focus on the center or corner points of the object, which
are actually conditional results of the extreme points. In this paper, we
present a new anchor-free dense object detector, which directly regresses the
relative displacement vector between each pixel and the four extreme points. We
also propose a new metric to measure the similarity between two groups of
extreme points, namely, Extreme Intersection over Union (EIoU), and incorporate
this EIoU as a new regression loss. Moreover, we propose a novel branch to
predict the EIoU between the ground-truth and the prediction results, and
combine it with the classification confidence as the ranking keyword in
non-maximum suppression. On the MS-COCO dataset, our method achieves an average
precision (AP) of 39.3% with ResNet-50 and an AP of 48.3% with ResNeXt-101-DCN.
The proposed EPP-Net provides a new method to detect objects and outperforms
state-of-the-art anchor-free detectors.
Related papers
- CenterNet++ for Object Detection [174.59360147041673]
Bottom-up approaches are as competitive as the top-down and enjoy higher recall.
Our approach, named CenterNet, detects each object as a triplet keypoints (top-left and bottom-right corners and the center keypoint)
On the MS-COCO dataset, CenterNet with Res2Net-101 and Swin-Transformer achieves APs of 53.7% and 57.1%, respectively.
arXiv Detail & Related papers (2022-04-18T16:45:53Z) - SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object
Detection [78.90102636266276]
We propose a novel set abstraction method named Semantics-Augmented Set Abstraction (SASA)
Based on the estimated point-wise foreground scores, we then propose a semantics-guided point sampling algorithm to help retain more important foreground points during down-sampling.
In practice, SASA shows to be effective in identifying valuable points related to foreground objects and improving feature learning for point-based 3D detection.
arXiv Detail & Related papers (2022-01-06T08:54:47Z) - Anchor Retouching via Model Interaction for Robust Object Detection in
Aerial Images [15.404024559652534]
We present an effective Dynamic Enhancement Anchor (DEA) network to construct a novel training sample generator.
Our method achieves state-of-the-art performance in accuracy with moderate inference speed and computational overhead for training.
arXiv Detail & Related papers (2021-12-13T14:37:20Z) - A Normalized Gaussian Wasserstein Distance for Tiny Object Detection [21.50800336432307]
State-of-the-art detectors do not produce satisfactory results on tiny objects due to lack of appearance information.
We propose a new evaluation metric using Wasserstein distance for tiny object detection.
arXiv Detail & Related papers (2021-10-26T03:43:17Z) - 3D Object Detection Combining Semantic and Geometric Features from Point
Clouds [19.127930862527666]
We propose a novel end-to-end two-stage 3D object detector named SGNet for point clouds scenes.
The VTPM is a Voxel-Point-Based Module that finally implements 3D object detection in point space.
As of September 19, 2021, for KITTI dataset, SGNet ranked 1st in 3D and BEV detection on cyclists with easy difficulty level, and 2nd in the 3D detection of moderate cyclists.
arXiv Detail & Related papers (2021-10-10T04:43:27Z) - PDNet: Towards Better One-stage Object Detection with Prediction
Decoupling [37.83405509385431]
We propose a prediction-target-decoupled detector named PDNet to establish a more flexible detection paradigm.
With a single ResNeXt-64x4d-101 as the backbone, our detector achieves 48.7 AP with single-scale testing.
arXiv Detail & Related papers (2021-04-28T16:48:04Z) - Corner Proposal Network for Anchor-free, Two-stage Object Detection [174.59360147041673]
The goal of object detection is to determine the class and location of objects in an image.
This paper proposes a novel anchor-free, two-stage framework which first extracts a number of object proposals.
We demonstrate that these two stages are effective solutions for improving recall and precision.
arXiv Detail & Related papers (2020-07-27T19:04:57Z) - Scope Head for Accurate Localization in Object Detection [135.9979405835606]
We propose a novel detector coined as ScopeNet, which models anchors of each location as a mutually dependent relationship.
With our concise and effective design, the proposed ScopeNet achieves state-of-the-art results on COCO.
arXiv Detail & Related papers (2020-05-11T04:00:09Z) - PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation [111.7241018610573]
We present PointGroup, a new end-to-end bottom-up architecture for instance segmentation.
We design a two-branch network to extract point features and predict semantic labels and offsets, for shifting each point towards its respective instance centroid.
A clustering component is followed to utilize both the original and offset-shifted point coordinate sets, taking advantage of their complementary strength.
We conduct extensive experiments on two challenging datasets, ScanNet v2 and S3DIS, on which our method achieves the highest performance, 63.6% and 64.0%, compared to 54.9% and 54.4% achieved by former best
arXiv Detail & Related papers (2020-04-03T16:26:37Z) - Robust 6D Object Pose Estimation by Learning RGB-D Features [59.580366107770764]
We propose a novel discrete-continuous formulation for rotation regression to resolve this local-optimum problem.
We uniformly sample rotation anchors in SO(3), and predict a constrained deviation from each anchor to the target, as well as uncertainty scores for selecting the best prediction.
Experiments on two benchmarks: LINEMOD and YCB-Video, show that the proposed method outperforms state-of-the-art approaches.
arXiv Detail & Related papers (2020-02-29T06:24:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.