Point-to-Box Network for Accurate Object Detection via Single Point
Supervision
- URL: http://arxiv.org/abs/2207.06827v1
- Date: Thu, 14 Jul 2022 11:32:00 GMT
- Title: Point-to-Box Network for Accurate Object Detection via Single Point
Supervision
- Authors: Pengfei Chen, Xuehui Yu, Xumeng Han, Najmul Hassan, Kai Wang, Jiachen
Li, Jian Zhao, Humphrey Shi, Zhenjun Han, and Qixiang Ye
- Abstract summary: We introduce a lightweight alternative to the off-the-shelf proposal (OTSP) method.
P2BNet can construct an inter-objects balanced proposal bag by generating proposals in an anchor-like way.
The code will be released at COCO.com/ucas-vg/P2BNet.
- Score: 51.95993495703855
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object detection using single point supervision has received increasing
attention over the years. In this paper, we attribute such a large performance
gap to the failure of generating high-quality proposal bags which are crucial
for multiple instance learning (MIL). To address this problem, we introduce a
lightweight alternative to the off-the-shelf proposal (OTSP) method and thereby
create the Point-to-Box Network (P2BNet), which can construct an inter-objects
balanced proposal bag by generating proposals in an anchor-like way. By fully
investigating the accurate position information, P2BNet further constructs an
instance-level bag, avoiding the mixture of multiple objects. Finally, a
coarse-to-fine policy in a cascade fashion is utilized to improve the IoU
between proposals and ground-truth (GT). Benefiting from these strategies,
P2BNet is able to produce high-quality instance-level bags for object
detection. P2BNet improves the mean average precision (AP) by more than 50%
relative to the previous best PSOD method on the MS COCO dataset. It also
demonstrates the great potential to bridge the performance gap between point
supervised and bounding-box supervised detectors. The code will be released at
github.com/ucas-vg/P2BNet.
Related papers
- P2P-Bridge: Diffusion Bridges for 3D Point Cloud Denoising [81.92854168911704]
We tackle the task of point cloud denoising through a novel framework that adapts Diffusion Schr"odinger bridges to points clouds.
Experiments on object datasets show that P2P-Bridge achieves significant improvements over existing methods.
arXiv Detail & Related papers (2024-08-29T08:00:07Z) - Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation [15.414518995812754]
We propose a unified framework (NIDS-Net) comprising object proposal generation, embedding creation for both instance templates and proposal regions, and embedding matching for instance label assignment.
Our framework surpasses current state-of-the-art methods, demonstrating notable improvements of 22.3, 46.2, 10.3, and 24.0 in average precision (AP) across four detection datasets.
arXiv Detail & Related papers (2024-05-28T06:16:57Z) - Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision [81.60564776995682]
We present Point2RBox, an end-to-end solution for point-supervised object detection.
Our method uses a lightweight paradigm, yet it achieves a competitive performance among point-supervised alternatives.
In particular, our method uses a lightweight paradigm, yet it achieves a competitive performance among point-supervised alternatives.
arXiv Detail & Related papers (2023-11-23T15:57:41Z) - PointOBB: Learning Oriented Object Detection via Single Point
Supervision [55.88982271340328]
This paper proposes PointOBB, the first single Point-based OBB generation method, for oriented object detection.
PointOBB operates through the collaborative utilization of three distinctive views: an original view, a resized view, and a rotated/flipped (rot/flp) view.
Experimental results on the DIOR-R and DOTA-v1.0 datasets demonstrate that PointOBB achieves promising performance.
arXiv Detail & Related papers (2023-11-23T15:51:50Z) - Learning Remote Sensing Object Detection with Single Point Supervision [17.12725535531483]
Pointly Supervised Object Detection (PSOD) has attracted considerable interests due to its lower labeling cost as compared to box-level supervised object detection.
We make the first attempt to achieve RS object detection with single point supervision, and propose a PSOD method tailored for RS images.
Our method can achieve significantly better performance as compared to state-of-the-art image-level and point-level supervised detection methods, and reduce the performance gap between PSOD and box-level supervised object detection.
arXiv Detail & Related papers (2023-05-23T15:06:04Z) - Plug-and-Play Few-shot Object Detection with Meta Strategy and Explicit
Localization Inference [78.41932738265345]
This paper proposes a plug detector that can accurately detect the objects of novel categories without fine-tuning process.
We introduce two explicit inferences into the localization process to reduce its dependence on annotated data.
It shows a significant lead in both efficiency, precision, and recall under varied evaluation protocols.
arXiv Detail & Related papers (2021-10-26T03:09:57Z) - RSKDD-Net: Random Sample-based Keypoint Detector and Descriptor [11.393546826269372]
This paper proposes Random Sample-based Keypoint Detector and Descriptor Network (RSKDD-Net) for large scale point cloud registration.
The key idea is using random sampling to efficiently select candidate points and using a learning-based method to jointly generate keypoints and descriptors.
Experiments on two large scale outdoor LiDAR datasets show that the proposed RSKDD-Net achieves state-of-the-art performance with more than 15 times faster than existing methods.
arXiv Detail & Related papers (2020-10-23T13:29:29Z) - Corner Proposal Network for Anchor-free, Two-stage Object Detection [174.59360147041673]
The goal of object detection is to determine the class and location of objects in an image.
This paper proposes a novel anchor-free, two-stage framework which first extracts a number of object proposals.
We demonstrate that these two stages are effective solutions for improving recall and precision.
arXiv Detail & Related papers (2020-07-27T19:04:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.