Augmenting Proposals by the Detector Itself
- URL: http://arxiv.org/abs/2101.11789v1
- Date: Thu, 28 Jan 2021 02:48:00 GMT
- Title: Augmenting Proposals by the Detector Itself
- Authors: Xiaopei Wan, Zhenhua Guo, Chao He, Yujiu Yang, Fangbo Tao
- Abstract summary: In this paper, we design a novel training method named APDI, which means augmenting proposals by the detector itself and can generate proposals with higher quality.
Experiments on COCO dataset show that our method brings at least 2.7 AP improvements on Faster R-CNN with various backbones.
It can cooperate with advanced RPNs, such as GA-RPN and Cascade RPN, to obtain extra gains.
- Score: 16.059480514515506
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Lacking enough high quality proposals for RoI box head has impeded two-stage
and multi-stage object detectors for a long time, and many previous works try
to solve it via improving RPN's performance or manually generating proposals
from ground truth. However, these methods either need huge training and
inference costs or bring little improvements. In this paper, we design a novel
training method named APDI, which means augmenting proposals by the detector
itself and can generate proposals with higher quality. Furthermore, APDI makes
it possible to integrate IoU head into RoI box head. And it does not add any
hyperparameter, which is beneficial for future research and downstream tasks.
Extensive experiments on COCO dataset show that our method brings at least 2.7
AP improvements on Faster R-CNN with various backbones, and APDI can cooperate
with advanced RPNs, such as GA-RPN and Cascade RPN, to obtain extra gains.
Furthermore, it brings significant improvements on Cascade R-CNN.
Related papers
- PETDet: Proposal Enhancement for Two-Stage Fine-Grained Object Detection [26.843891792018447]
We present PETDet (Proposal Enhancement for Two-stage fine-grained object detection) to better handle the sub-tasks in two-stage FGOD methods.
An anchor-free Quality Oriented Proposal Network (QOPN) is proposed with dynamic label assignment and attention-based decomposition.
A novel Adaptive Recognition Loss (ARL) offers guidance for the R-CNN head to focus on high-quality proposals.
arXiv Detail & Related papers (2023-12-16T18:04:56Z) - Enhancing Secrecy in UAV RSMA Networks: Deep Unfolding Meets Deep Reinforcement Learning [0.8287206589886881]
We consider the network of the secrecy in multiple unmanned aerial vehicles (UAV) rate trajectory (SMAR)
The proposed deep reinforcement learning (DRL) has shown great performance and outperformed other DRL-based methods in the literature.
arXiv Detail & Related papers (2023-09-30T12:26:24Z) - Retrieval-Augmented Reinforcement Learning [63.32076191982944]
We train a network to map a dataset of past experiences to optimal behavior.
The retrieval process is trained to retrieve information from the dataset that may be useful in the current context.
We show that retrieval-augmented R2D2 learns significantly faster than the baseline R2D2 agent and achieves higher scores.
arXiv Detail & Related papers (2022-02-17T02:44:05Z) - Contrastive Learning from Extremely Augmented Skeleton Sequences for
Self-supervised Action Recognition [23.27198457894644]
A Contrastive Learning framework utilizing Abundant Information Mining for self-supervised action Representation (AimCLR) is proposed.
First, the extreme augmentations and the Energy-based Attention-guided Drop Module (EADM) are proposed to obtain diverse positive samples.
Third, the Nearest Neighbors Mining (NNM) is proposed to further expand positive samples to make the abundant information mining process more reasonable.
arXiv Detail & Related papers (2021-12-07T09:38:37Z) - Anchor-free Oriented Proposal Generator for Object Detection [59.54125119453818]
Oriented object detection is a practical and challenging task in remote sensing image interpretation.
Nowadays, oriented detectors mostly use horizontal boxes as intermedium to derive oriented boxes from them.
We propose a novel Anchor-free Oriented Proposal Generator (AOPG) that abandons the horizontal boxes-related operations from the network architecture.
arXiv Detail & Related papers (2021-10-05T10:45:51Z) - OPANAS: One-Shot Path Aggregation Network Architecture Search for Object
Detection [82.04372532783931]
Recently, neural architecture search (NAS) has been exploited to design feature pyramid networks (FPNs)
We propose a novel One-Shot Path Aggregation Network Architecture Search (OPANAS) algorithm, which significantly improves both searching efficiency and detection accuracy.
arXiv Detail & Related papers (2021-03-08T01:48:53Z) - MRDet: A Multi-Head Network for Accurate Oriented Object Detection in
Aerial Images [51.227489316673484]
We propose an arbitrary-oriented region proposal network (AO-RPN) to generate oriented proposals transformed from horizontal anchors.
To obtain accurate bounding boxes, we decouple the detection task into multiple subtasks and propose a multi-head network.
Each head is specially designed to learn the features optimal for the corresponding task, which allows our network to detect objects accurately.
arXiv Detail & Related papers (2020-12-24T06:36:48Z) - RRPN++: Guidance Towards More Accurate Scene Text Detection [0.30458514384586394]
We propose RRPN++ to exploit the potential of RRPN-based model by several improvements.
Based on RRPN, we propose the Anchor-free Pyramid Proposal Networks (APPN) to generate first-stage proposals.
In our second stage, both the detection branch and the recognition branch are incorporated to perform multi-task learning.
arXiv Detail & Related papers (2020-09-28T08:00:35Z) - Collaborative Training between Region Proposal Localization and
Classification for Domain Adaptive Object Detection [121.28769542994664]
Domain adaptation for object detection tries to adapt the detector from labeled datasets to unlabeled ones for better performance.
In this paper, we are the first to reveal that the region proposal network (RPN) and region proposal classifier(RPC) demonstrate significantly different transferability when facing large domain gap.
arXiv Detail & Related papers (2020-09-17T07:39:52Z) - Optimization-driven Deep Reinforcement Learning for Robust Beamforming
in IRS-assisted Wireless Communications [54.610318402371185]
Intelligent reflecting surface (IRS) is a promising technology to assist downlink information transmissions from a multi-antenna access point (AP) to a receiver.
We minimize the AP's transmit power by a joint optimization of the AP's active beamforming and the IRS's passive beamforming.
We propose a deep reinforcement learning (DRL) approach that can adapt the beamforming strategies from past experiences.
arXiv Detail & Related papers (2020-05-25T01:42:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.