Dynamic Refinement Network for Oriented and Densely Packed Object
Detection
- URL: http://arxiv.org/abs/2005.09973v2
- Date: Wed, 10 Jun 2020 23:59:58 GMT
- Title: Dynamic Refinement Network for Oriented and Densely Packed Object
Detection
- Authors: Xingjia Pan, Yuqiang Ren, Kekai Sheng, Weiming Dong, Haolei Yuan,
Xiaowei Guo, Chongyang Ma, Changsheng Xu
- Abstract summary: We present a dynamic refinement network that consists of two novel components, i.e., a feature selection module (FSM) and a dynamic refinement head (DRH)
Our FSM enables neurons to adjust receptive fields in accordance with the shapes and orientations of target objects, whereas the DRH empowers our model to refine the prediction dynamically in an object-aware manner.
We perform quantitative evaluations on several publicly available benchmarks including DOTA, HRSC2016, SKU110K, and our own SKU110K-R dataset.
- Score: 75.29088991850958
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object detection has achieved remarkable progress in the past decade.
However, the detection of oriented and densely packed objects remains
challenging because of following inherent reasons: (1) receptive fields of
neurons are all axis-aligned and of the same shape, whereas objects are usually
of diverse shapes and align along various directions; (2) detection models are
typically trained with generic knowledge and may not generalize well to handle
specific objects at test time; (3) the limited dataset hinders the development
on this task. To resolve the first two issues, we present a dynamic refinement
network that consists of two novel components, i.e., a feature selection module
(FSM) and a dynamic refinement head (DRH). Our FSM enables neurons to adjust
receptive fields in accordance with the shapes and orientations of target
objects, whereas the DRH empowers our model to refine the prediction
dynamically in an object-aware manner. To address the limited availability of
related benchmarks, we collect an extensive and fully annotated dataset,
namely, SKU110K-R, which is relabeled with oriented bounding boxes based on
SKU110K. We perform quantitative evaluations on several publicly available
benchmarks including DOTA, HRSC2016, SKU110K, and our own SKU110K-R dataset.
Experimental results show that our method achieves consistent and substantial
gains compared with baseline approaches. The code and dataset are available at
https://github.com/Anymake/DRN_CVPR2020.
Related papers
- SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection [59.868772767818975]
We propose a simple yet effective Semi-supervised Oriented Object Detection method termed SOOD++.
Specifically, we observe that objects from aerial images are usually arbitrary orientations, small scales, and aggregation.
Extensive experiments conducted on various multi-oriented object datasets under various labeled settings demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2024-07-01T07:03:51Z) - SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection [79.23689506129733]
We establish a new benchmark dataset and an open-source method for large-scale SAR object detection.
Our dataset, SARDet-100K, is a result of intense surveying, collecting, and standardizing 10 existing SAR detection datasets.
To the best of our knowledge, SARDet-100K is the first COCO-level large-scale multi-class SAR object detection dataset ever created.
arXiv Detail & Related papers (2024-03-11T09:20:40Z) - Zero-Shot Refinement of Buildings' Segmentation Models using SAM [6.110856077714895]
We present a novel approach to adapt foundation models to address existing models' generalization dropback.
Among several models, our focus centers on the Segment Anything Model (SAM)
SAM does not offer recognition abilities and thus fails to classify and tag localized objects.
This novel approach augments SAM with recognition abilities, a first of its kind.
arXiv Detail & Related papers (2023-10-03T07:19:59Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - D2Q-DETR: Decoupling and Dynamic Queries for Oriented Object Detection
with Transformers [14.488821968433834]
We propose an end-to-end framework for oriented object detection.
Our framework is based on DETR, with the box regression head replaced with a points prediction head.
Experiments on the largest and challenging DOTA-v1.0 and DOTA-v1.5 datasets show that D2Q-DETR outperforms existing NMS-based and NMS-free oriented object detection methods.
arXiv Detail & Related papers (2023-03-01T14:36:19Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - Novel Human-Object Interaction Detection via Adversarial Domain
Generalization [103.55143362926388]
We study the problem of novel human-object interaction (HOI) detection, aiming at improving the generalization ability of the model to unseen scenarios.
The challenge mainly stems from the large compositional space of objects and predicates, which leads to the lack of sufficient training data for all the object-predicate combinations.
We propose a unified framework of adversarial domain generalization to learn object-invariant features for predicate prediction.
arXiv Detail & Related papers (2020-05-22T22:02:56Z) - Extended Target Tracking and Classification Using Neural Networks [1.2891210250935146]
State-of-the-art ETT algorithms can track the dynamic behaviour of objects and learn their shapes simultaneously.
We propose to use a naively deep neural network, which consists of one input, two hidden and one output layers, to classify dynamic objects regarding their shape estimates.
arXiv Detail & Related papers (2020-02-13T12:02:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.