MultiResolution Attention Extractor for Small Object Detection
- URL: http://arxiv.org/abs/2006.05941v1
- Date: Wed, 10 Jun 2020 16:47:56 GMT
- Title: MultiResolution Attention Extractor for Small Object Detection
- Authors: Fan Zhang, Licheng Jiao, Lingling Li, Fang Liu, and Xu Liu
- Abstract summary: Small objects are difficult to detect because of their low resolution and small size.
Inspired by human vision "attention" mechanism, we exploit two feature extraction methods to mine the most useful information of small objects.
- Score: 40.74232149130456
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Small objects are difficult to detect because of their low resolution and
small size. The existing small object detection methods mainly focus on data
preprocessing or narrowing the differences between large and small objects.
Inspired by human vision "attention" mechanism, we exploit two feature
extraction methods to mine the most useful information of small objects. Both
methods are based on multiresolution feature extraction. We initially design
and explore the soft attention method, but we find that its convergence speed
is slow. Then we present the second method, an attention-based feature
interaction method, called a MultiResolution Attention Extractor (MRAE),
showing significant improvement as a generic feature extractor in small object
detection. After each building block in the vanilla feature extractor, we
append a small network to generate attention weights followed by a weighted-sum
operation to get the final attention maps. Our attention-based feature
extractor is 2.0 times the AP of the "hard" attention counterpart (plain
architecture) on the COCO small object detection benchmark, proving that MRAE
can capture useful location and contextual information through adaptive
learning.
Related papers
- Learning to Borrow Features for Improved Detection of Small Objects in Single-Shot Detectors [0.0]
We propose a novel framework that enables small object representations to "borrow" discriminative features from larger, semantically richer instances within the same class.
Our approach significantly boosts small object detection accuracy over baseline methods, offering a promising direction for robust object detection in complex visual environments.
arXiv Detail & Related papers (2025-04-30T01:18:33Z) - ESOD: Efficient Small Object Detection on High-Resolution Images [36.80623357577051]
Small objects are usually sparsely distributed and locally clustered.
Massive feature extraction computations are wasted on the non-target background area of images.
We propose to reuse the detector's backbone to conduct feature-level object-seeking and patch-slicing.
arXiv Detail & Related papers (2024-07-23T12:21:23Z) - Visible and Clear: Finding Tiny Objects in Difference Map [50.54061010335082]
We introduce a self-reconstruction mechanism in the detection model, and discover the strong correlation between it and the tiny objects.
Specifically, we impose a reconstruction head in-between the neck of a detector, constructing a difference map of the reconstructed image and the input, which shows high sensitivity to tiny objects.
We further develop a Difference Map Guided Feature Enhancement (DGFE) module to make the tiny feature representation more clear.
arXiv Detail & Related papers (2024-05-18T12:22:26Z) - Improving the Detection of Small Oriented Objects in Aerial Images [0.0]
We propose a method to accurately detect small oriented objects in aerial images by enhancing the classification and regression tasks of the oriented object detection model.
We designed the Attention-Points Network consisting of two losses: Guided-Attention Loss (GALoss) and Box-Points Loss (BPLoss)
Experimental results show the effectiveness of our Attention-Points Network on a standard oriented aerial dataset with small object instances.
arXiv Detail & Related papers (2024-01-12T11:00:07Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - Focus-and-Detect: A Small Object Detection Framework for Aerial Images [1.911678487931003]
We propose a two-stage object detection framework called "Focus-and-Detect"
The first stage generates clusters of objects constituting the focused regions.
The second stage, which is also an object detector network, predicts objects within the focal regions.
Results indicate that the proposed two-stage framework achieves an AP score of 42.06 on VisDrone validation dataset.
arXiv Detail & Related papers (2022-03-24T10:43:56Z) - Infrared Small-Dim Target Detection with Transformer under Complex
Backgrounds [155.388487263872]
We propose a new infrared small-dim target detection method with the transformer.
We adopt the self-attention mechanism of the transformer to learn the interaction information of image features in a larger range.
We also design a feature enhancement module to learn more features of small-dim targets.
arXiv Detail & Related papers (2021-09-29T12:23:41Z) - Slender Object Detection: Diagnoses and Improvements [74.40792217534]
In this paper, we are concerned with the detection of a particular type of objects with extreme aspect ratios, namely textbfslender objects.
For a classical object detection method, a drastic drop of $18.9%$ mAP on COCO is observed, if solely evaluated on slender objects.
arXiv Detail & Related papers (2020-11-17T09:39:42Z) - Pixel-Semantic Revise of Position Learning A One-Stage Object Detector
with A Shared Encoder-Decoder [5.371825910267909]
We analyze that different methods detect objects adaptively.
Some state-of-the-art detectors combine different feature pyramids with many mechanisms to enhance multi-level semantic information.
This work addresses that by an anchor-free detector with shared encoder-decoder with attention mechanism.
arXiv Detail & Related papers (2020-01-04T08:55:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.