Density Crop-guided Semi-supervised Object Detection in Aerial Images
- URL: http://arxiv.org/abs/2308.05032v1
- Date: Wed, 9 Aug 2023 15:59:42 GMT
- Title: Density Crop-guided Semi-supervised Object Detection in Aerial Images
- Authors: Akhil Meethal, Eric Granger, Marco Pedersoli
- Abstract summary: We propose a density crop-guided semi-supervised detector for object detection in aerial images.
During training, image crops of clusters identified from labeled and unlabeled images are used to augment the training set.
During inference, the detector is able to detect the objects of interest but also regions with a high density of small objects.
- Score: 12.944309759825902
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: One of the important bottlenecks in training modern object detectors is the
need for labeled images where bounding box annotations have to be produced for
each object present in the image. This bottleneck is further exacerbated in
aerial images where the annotators have to label small objects often
distributed in clusters on high-resolution images. In recent days, the
mean-teacher approach trained with pseudo-labels and weak-strong augmentation
consistency is gaining popularity for semi-supervised object detection.
However, a direct adaptation of such semi-supervised detectors for aerial
images where small clustered objects are often present, might not lead to
optimal results. In this paper, we propose a density crop-guided
semi-supervised detector that identifies the cluster of small objects during
training and also exploits them to improve performance at inference. During
training, image crops of clusters identified from labeled and unlabeled images
are used to augment the training set, which in turn increases the chance of
detecting small objects and creating good pseudo-labels for small objects on
the unlabeled images. During inference, the detector is not only able to detect
the objects of interest but also regions with a high density of small objects
(density crops) so that detections from the input image and detections from
image crops are combined, resulting in an overall more accurate object
prediction, especially for small objects. Empirical studies on the popular
benchmarks of VisDrone and DOTA datasets show the effectiveness of our density
crop-guided semi-supervised detector with an average improvement of more than
2\% over the basic mean-teacher method in COCO style AP. Our code is available
at: https://github.com/akhilpm/DroneSSOD.
Related papers
- Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation [58.37525311718006]
We put forth a novel formulation of the aerial object detection problem, namely open-vocabulary aerial object detection (OVAD)
We propose CastDet, a CLIP-activated student-teacher detection framework that serves as the first OVAD detector specifically designed for the challenging aerial scenario.
Our framework integrates a robust localization teacher along with several box selection strategies to generate high-quality proposals for novel objects.
arXiv Detail & Related papers (2024-11-04T12:59:13Z) - SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection [59.868772767818975]
We propose a simple yet effective Semi-supervised Oriented Object Detection method termed SOOD++.
Specifically, we observe that objects from aerial images are usually arbitrary orientations, small scales, and aggregation.
Extensive experiments conducted on various multi-oriented object datasets under various labeled settings demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2024-07-01T07:03:51Z) - YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images [33.80392696735718]
YOLC (You Only Look Clusters) is an efficient and effective framework that builds on an anchor-free object detector, CenterNet.
To overcome the challenges posed by large-scale images and non-uniform object distribution, we introduce a Local Scale Module (LSM) that adaptively searches cluster regions for zooming in for accurate detection.
We perform extensive experiments on two aerial image datasets, including Visdrone 2019 and UAVDT, to demonstrate the effectiveness and superiority of our proposed approach.
arXiv Detail & Related papers (2024-04-09T10:03:44Z) - Improving the Detection of Small Oriented Objects in Aerial Images [0.0]
We propose a method to accurately detect small oriented objects in aerial images by enhancing the classification and regression tasks of the oriented object detection model.
We designed the Attention-Points Network consisting of two losses: Guided-Attention Loss (GALoss) and Box-Points Loss (BPLoss)
Experimental results show the effectiveness of our Attention-Points Network on a standard oriented aerial dataset with small object instances.
arXiv Detail & Related papers (2024-01-12T11:00:07Z) - Cascaded Zoom-in Detector for High Resolution Aerial Images [12.944309759825902]
We propose an efficient Cascaded Zoom-in (CZ) detector that re-purposes the detector itself for density-guided training and inference.
During training, density crops are located, labeled as a new class, and employed to augment the training dataset.
This approach is easily integrated into any detector, and creates no significant change in the standard detection process.
arXiv Detail & Related papers (2023-03-15T16:39:21Z) - A Coarse to Fine Framework for Object Detection in High Resolution Image [8.316322664637537]
Current approaches of object detection seldom consider detecting tiny object or the large scale variance problem in high resolution images.
We introduce a simple yet efficient approach that improves accuracy of object detection especially for small objects and large scale variance scene.
Our approach can make good use of the sparsity of the objects and the information in high-resolution image, thereby making the detection more efficient.
arXiv Detail & Related papers (2023-03-02T13:04:33Z) - MUS-CDB: Mixed Uncertainty Sampling with Class Distribution Balancing
for Active Annotation in Aerial Object Detection [40.94800050576902]
Recent aerial object detection models rely on a large amount of labeled training data.
Active learning effectively reduces the data labeling cost by selectively querying the informative and representative unlabelled samples.
We propose a novel active learning method for cost-effective aerial object detection.
arXiv Detail & Related papers (2022-12-06T07:50:00Z) - Exploiting Unlabeled Data with Vision and Language Models for Object
Detection [64.94365501586118]
Building robust and generic object detection frameworks requires scaling to larger label spaces and bigger training datasets.
We propose a novel method that leverages the rich semantics available in recent vision and language models to localize and classify objects in unlabeled images.
We demonstrate the value of the generated pseudo labels in two specific tasks, open-vocabulary detection and semi-supervised object detection.
arXiv Detail & Related papers (2022-07-18T21:47:15Z) - Instance Localization for Self-supervised Detection Pretraining [68.24102560821623]
We propose a new self-supervised pretext task, called instance localization.
We show that integration of bounding boxes into pretraining promotes better task alignment and architecture alignment for transfer learning.
Experimental results demonstrate that our approach yields state-of-the-art transfer learning results for object detection.
arXiv Detail & Related papers (2021-02-16T17:58:57Z) - Unsupervised Object Detection with LiDAR Clues [70.73881791310495]
We present the first practical method for unsupervised object detection with the aid of LiDAR clues.
In our approach, candidate object segments based on 3D point clouds are firstly generated.
Then, an iterative segment labeling process is conducted to assign segment labels and to train a segment labeling network.
The labeling process is carefully designed so as to mitigate the issue of long-tailed and open-ended distribution.
arXiv Detail & Related papers (2020-11-25T18:59:54Z) - EHSOD: CAM-Guided End-to-end Hybrid-Supervised Object Detection with
Cascade Refinement [53.69674636044927]
We present EHSOD, an end-to-end hybrid-supervised object detection system.
It can be trained in one shot on both fully and weakly-annotated data.
It achieves comparable results on multiple object detection benchmarks with only 30% fully-annotated data.
arXiv Detail & Related papers (2020-02-18T08:04:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.