A Comparison of Deep Learning Object Detection Models for Satellite
Imagery
- URL: http://arxiv.org/abs/2009.04857v1
- Date: Thu, 10 Sep 2020 13:43:14 GMT
- Title: A Comparison of Deep Learning Object Detection Models for Satellite
Imagery
- Authors: Austen Groener, Gary Chern, Mark Pritt
- Abstract summary: We compare the detection accuracy and speed of several state-of-the-art models for the task of detecting oil and gas fracking wells and small cars in commercial electro-optical satellite imagery.
For the detection of fracking well pads (50m - 250m), we find single-stage detectors provide superior prediction speed while also matching detection performance of their two and multi-stage counterparts.
For detecting small cars, two-stage and multi-stage models provide substantially higher accuracies at the cost of some speed.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this work, we compare the detection accuracy and speed of several
state-of-the-art models for the task of detecting oil and gas fracking wells
and small cars in commercial electro-optical satellite imagery. Several models
are studied from the single-stage, two-stage, and multi-stage object detection
families of techniques. For the detection of fracking well pads (50m - 250m),
we find single-stage detectors provide superior prediction speed while also
matching detection performance of their two and multi-stage counterparts.
However, for detecting small cars, two-stage and multi-stage models provide
substantially higher accuracies at the cost of some speed. We also measure
timing results of the sliding window object detection algorithm to provide a
baseline for comparison. Some of these models have been incorporated into the
Lockheed Martin Globally-Scalable Automated Target Recognition (GATR)
framework.
Related papers
- A Multispectral Automated Transfer Technique (MATT) for machine-driven
image labeling utilizing the Segment Anything Model (SAM) [0.0]
This paper outlines a method we call the Multispectral Automated Transfer Technique (MATT)
By transposing SAM segmentation masks from RGB images we can automatically segment and label multispectral imagery with high precision and efficiency.
This research greatly contributes to the study of multispectral object detection by providing a novel and open-source method to rapidly segment, label, and train multispectral object detection models.
arXiv Detail & Related papers (2024-02-18T01:01:13Z) - A Dual Attentive Generative Adversarial Network for Remote Sensing Image
Change Detection [6.906936669510404]
We propose a dual attentive generative adversarial network for achieving very high-resolution remote sensing image change detection tasks.
The DAGAN framework has better performance with 85.01% mean IoU and 91.48% mean F1 score than advanced methods on the LEVIR dataset.
arXiv Detail & Related papers (2023-10-03T08:26:27Z) - Robust and Fast Vehicle Detection using Augmented Confidence Map [10.261351772602543]
We introduce the concept of augmentation which highlights the region of interest containing the vehicles.
The output of MR-MSER is supplied to fast CNN to generate a confidence map.
Unlike existing models that implement complicated models for vehicle detection, we explore the combination of a rough set and fuzzy-based models.
arXiv Detail & Related papers (2023-04-27T18:41:16Z) - 2nd Place Solution for Waymo Open Dataset Challenge - Real-time 2D
Object Detection [26.086623067939605]
In this report, we introduce a real-time method to detect the 2D objects from images.
We leverage accelerationRT to optimize the inference time of our detection pipeline.
Our framework achieves the latency of 45.8ms/frame on an Nvidia Tesla V100 GPU.
arXiv Detail & Related papers (2021-06-16T11:32:03Z) - Multimodal Object Detection via Bayesian Fusion [59.31437166291557]
We study multimodal object detection with RGB and thermal cameras, since the latter can provide much stronger object signatures under poor illumination.
Our key contribution is a non-learned late-fusion method that fuses together bounding box detections from different modalities.
We apply our approach to benchmarks containing both aligned (KAIST) and unaligned (FLIR) multimodal sensor data.
arXiv Detail & Related papers (2021-04-07T04:03:20Z) - MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object
Detection [65.74032877197844]
One-stage detectors are more efficient owing to straightforward architectures, but the two-stage detectors still take the lead in accuracy.
We propose MimicDet, a novel framework to train a one-stage detector by directly mimicking the two-stage features.
Mimic methods have a shared backbone for one-stage and two-stage detectors, then it branches into two heads which are well designed to have compatible features for mimicking.
arXiv Detail & Related papers (2020-09-24T07:36:58Z) - Anchor-free Small-scale Multispectral Pedestrian Detection [88.7497134369344]
We propose a method for effective and efficient multispectral fusion of the two modalities in an adapted single-stage anchor-free base architecture.
We aim at learning pedestrian representations based on object center and scale rather than direct bounding box predictions.
Results show our method's effectiveness in detecting small-scaled pedestrians.
arXiv Detail & Related papers (2020-08-19T13:13:01Z) - Condensing Two-stage Detection with Automatic Object Key Part Discovery [87.1034745775229]
Two-stage object detectors generally require excessively large models for their detection heads to achieve high accuracy.
We propose that the model parameters of two-stage detection heads can be condensed and reduced by concentrating on object key parts.
Our proposed technique consistently maintains original performance while waiving around 50% of the model parameters of common two-stage detection heads.
arXiv Detail & Related papers (2020-06-10T01:20:47Z) - Concurrent Segmentation and Object Detection CNNs for Aircraft Detection
and Identification in Satellite Images [0.0]
We present a dedicated method to detect and identify aircraft, combining two very different convolutional neural networks (CNNs)
The results we present show that this combination outperforms significantly each unitary model, reducing drastically the false negative rate.
arXiv Detail & Related papers (2020-05-27T07:35:55Z) - One-Shot Object Detection without Fine-Tuning [62.39210447209698]
We introduce a two-stage model consisting of a first stage Matching-FCOS network and a second stage Structure-Aware Relation Module.
We also propose novel training strategies that effectively improve detection performance.
Our method exceeds the state-of-the-art one-shot performance consistently on multiple datasets.
arXiv Detail & Related papers (2020-05-08T01:59:23Z) - EHSOD: CAM-Guided End-to-end Hybrid-Supervised Object Detection with
Cascade Refinement [53.69674636044927]
We present EHSOD, an end-to-end hybrid-supervised object detection system.
It can be trained in one shot on both fully and weakly-annotated data.
It achieves comparable results on multiple object detection benchmarks with only 30% fully-annotated data.
arXiv Detail & Related papers (2020-02-18T08:04:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.