Seeing BDD100K in dark: Single-Stage Night-time Object Detection via
Continual Fourier Contrastive Learning
- URL: http://arxiv.org/abs/2112.02891v1
- Date: Mon, 6 Dec 2021 09:28:45 GMT
- Title: Seeing BDD100K in dark: Single-Stage Night-time Object Detection via
Continual Fourier Contrastive Learning
- Authors: Ujjal Kr Dutta
- Abstract summary: Night-time object detection has been studied only sparsely, that too, via non-uniform evaluation protocols among the limited available papers.
In this paper, we bridge these 3 gaps:.
Lack of an uniform evaluation protocol (using a single-stage detector, due to its efficacy, and efficiency);.
A choice of dataset for benchmarking night-time object detection, and.
A novel method to address the limitations of current alternatives.
- Score: 3.4012007729454816
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Despite tremendous improvements in state-of-the-art object detectors,
addressing object detection in the night-time has been studied only sparsely,
that too, via non-uniform evaluation protocols among the limited available
papers. In addition to the lack of methods to address this problem, there was
also a lack of an adequately large benchmark dataset to study night-time object
detection. Recently, the large scale BDD100K was introduced, which, in our
opinion, should be chosen as the benchmark, to kickstart research in this area.
Now, coming to the methods, existing approaches (limited in number), are mainly
either generative image translation based, or image enhancement/ illumination
based, neither of which is natural, conforming to how humans see objects in the
night time (by focusing on object contours). In this paper, we bridge these 3
gaps: 1. Lack of an uniform evaluation protocol (using a single-stage detector,
due to its efficacy, and efficiency), 2. Choice of dataset for benchmarking
night-time object detection, and 3. A novel method to address the limitations
of current alternatives. Our method leverages a Contrastive Learning based
feature extractor, borrowing information from the frequency domain via Fourier
transformation, and trained in a continual learning based fashion. The learned
features when used for object detection (after fine-tuning the classification
and regression layers), help achieve a new state-of-the-art empirical
performance, comfortably outperforming an extensive number of competitors.
Related papers
- Bayesian Detector Combination for Object Detection with Crowdsourced Annotations [49.43709660948812]
Acquiring fine-grained object detection annotations in unconstrained images is time-consuming, expensive, and prone to noise.
We propose a novel Bayesian Detector Combination (BDC) framework to more effectively train object detectors with noisy crowdsourced annotations.
BDC is model-agnostic, requires no prior knowledge of the annotators' skill level, and seamlessly integrates with existing object detection models.
arXiv Detail & Related papers (2024-07-10T18:00:54Z) - Proposal-Contrastive Pretraining for Object Detection from Fewer Data [11.416621957617334]
We present Proposal Selection Contrast (ProSeCo), a novel unsupervised overall pretraining approach.
ProSeCo uses the large number of object proposals generated by the detector for contrastive learning.
We show that our method outperforms state of the art in unsupervised pretraining for object detection on standard and novel benchmarks.
arXiv Detail & Related papers (2023-10-25T17:59:26Z) - Improved Region Proposal Network for Enhanced Few-Shot Object Detection [23.871860648919593]
Few-shot object detection (FSOD) methods have emerged as a solution to the limitations of classic object detection approaches.
We develop a semi-supervised algorithm to detect and then utilize unlabeled novel objects as positive samples during the FSOD training stage.
Our improved hierarchical sampling strategy for the region proposal network (RPN) also boosts the perception of the object detection model for large objects.
arXiv Detail & Related papers (2023-08-15T02:35:59Z) - Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for
Autonomous Driving [91.39625612027386]
We propose a novel task, called generalized few-shot 3D object detection, where we have a large amount of training data for common (base) objects, but only a few data for rare (novel) classes.
Specifically, we analyze in-depth differences between images and point clouds, and then present a practical principle for the few-shot setting in the 3D LiDAR dataset.
To solve this task, we propose an incremental fine-tuning method to extend existing 3D detection models to recognize both common and rare objects.
arXiv Detail & Related papers (2023-02-08T07:11:36Z) - Detecting the unknown in Object Detection [20.84221126313118]
We propose a novel training strategy, called UNKAD, able to predict unknown objects without requiring any annotation.
UNKAD first identifies and pseudo-labels unknown objects and then uses the pseudo-annotations to train an additional unknown class.
While UNKAD can directly detect unknown objects, we further combine it with previous unknown detection techniques, showing that it improves their performance at no costs.
arXiv Detail & Related papers (2022-08-24T16:27:38Z) - Analysis of voxel-based 3D object detection methods efficiency for
real-time embedded systems [93.73198973454944]
Two popular voxel-based 3D object detection methods are studied in this paper.
Our experiments show that these methods mostly fail to detect distant small objects due to the sparsity of the input point clouds at large distances.
Our findings suggest that a considerable part of the computations of existing methods is focused on locations of the scene that do not contribute with successful detection.
arXiv Detail & Related papers (2021-05-21T12:40:59Z) - QuickBrowser: A Unified Model to Detect and Read Simple Object in
Real-time [3.098115480186737]
This work aims to solve this detect-and-read problem in a lightweight way by integrating multi-digit recognition into a one-stage object detection model.
Our choice of backbones and modifications in architecture, loss function, data augmentation and training make the method robust, efficient and speedy.
arXiv Detail & Related papers (2021-02-15T05:47:40Z) - Slender Object Detection: Diagnoses and Improvements [74.40792217534]
In this paper, we are concerned with the detection of a particular type of objects with extreme aspect ratios, namely textbfslender objects.
For a classical object detection method, a drastic drop of $18.9%$ mAP on COCO is observed, if solely evaluated on slender objects.
arXiv Detail & Related papers (2020-11-17T09:39:42Z) - A Self-Training Approach for Point-Supervised Object Detection and
Counting in Crowds [54.73161039445703]
We propose a novel self-training approach that enables a typical object detector trained only with point-level annotations.
During training, we utilize the available point annotations to supervise the estimation of the center points of objects.
Experimental results show that our approach significantly outperforms state-of-the-art point-supervised methods under both detection and counting tasks.
arXiv Detail & Related papers (2020-07-25T02:14:42Z) - Frustratingly Simple Few-Shot Object Detection [98.42824677627581]
We find that fine-tuning only the last layer of existing detectors on rare classes is crucial to the few-shot object detection task.
Such a simple approach outperforms the meta-learning methods by roughly 220 points on current benchmarks.
arXiv Detail & Related papers (2020-03-16T00:29:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.