Integrated Multiscale Domain Adaptive YOLO
- URL: http://arxiv.org/abs/2202.03527v1
- Date: Mon, 7 Feb 2022 21:30:53 GMT
- Title: Integrated Multiscale Domain Adaptive YOLO
- Authors: Mazin Hnewa and Hayder Radha
- Abstract summary: We introduce a novel MultiScale Domain Adaptive YOLO (MS-DAYOLO) framework that employs multiple domain adaptation paths and corresponding domain classifiers at different scales of the recently introduced YOLOv4 object detector.
Our experiments show significant improvements in object detection performance when training YOLOv4 using the proposed MS-DAYOLO architectures and when tested on target data for autonomous driving applications.
- Score: 5.33024001730262
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The area of domain adaptation has been instrumental in addressing the domain
shift problem encountered by many applications. This problem arises due to the
difference between the distributions of source data used for training in
comparison with target data used during realistic testing scenarios. In this
paper, we introduce a novel MultiScale Domain Adaptive YOLO (MS-DAYOLO)
framework that employs multiple domain adaptation paths and corresponding
domain classifiers at different scales of the recently introduced YOLOv4 object
detector. Building on our baseline multiscale DAYOLO framework, we introduce
three novel deep learning architectures for a Domain Adaptation Network (DAN)
that generates domain-invariant features. In particular, we propose a
Progressive Feature Reduction (PFR), a Unified Classifier (UC), and an
Integrated architecture. We train and test our proposed DAN architectures in
conjunction with YOLOv4 using popular datasets. Our experiments show
significant improvements in object detection performance when training YOLOv4
using the proposed MS-DAYOLO architectures and when tested on target data for
autonomous driving applications. Moreover, MS-DAYOLO framework achieves an
order of magnitude real-time speed improvement relative to Faster R-CNN
solutions while providing comparable object detection performance.
Related papers
- Investigating the potential of Sparse Mixtures-of-Experts for multi-domain neural machine translation [59.41178047749177]
We focus on multi-domain Neural Machine Translation, with the goal of developing efficient models which can handle data from various domains seen during training and are robust to domains unseen during training.
We hypothesize that Sparse Mixture-of-Experts (SMoE) models are a good fit for this task, as they enable efficient model scaling.
We conduct a series of experiments aimed at validating the utility of SMoE for the multi-domain scenario, and find that a straightforward width scaling of Transformer is a simpler and surprisingly more efficient approach in practice, and reaches the same performance level as SMoE.
arXiv Detail & Related papers (2024-07-01T09:45:22Z) - YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time
Object Detection [80.11152626362109]
We provide an efficient and performant object detector, termed YOLO-MS.
We train our YOLO-MS on the MS COCO dataset from scratch without relying on any other large-scale datasets.
Our work can also be used as a plug-and-play module for other YOLO models.
arXiv Detail & Related papers (2023-08-10T10:12:27Z) - SSDA-YOLO: Semi-supervised Domain Adaptive YOLO for Cross-Domain Object
Detection [27.02391566687007]
Domain adaptive object detection (DAD) aims to alleviate transfer performance degradation caused by the cross-domain discrepancy.
We propose a novel semi-supervised domain adaptive YOLO (SSDA-YOLO) based method to improve cross-domain detection performance.
arXiv Detail & Related papers (2022-11-04T01:50:13Z) - Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark [28.818423712485504]
Multi-dOmain Few-Shot Object Detection (MoFSOD) benchmark consists of 10 datasets from a wide range of domains.
We analyze the impacts of freezing layers, different architectures, and different pre-training datasets on FSOD performance.
arXiv Detail & Related papers (2022-07-22T16:13:22Z) - Multi-level Consistency Learning for Semi-supervised Domain Adaptation [85.90600060675632]
Semi-supervised domain adaptation (SSDA) aims to apply knowledge learned from a fully labeled source domain to a scarcely labeled target domain.
We propose a Multi-level Consistency Learning framework for SSDA.
arXiv Detail & Related papers (2022-05-09T06:41:18Z) - A lightweight and accurate YOLO-like network for small target detection
in Aerial Imagery [94.78943497436492]
We present YOLO-S, a simple, fast and efficient network for small target detection.
YOLO-S exploits a small feature extractor based on Darknet20, as well as skip connection, via both bypass and concatenation.
YOLO-S has an 87% decrease of parameter size and almost one half FLOPs of YOLOv3, making practical the deployment for low-power industrial applications.
arXiv Detail & Related papers (2022-04-05T16:29:49Z) - Multi-Source Domain Adaptation for Object Detection [52.87890831055648]
We propose a unified Faster R-CNN based framework, termed Divide-and-Merge Spindle Network (DMSN)
DMSN can simultaneously enhance domain innative and preserve discriminative power.
We develop a novel pseudo learning algorithm to approximate optimal parameters of pseudo target subset.
arXiv Detail & Related papers (2021-06-30T03:17:20Z) - Domain Adaptive YOLO for One-Stage Cross-Domain Detection [4.596221278839825]
Domain Adaptive YOLO (DA-YOLO) is proposed to improve cross-domain performance for one-stage detectors.
We evaluate our proposed method on popular datasets like Cityscapes, KITTI, SIM10K and etc.
arXiv Detail & Related papers (2021-06-26T04:17:42Z) - Multiscale Domain Adaptive YOLO for Cross-Domain Object Detection [0.0]
We introduce a novel MultiScale Domain Adaptive YOLO (MS-DAYOLO) framework that employs multiple domain adaptation paths and corresponding domain classifiers at different scales to generate domain-invariant features.
Our experiments show significant improvements in object detection performance when training YOLOv4 using the proposed MS-DAYOLO and when tested on target data representing challenging weather conditions for autonomous driving applications.
arXiv Detail & Related papers (2021-06-02T21:50:25Z) - Multi-path Neural Networks for On-device Multi-domain Visual
Classification [55.281139434736254]
This paper proposes a novel approach to automatically learn a multi-path network for multi-domain visual classification on mobile devices.
The proposed multi-path network is learned from neural architecture search by applying one reinforcement learning controller for each domain to select the best path in the super-network created from a MobileNetV3-like search space.
The determined multi-path model selectively shares parameters across domains in shared nodes while keeping domain-specific parameters within non-shared nodes in individual domain paths.
arXiv Detail & Related papers (2020-10-10T05:13:49Z) - Deeply Aligned Adaptation for Cross-domain Object Detection [33.766468227676214]
Cross-domain object detection has recently attracted more and more attention for real-world applications.
We propose an end-to-end solution based on Faster R-CNN, where ground-truth annotations are available for source images but not for target ones.
arXiv Detail & Related papers (2020-04-05T04:41:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.