ADOD: Adaptive Domain-Aware Object Detection with Residual Attention for
Underwater Environments
- URL: http://arxiv.org/abs/2312.06801v1
- Date: Mon, 11 Dec 2023 19:20:56 GMT
- Title: ADOD: Adaptive Domain-Aware Object Detection with Residual Attention for
Underwater Environments
- Authors: Lyes Saad Saoud, Zhenwei Niu, Atif Sultan, Lakmal Seneviratne and
Irfan Hussain
- Abstract summary: This research presents ADOD, a novel approach to address domain generalization in underwater object detection.
Our method enhances the model's ability to generalize across diverse and unseen domains, ensuring robustness in various underwater environments.
- Score: 1.2624532490634643
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: This research presents ADOD, a novel approach to address domain
generalization in underwater object detection. Our method enhances the model's
ability to generalize across diverse and unseen domains, ensuring robustness in
various underwater environments. The first key contribution is Residual
Attention YOLOv3, a novel variant of the YOLOv3 framework empowered by residual
attention modules. These modules enable the model to focus on informative
features while suppressing background noise, leading to improved detection
accuracy and adaptability to different domains. The second contribution is the
attention-based domain classification module, vital during training. This
module helps the model identify domain-specific information, facilitating the
learning of domain-invariant features. Consequently, ADOD can generalize
effectively to underwater environments with distinct visual characteristics.
Extensive experiments on diverse underwater datasets demonstrate ADOD's
superior performance compared to state-of-the-art domain generalization
methods, particularly in challenging scenarios. The proposed model achieves
exceptional detection performance in both seen and unseen domains, showcasing
its effectiveness in handling domain shifts in underwater object detection
tasks. ADOD represents a significant advancement in adaptive object detection,
providing a promising solution for real-world applications in underwater
environments. With the prevalence of domain shifts in such settings, the
model's strong generalization ability becomes a valuable asset for practical
underwater surveillance and marine research endeavors.
Related papers
- FAFA: Frequency-Aware Flow-Aided Self-Supervision for Underwater Object Pose Estimation [65.01601309903971]
We introduce FAFA, a Frequency-Aware Flow-Aided self-supervised framework for 6D pose estimation of unmanned underwater vehicles (UUVs)
Our framework relies solely on the 3D model and RGB images, alleviating the need for any real pose annotations or other-modality data like depths.
We evaluate the effectiveness of FAFA on common underwater object pose benchmarks and showcase significant performance improvements compared to state-of-the-art methods.
arXiv Detail & Related papers (2024-09-25T03:54:01Z) - A Real-Time Framework for Domain-Adaptive Underwater Object Detection with Image Enhancement [18.936260846385444]
EnYOLO is an integrated real-time framework designed for simultaneous underwater image enhancement (UIE) and object detection (UOD)
Our framework achieves state-of-the-art (SOTA) performance in both UIE and UOD tasks, but also shows superior adaptability when applied to different underwater scenarios.
arXiv Detail & Related papers (2024-03-28T01:00:08Z) - CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D
Object Detection [14.063365469339812]
LiDAR-based 3D Object Detection methods often do not generalize well to target domains outside the source (or training) data distribution.
We introduce a novel unsupervised domain adaptation (UDA) method, called CMDA, which leverages visual semantic cues from an image modality.
We also introduce a self-training-based learning strategy, wherein a model is adversarially trained to generate domain-invariant features.
arXiv Detail & Related papers (2024-03-06T14:12:38Z) - Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector [72.05791402494727]
This paper studies the challenging cross-domain few-shot object detection (CD-FSOD)
It aims to develop an accurate object detector for novel domains with minimal labeled examples.
arXiv Detail & Related papers (2024-02-05T15:25:32Z) - Edge-guided Representation Learning for Underwater Object Detection [15.832646455660278]
Underwater object detection is crucial for marine economic development, environmental protection, and the planet's sustainable development.
Main challenges of this task arise from low-contrast, small objects, and mimicry of aquatic organisms.
We propose an Edge-guided Representation Learning Network, termed ERL-Net, that aims to achieve discriminative representation learning and aggregation.
arXiv Detail & Related papers (2023-06-01T08:29:44Z) - Out-of-Domain Robustness via Targeted Augmentations [90.94290420322457]
We study principles for designing data augmentations for out-of-domain generalization.
Motivated by theoretical analysis on a linear setting, we propose targeted augmentations.
We show that targeted augmentations set new states-of-the-art for OOD performance by 3.2-15.2 percentage points.
arXiv Detail & Related papers (2023-02-23T08:59:56Z) - Towards Online Domain Adaptive Object Detection [79.89082006155135]
Existing object detection models assume both the training and test data are sampled from the same source domain.
We propose a novel unified adaptation framework that adapts and improves generalization on the target domain in online settings.
arXiv Detail & Related papers (2022-04-11T17:47:22Z) - SVAM: Saliency-guided Visual Attention Modeling by Autonomous Underwater
Robots [16.242924916178282]
This paper presents a holistic approach to saliency-guided visual attention modeling (SVAM) for use by autonomous underwater robots.
Our proposed model, named SVAM-Net, integrates deep visual features at various scales and semantics for effective salient object detection (SOD) in natural underwater images.
arXiv Detail & Related papers (2020-11-12T08:17:21Z) - Unsupervised Domain Adaptation for Spatio-Temporal Action Localization [69.12982544509427]
S-temporal action localization is an important problem in computer vision.
We propose an end-to-end unsupervised domain adaptation algorithm.
We show that significant performance gain can be achieved when spatial and temporal features are adapted separately or jointly.
arXiv Detail & Related papers (2020-10-19T04:25:10Z) - Domain Conditioned Adaptation Network [90.63261870610211]
We propose a Domain Conditioned Adaptation Network (DCAN) to excite distinct convolutional channels with a domain conditioned channel attention mechanism.
This is the first work to explore the domain-wise convolutional channel activation for deep DA networks.
arXiv Detail & Related papers (2020-05-14T04:23:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.