Improving Buoy Detection with Deep Transfer Learning for Mussel Farm
Automation
- URL: http://arxiv.org/abs/2308.09238v2
- Date: Mon, 26 Feb 2024 08:54:21 GMT
- Title: Improving Buoy Detection with Deep Transfer Learning for Mussel Farm
Automation
- Authors: Carl McMillan, Junhong Zhao, Bing Xue, Ross Vennell, Mengjie Zhang
- Abstract summary: The aquaculture sector in New Zealand is experiencing rapid expansion, with a particular emphasis on mussel exports.
As the demands of mussel farming operations continue to evolve, the integration of artificial intelligence and computer vision techniques is emerging as an effective approach to enhance operational efficiency.
This study delves into advancing buoy detection by leveraging deep learning methodologies for intelligent mussel farm monitoring and management.
- Score: 7.906113472259946
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The aquaculture sector in New Zealand is experiencing rapid expansion, with a
particular emphasis on mussel exports. As the demands of mussel farming
operations continue to evolve, the integration of artificial intelligence and
computer vision techniques, such as intelligent object detection, is emerging
as an effective approach to enhance operational efficiency. This study delves
into advancing buoy detection by leveraging deep learning methodologies for
intelligent mussel farm monitoring and management. The primary objective
centers on improving accuracy and robustness in detecting buoys across a
spectrum of real-world scenarios. A diverse dataset sourced from mussel farms
is captured and labeled for training, encompassing imagery taken from cameras
mounted on both floating platforms and traversing vessels, capturing various
lighting and weather conditions. To establish an effective deep learning model
for buoy detection with a limited number of labeled data, we employ transfer
learning techniques. This involves adapting a pre-trained object detection
model to create a specialized deep learning buoy detection model. We explore
different pre-trained models, including YOLO and its variants, alongside data
diversity to investigate their effects on model performance. Our investigation
demonstrates a significant enhancement in buoy detection performance through
deep learning, accompanied by improved generalization across diverse weather
conditions, highlighting the practical effectiveness of our approach.
Related papers
- Exploring Machine Learning Engineering for Object Detection and Tracking by Unmanned Aerial Vehicle (UAV) [3.600782980481468]
This research effort focuses on the development of a machine learning pipeline emphasizing the inclusion of assurance methods with increasing automation.
A new dataset was created by collecting videos of moving object such as Roomba vacuum cleaner, emulating search and rescue (SAR) for indoor environment.
After the refinement of the dataset it was trained on a second YOLOv4 and a Mask R-CNN model, which is deployed on a Parrot Mambo drone to perform real-time object detection and tracking.
arXiv Detail & Related papers (2024-12-19T19:27:31Z) - Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models [68.90917438865078]
Deepfake techniques for facial synthesis and editing pose serious risks for generative models.
In this paper, we investigate how detection performance varies across model backbones, types, and datasets.
We introduce Contrastive Blur, which enhances performance on facial images, and MINDER, which addresses noise type bias, balancing performance across domains.
arXiv Detail & Related papers (2024-11-28T13:04:45Z) - LEAP:D - A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection [2.1233286062376497]
We introduce an innovative vision-language approach using learnable prompts.
This shift from conventional manual prompts aims to reduce domain-specific knowledge interference.
We streamline the training process with a one-step approach, updating the learnable prompt concurrently with model training.
arXiv Detail & Related papers (2024-11-14T04:39:10Z) - A Simple Background Augmentation Method for Object Detection with Diffusion Model [53.32935683257045]
In computer vision, it is well-known that a lack of data diversity will impair model performance.
We propose a simple yet effective data augmentation approach by leveraging advancements in generative models.
Background augmentation, in particular, significantly improves the models' robustness and generalization capabilities.
arXiv Detail & Related papers (2024-08-01T07:40:00Z) - Pose Estimation from Camera Images for Underwater Inspection [0.0]
Visual localization is a cost-effective alternative to inertial navigation systems.
We show that machine learning-based pose estimation from images shows promise in underwater environments.
We employ novel view synthesis models to generate augmented training data, significantly enhancing pose estimation in unexplored regions.
arXiv Detail & Related papers (2024-07-24T03:00:53Z) - Learning Objective-Specific Active Learning Strategies with Attentive
Neural Processes [72.75421975804132]
Learning Active Learning (LAL) suggests to learn the active learning strategy itself, allowing it to adapt to the given setting.
We propose a novel LAL method for classification that exploits symmetry and independence properties of the active learning problem.
Our approach is based on learning from a myopic oracle, which gives our model the ability to adapt to non-standard objectives.
arXiv Detail & Related papers (2023-09-11T14:16:37Z) - SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video
Anomaly Detection [108.57862846523858]
We revisit the self-supervised multi-task learning framework, proposing several updates to the original method.
We modernize the 3D convolutional backbone by introducing multi-head self-attention modules.
In our attempt to further improve the model, we study additional self-supervised learning tasks, such as predicting segmentation maps.
arXiv Detail & Related papers (2022-07-16T19:25:41Z) - TRAIL: Near-Optimal Imitation Learning with Suboptimal Data [100.83688818427915]
We present training objectives that use offline datasets to learn a factored transition model.
Our theoretical analysis shows that the learned latent action space can boost the sample-efficiency of downstream imitation learning.
To learn the latent action space in practice, we propose TRAIL (Transition-Reparametrized Actions for Imitation Learning), an algorithm that learns an energy-based transition model.
arXiv Detail & Related papers (2021-10-27T21:05:00Z) - Multitask Adaptation by Retrospective Exploration with Learned World
Models [77.34726150561087]
We propose a meta-learned addressing model called RAMa that provides training samples for the MBRL agent taken from task-agnostic storage.
The model is trained to maximize the expected agent's performance by selecting promising trajectories solving prior tasks from the storage.
arXiv Detail & Related papers (2021-10-25T20:02:57Z) - Perceptual underwater image enhancement with deep learning and physical
priors [35.37760003463292]
We propose two perceptual enhancement models, each of which uses a deep enhancement model with a detection perceptor.
Due to the lack of training data, a hybrid underwater image synthesis model, which fuses physical priors and data-driven cues, is proposed to synthesize training data.
Experimental results show the superiority of our proposed method over several state-of-the-art methods on both real-world and synthetic underwater datasets.
arXiv Detail & Related papers (2020-08-21T22:11:34Z) - Building Robust Industrial Applicable Object Detection Models Using
Transfer Learning and Single Pass Deep Learning Architectures [1.1816942730023883]
We explore how deep convolutional neural networks dedicated to the task of object detection can improve our industrial-oriented object detection pipelines.
By using a deep learning architecture that integrates region proposals, classification and probability estimation in a single run, we aim at obtaining real-time performance.
We apply these algorithms to two industrially relevant applications, one being the detection of promotion boards in eye tracking data and the other detecting and recognizing packages of warehouse products for augmented advertisements.
arXiv Detail & Related papers (2020-07-09T09:50:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.