GUNNEL: Guided Mixup Augmentation and Multi-View Fusion for Aquatic
Animal Segmentation
- URL: http://arxiv.org/abs/2112.06193v3
- Date: Thu, 10 Aug 2023 16:03:31 GMT
- Title: GUNNEL: Guided Mixup Augmentation and Multi-View Fusion for Aquatic
Animal Segmentation
- Authors: Minh-Quan Le and Trung-Nghia Le and Tam V. Nguyen and Isao Echizen and
Minh-Triet Tran
- Abstract summary: We build a new dataset dubbed Aquatic Animal Species.
We devise a novel GUided mixup augmeNtatioN and multi-modEl fusion for aquatic animaL segmentation (GUNNEL)
Experiments demonstrated the superiority of our proposed framework over existing state-of-the-art instance segmentation methods.
- Score: 30.759713670293287
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent years have witnessed great advances in object segmentation research.
In addition to generic objects, aquatic animals have attracted research
attention. Deep learning-based methods are widely used for aquatic animal
segmentation and have achieved promising performance. However, there is a lack
of challenging datasets for benchmarking. In this work, we build a new dataset
dubbed Aquatic Animal Species. We also devise a novel GUided mixup augmeNtatioN
and multi-modEl fusion for aquatic animaL segmentation (GUNNEL) that leverages
the advantages of multiple segmentation models to effectively segment aquatic
animals and improves the training performance by synthesizing hard samples.
Extensive experiments demonstrated the superiority of our proposed framework
over existing state-of-the-art instance segmentation methods. The code is
available at https://github.com/lmquan2000/mask-mixup. The dataset is available
at https://doi.org/10.5281/zenodo.8208877 .
Related papers
- Benchmarking Large Language Models for Image Classification of Marine Mammals [4.274291455715579]
We build a benchmark dataset with 1,423 images of 65 kinds of marine mammals.
Each animal is uniquely classified into different levels of class, ranging from species-level to medium-level to group-level.
We evaluate several approaches for classifying these marine mammals.
arXiv Detail & Related papers (2024-10-22T01:49:49Z) - OpenAnimals: Revisiting Person Re-Identification for Animals Towards Better Generalization [10.176567936487364]
We conduct a study by revisiting several state-of-the-art person re-identification methods, including BoT, AGW, SBS, and MGN.
We evaluate their effectiveness on animal re-identification benchmarks such as HyenaID, LeopardID, SeaTurtleID, and WhaleSharkID.
Our findings reveal that while some techniques well, many do not generalize, underscoring the significant differences between the two tasks.
We propose ARBase, a strong textbfBase model tailored for textbfAnimal textbfRe-
arXiv Detail & Related papers (2024-09-30T20:07:14Z) - Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset [60.14089302022989]
Underwater vision tasks often suffer from low segmentation accuracy due to the complex underwater circumstances.
We construct the first large-scale underwater salient instance segmentation dataset (USIS10K)
We propose an Underwater Salient Instance architecture based on Segment Anything Model (USIS-SAM) specifically for the underwater domain.
arXiv Detail & Related papers (2024-06-10T06:17:33Z) - Generative Active Learning for Long-tailed Instance Segmentation [55.66158205855948]
We propose BSGAL, a new algorithm that estimates the contribution of generated data based on cache gradient.
Experiments show that BSGAL outperforms the baseline approach and effectually improves the performance of long-tailed segmentation.
arXiv Detail & Related papers (2024-06-04T15:57:43Z) - Fantastic Animals and Where to Find Them: Segment Any Marine Animal with Dual SAM [62.85895749882285]
Marine Animal (MAS) involves segmenting animals within marine environments.
We propose a novel feature learning framework, named Dual-SAM for high-performance MAS.
Our proposed method achieves state-of-the-art performances on five widely-used MAS datasets.
arXiv Detail & Related papers (2024-04-07T15:34:40Z) - WhaleNet: a Novel Deep Learning Architecture for Marine Mammals Vocalizations on Watkins Marine Mammal Sound Database [49.1574468325115]
We introduce textbfWhaleNet (Wavelet Highly Adaptive Learning Ensemble Network), a sophisticated deep ensemble architecture for the classification of marine mammal vocalizations.
We achieve an improvement in classification accuracy by $8-10%$ over existing architectures, corresponding to a classification accuracy of $97.61%$.
arXiv Detail & Related papers (2024-02-20T11:36:23Z) - Transformer-based Self-Supervised Fish Segmentation in Underwater Videos [1.9249287163937976]
We introduce a Transformer-based method that uses self-supervision for high-quality fish segmentation.
We show that when trained on a set of underwater videos from one dataset, the proposed model surpasses previous CNN-based and Transformer-based self-supervised methods.
arXiv Detail & Related papers (2022-06-11T01:20:48Z) - Revisiting Contrastive Methods for Unsupervised Learning of Visual
Representations [78.12377360145078]
Contrastive self-supervised learning has outperformed supervised pretraining on many downstream tasks like segmentation and object detection.
In this paper, we first study how biases in the dataset affect existing methods.
We show that current contrastive approaches work surprisingly well across: (i) object- versus scene-centric, (ii) uniform versus long-tailed and (iii) general versus domain-specific datasets.
arXiv Detail & Related papers (2021-06-10T17:59:13Z) - Reviving Iterative Training with Mask Guidance for Interactive
Segmentation [8.271859911016719]
Recent works on click-based interactive segmentation have demonstrated state-of-the-art results by using various inference-time optimization schemes.
We propose a simple feedforward model for click-based interactive segmentation that employs the segmentation masks from previous steps.
We find that the models trained on a combination of COCO and LVIS with diverse and high-quality annotations show performance superior to all existing models.
arXiv Detail & Related papers (2021-02-12T15:44:31Z) - The Devil is in Classification: A Simple Framework for Long-tail Object
Detection and Instance Segmentation [93.17367076148348]
We investigate performance drop of the state-of-the-art two-stage instance segmentation model Mask R-CNN on the recent long-tail LVIS dataset.
We unveil that a major cause is the inaccurate classification of object proposals.
We propose a simple calibration framework to more effectively alleviate classification head bias with a bi-level class balanced sampling approach.
arXiv Detail & Related papers (2020-07-23T12:49:07Z) - Semantic Segmentation of Underwater Imagery: Dataset and Benchmark [13.456412091502527]
We present the first large-scale dataset for semantic analysis of Underwater IMagery (SUIM)
It contains over 1500 images with pixel annotations for eight object categories: fish (vertebrates), reefs (invertebrates), aquatic plants, wrecks/ruins, human divers, robots, and sea-floor.
We also present a benchmark evaluation of state-of-the-art semantic segmentation approaches based on standard performance metrics.
arXiv Detail & Related papers (2020-04-02T19:53:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.