HCIL: Hierarchical Class Incremental Learning for Longline Fishing
Visual Monitoring
- URL: http://arxiv.org/abs/2202.13018v1
- Date: Fri, 25 Feb 2022 23:53:11 GMT
- Title: HCIL: Hierarchical Class Incremental Learning for Longline Fishing
Visual Monitoring
- Authors: Jie Mei, Suzanne Romain, Craig Rose, Kelsey Magrane, Jenq-Neng Hwang
- Abstract summary: We introduce a Hierarchical Class Incremental Learning (HCIL) model, which significantly improves the state-of-the-art hierarchical classification methods under the CIL scenario.
A CIL system should be able to learn about more and more classes over time from a stream of data, i.e., only the training data for a small number of classes have to be present at the beginning and new classes can be added progressively.
- Score: 30.084499552709183
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The goal of electronic monitoring of longline fishing is to visually monitor
the fish catching activities on fishing vessels based on cameras, either for
regulatory compliance or catch counting. The previous hierarchical
classification method demonstrates efficient fish species identification of
catches from longline fishing, where fishes are under severe deformation and
self-occlusion during the catching process. Although the hierarchical
classification mitigates the laborious efforts of human reviews by providing
confidence scores in different hierarchical levels, its performance drops
dramatically under the class incremental learning (CIL) scenario. A CIL system
should be able to learn about more and more classes over time from a stream of
data, i.e., only the training data for a small number of classes have to be
present at the beginning and new classes can be added progressively. In this
work, we introduce a Hierarchical Class Incremental Learning (HCIL) model,
which significantly improves the state-of-the-art hierarchical classification
methods under the CIL scenario.
Related papers
- Efficient Prompt Tuning of Large Vision-Language Model for Fine-Grained
Ship Classification [62.425462136772666]
Fine-grained ship classification in remote sensing (RS-FGSC) poses a significant challenge due to the high similarity between classes and the limited availability of labeled data.
Recent advancements in large pre-trained Vision-Language Models (VLMs) have demonstrated impressive capabilities in few-shot or zero-shot learning.
This study delves into harnessing the potential of VLMs to enhance classification accuracy for unseen ship categories.
arXiv Detail & Related papers (2024-03-13T05:48:58Z) - Automatic evaluation of herding behavior in towed fishing gear using
end-to-end training of CNN and attention-based networks [0.0]
The paper compares three convolutional and attention-based deep action recognition network architectures trained end-to-end.
A two-stream CNN model, a CNN-transformer hybrid, and a pure transformer model were trained end-to-end to achieve 63%, 54%, and 60% 10-fold classification accuracy.
arXiv Detail & Related papers (2023-03-21T16:52:08Z) - Better Supervisory Signals by Observing Learning Paths [10.044413937134237]
We explain two existing label refining methods, label smoothing and knowledge distillation, in terms of our proposed criterion.
We observe the learning path, i.e., the trajectory of the model's predictions during training, for each training sample.
We find that the model can spontaneously refine "bad" labels through a "zig-zag" learning path, which occurs on both toy and real datasets.
arXiv Detail & Related papers (2022-03-04T18:31:23Z) - Self-Supervised Class Incremental Learning [51.62542103481908]
Existing Class Incremental Learning (CIL) methods are based on a supervised classification framework sensitive to data labels.
When updating them based on the new class data, they suffer from catastrophic forgetting: the model cannot discern old class data clearly from the new.
In this paper, we explore the performance of Self-Supervised representation learning in Class Incremental Learning (SSCIL) for the first time.
arXiv Detail & Related papers (2021-11-18T06:58:19Z) - Calibrating Class Activation Maps for Long-Tailed Visual Recognition [60.77124328049557]
We present two effective modifications of CNNs to improve network learning from long-tailed distribution.
First, we present a Class Activation Map (CAMC) module to improve the learning and prediction of network classifiers.
Second, we investigate the use of normalized classifiers for representation learning in long-tailed problems.
arXiv Detail & Related papers (2021-08-29T05:45:03Z) - Video-based Hierarchical Species Classification for Longline Fishing
Monitoring [17.031967273526803]
Hierarchical classification based on videos allows for inexpensive and efficient fish species identification of catches from longline fishing.
With a known non-overlapping hierarchical data structure provided by fisheries scientists, our method enforces the hierarchical data structure.
Our experiments show that the proposed method outperforms the classic flat classification system significantly.
arXiv Detail & Related papers (2021-02-06T06:10:52Z) - Movement Tracks for the Automatic Detection of Fish Behavior in Videos [63.85815474157357]
We offer a dataset of sablefish (Anoplopoma fimbria) startle behaviors in underwater videos, and investigate the use of deep learning (DL) methods for behavior detection on it.
Our proposed detection system identifies fish instances using DL-based frameworks, determines trajectory tracks, derives novel behavior-specific features, and employs Long Short-Term Memory (LSTM) networks to identify startle behavior in sablefish.
arXiv Detail & Related papers (2020-11-28T05:51:19Z) - Overcoming Classifier Imbalance for Long-tail Object Detection with
Balanced Group Softmax [88.11979569564427]
We provide the first systematic analysis on the underperformance of state-of-the-art models in front of long-tail distribution.
We propose a novel balanced group softmax (BAGS) module for balancing the classifiers within the detection frameworks through group-wise training.
Extensive experiments on the very recent long-tail large vocabulary object recognition benchmark LVIS show that our proposed BAGS significantly improves the performance of detectors.
arXiv Detail & Related papers (2020-06-18T10:24:26Z) - Temperate Fish Detection and Classification: a Deep Learning based
Approach [6.282069822653608]
We propose a two-step deep learning approach for the detection and classification of temperate fishes without pre-filtering.
The first step is to detect each single fish in an image, independent of species and sex.
In the second step, we adopt a Convolutional Neural Network (CNN) with the Squeeze-and-Excitation (SE) architecture for classifying each fish in the image without pre-filtering.
arXiv Detail & Related papers (2020-05-14T12:40:57Z) - Fine-Grained Visual Classification with Efficient End-to-end
Localization [49.9887676289364]
We present an efficient localization module that can be fused with a classification network in an end-to-end setup.
We evaluate the new model on the three benchmark datasets CUB200-2011, Stanford Cars and FGVC-Aircraft.
arXiv Detail & Related papers (2020-05-11T14:07:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.