Feature Fusion from Head to Tail for Long-Tailed Visual Recognition
- URL: http://arxiv.org/abs/2306.06963v3
- Date: Mon, 18 Dec 2023 14:39:46 GMT
- Title: Feature Fusion from Head to Tail for Long-Tailed Visual Recognition
- Authors: Mengke Li, Zhikai Hu, Yang Lu, Weichao Lan, Yiu-ming Cheung, Hui Huang
- Abstract summary: The biased decision boundary caused by inadequate semantic information in tail classes is one of the key factors contributing to their low recognition accuracy.
We propose to augment tail classes by grafting the diverse semantic information from head classes, referred to as head-to-tail fusion (H2T)
Both theoretical analysis and practical experimentation demonstrate that H2T can contribute to a more optimized solution for the decision boundary.
- Score: 39.86973663532936
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The imbalanced distribution of long-tailed data presents a considerable
challenge for deep learning models, as it causes them to prioritize the
accurate classification of head classes but largely disregard tail classes. The
biased decision boundary caused by inadequate semantic information in tail
classes is one of the key factors contributing to their low recognition
accuracy. To rectify this issue, we propose to augment tail classes by grafting
the diverse semantic information from head classes, referred to as head-to-tail
fusion (H2T). We replace a portion of feature maps from tail classes with those
belonging to head classes. These fused features substantially enhance the
diversity of tail classes. Both theoretical analysis and practical
experimentation demonstrate that H2T can contribute to a more optimized
solution for the decision boundary. We seamlessly integrate H2T in the
classifier adjustment stage, making it a plug-and-play module. Its simplicity
and ease of implementation allow for smooth integration with existing
long-tailed recognition methods, facilitating a further performance boost.
Extensive experiments on various long-tailed benchmarks demonstrate the
effectiveness of the proposed H2T. The source code is available at
https://github.com/Keke921/H2T.
Related papers
- Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud
Semantic Segmentation via Decoupling Optimization [64.36097398869774]
Semi-supervised learning (SSL) has been an active research topic for large-scale 3D scene understanding.
The existing SSL-based methods suffer from severe training bias due to class imbalance and long-tail distributions of the point cloud data.
We introduce a new decoupling optimization framework, which disentangles feature representation learning and classifier in an alternative optimization manner to shift the bias decision boundary effectively.
arXiv Detail & Related papers (2024-01-13T04:16:40Z) - LCReg: Long-Tailed Image Classification with Latent Categories based
Recognition [81.5551335554507]
We propose the Latent Categories based long-tail Recognition (LCReg) method.
Our hypothesis is that common latent features shared by head and tail classes can be used to improve feature representation.
Specifically, we learn a set of class-agnostic latent features shared by both head and tail classes, and then use semantic data augmentation on the latent features to implicitly increase the diversity of the training sample.
arXiv Detail & Related papers (2023-09-13T02:03:17Z) - Dual Compensation Residual Networks for Class Imbalanced Learning [98.35401757647749]
We propose Dual Compensation Residual Networks to better fit both tail and head classes.
An important factor causing overfitting is that there is severe feature drift between training and test data on tail classes.
We also propose a Residual Balanced Multi-Proxies classifier to alleviate the under-fitting issue.
arXiv Detail & Related papers (2023-08-25T04:06:30Z) - Head-Tail Cooperative Learning Network for Unbiased Scene Graph
Generation [30.467562472064177]
Current unbiased Scene Graph Generation (SGG) methods ignore the substantial sacrifice in the prediction of head predicates.
We propose a model-agnostic Head-Tail Collaborative Learning network that includes head-prefer and tail-prefer feature representation branches.
Our method achieves higher mean Recall with a minimal sacrifice in Recall and achieves a new state-of-the-art overall performance.
arXiv Detail & Related papers (2023-08-23T10:29:25Z) - Constructing Balance from Imbalance for Long-tailed Image Recognition [50.6210415377178]
The imbalance between majority (head) classes and minority (tail) classes severely skews the data-driven deep neural networks.
Previous methods tackle with data imbalance from the viewpoints of data distribution, feature space, and model design.
We propose a concise paradigm by progressively adjusting label space and dividing the head classes and tail classes.
Our proposed model also provides a feature evaluation method and paves the way for long-tailed feature learning.
arXiv Detail & Related papers (2022-08-04T10:22:24Z) - Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation [87.13847750383778]
We propose a Dual-branch Hybrid Learning network (DHL) to take care of both head predicates and tail ones for Scene Graph Generation (SGG)
We show that our approach achieves a new state-of-the-art performance on VG and GQA datasets.
arXiv Detail & Related papers (2022-07-16T11:53:50Z) - Deep Representation Learning on Long-tailed Data: A Learnable Embedding
Augmentation Perspective [17.602607883721973]
In the deep feature space, the head classes and the tail classes present different distribution patterns.
We propose to construct each feature into a "feature cloud"
It allows each tail sample to push the samples from other classes far away, recovering the intra-class diversity of tail classes.
arXiv Detail & Related papers (2020-02-25T12:38:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.