Curriculum Learning Meets Weakly Supervised Modality Correlation
Learning
- URL: http://arxiv.org/abs/2212.07619v1
- Date: Thu, 15 Dec 2022 05:11:04 GMT
- Title: Curriculum Learning Meets Weakly Supervised Modality Correlation
Learning
- Authors: Sijie Mai, Ya Sun, Haifeng Hu
- Abstract summary: We introduce curriculum learning into weakly supervised modality correlation learning.
To assist the correlation learning, we feed the training pairs to the model according to difficulty.
The proposed method reaches state-of-the-art performance on multimodal sentiment analysis.
- Score: 26.754095474534534
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In the field of multimodal sentiment analysis (MSA), a few studies have
leveraged the inherent modality correlation information stored in samples for
self-supervised learning. However, they feed the training pairs in a random
order without consideration of difficulty. Without human annotation, the
generated training pairs of self-supervised learning often contain noise. If
noisy or hard pairs are used for training at the easy stage, the model might be
stuck in bad local optimum. In this paper, we inject curriculum learning into
weakly supervised modality correlation learning. The weakly supervised
correlation learning leverages the label information to generate scores for
negative pairs to learn a more discriminative embedding space, where negative
pairs are defined as two unimodal embeddings from different samples. To assist
the correlation learning, we feed the training pairs to the model according to
difficulty by the proposed curriculum learning, which consists of elaborately
designed scoring and feeding functions. The scoring function computes the
difficulty of pairs using pre-trained and current correlation predictors, where
the pairs with large losses are defined as hard pairs. Notably, the hardest
pairs are discarded in our algorithm, which are assumed as noisy pairs.
Moreover, the feeding function takes the difference of correlation losses as
feedback to determine the feeding actions (`stay', `step back', or `step
forward'). The proposed method reaches state-of-the-art performance on MSA.
Related papers
- FaLW: A Forgetting-aware Loss Reweighting for Long-tailed Unlearning [24.734154431191538]
FaLW is a plug-and-play, instance-wise dynamic loss reweighting method.<n>It assesses the unlearning state of each sample by comparing its predictive probability to the distribution of unseen data from the same class.<n>Experiments demonstrate that FaLW achieves superior performance.
arXiv Detail & Related papers (2026-01-26T16:21:01Z) - Contrastive Learning with Nasty Noise [0.0]
This work analyzes the theoretical limits of contrastive learning under nasty noise, where an adversary modifies or replaces training samples.
Data-dependent sample complexity bounds based on the l2-distance function are derived.
arXiv Detail & Related papers (2025-02-25T05:55:15Z) - Deep Bayesian Active Learning-to-Rank with Relative Annotation for Estimation of Ulcerative Colitis Severity [10.153691271169745]
We propose a deep Bayesian active learning-to-rank that automatically selects appropriate pairs for relative annotation.
Our method preferentially annotates unlabeled pairs with high learning efficiency from the model uncertainty of the samples.
arXiv Detail & Related papers (2024-09-08T02:19:40Z) - Probabilistic Contrastive Learning for Long-Tailed Visual Recognition [78.70453964041718]
Longtailed distributions frequently emerge in real-world data, where a large number of minority categories contain a limited number of samples.
Recent investigations have revealed that supervised contrastive learning exhibits promising potential in alleviating the data imbalance.
We propose a novel probabilistic contrastive (ProCo) learning algorithm that estimates the data distribution of the samples from each class in the feature space.
arXiv Detail & Related papers (2024-03-11T13:44:49Z) - Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation [63.180725016463974]
Cross-modal retrieval relies on well-matched large-scale datasets that are laborious in practice.
We introduce a novel noisy correspondence learning framework, namely textbfSelf-textbfReinforcing textbfErrors textbfMitigation (SREM)
arXiv Detail & Related papers (2023-12-27T09:03:43Z) - Uncertainty-Aware Bootstrap Learning for Joint Extraction on
Distantly-Supervised Data [36.54640096189285]
bootstrap learning is motivated by the intuition that the higher uncertainty of an instance, the more likely the model confidence is inconsistent with the ground truths.
We first explore instance-level data uncertainty to create an initial high-confident examples.
During bootstrap learning, we propose self-ensembling as a regularizer to alleviate inter-model uncertainty produced by noisy labels.
arXiv Detail & Related papers (2023-05-05T20:06:11Z) - Uncertainty Estimation for Language Reward Models [5.33024001730262]
Language models can learn a range of capabilities from unsupervised training on text corpora.
It is often easier for humans to choose between options than to provide labeled data, and prior work has achieved state-of-the-art performance by training a reward model from such preference comparisons.
We seek to address these problems via uncertainty estimation, which can improve sample efficiency and robustness using active learning and risk-averse reinforcement learning.
arXiv Detail & Related papers (2022-03-14T20:13:21Z) - BatchFormer: Learning to Explore Sample Relationships for Robust
Representation Learning [93.38239238988719]
We propose to enable deep neural networks with the ability to learn the sample relationships from each mini-batch.
BatchFormer is applied into the batch dimension of each mini-batch to implicitly explore sample relationships during training.
We perform extensive experiments on over ten datasets and the proposed method achieves significant improvements on different data scarcity applications.
arXiv Detail & Related papers (2022-03-03T05:31:33Z) - Examining and Combating Spurious Features under Distribution Shift [94.31956965507085]
We define and analyze robust and spurious representations using the information-theoretic concept of minimal sufficient statistics.
We prove that even when there is only bias of the input distribution, models can still pick up spurious features from their training data.
Inspired by our analysis, we demonstrate that group DRO can fail when groups do not directly account for various spurious correlations.
arXiv Detail & Related papers (2021-06-14T05:39:09Z) - Can Active Learning Preemptively Mitigate Fairness Issues? [66.84854430781097]
dataset bias is one of the prevailing causes of unfairness in machine learning.
We study whether models trained with uncertainty-based ALs are fairer in their decisions with respect to a protected class.
We also explore the interaction of algorithmic fairness methods such as gradient reversal (GRAD) and BALD.
arXiv Detail & Related papers (2021-04-14T14:20:22Z) - Contrastive learning of strong-mixing continuous-time stochastic
processes [53.82893653745542]
Contrastive learning is a family of self-supervised methods where a model is trained to solve a classification task constructed from unlabeled data.
We show that a properly constructed contrastive learning task can be used to estimate the transition kernel for small-to-mid-range intervals in the diffusion case.
arXiv Detail & Related papers (2021-03-03T23:06:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.