Related papers: HPCR: Holistic Proxy-based Contrastive Replay for Online Continual Learning

HPCR: Holistic Proxy-based Contrastive Replay for Online Continual Learning

URL: http://arxiv.org/abs/2309.15038v2
Date: Fri, 03 Jan 2025 04:44:02 GMT
Title: HPCR: Holistic Proxy-based Contrastive Replay for Online Continual Learning
Authors: Huiwei Lin, Shanshan Feng, Baoquan Zhang, Xutao Li, Yunming Ye,
Abstract summary: Online continual learning is aimed at developing a neural network that continuously learns new data from a single pass over an online data stream.<n>Existing replay-based methods alleviate forgetting by replaying partial old data in a proxy-based or contrastive-based replay manner.
Score: 23.70942253222081
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Online continual learning, aimed at developing a neural network that continuously learns new data from a single pass over an online data stream, generally suffers from catastrophic forgetting. Existing replay-based methods alleviate forgetting by replaying partial old data in a proxy-based or contrastive-based replay manner, each with its own shortcomings. Our previous work proposes a novel replay-based method called proxy-based contrastive replay (PCR), which handles the shortcomings by achieving complementary advantages of both replay manners. In this work, we further conduct gradient and limitation analysis of PCR. The analysis results show that PCR still can be further improved in feature extraction, generalization, and anti-forgetting capabilities of the model. Hence, we develop a more advanced method named holistic proxy-based contrastive replay (HPCR). HPCR consists of three components, each tackling one of the limitations of PCR. The contrastive component conditionally incorporates anchor-to-sample pairs to PCR, improving the feature extraction ability. The second is a temperature component that decouples the temperature coefficient into two parts based on their gradient impacts and sets different values for them to enhance the generalization ability. The third is a distillation component that constrains the learning process with additional loss terms to improve the anti-forgetting ability. Experiments on four datasets consistently demonstrate the superiority of HPCR over various state-of-the-art methods.

Related papers

A Plug-and-Play Method for Guided Multi-contrast MRI Reconstruction based on Content/Style Modeling [1.1622133377827824]
We propose a modular two-stage approach for guided reconstruction. In a radiological task, MUNIT allowed 33.3% more acceleration over clinical reconstruction at diagnostic quality.
arXiv Detail & Related papers (2024-09-20T13:08:51Z)
INN-PAR: Invertible Neural Network for PPG to ABP Reconstruction [9.127220498800645]
We introduce an invertible neural network for PPG to ABP reconstruction (INN-PAR) INN-PAR efficiently captures both forward and inverse mappings simultaneously, thereby preventing information loss. We propose a multi-scale convolution module (MSCM) within the invertible block, enabling the model to learn features across multiple scales effectively.
arXiv Detail & Related papers (2024-09-13T17:48:48Z)
REAL: Representation Enhanced Analytic Learning for Exemplar-free Class-incremental Learning [12.197327462627912]
We propose a representation enhanced analytic learning (REAL) for Exemplar-free class-incremental learning (EFCIL) The REAL constructs a dual-stream base pretraining (DS-BPT) and a representation enhancing distillation (RED) process to enhance the representation of the extractor. Our method addresses the issue of insufficient discriminability in representations of unseen data caused by a frozen backbone in the existing AL-based CIL.
arXiv Detail & Related papers (2024-03-20T11:48:10Z)
Retrosynthesis prediction enhanced by in-silico reaction data augmentation [66.5643280109899]
We present RetroWISE, a framework that employs a base model inferred from real paired data to perform in-silico reaction generation and augmentation. On three benchmark datasets, RetroWISE achieves the best overall performance against state-of-the-art models.
arXiv Detail & Related papers (2024-01-31T07:40:37Z)
Dealing with Cross-Task Class Discrimination in Online Continual Learning [54.31411109376545]
This paper argues for another challenge in class-incremental learning (CIL) How to establish decision boundaries between the classes of the new task and old tasks with no (or limited) access to the old task data. A replay method saves a small amount of data (replay data) from previous tasks. When a batch of current task data arrives, the system jointly trains the new data and some sampled replay data. This paper argues that the replay approach also has a dynamic training bias issue which reduces the effectiveness of the replay data in solving the CTCD problem.
arXiv Detail & Related papers (2023-05-24T02:52:30Z)
PCR: Proxy-based Contrastive Replay for Online Class-Incremental Continual Learning [16.67238259139417]
Existing replay-based methods effectively alleviate this issue by saving and replaying part of old data in a proxy-based or contrastive-based replay manner. We propose a novel replay-based method called proxy-based contrastive replay (PCR)
arXiv Detail & Related papers (2023-04-10T06:35:19Z)
PartMix: Regularization Strategy to Learn Part Discovery for Visible-Infrared Person Re-identification [76.40417061480564]
We present a novel data augmentation technique, dubbed PartMix, for part-based Visible-Infrared person Re-IDentification (VI-ReID) models. We synthesize the augmented samples by mixing the part descriptors across the modalities to improve the performance of part-based VI-ReID models.
arXiv Detail & Related papers (2023-04-04T05:21:23Z)
Dataset Distillation via Factorization [58.8114016318593]
We introduce a emphdataset factorization approach, termed emphHaBa, which is a plug-and-play strategy portable to any existing dataset distillation (DD) baseline. emphHaBa explores decomposing a dataset into two components: data emphHallucination networks and emphBases. Our method can yield significant improvement on downstream classification tasks compared with previous state of the arts, while reducing the total number of compressed parameters by up to 65%.
arXiv Detail & Related papers (2022-10-30T08:36:19Z)
Few-Shot Class-Incremental Learning via Entropy-Regularized Data-Free Replay [52.251188477192336]
Few-shot class-incremental learning (FSCIL) has been proposed aiming to enable a deep learning system to incrementally learn new classes with limited data. We show through empirical results that adopting the data replay is surprisingly favorable. We propose using data-free replay that can synthesize data by a generator without accessing real data.
arXiv Detail & Related papers (2022-07-22T17:30:51Z)
Look Back When Surprised: Stabilizing Reverse Experience Replay for Neural Approximation [7.6146285961466]
We consider the recently developed and theoretically rigorous reverse experience replay (RER) We show via experiments that this has a better performance than techniques like prioritized experience replay (PER) on various tasks.
arXiv Detail & Related papers (2022-06-07T10:42:02Z)
Face2PPG: An unsupervised pipeline for blood volume pulse extraction from faces [0.456877715768796]
Photoplethys signals have become a key technology in many fields, such as medicine, well-being, or sports. Our work proposes a set of pipelines to extract PPG signals from the face robustly, reliably, and robustness.
arXiv Detail & Related papers (2022-02-08T19:06:20Z)
Adaptive Contrast for Image Regression in Computer-Aided Disease Assessment [22.717658723840255]
We propose the first contrastive learning framework for deep image regression, namely AdaCon. AdaCon consists of a feature learning branch via a novel adaptive-margin contrastive loss and a regression prediction branch. We demonstrate the effectiveness of AdaCon on two medical image regression tasks.
arXiv Detail & Related papers (2021-12-22T07:13:02Z)
Self-Supervised Learning for MRI Reconstruction with a Parallel Network Training Framework [24.46388892324129]
The proposed method is flexible and can be employed in any existing deep learning-based method. The effectiveness of the method is evaluated on an open brain MRI dataset.
arXiv Detail & Related papers (2021-09-26T06:09:56Z)
Cross-Site Severity Assessment of COVID-19 from CT Images via Domain Adaptation [64.59521853145368]
Early and accurate severity assessment of Coronavirus disease 2019 (COVID-19) based on computed tomography (CT) images offers a great help to the estimation of intensive care unit event. To augment the labeled data and improve the generalization ability of the classification model, it is necessary to aggregate data from multiple sites. This task faces several challenges including class imbalance between mild and severe infections, domain distribution discrepancy between sites, and presence of heterogeneous features.
arXiv Detail & Related papers (2021-09-08T07:56:51Z)
Always Be Dreaming: A New Approach for Data-Free Class-Incremental Learning [73.24988226158497]
We consider the high-impact problem of Data-Free Class-Incremental Learning (DFCIL) We propose a novel incremental distillation strategy for DFCIL, contributing a modified cross-entropy training and importance-weighted feature distillation. Our method results in up to a 25.1% increase in final task accuracy (absolute difference) compared to SOTA DFCIL methods for common class-incremental benchmarks.
arXiv Detail & Related papers (2021-06-17T17:56:08Z)
Predicting the Binding of SARS-CoV-2 Peptides to the Major Histocompatibility Complex with Recurrent Neural Networks [0.40040974874482094]
We adapt and extend USMPep, a proposed, conceptually simple prediction algorithm based on recurrent neural networks. We evaluate the performance on a recently released SARS-CoV-2 dataset with binding stability measurements.
arXiv Detail & Related papers (2021-04-16T17:16:35Z)
Understanding Self-supervised Learning with Dual Deep Networks [74.92916579635336]
We propose a novel framework to understand contrastive self-supervised learning (SSL) methods that employ dual pairs of deep ReLU networks. We prove that in each SGD update of SimCLR with various loss functions, the weights at each layer are updated by a emphcovariance operator. To further study what role the covariance operator plays and which features are learned in such a process, we model data generation and augmentation processes through a emphhierarchical latent tree model (HLTM)
arXiv Detail & Related papers (2020-10-01T17:51:49Z)
Simultaneous Estimation of X-ray Back-Scatter and Forward-Scatter using Multi-Task Learning [59.17383024536595]
Back-scatter significantly contributes to patient (skin) dose during complicated interventions. Forward-scattered radiation reduces contrast in projection images and introduces artifacts in 3-D reconstructions. We propose a novel approach combining conventional techniques with learning-based methods to simultaneously estimate the forward-scatter reaching the detector.
arXiv Detail & Related papers (2020-07-08T10:47:37Z)
AutoHR: A Strong End-to-end Baseline for Remote Heart Rate Measurement with Neural Searching [76.4844593082362]
We investigate the reason why existing end-to-end networks perform poorly in challenging conditions and establish a strong baseline for remote HR measurement with architecture search (NAS) Comprehensive experiments are performed on three benchmark datasets on both intra-temporal and cross-dataset testing.
arXiv Detail & Related papers (2020-04-26T05:43:21Z)

This list is automatically generated from the titles and abstracts of the papers in this site.