A Survey of Incremental Transfer Learning: Combining Peer-to-Peer
  Federated Learning and Domain Incremental Learning for Multicenter
  Collaboration
        - URL: http://arxiv.org/abs/2309.17192v1
- Date: Fri, 29 Sep 2023 12:43:21 GMT
- Title: A Survey of Incremental Transfer Learning: Combining Peer-to-Peer
  Federated Learning and Domain Incremental Learning for Multicenter
  Collaboration
- Authors: Yixing Huang, Christoph Bert, Ahmed Gomaa, Rainer Fietkau, Andreas
  Maier, Florian Putz
- Abstract summary: Data privacy constraints impede the development of high performance deep learning models from multicenter collaboration.
Weight transfer methods share intermediate model weights without raw data and hence can bypass data privacy restrictions.
Performance drops are observed when the model is transferred from one center to the next because of forgetting the problem.
In this work, a conventional domain/task incremental learning framework is adapted for incremental transfer learning.
- Score: 6.064986446665161
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Due to data privacy constraints, data sharing among multiple clinical centers
is restricted, which impedes the development of high performance deep learning
models from multicenter collaboration. Naive weight transfer methods share
intermediate model weights without raw data and hence can bypass data privacy
restrictions. However, performance drops are typically observed when the model
is transferred from one center to the next because of the forgetting problem.
Incremental transfer learning, which combines peer-to-peer federated learning
and domain incremental learning, can overcome the data privacy issue and
meanwhile preserve model performance by using continual learning techniques. In
this work, a conventional domain/task incremental learning framework is adapted
for incremental transfer learning. A comprehensive survey on the efficacy of
different regularization-based continual learning methods for multicenter
collaboration is performed. The influences of data heterogeneity, classifier
head setting, network optimizer, model initialization, center order, and weight
transfer type have been investigated thoroughly. Our framework is publicly
accessible to the research community for further development.
 
      
        Related papers
        - Multi-Stage Knowledge Integration of Vision-Language Models for   Continual Learning [79.46570165281084]
 We propose a Multi-Stage Knowledge Integration network (MulKI) to emulate the human learning process in distillation methods.
MulKI achieves this through four stages, including Eliciting Ideas, Adding New Ideas, Distinguishing Ideas, and Making Connections.
Our method demonstrates significant improvements in maintaining zero-shot capabilities while supporting continual learning across diverse downstream tasks.
 arXiv  Detail & Related papers  (2024-11-11T07:36:19Z)
- DiffClass: Diffusion-Based Class Incremental Learning [30.514281721324853]
 Class Incremental Learning (CIL) is challenging due to catastrophic forgetting.
Recent exemplar-free CIL methods attempt to mitigate catastrophic forgetting by synthesizing previous task data.
We propose a novel exemplar-free CIL method to overcome these issues.
 arXiv  Detail & Related papers  (2024-03-08T03:34:18Z)
- Federated Pruning: Improving Neural Network Efficiency with Federated
  Learning [24.36174705715827]
 We propose Federated Pruning to train a reduced model under the federated setting.
We explore different pruning schemes and provide empirical evidence of the effectiveness of our methods.
 arXiv  Detail & Related papers  (2022-09-14T00:48:37Z)
- FedILC: Weighted Geometric Mean and Invariant Gradient Covariance for
  Federated Learning on Non-IID Data [69.0785021613868]
 Federated learning is a distributed machine learning approach which enables a shared server model to learn by aggregating the locally-computed parameter updates with the training data from spatially-distributed client silos.
We propose the Federated Invariant Learning Consistency (FedILC) approach, which leverages the gradient covariance and the geometric mean of Hessians to capture both inter-silo and intra-silo consistencies.
This is relevant to various fields such as medical healthcare, computer vision, and the Internet of Things (IoT)
 arXiv  Detail & Related papers  (2022-05-19T03:32:03Z)
- DQRE-SCnet: A novel hybrid approach for selecting users in Federated
  Learning with Deep-Q-Reinforcement Learning based on Spectral Clustering [1.174402845822043]
 Machine learning models based on sensitive data in the real-world promise advances in areas ranging from medical screening to disease outbreaks, agriculture, industry, defense science, and more.
In many applications, learning participant communication rounds benefit from collecting their own private data sets, teaching detailed machine learning models on the real data, and sharing the benefits of using these models.
Due to existing privacy and security concerns, most people avoid sensitive data sharing for training. Without each user demonstrating their local data to a central server, Federated Learning allows various parties to train a machine learning algorithm on their shared data jointly.
 arXiv  Detail & Related papers  (2021-11-07T15:14:29Z)
- Quasi-Global Momentum: Accelerating Decentralized Deep Learning on
  Heterogeneous Data [77.88594632644347]
 Decentralized training of deep learning models is a key element for enabling data privacy and on-device learning over networks.
In realistic learning scenarios, the presence of heterogeneity across different clients' local datasets poses an optimization challenge.
We propose a novel momentum-based method to mitigate this decentralized training difficulty.
 arXiv  Detail & Related papers  (2021-02-09T11:27:14Z)
- CosSGD: Nonlinear Quantization for Communication-efficient Federated
  Learning [62.65937719264881]
 Federated learning facilitates learning across clients without transferring local data on these clients to a central server.
We propose a nonlinear quantization for compressed gradient descent, which can be easily utilized in federated learning.
Our system significantly reduces the communication cost by up to three orders of magnitude, while maintaining convergence and accuracy of the training process.
 arXiv  Detail & Related papers  (2020-12-15T12:20:28Z)
- Siloed Federated Learning for Multi-Centric Histopathology Datasets [0.17842332554022694]
 This paper proposes a novel federated learning approach for deep learning architectures in the medical domain.
Local-statistic batch normalization (BN) layers are introduced, resulting in collaboratively-trained, yet center-specific models.
We benchmark the proposed method on the classification of tumorous histopathology image patches extracted from the Camelyon16 and Camelyon17 datasets.
 arXiv  Detail & Related papers  (2020-08-17T15:49:30Z)
- Minimax Lower Bounds for Transfer Learning with Linear and One-hidden
  Layer Neural Networks [27.44348371795822]
 We develop a statistical minimax framework to characterize the limits of transfer learning.
We derive a lower-bound for the target generalization error achievable by any algorithm as a function of the number of labeled source and target data.
 arXiv  Detail & Related papers  (2020-06-16T22:49:26Z)
- Task-Feature Collaborative Learning with Application to Personalized
  Attribute Prediction [166.87111665908333]
 We propose a novel multi-task learning method called Task-Feature Collaborative Learning (TFCL)
Specifically, we first propose a base model with a heterogeneous block-diagonal structure regularizer to leverage the collaborative grouping of features and tasks.
As a practical extension, we extend the base model by allowing overlapping features and differentiating the hard tasks.
 arXiv  Detail & Related papers  (2020-04-29T02:32:04Z)
- Federated Residual Learning [53.77128418049985]
 We study a new form of federated learning where the clients train personalized local models and make predictions jointly with the server-side shared model.
Using this new federated learning framework, the complexity of the central shared model can be minimized while still gaining all the performance benefits that joint training provides.
 arXiv  Detail & Related papers  (2020-03-28T19:55:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.