Density-Aware Personalized Training for Risk Prediction in Imbalanced
Medical Data
- URL: http://arxiv.org/abs/2207.11382v1
- Date: Sat, 23 Jul 2022 00:39:53 GMT
- Title: Density-Aware Personalized Training for Risk Prediction in Imbalanced
Medical Data
- Authors: Zepeng Huo, Xiaoning Qian, Shuai Huang, Zhangyang Wang, Bobak
Mortazavi
- Abstract summary: Training models with imbalance rate (class density discrepancy) may lead to suboptimal prediction.
We propose a framework for training models for this imbalance issue.
We demonstrate our model's improved performance in real-world medical datasets.
- Score: 89.79617468457393
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Medical events of interest, such as mortality, often happen at a low rate in
electronic medical records, as most admitted patients survive. Training models
with this imbalance rate (class density discrepancy) may lead to suboptimal
prediction. Traditionally this problem is addressed through ad-hoc methods such
as resampling or reweighting but performance in many cases is still limited. We
propose a framework for training models for this imbalance issue: 1) we first
decouple the feature extraction and classification process, adjusting training
batches separately for each component to mitigate bias caused by class density
discrepancy; 2) we train the network with both a density-aware loss and a
learnable cost matrix for misclassifications. We demonstrate our model's
improved performance in real-world medical datasets (TOPCAT and MIMIC-III) to
show improved AUC-ROC, AUC-PRC, Brier Skill Score compared with the baselines
in the domain.
Related papers
- Addressing Imbalance for Class Incremental Learning in Medical Image Classification [14.242875524728495]
We introduce two plug-in methods to mitigate the adverse effects of imbalance.
First, we propose a CIL-balanced classification loss to mitigate the classification bias toward majority classes.
Second, we propose a distribution margin loss that not only alleviates the inter-class overlap in embedding space but also enforces the intra-class compactness.
arXiv Detail & Related papers (2024-07-18T17:59:44Z) - Towards Clinician-Preferred Segmentation: Leveraging Human-in-the-Loop for Test Time Adaptation in Medical Image Segmentation [10.65123164779962]
Deep learning-based medical image segmentation models often face performance degradation when deployed across various medical centers.
We propose a novel Human-in-the-loop TTA framework that capitalizes on the largely overlooked potential of clinician-corrected predictions.
Our framework conceives a divergence loss, designed specifically to diminish the prediction divergence instigated by domain disparities.
arXiv Detail & Related papers (2024-05-14T02:02:15Z) - MCRAGE: Synthetic Healthcare Data for Fairness [3.0089659534785853]
We propose Minority Class Rebalancing through Augmentation by Generative modeling (MCRAGE) to augment imbalanced datasets.
MCRAGE involves training a Denoising Diffusion Probabilistic Model (CDDPM) capable of generating high-quality synthetic EHR samples from underrepresented classes.
We use this synthetic data to augment the existing imbalanced dataset, resulting in a more balanced distribution across all classes.
arXiv Detail & Related papers (2023-10-27T19:02:22Z) - TWINS: A Fine-Tuning Framework for Improved Transferability of
Adversarial Robustness and Generalization [89.54947228958494]
This paper focuses on the fine-tuning of an adversarially pre-trained model in various classification tasks.
We propose a novel statistics-based approach, Two-WIng NormliSation (TWINS) fine-tuning framework.
TWINS is shown to be effective on a wide range of image classification datasets in terms of both generalization and robustness.
arXiv Detail & Related papers (2023-03-20T14:12:55Z) - Machine Learning Performance Analysis to Predict Stroke Based on
Imbalanced Medical Dataset [0.0]
Cerebral stroke, the second most substantial cause of death universally, has been a primary public health concern over the last few years.
Medical dataset are frequently unbalanced in their class label, with a tendency to poorly predict minority classes.
In this paper, the potential risk factors for stroke are investigated.
Four distinctive approaches are applied to improve the classification of the minority class in the imbalanced stroke dataset.
arXiv Detail & Related papers (2022-11-14T17:36:46Z) - Competence-based Multimodal Curriculum Learning for Medical Report
Generation [98.10763792453925]
We propose a Competence-based Multimodal Curriculum Learning framework ( CMCL) to alleviate the data bias and make best use of available data.
Specifically, CMCL simulates the learning process of radiologists and optimize the model in a step by step manner.
Experiments on the public IU-Xray and MIMIC-CXR datasets show that CMCL can be incorporated into existing models to improve their performance.
arXiv Detail & Related papers (2022-06-24T08:16:01Z) - Deep Reinforcement Learning for Multi-class Imbalanced Training [64.9100301614621]
We introduce an imbalanced classification framework, based on reinforcement learning, for training extremely imbalanced data sets.
We formulate a custom reward function and episode-training procedure, specifically with the added capability of handling multi-class imbalanced training.
Using real-world clinical case studies, we demonstrate that our proposed framework outperforms current state-of-the-art imbalanced learning methods.
arXiv Detail & Related papers (2022-05-24T13:39:59Z) - Estimating and Improving Fairness with Adversarial Learning [65.99330614802388]
We propose an adversarial multi-task training strategy to simultaneously mitigate and detect bias in the deep learning-based medical image analysis system.
Specifically, we propose to add a discrimination module against bias and a critical module that predicts unfairness within the base classification model.
We evaluate our framework on a large-scale public-available skin lesion dataset.
arXiv Detail & Related papers (2021-03-07T03:10:32Z) - The unreasonable effectiveness of Batch-Norm statistics in addressing
catastrophic forgetting across medical institutions [8.244654685687054]
We investigate trade-off between model refinement and retention of previously learned knowledge.
We propose a simple yet effective approach, adapting Elastic weight consolidation (EWC) using the global batch normalization statistics of the original dataset.
arXiv Detail & Related papers (2020-11-16T16:57:05Z) - Predictive Modeling of ICU Healthcare-Associated Infections from
Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling
Approach [55.41644538483948]
This work is focused on both the identification of risk factors and the prediction of healthcare-associated infections in intensive-care units.
The aim is to support decision making addressed at reducing the incidence rate of infections.
arXiv Detail & Related papers (2020-05-07T16:13:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.