A Study of Age and Sex Bias in Multiple Instance Learning based
Classification of Acute Myeloid Leukemia Subtypes
- URL: http://arxiv.org/abs/2308.12675v1
- Date: Thu, 24 Aug 2023 09:32:46 GMT
- Title: A Study of Age and Sex Bias in Multiple Instance Learning based
Classification of Acute Myeloid Leukemia Subtypes
- Authors: Ario Sadafi, Matthias Hehr, Nassir Navab, Carsten Marr
- Abstract summary: We train multiple MIL models using different levels of sex imbalance in the training set and excluding certain age groups.
We find a significant effect of sex and age bias on the performance of the model for AML subtype classification.
- Score: 44.077241051884926
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate classification of Acute Myeloid Leukemia (AML) subtypes is crucial
for clinical decision-making and patient care. In this study, we investigate
the potential presence of age and sex bias in AML subtype classification using
Multiple Instance Learning (MIL) architectures. To that end, we train multiple
MIL models using different levels of sex imbalance in the training set and
excluding certain age groups. To assess the sex bias, we evaluate the
performance of the models on male and female test sets. For age bias, models
are tested against underrepresented age groups in the training data. We find a
significant effect of sex and age bias on the performance of the model for AML
subtype classification. Specifically, we observe that females are more likely
to be affected by sex imbalance dataset and certain age groups, such as
patients with 72 to 86 years of age with the RUNX1::RUNX1T1 genetic subtype,
are significantly affected by an age bias present in the training data.
Ensuring inclusivity in the training data is thus essential for generating
reliable and equitable outcomes in AML genetic subtype classification,
ultimately benefiting diverse patient populations.
Related papers
- Dataset Distribution Impacts Model Fairness: Single vs. Multi-Task Learning [2.9530211066840417]
We evaluate the performance of skin lesion classification using ResNet-based CNNs.
We present a linear programming method for generating datasets with varying patient sex and class labels.
arXiv Detail & Related papers (2024-07-24T15:23:26Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Fairness Evolution in Continual Learning for Medical Imaging [47.52603262576663]
We study the behavior of Continual Learning (CL) strategies in medical imaging regarding classification performance.
We evaluate the Replay, Learning without Forgetting (LwF), LwF, and Pseudo-Label strategies.
LwF and Pseudo-Label exhibit optimal classification performance, but when including fairness metrics in the evaluation, it is clear that Pseudo-Label is less biased.
arXiv Detail & Related papers (2024-04-10T09:48:52Z) - Studying the Effects of Sex-related Differences on Brain Age Prediction
using brain MR Imaging [0.3958317527488534]
We study biases related to sex when developing a machine learning model based on brain magnetic resonance images (MRI)
We investigate the effects of sex by performing brain age prediction considering different experimental designs.
We found disparities in the performance of brain age prediction models when trained on distinct sex subgroups and datasets.
arXiv Detail & Related papers (2023-10-17T20:55:53Z) - Analysing race and sex bias in brain age prediction [18.68533487971233]
We analyse the commonly used ResNet-34 model by conducting a subgroup performance analysis and feature inspection.
Our results reveal statistically significant differences in predictive performance between Black and White, Black and Asian, and male and female subjects.
arXiv Detail & Related papers (2023-09-19T14:40:19Z) - Adapting Machine Learning Diagnostic Models to New Populations Using a Small Amount of Data: Results from Clinical Neuroscience [21.420302408947194]
We develop a weighted empirical risk minimization approach that optimally combines data from a source group to make predictions on a target group.
We apply this method to multi-source data of 15,363 individuals from 20 neuroimaging studies to build ML models for diagnosis of Alzheimer's disease and estimation of brain age.
arXiv Detail & Related papers (2023-08-06T18:05:39Z) - Assessing Social Determinants-Related Performance Bias of Machine
Learning Models: A case of Hyperchloremia Prediction in ICU Population [6.8473641147443995]
We evaluated four classifiers built to predict Hyperchloremia, a condition that often results from aggressive fluids administration in the ICU population.
We observed that adding social determinants features in addition to the lab-based ones improved model performance on all patients.
We urge future researchers to design models that proactively adjust for potential biases and include subgroup reporting.
arXiv Detail & Related papers (2021-11-18T03:58:50Z) - Improving Gender Fairness of Pre-Trained Language Models without
Catastrophic Forgetting [88.83117372793737]
Forgetting information in the original training data may damage the model's downstream performance by a large margin.
We propose GEnder Equality Prompt (GEEP) to improve gender fairness of pre-trained models with less forgetting.
arXiv Detail & Related papers (2021-10-11T15:52:16Z) - Relational Subsets Knowledge Distillation for Long-tailed Retinal
Diseases Recognition [65.77962788209103]
We propose class subset learning by dividing the long-tailed data into multiple class subsets according to prior knowledge.
It enforces the model to focus on learning the subset-specific knowledge.
The proposed framework proved to be effective for the long-tailed retinal diseases recognition task.
arXiv Detail & Related papers (2021-04-22T13:39:33Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.