From Majority to Minority: A Diffusion-based Augmentation for Underrepresented Groups in Skin Lesion Analysis
- URL: http://arxiv.org/abs/2406.18375v2
- Date: Tue, 30 Jul 2024 13:43:54 GMT
- Title: From Majority to Minority: A Diffusion-based Augmentation for Underrepresented Groups in Skin Lesion Analysis
- Authors: Janet Wang, Yunsung Chung, Zhengming Ding, Jihun Hamm,
- Abstract summary: AI-based diagnoses have demonstrated dermatologist-level performance in classifying skin cancer.
Such systems are prone to under-performing when tested on data from minority groups that lack sufficient representation in the training sets.
We propose an effective diffusion-based augmentation framework that maximizes the use of rich information from majority groups to benefit minority groups.
- Score: 43.1078084014722
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: AI-based diagnoses have demonstrated dermatologist-level performance in classifying skin cancer. However, such systems are prone to under-performing when tested on data from minority groups that lack sufficient representation in the training sets. Although data collection and annotation offer the best means for promoting minority groups, these processes are costly and time-consuming. Prior works have suggested that data from majority groups may serve as a valuable information source to supplement the training of diagnosis tools for minority groups. In this work, we propose an effective diffusion-based augmentation framework that maximizes the use of rich information from majority groups to benefit minority groups. Using groups with different skin types as a case study, our results show that the proposed framework can generate synthetic images that improve diagnostic results for the minority groups, even when there is little or no reference data from these target groups. The practical value of our work is evident in medical imaging analysis, where under-diagnosis persists as a problem for certain groups due to insufficient representation.
Related papers
- Unsupervised Search for Ethnic Minorities' Medical Segmentation Training Set [5.880582406602758]
This article investigates the critical issue of dataset bias in medical imaging, with a particular emphasis on racial disparities.
Our analysis reveals that medical segmentation datasets are significantly biased, primarily influenced by the demographic composition of their collection sites.
We propose a novel training set search strategy aimed at reducing these biases by focusing on underrepresented racial groups.
arXiv Detail & Related papers (2025-01-05T05:04:47Z) - How does promoting the minority fraction affect generalization? A theoretical study of the one-hidden-layer neural network on group imbalance [64.1656365676171]
Group imbalance has been a known problem in empirical risk minimization.
This paper quantifies the impact of individual groups on the sample complexity, the convergence rate, and the average and group-level testing performance.
arXiv Detail & Related papers (2024-03-12T04:38:05Z) - Adversarial Reweighting Guided by Wasserstein Distance for Bias
Mitigation [24.160692009892088]
Under-representation of minorities in the data makes the disparate treatment of subpopulations difficult to deal with during learning.
We propose a novel adversarial reweighting method to address such emphrepresentation bias.
arXiv Detail & Related papers (2023-11-21T15:46:11Z) - Group Distributionally Robust Knowledge Distillation [16.058258572636376]
Sub-population shifts are a common scenario in medical imaging analysis.
In this paper, we propose a group-aware distillation loss.
We empirically validate our method, GroupDistil on two benchmark datasets.
arXiv Detail & Related papers (2023-11-01T12:25:02Z) - Ambiguous Medical Image Segmentation using Diffusion Models [60.378180265885945]
We introduce a single diffusion model-based approach that produces multiple plausible outputs by learning a distribution over group insights.
Our proposed model generates a distribution of segmentation masks by leveraging the inherent sampling process of diffusion.
Comprehensive results show that our proposed approach outperforms existing state-of-the-art ambiguous segmentation networks.
arXiv Detail & Related papers (2023-04-10T17:58:22Z) - Outlier-Robust Group Inference via Gradient Space Clustering [50.87474101594732]
Existing methods can improve the worst-group performance, but they require group annotations, which are often expensive and sometimes infeasible to obtain.
We address the problem of learning group annotations in the presence of outliers by clustering the data in the space of gradients of the model parameters.
We show that data in the gradient space has a simpler structure while preserving information about minority groups and outliers, making it suitable for standard clustering methods like DBSCAN.
arXiv Detail & Related papers (2022-10-13T06:04:43Z) - How does overparametrization affect performance on minority groups? [39.54853544590893]
We show that over parameterization always improves minority group performance.
In a setting in which the regression functions for the majority and minority groups are different, we show that over parameterization always improves minority group performance.
arXiv Detail & Related papers (2022-06-07T18:00:52Z) - Towards Group Robustness in the presence of Partial Group Labels [61.33713547766866]
spurious correlations between input samples and the target labels wrongly direct the neural network predictions.
We propose an algorithm that optimize for the worst-off group assignments from a constraint set.
We show improvements in the minority group's performance while preserving overall aggregate accuracy across groups.
arXiv Detail & Related papers (2022-01-10T22:04:48Z) - Inclusive GAN: Improving Data and Minority Coverage in Generative Models [101.67587566218928]
We formalize the problem of minority inclusion as one of data coverage.
We then propose to improve data coverage by harmonizing adversarial training with reconstructive generation.
We develop an extension that allows explicit control over the minority subgroups that the model should ensure to include.
arXiv Detail & Related papers (2020-04-07T13:31:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.