Algorithmic Bias, Generalist Models,and Clinical Medicine
- URL: http://arxiv.org/abs/2305.04008v1
- Date: Sat, 6 May 2023 10:48:51 GMT
- Title: Algorithmic Bias, Generalist Models,and Clinical Medicine
- Authors: Geoff Keeling
- Abstract summary: The dominant paradigm in clinical machine learning is narrow in the sense that models are trained on biomedical datasets for particular clinical tasks.
The emerging paradigm is generalist in the sense that general-purpose language models such as Google's BERT and PaLM are increasingly being adapted for clinical use cases.
Many of these next-generation models provide substantial performance gains over prior clinical models, but at the same time introduce novel kinds of algorithmic bias.
This paper articulates how and in what respects biases in generalist models differ from biases in prior clinical models, and draws out practical recommendations for algorithmic bias mitigation.
- Score: 1.9143819780453073
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The technical landscape of clinical machine learning is shifting in ways that
destabilize pervasive assumptions about the nature and causes of algorithmic
bias. On one hand, the dominant paradigm in clinical machine learning is narrow
in the sense that models are trained on biomedical datasets for particular
clinical tasks such as diagnosis and treatment recommendation. On the other
hand, the emerging paradigm is generalist in the sense that general-purpose
language models such as Google's BERT and PaLM are increasingly being adapted
for clinical use cases via prompting or fine-tuning on biomedical datasets.
Many of these next-generation models provide substantial performance gains over
prior clinical models, but at the same time introduce novel kinds of
algorithmic bias and complicate the explanatory relationship between
algorithmic biases and biases in training data. This paper articulates how and
in what respects biases in generalist models differ from biases in prior
clinical models, and draws out practical recommendations for algorithmic bias
mitigation.
Related papers
- Learning Personalized Treatment Decisions in Precision Medicine: Disentangling Treatment Assignment Bias in Counterfactual Outcome Prediction and Biomarker Identification [0.05277756703318046]
This study models various types of treatment assignment biases using mutual information.
By incorporating empirical biological mechanisms, we create a more realistic benchmark that reflects the complexities of real-world data.
arXiv Detail & Related papers (2024-10-01T08:47:29Z) - Towards a clinically accessible radiology foundation model: open-access and lightweight, with automated evaluation [113.5002649181103]
Training open-source small multimodal models (SMMs) to bridge competency gaps for unmet clinical needs in radiology.
For training, we assemble a large dataset of over 697 thousand radiology image-text pairs.
For evaluation, we propose CheXprompt, a GPT-4-based metric for factuality evaluation, and demonstrate its parity with expert evaluation.
The inference of LlaVA-Rad is fast and can be performed on a single V100 GPU in private settings, offering a promising state-of-the-art tool for real-world clinical applications.
arXiv Detail & Related papers (2024-03-12T18:12:02Z) - Patient Aware Active Learning for Fine-Grained OCT Classification [12.89552245538411]
We propose a framework that incorporates clinical insights into the sample selection process of active learning.
Our medically interpretable active learning framework captures diverse disease manifestations from patients to improve performance of OCT classification.
arXiv Detail & Related papers (2022-06-23T05:47:51Z) - Benchmarking Heterogeneous Treatment Effect Models through the Lens of
Interpretability [82.29775890542967]
Estimating personalized effects of treatments is a complex, yet pervasive problem.
Recent developments in the machine learning literature on heterogeneous treatment effect estimation gave rise to many sophisticated, but opaque, tools.
We use post-hoc feature importance methods to identify features that influence the model's predictions.
arXiv Detail & Related papers (2022-06-16T17:59:05Z) - Pseudo Bias-Balanced Learning for Debiased Chest X-ray Classification [57.53567756716656]
We study the problem of developing debiased chest X-ray diagnosis models without knowing exactly the bias labels.
We propose a novel algorithm, pseudo bias-balanced learning, which first captures and predicts per-sample bias labels.
Our proposed method achieved consistent improvements over other state-of-the-art approaches.
arXiv Detail & Related papers (2022-03-18T11:02:18Z) - TrialGraph: Machine Intelligence Enabled Insight from Graph Modelling of
Clinical Trials [0.0]
We introduce a curated clinical trial data set compiled from the CT.gov, AACT and TrialTrove databases (n=1191 trials; representing one million patients)
We then detail the mathematical basis and implementation of a selection of graph machine learning algorithms.
We trained these models to predict side effect information for a clinical trial given information on the disease, existing medical conditions, and treatment.
arXiv Detail & Related papers (2021-12-15T15:36:57Z) - What Do You See in this Patient? Behavioral Testing of Clinical NLP
Models [69.09570726777817]
We introduce an extendable testing framework that evaluates the behavior of clinical outcome models regarding changes of the input.
We show that model behavior varies drastically even when fine-tuned on the same data and that allegedly best-performing models have not always learned the most medically plausible patterns.
arXiv Detail & Related papers (2021-11-30T15:52:04Z) - Clinical Validation of Single-Chamber Model-Based Algorithms Used to
Estimate Respiratory Compliance [2.9511531830032083]
We establish an open, clinically validated dataset of mechanical lungs and nearly 40,000 breaths from 18 intubated patients.
Next, we evaluate 15 different algorithms that use the "single chamber" model of estimating respiratory compliance.
In particular, we explore algorithm performance under four different types of patient ventilator asynchrony.
arXiv Detail & Related papers (2021-09-19T07:34:15Z) - Adversarial Sample Enhanced Domain Adaptation: A Case Study on
Predictive Modeling with Electronic Health Records [57.75125067744978]
We propose a data augmentation method to facilitate domain adaptation.
adversarially generated samples are used during domain adaptation.
Results confirm the effectiveness of our method and the generality on different tasks.
arXiv Detail & Related papers (2021-01-13T03:20:20Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - "Name that manufacturer". Relating image acquisition bias with task
complexity when training deep learning models: experiments on head CT [0.0]
We analyze how the distribution of scanner manufacturers in a dataset can contribute to the overall bias of deep learning models.
We demonstrate that CNNs can learn to distinguish the imaging scanner manufacturer and that this bias can substantially impact model performance.
arXiv Detail & Related papers (2020-08-19T16:05:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.