Learning Optimal Predictive Checklists
- URL: http://arxiv.org/abs/2112.01020v1
- Date: Thu, 2 Dec 2021 07:15:28 GMT
- Title: Learning Optimal Predictive Checklists
- Authors: Haoran Zhang, Quaid Morris, Berk Ustun, Marzyeh Ghassemi
- Abstract summary: We represent predictive checklists as discrete linear classifiers with binary features and unit weights.
We then learn globally optimal predictive checklists from data by solving an integer programming problem.
Our results show that our method can fit simple predictive checklists that perform well and that can easily be customized to obey a rich class of custom constraints.
- Score: 22.91829410102425
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Checklists are simple decision aids that are often used to promote safety and
reliability in clinical applications. In this paper, we present a method to
learn checklists for clinical decision support. We represent predictive
checklists as discrete linear classifiers with binary features and unit
weights. We then learn globally optimal predictive checklists from data by
solving an integer programming problem. Our method allows users to customize
checklists to obey complex constraints, including constraints to enforce group
fairness and to binarize real-valued features at training time. In addition, it
pairs models with an optimality gap that can inform model development and
determine the feasibility of learning sufficiently accurate checklists on a
given dataset. We pair our method with specialized techniques that speed up its
ability to train a predictive checklist that performs well and has a small
optimality gap. We benchmark the performance of our method on seven clinical
classification problems, and demonstrate its practical benefits by training a
short-form checklist for PTSD screening. Our results show that our method can
fit simple predictive checklists that perform well and that can easily be
customized to obey a rich class of custom constraints.
Related papers
- Learning Predictive Checklists with Probabilistic Logic Programming [17.360186431981592]
We propose a novel method for learning predictive checklists from diverse data modalities, such as images and time series.
Our approach relies on probabilistic logic programming, a learning paradigm that enables matching the discrete nature of checklist with continuous-valued data.
We demonstrate that our method outperforms various explainable machine learning techniques on prediction tasks involving image sequences, time series, and clinical notes.
arXiv Detail & Related papers (2024-11-25T09:07:19Z) - Probably Approximately Precision and Recall Learning [62.912015491907994]
Precision and Recall are foundational metrics in machine learning.
One-sided feedback--where only positive examples are observed during training--is inherent in many practical problems.
We introduce a PAC learning framework where each hypothesis is represented by a graph, with edges indicating positive interactions.
arXiv Detail & Related papers (2024-11-20T04:21:07Z) - Socratic Pretraining: Question-Driven Pretraining for Controllable
Summarization [89.04537372465612]
Socratic pretraining is a question-driven, unsupervised pretraining objective designed to improve controllability in summarization tasks.
Our results show that Socratic pretraining cuts task-specific labeled data requirements in half.
arXiv Detail & Related papers (2022-12-20T17:27:10Z) - Learning predictive checklists from continuous medical data [5.37133760455631]
Checklists are highly popular in daily clinical practice due to their combined effectiveness and great interpretability.
Recent works have taken a step in that direction by learning predictive checklists from categorical data.
We show that this extension outperforms a range of explainable machine learning baselines on the prediction of sepsis.
arXiv Detail & Related papers (2022-11-14T02:51:04Z) - Query-Adaptive Predictive Inference with Partial Labels [0.0]
We propose a new methodology to construct predictive sets using only partially labeled data on top of black-box predictive models.
Our experiments highlight the validity of our predictive set construction as well as the attractiveness of a more flexible user-dependent loss framework.
arXiv Detail & Related papers (2022-06-15T01:48:42Z) - Leveraging Time Irreversibility with Order-Contrastive Pre-training [3.1848820580333737]
We explore an "order-contrastive" method for self-supervised pre-training on longitudinal data.
We prove a finite-sample guarantee for the downstream error of a representation learned with order-contrastive pre-training.
Our results indicate that pre-training methods designed for particular classes of distributions and downstream tasks can improve the performance of self-supervised learning.
arXiv Detail & Related papers (2021-11-04T02:56:52Z) - Predictive machine learning for prescriptive applications: a coupled
training-validating approach [77.34726150561087]
We propose a new method for training predictive machine learning models for prescriptive applications.
This approach is based on tweaking the validation step in the standard training-validating-testing scheme.
Several experiments with synthetic data demonstrate promising results in reducing the prescription costs in both deterministic and real models.
arXiv Detail & Related papers (2021-10-22T15:03:20Z) - Active learning for medical code assignment [55.99831806138029]
We demonstrate the effectiveness of Active Learning (AL) in multi-label text classification in the clinical domain.
We apply a set of well-known AL methods to help automatically assign ICD-9 codes on the MIMIC-III dataset.
Our results show that the selection of informative instances provides satisfactory classification with a significantly reduced training set.
arXiv Detail & Related papers (2021-04-12T18:11:17Z) - Towards Model-Agnostic Post-Hoc Adjustment for Balancing Ranking
Fairness and Algorithm Utility [54.179859639868646]
Bipartite ranking aims to learn a scoring function that ranks positive individuals higher than negative ones from labeled data.
There have been rising concerns on whether the learned scoring function can cause systematic disparity across different protected groups.
We propose a model post-processing framework for balancing them in the bipartite ranking scenario.
arXiv Detail & Related papers (2020-06-15T10:08:39Z) - Self-Training with Improved Regularization for Sample-Efficient Chest
X-Ray Classification [80.00316465793702]
We present a deep learning framework that enables robust modeling in challenging scenarios.
Our results show that using 85% lesser labeled data, we can build predictive models that match the performance of classifiers trained in a large-scale data setting.
arXiv Detail & Related papers (2020-05-03T02:36:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.