Related papers: Text Classification of Cancer Clinical Trial Eligibility Criteria

Text Classification of Cancer Clinical Trial Eligibility Criteria

URL: http://arxiv.org/abs/2309.07812v2
Date: Fri, 15 Sep 2023 21:59:56 GMT
Title: Text Classification of Cancer Clinical Trial Eligibility Criteria
Authors: Yumeng Yang, Soumya Jayaraj, Ethan B Ludmir, Kirk Roberts
Abstract summary: We focus on seven common exclusion criteria in cancer trials: prior malignancy, human immunodeficiency virus, hepatitis B, hepatitis C, psychiatric illness, drug/substance abuse, and autoimmune illness. Our dataset consists of 764 phase III cancer trials with these exclusions annotated at the trial level. Our results demonstrate the feasibility of automatically classifying common exclusion criteria.
Score: 3.372747046563984
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Automatic identification of clinical trials for which a patient is eligible is complicated by the fact that trial eligibility is stated in natural language. A potential solution to this problem is to employ text classification methods for common types of eligibility criteria. In this study, we focus on seven common exclusion criteria in cancer trials: prior malignancy, human immunodeficiency virus, hepatitis B, hepatitis C, psychiatric illness, drug/substance abuse, and autoimmune illness. Our dataset consists of 764 phase III cancer trials with these exclusions annotated at the trial level. We experiment with common transformer models as well as a new pre-trained clinical trial BERT model. Our results demonstrate the feasibility of automatically classifying common exclusion criteria. Additionally, we demonstrate the value of a pre-trained language model specifically for clinical trials, which yields the highest average performance across all criteria.

Related papers

Towards Regulatory-Confirmed Adaptive Clinical Trials: Machine Learning Opportunities and Solutions [59.28853595868749]
We introduce two new objectives for future clinical trials that integrate regulatory constraints and treatment policy value for both the entire population and under-served populations. We formulate Randomize First Augment Next (RFAN), a new framework for designing Phase III clinical trials. Our framework consists of a standard randomized component followed by an adaptive one, jointly meant to efficiently and safely acquire and assign patients into treatment arms during the trial.
arXiv Detail & Related papers (2025-03-12T10:17:54Z)
TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets [57.067409211231244]
This paper presents meticulously curated AIready datasets covering multi-modal data (e.g., drug molecule, disease code, text, categorical/numerical features) and 8 crucial prediction challenges in clinical trial design. We provide basic validation methods for each task to ensure the datasets' usability and reliability. We anticipate that the availability of such open-access datasets will catalyze the development of advanced AI approaches for clinical trial design.
arXiv Detail & Related papers (2024-06-30T09:13:10Z)
Exploring the Generalization of Cancer Clinical Trial Eligibility Classifiers Across Diseases [3.087385668501741]
This study aims to evaluate the generalizability of eligibility classification across a broad spectrum of clinical trials. We have compiled eligibility criteria data for five types of trials: (1) additional phase 3 cancer trials, (2) phase 1 and 2 cancer trials, (3) heart disease trials, (4) type 2 diabetes trials, and (5) observational trials for any disease. Our results show that models trained on the extensive cancer dataset can effectively handle criteria commonly found in non-cancer trials, such as autoimmune diseases.
arXiv Detail & Related papers (2024-03-25T19:17:59Z)
TREEMENT: Interpretable Patient-Trial Matching via Personalized Dynamic Tree-Based Memory Network [54.332862955411656]
Clinical trials are critical for drug development but often suffer from expensive and inefficient patient recruitment. In recent years, machine learning models have been proposed for speeding up patient recruitment via automatically matching patients with clinical trials. We introduce a dynamic tree-based memory network model named TREEMENT to provide accurate and interpretable patient trial matching.
arXiv Detail & Related papers (2023-07-19T12:35:09Z)
AutoTrial: Prompting Language Models for Clinical Trial Design [53.630479619856516]
We present a method named AutoTrial to aid the design of clinical eligibility criteria using language models. Experiments on over 70K clinical trials verify that AutoTrial generates high-quality criteria texts.
arXiv Detail & Related papers (2023-05-19T01:04:16Z)
Improving Patient Pre-screening for Clinical Trials: Assisting Physicians with Large Language Models [0.0]
Large Language Models (LLMs) have shown to perform well for clinical information extraction and clinical reasoning. This paper investigates the use of InstructGPT to assist physicians in determining eligibility for clinical trials based on a patient's summarised medical profile.
arXiv Detail & Related papers (2023-04-14T21:19:46Z)
Towards Fair Patient-Trial Matching via Patient-Criterion Level Fairness Constraint [50.35075018041199]
This work proposes a fair patient-trial matching framework by generating a patient-criterion level fairness constraint. The experimental results on real-world patient-trial and patient-criterion matching tasks demonstrate that the proposed framework can successfully alleviate the predictions that tend to be biased.
arXiv Detail & Related papers (2023-03-24T03:59:19Z)
The Leaf Clinical Trials Corpus: a new resource for query generation from clinical trial eligibility criteria [1.7205106391379026]
We introduce the Leaf Clinical Trials (LCT) corpus, a human-annotated corpus of over 1,000 clinical trial eligibility criteria descriptions. We provide details of our schema, annotation process, corpus quality, and statistics.
arXiv Detail & Related papers (2022-07-27T19:22:24Z)
Clinical trial site matching with improved diversity using fair policy learning [56.01170456417214]
We learn a model that maps a clinical trial description to a ranked list of potential trial sites. Unlike existing fairness frameworks, the group membership of each trial site is non-binary. We propose fairness criteria based on demographic parity to address such a multi-group membership scenario.
arXiv Detail & Related papers (2022-04-13T16:35:28Z)
A Scalable AI Approach for Clinical Trial Cohort Optimization [6.076017404694899]
FDA has been promoting enrollment practices that could enhance the diversity of clinical trial populations. We propose an AI approach to Cohort Optimization (AICO) through transformer-based natural language processing. A case study on breast cancer trial design demonstrates the utility of the method in improving trial generalizability.
arXiv Detail & Related papers (2021-09-07T01:49:05Z)
Malignancy Prediction and Lesion Identification from Clinical Dermatological Images [65.1629311281062]
We consider machine-learning-based malignancy prediction and lesion identification from clinical dermatological images. We first identify all lesions present in the image regardless of sub-type or likelihood of malignancy, then it estimates their likelihood of malignancy, and through aggregation, it also generates an image-level likelihood of malignancy.
arXiv Detail & Related papers (2021-04-02T20:52:05Z)

This list is automatically generated from the titles and abstracts of the papers in this site.