Validating GAN-BioBERT: A Methodology For Assessing Reporting Trends In
Clinical Trials
- URL: http://arxiv.org/abs/2106.00665v1
- Date: Tue, 1 Jun 2021 17:51:54 GMT
- Title: Validating GAN-BioBERT: A Methodology For Assessing Reporting Trends In
Clinical Trials
- Authors: Joshua J Myszewski, Emily Klossowski, Patrick Meyer, Kristin Bevil,
Lisa Klesius, Kristopher M Schroeder
- Abstract summary: This study develops a sentiment classification algorithm for clinical trial abstracts using a semi-supervised natural language process model.
The use of this algorithm was found to have a classification accuracy of 91.3%, with a macro F1-Score of 0.92, which is a significant improvement in accuracy when compared to previous methods.
- Score: 3.164363223464948
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In the past decade, there has been much discussion about the issue of biased
reporting in clinical research. Despite this attention, there have been limited
tools developed for the systematic assessment of qualitative statements made in
clinical research, with most studies assessing qualitative statements relying
on the use of manual expert raters, which limits their size. Also, previous
attempts to develop larger scale tools, such as those using natural language
processing, were limited by both their accuracy and the number of categories
used for the classification of their findings. With these limitations in mind,
this study's goal was to develop a classification algorithm that was both
suitably accurate and finely grained to be applied on a large scale for
assessing the qualitative sentiment expressed in clinical trial abstracts.
Additionally, this study seeks to compare the performance of the proposed
algorithm, GAN-BioBERT, to previous studies as well as to expert manual rating
of clinical trial abstracts. This study develops a three-class sentiment
classification algorithm for clinical trial abstracts using a semi-supervised
natural language process model based on the Bidirectional Encoder
Representation from Transformers (BERT) model, from a series of clinical trial
abstracts annotated by a group of experts in academic medicine. Results: The
use of this algorithm was found to have a classification accuracy of 91.3%,
with a macro F1-Score of 0.92, which is a significant improvement in accuracy
when compared to previous methods and expert ratings, while also making the
sentiment classification finer grained than previous studies. The proposed
algorithm, GAN-BioBERT, is a suitable classification model for the large-scale
assessment of qualitative statements in clinical trial literature, providing an
accurate, reproducible tool for the large-scale study of clinical publication
trends.
Related papers
- Towards Efficient Patient Recruitment for Clinical Trials: Application of a Prompt-Based Learning Model [0.7373617024876725]
Clinical trials are essential for advancing pharmaceutical interventions, but they face a bottleneck in selecting eligible participants.
The complex nature of unstructured medical texts presents challenges in efficiently identifying participants.
In this study, we aimed to evaluate the performance of a prompt-based large language model for the cohort selection task.
arXiv Detail & Related papers (2024-04-24T20:42:28Z) - TrialDura: Hierarchical Attention Transformer for Interpretable Clinical Trial Duration Prediction [19.084936647082632]
We propose TrialDura, a machine learning-based method that estimates the duration of clinical trials using multimodal data.
We encode them into Bio-BERT embeddings specifically tuned for biomedical contexts to provide a deeper and more relevant semantic understanding.
Our proposed model demonstrated superior performance with a mean absolute error (MAE) of 1.04 years and a root mean square error (RMSE) of 1.39 years compared to the other models.
arXiv Detail & Related papers (2024-04-20T02:12:59Z) - Uncertainty Quantification on Clinical Trial Outcome Prediction [37.238845949535616]
We propose incorporating uncertainty quantification into clinical trial outcome predictions.
Our main goal is to enhance the model's ability to discern nuanced differences.
We have adopted a selective classification approach to fulfill our objective.
arXiv Detail & Related papers (2024-01-07T13:48:05Z) - Multi-task Explainable Skin Lesion Classification [54.76511683427566]
We propose a few-shot-based approach for skin lesions that generalizes well with few labelled data.
The proposed approach comprises a fusion of a segmentation network that acts as an attention module and classification network.
arXiv Detail & Related papers (2023-10-11T05:49:47Z) - Machine Learning-Assisted Pattern Recognition Algorithms for Estimating
Ultimate Tensile Strength in Fused Deposition Modeled Polylactic Acid
Specimens [0.0]
We investigate the application of supervised machine learning algorithms for estimating the Ultimate Tensile Strength (UTS) of Polylactic Acid (PLA) specimens fabricated using the Fused Deposition Modeling (FDM) process.
The primary objective was to assess the accuracy and effectiveness of four distinct supervised classification algorithms, namely Logistic Classification, Gradient Boosting Classification, Decision Tree, and K-Nearest Neighbor.
The results revealed that while the Decision Tree and K-Nearest Neighbor algorithms both achieved an F1 score of 0.71, the KNN algorithm exhibited a higher Area Under the Curve (AUC) score of 0.79, outperforming the other algorithms
arXiv Detail & Related papers (2023-07-13T11:10:22Z) - AutoTrial: Prompting Language Models for Clinical Trial Design [53.630479619856516]
We present a method named AutoTrial to aid the design of clinical eligibility criteria using language models.
Experiments on over 70K clinical trials verify that AutoTrial generates high-quality criteria texts.
arXiv Detail & Related papers (2023-05-19T01:04:16Z) - SPOT: Sequential Predictive Modeling of Clinical Trial Outcome with
Meta-Learning [67.8195828626489]
Clinical trials are essential to drug development but time-consuming, costly, and prone to failure.
We propose Sequential Predictive mOdeling of clinical Trial outcome (SPOT) that first identifies trial topics to cluster the multi-sourced trial data into relevant trial topics.
With the consideration of each trial sequence as a task, it uses a meta-learning strategy to achieve a point where the model can rapidly adapt to new tasks with minimal updates.
arXiv Detail & Related papers (2023-04-07T23:04:27Z) - Development and validation of a natural language processing algorithm to
pseudonymize documents in the context of a clinical data warehouse [53.797797404164946]
The study highlights the difficulties faced in sharing tools and resources in this domain.
We annotated a corpus of clinical documents according to 12 types of identifying entities.
We build a hybrid system, merging the results of a deep learning model as well as manual rules.
arXiv Detail & Related papers (2023-03-23T17:17:46Z) - CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark [51.38557174322772]
We present the first Chinese Biomedical Language Understanding Evaluation benchmark.
It is a collection of natural language understanding tasks including named entity recognition, information extraction, clinical diagnosis normalization, single-sentence/sentence-pair classification.
We report empirical results with the current 11 pre-trained Chinese models, and experimental results show that state-of-the-art neural models perform by far worse than the human ceiling.
arXiv Detail & Related papers (2021-06-15T12:25:30Z) - AI Driven Knowledge Extraction from Clinical Practice Guidelines:
Turning Research into Practice [2.803896166632835]
Clinical Practice Guidelines (CPGs) represent the foremost methodology for sharing state-of-the-art research findings in the healthcare domain with medical practitioners.
However, extracting relevant knowledge from the plethora of CPGs is not feasible for already burdened healthcare professionals.
This research presents a novel methodology for knowledge extraction from CPGs to reduce the gap and turn the latest research findings into clinical practice.
arXiv Detail & Related papers (2020-12-10T07:23:02Z) - Semi-supervised Medical Image Classification with Relation-driven
Self-ensembling Model [71.80319052891817]
We present a relation-driven semi-supervised framework for medical image classification.
It exploits the unlabeled data by encouraging the prediction consistency of given input under perturbations.
Our method outperforms many state-of-the-art semi-supervised learning methods on both single-label and multi-label image classification scenarios.
arXiv Detail & Related papers (2020-05-15T06:57:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.