Automating the Compilation of Potential Core-Outcomes for Clinical
Trials
- URL: http://arxiv.org/abs/2101.04076v1
- Date: Mon, 11 Jan 2021 18:14:49 GMT
- Title: Automating the Compilation of Potential Core-Outcomes for Clinical
Trials
- Authors: Shwetha Bharadwaj, Melanie Laffin
- Abstract summary: The objective of this paper is to describe an automated method utilizing natural language processing in order to describe the probable core outcomes of clinical trials.
In addition to BioBERT, an unsupervised feature-based approach making use of only the encoder output embedding representations was utilized.
This method was able to both harness the domain-specific context of each of the tokens from the learned embeddings of the BioBERT model as well as a more stable metric of sentence similarity.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Due to increased access to clinical trial outcomes and analysis, researchers
and scientists are able to iterate or improve upon relevant approaches more
effectively. However, the metrics and related results of clinical trials
typically do not follow any standardization in their reports, making it more
difficult for researchers to parse the results of different trials. The
objective of this paper is to describe an automated method utilizing natural
language processing in order to describe the probable core outcomes of clinical
trials, in order to alleviate the issues around disparate clinical trial
outcomes. As the nature of this process is domain specific, BioBERT was
employed in order to conduct a multi-class entity normalization task. In
addition to BioBERT, an unsupervised feature-based approach making use of only
the encoder output embedding representations for the outcomes and labels was
utilized. Finally, cosine similarity was calculated across the vectors to
obtain the semantic similarity. This method was able to both harness the
domain-specific context of each of the tokens from the learned embeddings of
the BioBERT model as well as a more stable metric of sentence similarity. Some
common outcomes identified using the Jaccard similarity in each of the
classifications were compiled, and while some are untenable, a pipeline for
which this automation process could be conducted was established.
Related papers
- Subgroup analysis methods for time-to-event outcomes in heterogeneous
randomized controlled trials [7.940293148084845]
Non-significant randomized control trials can hide subgroups of good responders to experimental drugs.
We provide an open source Python package, available on Github, containing our generation process and our comprehensive benchmark framework.
arXiv Detail & Related papers (2024-01-22T11:00:49Z) - Multi-task Explainable Skin Lesion Classification [54.76511683427566]
We propose a few-shot-based approach for skin lesions that generalizes well with few labelled data.
The proposed approach comprises a fusion of a segmentation network that acts as an attention module and classification network.
arXiv Detail & Related papers (2023-10-11T05:49:47Z) - A reproducible experimental survey on biomedical sentence similarity: a
string-based method sets the state of the art [0.0]
This report introduces the largest, and for the first time, reproducible experimental survey on biomedical sentence similarity.
Our aim is to elucidate the state of the art of the problem and to solve some problems preventing the evaluation of most of current methods.
Our experiments confirm that the pre-processing stages, and the choice of the NER tool, have a significant impact on the performance of the sentence similarity methods.
arXiv Detail & Related papers (2022-05-18T06:20:42Z) - Assessment of contextualised representations in detecting outcome
phrases in clinical trials [14.584741378279316]
We introduce "EBM-COMET", a dataset in which 300 PubMed abstracts are expertly annotated for clinical outcomes.
To extract outcomes, we fine-tune a variety of pre-trained contextualized representations.
We observe our best model (BioBERT) achieve 81.5% F1, 81.3% sensitivity and 98.0% specificity.
arXiv Detail & Related papers (2022-02-13T15:08:00Z) - Scalable Intervention Target Estimation in Linear Models [52.60799340056917]
Current approaches to causal structure learning either work with known intervention targets or use hypothesis testing to discover the unknown intervention targets.
This paper proposes a scalable and efficient algorithm that consistently identifies all intervention targets.
The proposed algorithm can be used to also update a given observational Markov equivalence class into the interventional Markov equivalence class.
arXiv Detail & Related papers (2021-11-15T03:16:56Z) - On Inductive Biases for Heterogeneous Treatment Effect Estimation [91.3755431537592]
We investigate how to exploit structural similarities of an individual's potential outcomes (POs) under different treatments.
We compare three end-to-end learning strategies to overcome this problem.
arXiv Detail & Related papers (2021-06-07T16:30:46Z) - Validating GAN-BioBERT: A Methodology For Assessing Reporting Trends In
Clinical Trials [3.164363223464948]
This study develops a sentiment classification algorithm for clinical trial abstracts using a semi-supervised natural language process model.
The use of this algorithm was found to have a classification accuracy of 91.3%, with a macro F1-Score of 0.92, which is a significant improvement in accuracy when compared to previous methods.
arXiv Detail & Related papers (2021-06-01T17:51:54Z) - Neural networks for Anatomical Therapeutic Chemical (ATC) [83.73971067918333]
We propose combining multiple multi-label classifiers trained on distinct sets of features, including sets extracted from a Bidirectional Long Short-Term Memory Network (BiLSTM)
Experiments demonstrate the power of this approach, which is shown to outperform the best methods reported in the literature.
arXiv Detail & Related papers (2021-01-22T19:49:47Z) - Interaction Matching for Long-Tail Multi-Label Classification [57.262792333593644]
We present an elegant and effective approach for addressing limitations in existing multi-label classification models.
By performing soft n-gram interaction matching, we match labels with natural language descriptions.
arXiv Detail & Related papers (2020-05-18T15:27:55Z) - Semi-supervised Medical Image Classification with Relation-driven
Self-ensembling Model [71.80319052891817]
We present a relation-driven semi-supervised framework for medical image classification.
It exploits the unlabeled data by encouraging the prediction consistency of given input under perturbations.
Our method outperforms many state-of-the-art semi-supervised learning methods on both single-label and multi-label image classification scenarios.
arXiv Detail & Related papers (2020-05-15T06:57:54Z) - Exemplar Auditing for Multi-Label Biomedical Text Classification [0.4873362301533824]
We generalize a recently proposed zero-shot sequence labeling method, "supervised labeling via a convolutional decomposition"
The approach yields classification with "introspection", relating the fine-grained features of an inference-time prediction to their nearest neighbors.
Our proposed approach yields both a competitively effective classification model and an interrogation mechanism to aid healthcare workers in understanding the salient features that drive the model's predictions.
arXiv Detail & Related papers (2020-04-07T02:54:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.