Distinguishing Transformative from Incremental Clinical Evidence: A
Classifier of Clinical Research using Textual features from Abstracts and
Citing Sentences
- URL: http://arxiv.org/abs/2112.12996v1
- Date: Fri, 24 Dec 2021 08:35:18 GMT
- Title: Distinguishing Transformative from Incremental Clinical Evidence: A
Classifier of Clinical Research using Textual features from Abstracts and
Citing Sentences
- Authors: Xuanyu Shi, Jian Du
- Abstract summary: In clinical research, it is important to know if a study changes or only supports the current standards of care for specific disease management.
In this study, a machine learning approach is proposed to distinguishing transformative from incremental clinical evidence.
- Score: 1.3135234328352885
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In clinical research and clinical decision-making, it is important to know if
a study changes or only supports the current standards of care for specific
disease management. We define such a change as transformative and a support as
incremental research. It usually requires a huge amount of domain expertise and
time for humans to finish such tasks. Faculty Opinions provides us with a
well-annotated corpus on whether a research challenges or only confirms
established research. In this study, a machine learning approach is proposed to
distinguishing transformative from incremental clinical evidence. The texts
from both abstract and a 2-year window of citing sentences are collected for a
training set of clinical studies recommended and labeled by Faculty Opinions
experts. We achieve the best performance with an average AUC of 0.755
(0.705-0.875) using Random Forest as the classifier and citing sentences as the
feature. The results showed that transformative research has typical language
patterns in citing sentences unlike abstract sentences. We provide an efficient
tool for identifying those clinical evidence challenging or only confirming
established claims for clinicians and researchers.
Related papers
- Making the Most Out of the Limited Context Length: Predictive Power
Varies with Clinical Note Type and Note Section [70.37720062263176]
We propose a framework to analyze the sections with high predictive power.
Using MIMIC-III, we show that: 1) predictive power distribution is different between nursing notes and discharge notes and 2) combining different types of notes could improve performance when the context length is large.
arXiv Detail & Related papers (2023-07-13T20:04:05Z) - Self-Verification Improves Few-Shot Clinical Information Extraction [73.6905567014859]
Large language models (LLMs) have shown the potential to accelerate clinical curation via few-shot in-context learning.
They still struggle with issues regarding accuracy and interpretability, especially in mission-critical domains such as health.
Here, we explore a general mitigation framework using self-verification, which leverages the LLM to provide provenance for its own extraction and check its own outputs.
arXiv Detail & Related papers (2023-05-30T22:05:11Z) - Applying unsupervised keyphrase methods on concepts extracted from
discharge sheets [7.102620843620572]
It is necessary to identify the section in which each content is recorded and also to identify key concepts to extract meaning from clinical texts.
In this study, these challenges have been addressed by using clinical natural language processing techniques.
A set of popular unsupervised key phrase extraction methods has been verified and evaluated.
arXiv Detail & Related papers (2023-03-15T20:55:25Z) - Adaptive Identification of Populations with Treatment Benefit in
Clinical Trials: Machine Learning Challenges and Solutions [78.31410227443102]
We study the problem of adaptively identifying patient subpopulations that benefit from a given treatment during a confirmatory clinical trial.
We propose AdaGGI and AdaGCPI, two meta-algorithms for subpopulation construction.
arXiv Detail & Related papers (2022-08-11T14:27:49Z) - Cross-Lingual Knowledge Transfer for Clinical Phenotyping [55.92262310716537]
We investigate cross-lingual knowledge transfer strategies to execute this task for clinics that do not use the English language.
We evaluate these strategies for a Greek and a Spanish clinic leveraging clinical notes from different clinical domains.
Our results show that using multilingual data overall improves clinical phenotyping models and can compensate for data sparseness.
arXiv Detail & Related papers (2022-08-03T08:33:21Z) - Clinical Named Entity Recognition using Contextualized Token
Representations [49.036805795072645]
This paper introduces the technique of contextualized word embedding to better capture the semantic meaning of each word based on its context.
We pre-train two deep contextualized language models, Clinical Embeddings from Language Model (C-ELMo) and Clinical Contextual String Embeddings (C-Flair)
Explicit experiments show that our models gain dramatic improvements compared to both static word embeddings and domain-generic language models.
arXiv Detail & Related papers (2021-06-23T18:12:58Z) - Validating GAN-BioBERT: A Methodology For Assessing Reporting Trends In
Clinical Trials [3.164363223464948]
This study develops a sentiment classification algorithm for clinical trial abstracts using a semi-supervised natural language process model.
The use of this algorithm was found to have a classification accuracy of 91.3%, with a macro F1-Score of 0.92, which is a significant improvement in accuracy when compared to previous methods.
arXiv Detail & Related papers (2021-06-01T17:51:54Z) - A Practical Approach towards Causality Mining in Clinical Text using
Active Transfer Learning [2.6125458645126907]
Causality mining is an active research area, which requires the application of state-of-the-art natural language processing techniques.
This research work is to create a framework, which can convert clinical text into causal knowledge.
arXiv Detail & Related papers (2020-12-10T06:51:13Z) - Benchmarking Automated Clinical Language Simplification: Dataset,
Algorithm, and Evaluation [48.87254340298189]
We construct a new dataset named MedLane to support the development and evaluation of automated clinical language simplification approaches.
We propose a new model called DECLARE that follows the human annotation procedure and achieves state-of-the-art performance.
arXiv Detail & Related papers (2020-12-04T06:09:02Z) - Artificial Intelligence, speech and language processing approaches to
monitoring Alzheimer's Disease: a systematic review [5.635607414700482]
This paper summarises current findings on the use of artificial intelligence, speech and language processing to predict cognitive decline in Alzheimer's Disease.
We conducted a systematic review of original research between 2000 and 2019 registered in PROSPERO.
arXiv Detail & Related papers (2020-10-12T21:43:04Z) - Knowledge-guided Text Structuring in Clinical Trials [0.38073142980733]
We propose a knowledge-guided text structuring framework with an automatically generated knowledge base.
Experimental results show that our method can achieve overall high precision and recall.
arXiv Detail & Related papers (2019-12-28T01:12:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.