Impact of detecting clinical trial elements in exploration of COVID-19
literature
- URL: http://arxiv.org/abs/2105.12261v1
- Date: Tue, 25 May 2021 23:41:24 GMT
- Title: Impact of detecting clinical trial elements in exploration of COVID-19
literature
- Authors: Simon \v{S}uster, Karin Verspoor, Timothy Baldwin, Jey Han Lau,
Antonio Jimeno Yepes, David Martinez, Yulia Otmakhova
- Abstract summary: We compare the results retrieved by a standard search engine with those filtered using clinically-relevant concepts and their relations.
We find that the relational concept selection filters the original retrieved collection in a way that decreases the proportion of unjudged documents.
- Score: 29.027162080682643
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The COVID-19 pandemic has driven ever-greater demand for tools which enable
efficient exploration of biomedical literature. Although semi-structured
information resulting from concept recognition and detection of the defining
elements of clinical trials (e.g. PICO criteria) has been commonly used to
support literature search, the contributions of this abstraction remain poorly
understood, especially in relation to text-based retrieval. In this study, we
compare the results retrieved by a standard search engine with those filtered
using clinically-relevant concepts and their relations. With analysis based on
the annotations from the TREC-COVID shared task, we obtain quantitative as well
as qualitative insights into characteristics of relational and concept-based
literature exploration. Most importantly, we find that the relational concept
selection filters the original retrieved collection in a way that decreases the
proportion of unjudged documents and increases the precision, which means that
the user is likely to be exposed to a larger number of relevant documents.
Related papers
- Improving Retrieval in Theme-specific Applications using a Corpus
Topical Taxonomy [52.426623750562335]
We introduce ToTER (Topical taxonomy Enhanced Retrieval) framework.
ToTER identifies the central topics of queries and documents with the guidance of the taxonomy, and exploits their topical relatedness to supplement missing contexts.
As a plug-and-play framework, ToTER can be flexibly employed to enhance various PLM-based retrievers.
arXiv Detail & Related papers (2024-03-07T02:34:54Z) - DiscoverPath: A Knowledge Refinement and Retrieval System for
Interdisciplinarity on Biomedical Research [96.10765714077208]
Traditional keyword-based search engines fall short in assisting users who may not be familiar with specific terminologies.
We present a knowledge graph-based paper search engine for biomedical research to enhance the user experience.
The system, dubbed DiscoverPath, employs Named Entity Recognition (NER) and part-of-speech (POS) tagging to extract terminologies and relationships from article abstracts to create a KG.
arXiv Detail & Related papers (2023-09-04T20:52:33Z) - Self-Verification Improves Few-Shot Clinical Information Extraction [73.6905567014859]
Large language models (LLMs) have shown the potential to accelerate clinical curation via few-shot in-context learning.
They still struggle with issues regarding accuracy and interpretability, especially in mission-critical domains such as health.
Here, we explore a general mitigation framework using self-verification, which leverages the LLM to provide provenance for its own extraction and check its own outputs.
arXiv Detail & Related papers (2023-05-30T22:05:11Z) - Development and validation of a natural language processing algorithm to
pseudonymize documents in the context of a clinical data warehouse [53.797797404164946]
The study highlights the difficulties faced in sharing tools and resources in this domain.
We annotated a corpus of clinical documents according to 12 types of identifying entities.
We build a hybrid system, merging the results of a deep learning model as well as manual rules.
arXiv Detail & Related papers (2023-03-23T17:17:46Z) - Applying unsupervised keyphrase methods on concepts extracted from
discharge sheets [7.102620843620572]
It is necessary to identify the section in which each content is recorded and also to identify key concepts to extract meaning from clinical texts.
In this study, these challenges have been addressed by using clinical natural language processing techniques.
A set of popular unsupervised key phrase extraction methods has been verified and evaluated.
arXiv Detail & Related papers (2023-03-15T20:55:25Z) - A Marker-based Neural Network System for Extracting Social Determinants
of Health [12.6970199179668]
Social determinants of health (SDoH) on patients' healthcare quality and the disparity is well-known.
Many SDoH items are not coded in structured forms in electronic health records.
We explore a multi-stage pipeline involving named entity recognition (NER), relation classification (RC), and text classification methods to extract SDoH information from clinical notes automatically.
arXiv Detail & Related papers (2022-12-24T18:40:23Z) - EBOCA: Evidences for BiOmedical Concepts Association Ontology [55.41644538483948]
This paper proposes EBOCA, an ontology that describes (i) biomedical domain concepts and associations between them, and (ii) evidences supporting these associations.
Test data coming from a subset of DISNET and automatic association extractions from texts has been transformed to create a Knowledge Graph that can be used in real scenarios.
arXiv Detail & Related papers (2022-08-01T18:47:03Z) - Self-supervised Answer Retrieval on Clinical Notes [68.87777592015402]
We introduce CAPR, a rule-based self-supervision objective for training Transformer language models for domain-specific passage matching.
We apply our objective in four Transformer-based architectures: Contextual Document Vectors, Bi-, Poly- and Cross-encoders.
We report that CAPR outperforms strong baselines in the retrieval of domain-specific passages and effectively generalizes across rule-based and human-labeled passages.
arXiv Detail & Related papers (2021-08-02T10:42:52Z) - COVID-19 therapy target discovery with context-aware literature mining [5.839799877302573]
We propose a system for contextualization of empirical expression data by approximating relations between entities.
In order to exploit a larger scientific context by transfer learning, we propose a novel embedding generation technique.
arXiv Detail & Related papers (2020-07-30T18:37:36Z) - On the Combined Use of Extrinsic Semantic Resources for Medical
Information Search [0.0]
We develop a framework to highlight and expand head medical concepts in verbose medical queries.
We also build semantically enhanced inverted index documents.
To demonstrate the effectiveness of the proposed approach, we conducted several experiments over the CLEF 2014 dataset.
arXiv Detail & Related papers (2020-05-17T14:18:04Z) - Knowledge-guided Text Structuring in Clinical Trials [0.38073142980733]
We propose a knowledge-guided text structuring framework with an automatically generated knowledge base.
Experimental results show that our method can achieve overall high precision and recall.
arXiv Detail & Related papers (2019-12-28T01:12:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.