On the Combined Use of Extrinsic Semantic Resources for Medical
Information Search
- URL: http://arxiv.org/abs/2005.08259v1
- Date: Sun, 17 May 2020 14:18:04 GMT
- Title: On the Combined Use of Extrinsic Semantic Resources for Medical
Information Search
- Authors: Mohammed Maree, Israa Noor, Khaled Rabayah, Mohammed Belkhatir, and
Saadat M. Alhashmi
- Abstract summary: We develop a framework to highlight and expand head medical concepts in verbose medical queries.
We also build semantically enhanced inverted index documents.
To demonstrate the effectiveness of the proposed approach, we conducted several experiments over the CLEF 2014 dataset.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Semantic concepts and relations encoded in domain-specific ontologies and
other medical semantic resources play a crucial role in deciphering terms in
medical queries and documents. The exploitation of these resources for tackling
the semantic gap issue has been widely studied in the literature. However,
there are challenges that hinder their widespread use in real-world
applications. Among these challenges is the insufficient knowledge individually
encoded in existing medical ontologies, which is magnified when users express
their information needs using long-winded natural language queries. In this
context, many of the users query terms are either unrecognized by the used
ontologies, or cause retrieving false positives that degrade the quality of
current medical information search approaches. In this article, we explore the
combination of multiple extrinsic semantic resources in the development of a
full-fledged medical information search framework to: i) highlight and expand
head medical concepts in verbose medical queries (i.e. concepts among query
terms that significantly contribute to the informativeness and intent of a
given query), ii) build semantically enhanced inverted index documents, iii)
contribute to a heuristical weighting technique in the query document matching
process. To demonstrate the effectiveness of the proposed approach, we
conducted several experiments over the CLEF eHealth 2014 dataset. Findings
indicate that the proposed method combining several extrinsic semantic
resources proved to be more effective than related approaches in terms of
precision measure.
Related papers
- Improving Retrieval in Theme-specific Applications using a Corpus
Topical Taxonomy [52.426623750562335]
We introduce ToTER (Topical taxonomy Enhanced Retrieval) framework.
ToTER identifies the central topics of queries and documents with the guidance of the taxonomy, and exploits their topical relatedness to supplement missing contexts.
As a plug-and-play framework, ToTER can be flexibly employed to enhance various PLM-based retrievers.
arXiv Detail & Related papers (2024-03-07T02:34:54Z) - Zero-Shot Medical Information Retrieval via Knowledge Graph Embedding [27.14794371879541]
This paper introduces MedFusionRank, a novel approach to zero-shot medical information retrieval (MIR)
The proposed approach leverages a pre-trained BERT-style model to extract compact yet informative keywords.
These keywords are then enriched with domain knowledge by linking them to conceptual entities within a medical knowledge graph.
arXiv Detail & Related papers (2023-10-31T16:26:33Z) - DiscoverPath: A Knowledge Refinement and Retrieval System for
Interdisciplinarity on Biomedical Research [96.10765714077208]
Traditional keyword-based search engines fall short in assisting users who may not be familiar with specific terminologies.
We present a knowledge graph-based paper search engine for biomedical research to enhance the user experience.
The system, dubbed DiscoverPath, employs Named Entity Recognition (NER) and part-of-speech (POS) tagging to extract terminologies and relationships from article abstracts to create a KG.
arXiv Detail & Related papers (2023-09-04T20:52:33Z) - HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented
Prompting [33.1455954220194]
HiPrompt is a supervision-efficient knowledge fusion framework.
It elicits the few-shot reasoning ability of large language models through hierarchy-oriented prompts.
Empirical results on the collected KG-Hi-BKF benchmark datasets demonstrate the effectiveness of HiPrompt.
arXiv Detail & Related papers (2023-04-12T16:54:26Z) - EBOCA: Evidences for BiOmedical Concepts Association Ontology [55.41644538483948]
This paper proposes EBOCA, an ontology that describes (i) biomedical domain concepts and associations between them, and (ii) evidences supporting these associations.
Test data coming from a subset of DISNET and automatic association extractions from texts has been transformed to create a Knowledge Graph that can be used in real scenarios.
arXiv Detail & Related papers (2022-08-01T18:47:03Z) - Semantic Search for Large Scale Clinical Ontologies [63.71950996116403]
We present a deep learning approach to build a search system for large clinical vocabularies.
We propose a Triplet-BERT model and a method that generates training data based on semantic training data.
The model is evaluated using five real benchmark data sets and the results show that our approach achieves high results on both free text to concept and concept to searching concept vocabularies.
arXiv Detail & Related papers (2022-01-01T05:15:42Z) - Clinical Named Entity Recognition using Contextualized Token
Representations [49.036805795072645]
This paper introduces the technique of contextualized word embedding to better capture the semantic meaning of each word based on its context.
We pre-train two deep contextualized language models, Clinical Embeddings from Language Model (C-ELMo) and Clinical Contextual String Embeddings (C-Flair)
Explicit experiments show that our models gain dramatic improvements compared to both static word embeddings and domain-generic language models.
arXiv Detail & Related papers (2021-06-23T18:12:58Z) - A Meta-embedding-based Ensemble Approach for ICD Coding Prediction [64.42386426730695]
International Classification of Diseases (ICD) are the de facto codes used globally for clinical coding.
These codes enable healthcare providers to claim reimbursement and facilitate efficient storage and retrieval of diagnostic information.
Our proposed approach enhances the performance of neural models by effectively training word vectors using routine medical data as well as external knowledge from scientific articles.
arXiv Detail & Related papers (2021-02-26T17:49:58Z) - Knowledge-Base Enriched Word Embeddings for Biomedical Domain [5.086571902225929]
We propose a new word embedding based model for biomedical domain that jointly leverages the information from available corpora and domain knowledge.
Unlike existing approaches, the proposed methodology is simple but adept at capturing the precise knowledge available in domain resources in an accurate way.
arXiv Detail & Related papers (2021-02-20T18:18:51Z) - Natural language technology and query expansion: issues,
state-of-the-art and perspectives [0.0]
Linguistic characteristics that cause ambiguity and misinterpretation of queries as well as additional factors affect the users ability to accurately represent their information needs.
We lay down the anatomy of a generic linguistic based query expansion framework and propose its module-based decomposition.
For each of the modules we review the state-of-the-art solutions in the literature and categorized under the light of the techniques used.
arXiv Detail & Related papers (2020-04-23T11:39:07Z) - Coupling semantic and statistical techniques for dynamically enriching
web ontologies [0.0]
We propose an automatic coupled statistical/semantic framework for dynamically enriching large-scale generic from the World Wide Web.
The benefits of our approach are: (i) proposing the dynamic enrichment of large-scale semantic patterns with missing background knowledge, and thus, enabling the reuse of such knowledge.
arXiv Detail & Related papers (2020-04-23T11:21:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.