Medical Entity Linking using Triplet Network
- URL: http://arxiv.org/abs/2012.11164v1
- Date: Mon, 21 Dec 2020 07:44:37 GMT
- Title: Medical Entity Linking using Triplet Network
- Authors: Ishani Mondal, Sukannya Purkayastha, Sudeshna Sarkar, Pawan Goyal,
Jitesh Pillai, Amitava Bhattacharyya, Mahanandeeshwar Gattu
- Abstract summary: We present an approach to rank the candidate Knowledge Base entries based on their similarity with disease mention.
We introduce a robust and portable candidate generation scheme that does not make use of the hand-crafted rules.
Experimental results on the standard benchmark NCBI dataset demonstrate that our system outperforms the prior methods by a significant margin.
- Score: 7.18342554344254
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Entity linking (or Normalization) is an essential task in text mining that
maps the entity mentions in the medical text to standard entities in a given
Knowledge Base (KB). This task is of great importance in the medical domain. It
can also be used for merging different medical and clinical ontologies. In this
paper, we center around the problem of disease linking or normalization. This
task is executed in two phases: candidate generation and candidate scoring. In
this paper, we present an approach to rank the candidate Knowledge Base entries
based on their similarity with disease mention. We make use of the Triplet
Network for candidate ranking. While the existing methods have used carefully
generated sieves and external resources for candidate generation, we introduce
a robust and portable candidate generation scheme that does not make use of the
hand-crafted rules. Experimental results on the standard benchmark NCBI disease
dataset demonstrate that our system outperforms the prior methods by a
significant margin.
Related papers
- Medical Vision-Language Pre-Training for Brain Abnormalities [96.1408455065347]
We show how to automatically collect medical image-text aligned data for pretraining from public resources such as PubMed.
In particular, we present a pipeline that streamlines the pre-training process by initially collecting a large brain image-text dataset.
We also investigate the unique challenge of mapping subfigures to subcaptions in the medical domain.
arXiv Detail & Related papers (2024-04-27T05:03:42Z) - Bi-Encoders based Species Normalization -- Pairwise Sentence Learning to
Rank [0.0]
We present a novel deep learning approach for named entity normalization, treating it as a pair-wise learning to rank problem.
We conduct experiments on species entity types and evaluate our method against state-of-the-art techniques.
arXiv Detail & Related papers (2023-10-22T17:30:16Z) - EBOCA: Evidences for BiOmedical Concepts Association Ontology [55.41644538483948]
This paper proposes EBOCA, an ontology that describes (i) biomedical domain concepts and associations between them, and (ii) evidences supporting these associations.
Test data coming from a subset of DISNET and automatic association extractions from texts has been transformed to create a Knowledge Graph that can be used in real scenarios.
arXiv Detail & Related papers (2022-08-01T18:47:03Z) - Improving Candidate Retrieval with Entity Profile Generation for
Wikidata Entity Linking [76.00737707718795]
We propose a novel candidate retrieval paradigm based on entity profiling.
We use the profile to query the indexed search engine to retrieve candidate entities.
Our approach complements the traditional approach of using a Wikipedia anchor-text dictionary.
arXiv Detail & Related papers (2022-02-27T17:38:53Z) - Discovering Drug-Target Interaction Knowledge from Biomedical Literature [107.98712673387031]
The Interaction between Drugs and Targets (DTI) in human body plays a crucial role in biomedical science and applications.
As millions of papers come out every year in the biomedical domain, automatically discovering DTI knowledge from literature becomes an urgent demand in the industry.
We explore the first end-to-end solution for this task by using generative approaches.
We regard the DTI triplets as a sequence and use a Transformer-based model to directly generate them without using the detailed annotations of entities and relations.
arXiv Detail & Related papers (2021-09-27T17:00:14Z) - Self-supervised Answer Retrieval on Clinical Notes [68.87777592015402]
We introduce CAPR, a rule-based self-supervision objective for training Transformer language models for domain-specific passage matching.
We apply our objective in four Transformer-based architectures: Contextual Document Vectors, Bi-, Poly- and Cross-encoders.
We report that CAPR outperforms strong baselines in the retrieval of domain-specific passages and effectively generalizes across rule-based and human-labeled passages.
arXiv Detail & Related papers (2021-08-02T10:42:52Z) - A Lightweight Neural Model for Biomedical Entity Linking [1.8047694351309205]
We propose a lightweight neural method for biomedical entity linking.
Our method uses a simple alignment layer with attention mechanisms to capture the variations between mention and entity names.
Our model is competitive with previous work on standard evaluation benchmarks.
arXiv Detail & Related papers (2020-12-16T10:34:37Z) - Text Mining to Identify and Extract Novel Disease Treatments From
Unstructured Datasets [56.38623317907416]
We use Google Cloud to transcribe podcast episodes of an NPR radio show.
We then build a pipeline for systematically pre-processing the text.
Our model successfully identified that Omeprazole can help treat heartburn.
arXiv Detail & Related papers (2020-10-22T19:52:49Z) - Information Extraction of Clinical Trial Eligibility Criteria [6.192164049563104]
This paper investigates an information extraction (IE) approach for grounding criteria from trials in ClinicalTrials(dot)gov to a shared knowledge base.
We frame the problem as a novel knowledge base population task, and implement a solution combining machine learning and context free grammar.
To our knowledge, this work is the first criteria extraction system to apply attention-based conditional random field architecture for named entity recognition.
arXiv Detail & Related papers (2020-06-12T16:25:45Z) - Improving Broad-Coverage Medical Entity Linking with Semantic Type
Prediction and Large-Scale Datasets [12.131050765159145]
MedType is a fully modular system that prunes out irrelevant candidate concepts based on the predicted semantic type of an entity mention.
We present WikiMed and PubMedDS, two large-scale medical entity linking datasets, and demonstrate that pre-training MedType on these datasets further improves entity linking performance.
arXiv Detail & Related papers (2020-05-01T15:55:50Z) - Exemplar Auditing for Multi-Label Biomedical Text Classification [0.4873362301533824]
We generalize a recently proposed zero-shot sequence labeling method, "supervised labeling via a convolutional decomposition"
The approach yields classification with "introspection", relating the fine-grained features of an inference-time prediction to their nearest neighbors.
Our proposed approach yields both a competitively effective classification model and an interrogation mechanism to aid healthcare workers in understanding the salient features that drive the model's predictions.
arXiv Detail & Related papers (2020-04-07T02:54:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.