Automatic Extraction of Ranked SNP-Phenotype Associations from
Literature through Detecting Neural Candidates, Negation and Modality Markers
- URL: http://arxiv.org/abs/2012.00902v1
- Date: Wed, 2 Dec 2020 00:03:07 GMT
- Title: Automatic Extraction of Ranked SNP-Phenotype Associations from
Literature through Detecting Neural Candidates, Negation and Modality Markers
- Authors: Behrouz Bokharaeian, Alberto Diaz
- Abstract summary: There is no available method for extracting the association of SNP-phenotype from text.
The experiments show that negation cues and scope as well as detecting neutral candidates can be employed for implementing a superior relation extraction method.
A modality based approach is proposed to estimate the confidence level of the extracted association.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Genome-wide association (GWA) constitutes a prominent portion of studies
which have been conducted on personalized medicine and pharmacogenomics.
Recently, very few methods have been developed for extracting mutation-diseases
associations. However, there is no available method for extracting the
association of SNP-phenotype from text which considers degree of confidence in
associations. In this study, first a relation extraction method relying on
linguistic-based negation detection and neutral candidates is proposed. The
experiments show that negation cues and scope as well as detecting neutral
candidates can be employed for implementing a superior relation extraction
method which outperforms the kernel-based counterparts due to a uniform innate
polarity of sentences and small number of complex sentences in the corpus.
Moreover, a modality based approach is proposed to estimate the confidence
level of the extracted association which can be used to assess the reliability
of the reported association. Keywords: SNP, Phenotype, Biomedical Relation
Extraction, Negation Detection.
Related papers
- Entangled Relations: Leveraging NLI and Meta-analysis to Enhance Biomedical Relation Extraction [35.320291731292286]
We introduce MetaEntail-RE, a novel adaptation method that harnesses NLI principles to enhance relation extraction.
Our approach follows past works by verbalizing relation classes into class-indicative hypotheses.
Our experimental results underscore the versatility of MetaEntail-RE, demonstrating performance gains across both biomedical and general domains.
arXiv Detail & Related papers (2024-05-31T23:05:04Z) - HyPoradise: An Open Baseline for Generative Speech Recognition with
Large Language Models [81.56455625624041]
We introduce the first open-source benchmark to utilize external large language models (LLMs) for ASR error correction.
The proposed benchmark contains a novel dataset, HyPoradise (HP), encompassing more than 334,000 pairs of N-best hypotheses.
LLMs with reasonable prompt and its generative capability can even correct those tokens that are missing in N-best list.
arXiv Detail & Related papers (2023-09-27T14:44:10Z) - Insights Into the Nutritional Prevention of Macular Degeneration based
on a Comparative Topic Modeling Approach [0.0]
This work proposes a comparative topic modeling approach to analyze reports of contradictory results on the same general research question.
The proposed method was tested on broad-scope studies addressing whether supplemental nutritional compounds significantly benefit macular degeneration (MD)
arXiv Detail & Related papers (2023-09-01T07:53:28Z) - Plugin estimators for selective classification with out-of-distribution
detection [67.28226919253214]
Real-world classifiers can benefit from abstaining from predicting on samples where they have low confidence.
These settings have been the subject of extensive but disjoint study in the selective classification (SC) and out-of-distribution (OOD) detection literature.
Recent work on selective classification with OOD detection has argued for the unified study of these problems.
We propose new plugin estimators for SCOD that are theoretically grounded, effective, and generalise existing approaches.
arXiv Detail & Related papers (2023-01-29T07:45:17Z) - Can NLI Provide Proper Indirect Supervision for Low-resource Biomedical
Relation Extraction? [39.17663642263077]
Two key obstacles in biomedical relation extraction (RE) are the scarcity of annotations and the prevalence of instances without explicitly pre-defined labels.
Existing approaches, which treat biomedical RE as a multi-class classification task, often result in poor generalization in low-resource settings.
We present NBR, which converts biomedical RE as natural language inference formulation through indirect supervision.
arXiv Detail & Related papers (2022-12-21T05:49:08Z) - Self-Normalized Importance Sampling for Neural Language Modeling [97.96857871187052]
In this work, we propose self-normalized importance sampling. Compared to our previous work, the criteria considered in this work are self-normalized and there is no need to further conduct a correction step.
We show that our proposed self-normalized importance sampling is competitive in both research-oriented and production-oriented automatic speech recognition tasks.
arXiv Detail & Related papers (2021-11-11T16:57:53Z) - SAIS: Supervising and Augmenting Intermediate Steps for Document-Level
Relation Extraction [51.27558374091491]
We propose to explicitly teach the model to capture relevant contexts and entity types by supervising and augmenting intermediate steps (SAIS) for relation extraction.
Based on a broad spectrum of carefully designed tasks, our proposed SAIS method not only extracts relations of better quality due to more effective supervision, but also retrieves the corresponding supporting evidence more accurately.
arXiv Detail & Related papers (2021-09-24T17:37:35Z) - Challenges in Automated Debiasing for Toxic Language Detection [81.04406231100323]
Biased associations have been a challenge in the development of classifiers for detecting toxic language.
We investigate recently introduced debiasing methods for text classification datasets and models, as applied to toxic language detection.
Our focus is on lexical (e.g., swear words, slurs, identity mentions) and dialectal markers (specifically African American English)
arXiv Detail & Related papers (2021-01-29T22:03:17Z) - Decoding and Diversity in Machine Translation [90.33636694717954]
We characterize differences between cost diversity paid for the BLEU scores enjoyed by NMT.
Our study implicates search as a salient source of known bias when translating gender pronouns.
arXiv Detail & Related papers (2020-11-26T21:09:38Z) - Active Testing: An Unbiased Evaluation Method for Distantly Supervised
Relation Extraction [23.262284507381757]
We propose a novel evaluation method named active testing through utilizing both the noisy test set and a few manual annotations.
Experiments on a widely used benchmark show that our proposed approach can yield approximately unbiased evaluations for distantly supervised relation extractors.
arXiv Detail & Related papers (2020-10-17T12:29:09Z) - Considering Likelihood in NLP Classification Explanations with Occlusion
and Language Modeling [11.594541142399223]
Occlusion is a well established method that provides explanations on discrete language data.
We argue that current Occlusion-based methods often produce invalid or syntactically incorrect language data.
We propose OLM: a novel explanation method that combines Occlusion and language models to sample valid and syntactically correct replacements.
arXiv Detail & Related papers (2020-04-21T10:37:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.