Neural language models for text classification in evidence-based
medicine
- URL: http://arxiv.org/abs/2012.00584v1
- Date: Tue, 1 Dec 2020 15:53:44 GMT
- Title: Neural language models for text classification in evidence-based
medicine
- Authors: Andres Carvallo, Denis Parra, Gabriel Rada, Daniel Perez, Juan Ignacio
Vasquez and Camilo Vergara
- Abstract summary: Evidence-based medicine (EBM) is being challenged as never before due to the high volume of research articles published and pre-prints posted daily.
In this article, we report the results of an applied research project to classify scientific articles to support Epistemonikos.
We test several methods, and the best one, based on the XLNet neural language model, improves the current approach by 93% on average F1-score.
- Score: 3.5770353345663044
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The COVID-19 has brought about a significant challenge to the whole of
humanity, but with a special burden upon the medical community. Clinicians must
keep updated continuously about symptoms, diagnoses, and effectiveness of
emergent treatments under a never-ending flood of scientific literature. In
this context, the role of evidence-based medicine (EBM) for curating the most
substantial evidence to support public health and clinical practice turns
essential but is being challenged as never before due to the high volume of
research articles published and pre-prints posted daily. Artificial
Intelligence can have a crucial role in this situation. In this article, we
report the results of an applied research project to classify scientific
articles to support Epistemonikos, one of the most active foundations worldwide
conducting EBM. We test several methods, and the best one, based on the XLNet
neural language model, improves the current approach by 93\% on average
F1-score, saving valuable time from physicians who volunteer to curate COVID-19
research articles manually.
Related papers
- De-identification of clinical free text using natural language
processing: A systematic review of current approaches [48.343430343213896]
Natural language processing has repeatedly demonstrated its feasibility in automating the de-identification process.
Our study aims to provide systematic evidence on how the de-identification of clinical free text has evolved in the last thirteen years.
arXiv Detail & Related papers (2023-11-28T13:20:41Z) - Retrieval-Augmented and Knowledge-Grounded Language Models for Faithful Clinical Medicine [68.7814360102644]
We propose the Re$3$Writer method with retrieval-augmented generation and knowledge-grounded reasoning.
We demonstrate the effectiveness of our method in generating patient discharge instructions.
arXiv Detail & Related papers (2022-10-23T16:34:39Z) - Understanding COVID-19 News Coverage using Medical NLP [5.161531917413708]
The dataset includes more than 36,000 articles, analyzed using the clinical and biomedical Natural Language Processing (NLP) models from the Spark NLP for Healthcare library.
The analysis covers key entities and phrases, observed biases, and change over time in news coverage.
Another analysis is of extracted Adverse Drug Events about drug and vaccine manufacturers, which when reported by major news outlets has an impact on vaccine hesitancy.
arXiv Detail & Related papers (2022-03-19T15:07:46Z) - The Prominence of Artificial Intelligence in COVID-19 [0.5437050212139087]
In December 2019, a novel virus called COVID-19 had caused an enormous number of causalities to date.
This survey paper explores the methodologies proposed that can aid doctors and researchers in early and inexpensive methods of diagnosis of the disease.
Most developing countries have difficulties carrying out tests using the conventional manner, but a significant way can be adopted with Machine and Deep Learning.
arXiv Detail & Related papers (2021-11-18T06:11:45Z) - Domain-Specific Pretraining for Vertical Search: Case Study on
Biomedical Literature [67.4680600632232]
Self-supervised learning has emerged as a promising direction to overcome the annotation bottleneck.
We propose a general approach for vertical search based on domain-specific pretraining.
Our system can scale to tens of millions of articles on PubMed and has been deployed as Microsoft Biomedical Search.
arXiv Detail & Related papers (2021-06-25T01:02:55Z) - CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark [51.38557174322772]
We present the first Chinese Biomedical Language Understanding Evaluation benchmark.
It is a collection of natural language understanding tasks including named entity recognition, information extraction, clinical diagnosis normalization, single-sentence/sentence-pair classification.
We report empirical results with the current 11 pre-trained Chinese models, and experimental results show that state-of-the-art neural models perform by far worse than the human ceiling.
arXiv Detail & Related papers (2021-06-15T12:25:30Z) - COVIDScholar: An automated COVID-19 research aggregation and analysis
platform [0.0]
As of October 2020, over 81,000 COVID-19 related scientific papers have been released, at a rate of over 250 per day.
This has created a challenge to traditional methods of engagement with the research literature.
We present an analysis of trends in COVID-19 research over the course of 2020.
arXiv Detail & Related papers (2020-12-07T18:17:11Z) - Predicting Clinical Diagnosis from Patients Electronic Health Records
Using BERT-based Neural Networks [62.9447303059342]
We show the importance of this problem in medical community.
We present a modification of Bidirectional Representations from Transformers (BERT) model for classification sequence.
We use a large-scale Russian EHR dataset consisting of about 4 million unique patient visits.
arXiv Detail & Related papers (2020-07-15T09:22:55Z) - Automatic Text Summarization of COVID-19 Medical Research Articles using
BERT and GPT-2 [8.223517872575712]
We take advantage of the recent advances in pre-trained NLP models, BERT and OpenAI GPT-2.
Our model provides abstractive and comprehensive information based on keywords extracted from the original articles.
Our work can help the the medical community, by providing succinct summaries of articles for which the abstract are not already available.
arXiv Detail & Related papers (2020-06-03T00:54:44Z) - Evidence Inference 2.0: More Data, Better Models [22.53884716373888]
The Evidence Inference dataset was recently released to facilitate research toward this end.
This paper collects additional annotations to expand the Evidence Inference dataset by 25%.
The updated corpus, documentation, and code for new baselines and evaluations are available at http://evidence-inference.ebm-nlp.com/.
arXiv Detail & Related papers (2020-05-08T17:16:35Z) - Mapping the Landscape of Artificial Intelligence Applications against
COVID-19 [59.30734371401316]
COVID-19, the disease caused by the SARS-CoV-2 virus, has been declared a pandemic by the World Health Organization.
We present an overview of recent studies using Machine Learning and, more broadly, Artificial Intelligence to tackle many aspects of the COVID-19 crisis.
arXiv Detail & Related papers (2020-03-25T12:30:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.