Accelerating COVID-19 research with graph mining and transformer-based
learning
- URL: http://arxiv.org/abs/2102.07631v1
- Date: Wed, 10 Feb 2021 15:11:36 GMT
- Title: Accelerating COVID-19 research with graph mining and transformer-based
learning
- Authors: Ilya Tyagin and Ankit Kulshrestha and Justin Sybrandt and Krish Matta
and Michael Shtutman and Ilya Safro
- Abstract summary: We present an automated general purpose hypothesis generation systems AGATHA-C and AGATHA-GP for COVID-19 research.
Both systems achieve high-quality predictions across domains (in some domains up to 0.97% ROC AUC) in fast computational time.
We show that the systems are able to discover on-going research findings such as the relationship between COVID-19 and oxytocin hormone.
- Score: 2.493740042317776
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In 2020, the White House released the, "Call to Action to the Tech Community
on New Machine Readable COVID-19 Dataset," wherein artificial intelligence
experts are asked to collect data and develop text mining techniques that can
help the science community answer high-priority scientific questions related to
COVID-19. The Allen Institute for AI and collaborators announced the
availability of a rapidly growing open dataset of publications, the COVID-19
Open Research Dataset (CORD-19). As the pace of research accelerates,
biomedical scientists struggle to stay current. To expedite their
investigations, scientists leverage hypothesis generation systems, which can
automatically inspect published papers to discover novel implicit connections.
We present an automated general purpose hypothesis generation systems AGATHA-C
and AGATHA-GP for COVID-19 research. The systems are based on graph-mining and
the transformer model. The systems are massively validated using retrospective
information rediscovery and proactive analysis involving human-in-the-loop
expert analysis. Both systems achieve high-quality predictions across domains
(in some domains up to 0.97% ROC AUC) in fast computational time and are
released to the broad scientific community to accelerate biomedical research.
In addition, by performing the domain expert curated study, we show that the
systems are able to discover on-going research findings such as the
relationship between COVID-19 and oxytocin hormone.
Related papers
- Research Trends and Applications of Data Augmentation Algorithms [77.34726150561087]
We identify the main areas of application of data augmentation algorithms, the types of algorithms used, significant research trends, their progression over time and research gaps in data augmentation literature.
We expect readers to understand the potential of data augmentation, as well as identify future research directions and open questions within data augmentation research.
arXiv Detail & Related papers (2022-07-18T11:38:32Z) - Machine learning applications for COVID-19: A state-of-the-art review [0.0]
The COVID-19 pandemic has galvanized the machine learning community to create new solutions.
This article presents the latest advances in machine learning research applied to COVID-19.
arXiv Detail & Related papers (2021-01-19T19:12:45Z) - COVIDScholar: An automated COVID-19 research aggregation and analysis
platform [0.0]
As of October 2020, over 81,000 COVID-19 related scientific papers have been released, at a rate of over 250 per day.
This has created a challenge to traditional methods of engagement with the research literature.
We present an analysis of trends in COVID-19 research over the course of 2020.
arXiv Detail & Related papers (2020-12-07T18:17:11Z) - Covidex: Neural Ranking Models and Keyword Search Infrastructure for the
COVID-19 Open Research Dataset [87.47567807116204]
Covidex is a search engine that exploits the latest neural ranking models.
It provides access to the COVID-19 Open Research dataset curated by the Allen Institute for AI.
arXiv Detail & Related papers (2020-07-14T16:26:01Z) - A Survey on Applications of Artificial Intelligence in Fighting Against
COVID-19 [75.84689958489724]
The COVID-19 pandemic caused by the SARS-CoV-2 virus has spread rapidly worldwide, leading to a global outbreak.
As a powerful tool against COVID-19, artificial intelligence (AI) technologies are widely used in combating this pandemic.
This survey presents medical and AI researchers with a comprehensive view of the existing and potential applications of AI technology in combating COVID-19.
arXiv Detail & Related papers (2020-07-04T22:48:15Z) - Coronavirus Knowledge Graph: A Case Study [4.646516629534201]
We use several Machine Learning, Deep Learning, and Knowledge Graph construction and mining techniques to identify COVID-19 related experts and bio-entities.
We suggest possible techniques to predict related diseases, drug candidates, gene, gene mutations, and related compounds.
arXiv Detail & Related papers (2020-07-04T03:55:31Z) - CO-Search: COVID-19 Information Retrieval with Semantic Search, Question
Answering, and Abstractive Summarization [53.67205506042232]
CO-Search is a retriever-ranker semantic search engine designed to handle complex queries over the COVID-19 literature.
To account for the domain-specific and relatively limited dataset, we generate a bipartite graph of document paragraphs and citations.
We evaluate our system on the data of the TREC-COVID information retrieval challenge.
arXiv Detail & Related papers (2020-06-17T01:32:48Z) - Visualising COVID-19 Research [4.664989082015335]
We develop a novel automated theme-based visualisation method.
It combines advanced data modelling of large corpora, information mapping and trend analysis.
It provides a top-down and bottom-up browsing and search interface for quick discovery of topics and research resources.
arXiv Detail & Related papers (2020-05-13T15:45:14Z) - Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research
Dataset: Preliminary Thoughts and Lessons Learned [88.42878484408469]
We present the Neural Covidex, a search engine that exploits the latest neural ranking architectures.
This paper describes our initial efforts and offers a few thoughts about lessons we have learned along the way.
arXiv Detail & Related papers (2020-04-10T17:12:29Z) - Mapping the Landscape of Artificial Intelligence Applications against
COVID-19 [59.30734371401316]
COVID-19, the disease caused by the SARS-CoV-2 virus, has been declared a pandemic by the World Health Organization.
We present an overview of recent studies using Machine Learning and, more broadly, Artificial Intelligence to tackle many aspects of the COVID-19 crisis.
arXiv Detail & Related papers (2020-03-25T12:30:33Z) - AGATHA: Automatic Graph-mining And Transformer based Hypothesis
generation Approach [1.7954335118363964]
We present a hypothesis generation system that can introduce data-driven insights earlier in the discovery process.
AGATHA prioritizes plausible term-pairs among entity sets, allowing us to recommend new research directions.
This system achieves best-in-class performance on an established benchmark.
arXiv Detail & Related papers (2020-02-13T17:06:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.