COVID-19 Knowledge Graph: Accelerating Information Retrieval and
Discovery for Scientific Literature
- URL: http://arxiv.org/abs/2007.12731v1
- Date: Fri, 24 Jul 2020 18:29:43 GMT
- Title: COVID-19 Knowledge Graph: Accelerating Information Retrieval and
Discovery for Scientific Literature
- Authors: Colby Wise, Vassilis N. Ioannidis, Miguel Romero Calvo, Xiang Song,
George Price, Ninad Kulkarni, Ryan Brand, Parminder Bhatia, George Karypis
- Abstract summary: coronavirus disease (COVID-19) has claimed the lives of over 350,000 people and infected more than 6 million people worldwide.
Several search engines have surfaced to provide researchers with additional tools to find and retrieve information from the rapidly growing corpora on COVID-19.
We present the COVID-19 Knowledge Graph (CKG), a heterogeneous graph for extracting and visualizing complex relationships between COVID-19 articles.
- Score: 23.279540233851993
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The coronavirus disease (COVID-19) has claimed the lives of over 350,000
people and infected more than 6 million people worldwide. Several search
engines have surfaced to provide researchers with additional tools to find and
retrieve information from the rapidly growing corpora on COVID-19. These
engines lack extraction and visualization tools necessary to retrieve and
interpret complex relations inherent to scientific literature. Moreover,
because these engines mainly rely upon semantic information, their ability to
capture complex global relationships across documents is limited, which reduces
the quality of similarity-based article recommendations for users. In this
work, we present the COVID-19 Knowledge Graph (CKG), a heterogeneous graph for
extracting and visualizing complex relationships between COVID-19 scientific
articles. The CKG combines semantic information with document topological
information for the application of similar document retrieval. The CKG is
constructed using the latent schema of the data, and then enriched with
biomedical entity information extracted from the unstructured text of articles
using scalable AWS technologies to form relations in the graph. Finally, we
propose a document similarity engine that leverages low-dimensional graph
embeddings from the CKG with semantic embeddings for similar article retrieval.
Analysis demonstrates the quality of relationships in the CKG and shows that it
can be used to uncover meaningful information in COVID-19 scientific articles.
The CKG helps power www.cord19.aws and is publicly available.
Related papers
- Content Aware Analysis of Scholarly Networks: A Case Study on CORD19 Dataset [0.0]
We introduce a novel approach to use semantic information through the HITS algorithm-based propagation of topic information in the network.
We show that incorporating topic data significantly influences article rankings, revealing deeper insights into the structure of the academic community.
arXiv Detail & Related papers (2024-10-31T23:41:09Z) - An Information Retrieval and Extraction Tool for Covid-19 Related Papers [0.0]
The main focus of this paper is to provide researchers with a better search tool for COVID-19 related papers.
Our tool has shown the potential to assist researchers by automating a topic-based search of CORD-19 papers.
arXiv Detail & Related papers (2024-01-20T01:34:50Z) - A Review on Knowledge Graphs for Healthcare: Resources, Applications, and Promises [52.31710895034573]
This work provides the first comprehensive review of healthcare knowledge graphs (HKGs)
It summarizes the pipeline and key techniques for HKG construction, as well as the common utilization approaches.
At the application level, we delve into the successful integration of HKGs across various health domains.
arXiv Detail & Related papers (2023-06-07T21:51:56Z) - SKG: A Versatile Information Retrieval and Analysis Framework for
Academic Papers with Semantic Knowledge Graphs [9.668240269886413]
We propose a Semantic Knowledge Graph that integrates semantic concepts from abstracts and other meta-information to represent the corpus.
The SKG can support various semantic queries in academic literature thanks to the high diversity and rich information content stored within.
arXiv Detail & Related papers (2023-06-07T20:16:08Z) - COVID-19 Multidimensional Kaggle Literature Organization [3.201839066679614]
We show that factorization is a powerful unsupervised learning method capable of discovering hidden patterns in a document corpus.
We show that a higher-order representation of the corpus allows for the simultaneous grouping of similar articles, relevant journals, authors with similar research interests, and topic keywords.
arXiv Detail & Related papers (2021-07-17T06:16:36Z) - Scientific Language Models for Biomedical Knowledge Base Completion: An
Empirical Study [62.376800537374024]
We study scientific LMs for KG completion, exploring whether we can tap into their latent knowledge to enhance biomedical link prediction.
We integrate the LM-based models with KG embedding models, using a router method that learns to assign each input example to either type of model and provides a substantial boost in performance.
arXiv Detail & Related papers (2021-06-17T17:55:33Z) - Enhancing Scientific Papers Summarization with Citation Graph [78.65955304229863]
We redefine the task of scientific papers summarization by utilizing their citation graph.
We construct a novel scientific papers summarization dataset Semantic Scholar Network (SSN) which contains 141K research papers in different domains.
Our model can achieve competitive performance when compared with the pretrained models.
arXiv Detail & Related papers (2021-04-07T11:13:35Z) - Navigating the landscape of COVID-19 research through literature
analysis: A bird's eye view [11.362549790802483]
We analyze the LitCovid collection, 13,369 COVID-19 related articles found in PubMed as of May 15th, 2020.
We do that by applying state-of-the-art named entity recognition, classification, clustering and other NLP techniques.
Our clustering algorithm identifies topics represented by groups of related terms, and computes clusters corresponding to documents associated with the topic terms.
arXiv Detail & Related papers (2020-08-07T23:39:29Z) - COVID-19 Literature Knowledge Graph Construction and Drug Repurposing
Report Generation [79.33545724934714]
We have developed a novel and comprehensive knowledge discovery framework, COVID-KG, to extract fine-grained multimedia knowledge elements from scientific literature.
Our framework also provides detailed contextual sentences, subfigures, and knowledge subgraphs as evidence.
arXiv Detail & Related papers (2020-07-01T16:03:20Z) - CO-Search: COVID-19 Information Retrieval with Semantic Search, Question
Answering, and Abstractive Summarization [53.67205506042232]
CO-Search is a retriever-ranker semantic search engine designed to handle complex queries over the COVID-19 literature.
To account for the domain-specific and relatively limited dataset, we generate a bipartite graph of document paragraphs and citations.
We evaluate our system on the data of the TREC-COVID information retrieval challenge.
arXiv Detail & Related papers (2020-06-17T01:32:48Z) - ENT-DESC: Entity Description Generation by Exploring Knowledge Graph [53.03778194567752]
In practice, the input knowledge could be more than enough, since the output description may only cover the most significant knowledge.
We introduce a large-scale and challenging dataset to facilitate the study of such a practical scenario in KG-to-text.
We propose a multi-graph structure that is able to represent the original graph information more comprehensively.
arXiv Detail & Related papers (2020-04-30T14:16:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.