Content Aware Analysis of Scholarly Networks: A Case Study on CORD19 Dataset
- URL: http://arxiv.org/abs/2411.00262v2
- Date: Thu, 21 Nov 2024 00:10:23 GMT
- Title: Content Aware Analysis of Scholarly Networks: A Case Study on CORD19 Dataset
- Authors: Mehmet Emre Akbulut, Yusuf Erdem Nacar,
- Abstract summary: We introduce a novel approach to use semantic information through the HITS algorithm-based propagation of topic information in the network.
We show that incorporating topic data significantly influences article rankings, revealing deeper insights into the structure of the academic community.
- Score: 0.0
- License:
- Abstract: This paper investigates the relationships among key elements of the scientific research network, namely articles, researchers, and journals. We introduce a novel approach to use semantic information through the HITS algorithm-based propagation of topic information in the network. The topic information is derived by using the Named Entity Recognition and Entity Linkage. In our case, MedCAT is used to extract the topics from the CORD19 Dataset, which is a corpus of academic articles about COVID-19 and the coronavirus scientific network. Our approach focuses on the COVID-19 domain, utilizing the CORD-19 dataset to demonstrate the efficacy of integrating topic-related information within the citation framework. Through the application of a hybrid HITS algorithm, we show that incorporating topic data significantly influences article rankings, revealing deeper insights into the structure of the academic community.
Related papers
- An Information Retrieval and Extraction Tool for Covid-19 Related Papers [0.0]
The main focus of this paper is to provide researchers with a better search tool for COVID-19 related papers.
Our tool has shown the potential to assist researchers by automating a topic-based search of CORD-19 papers.
arXiv Detail & Related papers (2024-01-20T01:34:50Z) - DiscoverPath: A Knowledge Refinement and Retrieval System for
Interdisciplinarity on Biomedical Research [96.10765714077208]
Traditional keyword-based search engines fall short in assisting users who may not be familiar with specific terminologies.
We present a knowledge graph-based paper search engine for biomedical research to enhance the user experience.
The system, dubbed DiscoverPath, employs Named Entity Recognition (NER) and part-of-speech (POS) tagging to extract terminologies and relationships from article abstracts to create a KG.
arXiv Detail & Related papers (2023-09-04T20:52:33Z) - Covidia: COVID-19 Interdisciplinary Academic Knowledge Graph [99.28342534985146]
Existing literature and knowledge platforms on COVID-19 only focus on collecting papers on biology and medicine.
We propose Covidia, COVID-19 interdisciplinary academic knowledge graph to bridge the gap between knowledge of COVID-19 on different domains.
arXiv Detail & Related papers (2023-04-14T16:45:38Z) - Assessing Scientific Contributions in Data Sharing Spaces [64.16762375635842]
This paper introduces the SCIENCE-index, a blockchain-based metric measuring a researcher's scientific contributions.
To incentivize researchers to share their data, the SCIENCE-index is augmented to include a data-sharing parameter.
Our model is evaluated by comparing the distribution of its output for geographically diverse researchers to that of the h-index.
arXiv Detail & Related papers (2023-03-18T19:17:47Z) - Research Topic Flows in Co-Authorship Networks [0.0]
We propose a graph structure for the analysis of research topic flows between scientific authors and their respective research fields.
Our method requires for the construction of a TFN solely a corpus of publications (i.e., author and abstract information)
We demonstrate the utility of TFNs by applying our method to two comprehensive corpora of altogether 20 Mio. publications spanning more than 60 years of research in the fields computer science and mathematics.
arXiv Detail & Related papers (2022-06-16T07:45:53Z) - A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19
Misinformation Topics [2.1471398891979647]
We introduce a weakly-supervised iterative graph-based approach to detect keywords, topics, and themes related to misinformation.
Our approach can successfully detect specific topics from general misinformation-related seed words in a few seed texts.
arXiv Detail & Related papers (2022-05-19T09:30:39Z) - COVID-19 Multidimensional Kaggle Literature Organization [3.201839066679614]
We show that factorization is a powerful unsupervised learning method capable of discovering hidden patterns in a document corpus.
We show that a higher-order representation of the corpus allows for the simultaneous grouping of similar articles, relevant journals, authors with similar research interests, and topic keywords.
arXiv Detail & Related papers (2021-07-17T06:16:36Z) - Enhancing Scientific Papers Summarization with Citation Graph [78.65955304229863]
We redefine the task of scientific papers summarization by utilizing their citation graph.
We construct a novel scientific papers summarization dataset Semantic Scholar Network (SSN) which contains 141K research papers in different domains.
Our model can achieve competitive performance when compared with the pretrained models.
arXiv Detail & Related papers (2021-04-07T11:13:35Z) - Extracting a Knowledge Base of Mechanisms from COVID-19 Papers [50.17242035034729]
We pursue the construction of a knowledge base (KB) of mechanisms.
We develop a broad, unified schema that strikes a balance between relevance and breadth.
Experiments demonstrate the utility of our KB in supporting interdisciplinary scientific search over COVID-19 literature.
arXiv Detail & Related papers (2020-10-08T07:54:14Z) - COVID-19 Knowledge Graph: Accelerating Information Retrieval and
Discovery for Scientific Literature [23.279540233851993]
coronavirus disease (COVID-19) has claimed the lives of over 350,000 people and infected more than 6 million people worldwide.
Several search engines have surfaced to provide researchers with additional tools to find and retrieve information from the rapidly growing corpora on COVID-19.
We present the COVID-19 Knowledge Graph (CKG), a heterogeneous graph for extracting and visualizing complex relationships between COVID-19 articles.
arXiv Detail & Related papers (2020-07-24T18:29:43Z) - CAiRE-COVID: A Question Answering and Query-focused Multi-Document
Summarization System for COVID-19 Scholarly Information Management [48.251211691263514]
We present CAiRE-COVID, a real-time question answering (QA) and multi-document summarization system, which won one of the 10 tasks in the Kaggle COVID-19 Open Research dataset Challenge.
Our system aims to tackle the recent challenge of mining the numerous scientific articles being published on COVID-19 by answering high priority questions from the community.
arXiv Detail & Related papers (2020-05-04T15:07:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.