Early Indicators of Scientific Impact: Predicting Citations with
Altmetrics
- URL: http://arxiv.org/abs/2012.13599v1
- Date: Fri, 25 Dec 2020 16:25:07 GMT
- Title: Early Indicators of Scientific Impact: Predicting Citations with
Altmetrics
- Authors: Akhil Pandey Akella, Hamed Alhoori, Pavan Ravikanth Kondamudi, Cole
Freeman, Haiming Zhou
- Abstract summary: We use altmetrics to predict the short-term and long-term citations that a scholarly publication could receive.
We build various classification and regression models and evaluate their performance, finding neural networks and ensemble models to perform best for these tasks.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Identifying important scholarly literature at an early stage is vital to the
academic research community and other stakeholders such as technology companies
and government bodies. Due to the sheer amount of research published and the
growth of ever-changing interdisciplinary areas, researchers need an efficient
way to identify important scholarly work. The number of citations a given
research publication has accrued has been used for this purpose, but these take
time to occur and longer to accumulate. In this article, we use altmetrics to
predict the short-term and long-term citations that a scholarly publication
could receive. We build various classification and regression models and
evaluate their performance, finding neural networks and ensemble models to
perform best for these tasks. We also find that Mendeley readership is the most
important factor in predicting the early citations, followed by other factors
such as the academic status of the readers (e.g., student, postdoc, professor),
followers on Twitter, online post length, author count, and the number of
mentions on Twitter, Wikipedia, and across different countries.
Related papers
- ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models [56.08917291606421]
ResearchAgent is a large language model-powered research idea writing agent.
It generates problems, methods, and experiment designs while iteratively refining them based on scientific literature.
We experimentally validate our ResearchAgent on scientific publications across multiple disciplines.
arXiv Detail & Related papers (2024-04-11T13:36:29Z) - Mapping the Increasing Use of LLMs in Scientific Papers [99.67983375899719]
We conduct the first systematic, large-scale analysis across 950,965 papers published between January 2020 and February 2024 on the arXiv, bioRxiv, and Nature portfolio journals.
Our findings reveal a steady increase in LLM usage, with the largest and fastest growth observed in Computer Science papers.
arXiv Detail & Related papers (2024-04-01T17:45:15Z) - Change Summarization of Diachronic Scholarly Paper Collections by
Semantic Evolution Analysis [10.554831859741851]
We demonstrate a novel approach to analyze the collections of research papers published over longer time periods.
Our approach is based on comparing word semantic representations over time and aims to support users in a better understanding of large domain-focused archives of scholarly publications.
arXiv Detail & Related papers (2021-12-07T11:15:19Z) - Scientometric engineering: Exploring citation dynamics via arXiv eprints [0.0]
We investigate the citation data of more than 1.5 million eprints on arXiv.
We find that the typical growth and obsolescence patterns vary across disciplines.
We derive a model consistent with the observed quantitative and temporal characteristics of citation growth and obsolescence.
arXiv Detail & Related papers (2021-06-09T12:38:44Z) - CitationIE: Leveraging the Citation Graph for Scientific Information
Extraction [89.33938657493765]
We use the citation graph of referential links between citing and cited papers.
We observe a sizable improvement in end-to-end information extraction over the state-of-the-art.
arXiv Detail & Related papers (2021-06-03T03:00:12Z) - A Measure of Research Taste [91.3755431537592]
We present a citation-based measure that rewards both productivity and taste.
The presented measure, CAP, balances the impact of publications and their quantity.
We analyze the characteristics of CAP for highly-cited researchers in biology, computer science, economics, and physics.
arXiv Detail & Related papers (2021-05-17T18:01:47Z) - Semantic Analysis for Automated Evaluation of the Potential Impact of
Research Articles [62.997667081978825]
This paper presents a novel method for vector representation of text meaning based on information theory.
We show how this informational semantics is used for text classification on the basis of the Leicester Scientific Corpus.
We show that an informational approach to representing the meaning of a text has offered a way to effectively predict the scientific impact of research papers.
arXiv Detail & Related papers (2021-04-26T20:37:13Z) - A Graph Convolutional Neural Network based Framework for Estimating
Future Citations Count of Research Articles [0.03937354192623676]
We propose a Graph Convolutional Network (GCN) based framework for estimating future research publication citations for both the short-term (1-year) and long-term (for 5-years and 10-years) duration.
We have tested our proposed approach over the AMiner dataset, specifically on research articles from the computer science domain, consisting of more than 0.8 million articles.
arXiv Detail & Related papers (2021-04-11T07:20:53Z) - Enhancing Scientific Papers Summarization with Citation Graph [78.65955304229863]
We redefine the task of scientific papers summarization by utilizing their citation graph.
We construct a novel scientific papers summarization dataset Semantic Scholar Network (SSN) which contains 141K research papers in different domains.
Our model can achieve competitive performance when compared with the pretrained models.
arXiv Detail & Related papers (2021-04-07T11:13:35Z) - Utilizing Citation Network Structure to Predict Citation Counts: A Deep
Learning Approach [0.0]
This paper proposes an end-to-end deep learning network, DeepCCP, which combines the effect of information cascade and looks at the citation counts prediction problem.
According to experiments on 6 real data sets, DeepCCP is superior to the state-of-the-art methods in terms of the accuracy of citation count prediction.
arXiv Detail & Related papers (2020-09-06T05:27:50Z) - Attention: to Better Stand on the Shoulders of Giants [34.5017808610466]
This paper develops an attention mechanism for the long-term scientific impact prediction.
It validates the method based on a real large-scale citation data set.
arXiv Detail & Related papers (2020-05-27T00:25:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.