Attention: to Better Stand on the Shoulders of Giants
- URL: http://arxiv.org/abs/2005.14256v1
- Date: Wed, 27 May 2020 00:25:51 GMT
- Title: Attention: to Better Stand on the Shoulders of Giants
- Authors: Sha Yuan, Zhou Shao, Yu Zhang, Xingxing Wei, Tong Xiao, Yifan Wang,
Jie Tang
- Abstract summary: This paper develops an attention mechanism for the long-term scientific impact prediction.
It validates the method based on a real large-scale citation data set.
- Score: 34.5017808610466
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Science of science (SciSci) is an emerging discipline wherein science is used
to study the structure and evolution of science itself using large data sets.
The increasing availability of digital data on scholarly outcomes offers
unprecedented opportunities to explore SciSci. In the progress of science, the
previously discovered knowledge principally inspires new scientific ideas, and
citation is a reasonably good reflection of this cumulative nature of
scientific research. The researches that choose potentially influential
references will have a lead over the emerging publications. Although the peer
review process is the mainly reliable way of predicting a paper's future
impact, the ability to foresee the lasting impact based on citation records is
increasingly essential in the scientific impact analysis in the era of big
data. This paper develops an attention mechanism for the long-term scientific
impact prediction and validates the method based on a real large-scale citation
data set. The results break conventional thinking. Instead of accurately
simulating the original power-law distribution, emphasizing the limited
attention can better stand on the shoulders of giants.
Related papers
- Smoke and Mirrors in Causal Downstream Tasks [59.90654397037007]
This paper looks at the causal inference task of treatment effect estimation, where the outcome of interest is recorded in high-dimensional observations.
We compare 6 480 models fine-tuned from state-of-the-art visual backbones, and find that the sampling and modeling choices significantly affect the accuracy of the causal estimate.
Our results suggest that future benchmarks should carefully consider real downstream scientific questions, especially causal ones.
arXiv Detail & Related papers (2024-05-27T13:26:34Z) - Forecasting high-impact research topics via machine learning on evolving
knowledge graphs [0.8158530638728501]
We show how to predict the impact of onsets of ideas that have never been published by researchers.
We developed a large evolving knowledge graph built from more than 21 million scientific papers.
arXiv Detail & Related papers (2024-02-13T18:09:38Z) - Modeling Information Change in Science Communication with Semantically
Matched Paraphrases [50.67030449927206]
SPICED is the first paraphrase dataset of scientific findings annotated for degree of information change.
SPICED contains 6,000 scientific finding pairs extracted from news stories, social media discussions, and full texts of original papers.
Models trained on SPICED improve downstream performance on evidence retrieval for fact checking of real-world scientific claims.
arXiv Detail & Related papers (2022-10-24T07:44:38Z) - Citation Trajectory Prediction via Publication Influence Representation
Using Temporal Knowledge Graph [52.07771598974385]
Existing approaches mainly rely on mining temporal and graph data from academic articles.
Our framework is composed of three modules: difference-preserved graph embedding, fine-grained influence representation, and learning-based trajectory calculation.
Experiments are conducted on both the APS academic dataset and our contributed AIPatent dataset.
arXiv Detail & Related papers (2022-10-02T07:43:26Z) - Robust Causality and False Attribution in Data-Driven Earth Science
Discoveries [0.3503794925747607]
Causal and attribution studies are essential for earth scientific discoveries and informing climate, ecology, and water policies.
Here we show that transfer entropy-based causal graphs can be spurious even when augmented with statistical significance.
We develop a subsample-based ensemble approach for robust causality analysis.
arXiv Detail & Related papers (2022-09-26T10:45:48Z) - Change Summarization of Diachronic Scholarly Paper Collections by
Semantic Evolution Analysis [10.554831859741851]
We demonstrate a novel approach to analyze the collections of research papers published over longer time periods.
Our approach is based on comparing word semantic representations over time and aims to support users in a better understanding of large domain-focused archives of scholarly publications.
arXiv Detail & Related papers (2021-12-07T11:15:19Z) - CitationIE: Leveraging the Citation Graph for Scientific Information
Extraction [89.33938657493765]
We use the citation graph of referential links between citing and cited papers.
We observe a sizable improvement in end-to-end information extraction over the state-of-the-art.
arXiv Detail & Related papers (2021-06-03T03:00:12Z) - Semantic Analysis for Automated Evaluation of the Potential Impact of
Research Articles [62.997667081978825]
This paper presents a novel method for vector representation of text meaning based on information theory.
We show how this informational semantics is used for text classification on the basis of the Leicester Scientific Corpus.
We show that an informational approach to representing the meaning of a text has offered a way to effectively predict the scientific impact of research papers.
arXiv Detail & Related papers (2021-04-26T20:37:13Z) - Early Indicators of Scientific Impact: Predicting Citations with
Altmetrics [0.0]
We use altmetrics to predict the short-term and long-term citations that a scholarly publication could receive.
We build various classification and regression models and evaluate their performance, finding neural networks and ensemble models to perform best for these tasks.
arXiv Detail & Related papers (2020-12-25T16:25:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.