Utilizing Citation Network Structure to Predict Citation Counts: A Deep
Learning Approach
- URL: http://arxiv.org/abs/2009.02647v1
- Date: Sun, 6 Sep 2020 05:27:50 GMT
- Title: Utilizing Citation Network Structure to Predict Citation Counts: A Deep
Learning Approach
- Authors: Qihang Zhao
- Abstract summary: This paper proposes an end-to-end deep learning network, DeepCCP, which combines the effect of information cascade and looks at the citation counts prediction problem.
According to experiments on 6 real data sets, DeepCCP is superior to the state-of-the-art methods in terms of the accuracy of citation count prediction.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the advancement of science and technology, the number of academic papers
published in the world each year has increased almost exponentially. While a
large number of research papers highlight the prosperity of science and
technology, they also give rise to some problems. As we all know, academic
papers are the most intuitive embodiment of the research results of scholars,
which can reflect the level of researchers. It is also the evaluation standard
for decision-making such as promotion and allocation of funds. Therefore, how
to measure the quality of an academic paper is very important. The most common
standard for measuring academic papers is the number of citation counts of
papers, because this indicator is widely used in the evaluation of scientific
publications, and it also serves as the basis for many other indicators (such
as the h-index). Therefore, it is very important to be able to accurately
predict the citation counts of academic papers.
This paper proposes an end-to-end deep learning network, DeepCCP, which
combines the effect of information cascade and looks at the citation counts
prediction problem from the perspective of information cascade prediction.
DeepCCP directly uses the citation network formed in the early stage of the
paper as the input, and the output is the citation counts of the corresponding
paper after a period of time. DeepCCP only uses the structure and temporal
information of the citation network, and does not require other additional
information, but it can still achieve outstanding performance. According to
experiments on 6 real data sets, DeepCCP is superior to the state-of-the-art
methods in terms of the accuracy of citation count prediction.
Related papers
- CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text [14.279848166377667]
Main text is an important factor for citation count prediction, but it is difficult to handle in machine learning models because the main text is typically very long.
We propose a BERT-based citation count prediction model, called CiMaTe, that leverages the main text by explicitly capturing a paper's sectional structure.
arXiv Detail & Related papers (2024-10-06T08:39:13Z) - CausalCite: A Causal Formulation of Paper Citations [80.82622421055734]
CausalCite is a new way to measure the significance of a paper by assessing the causal impact of the paper on its follow-up papers.
It is based on a novel causal inference method, TextMatch, which adapts the traditional matching framework to high-dimensional text embeddings.
We demonstrate the effectiveness of CausalCite on various criteria, such as high correlation with paper impact as reported by scientific experts.
arXiv Detail & Related papers (2023-11-05T23:09:39Z) - Forgotten Knowledge: Examining the Citational Amnesia in NLP [63.13508571014673]
We show how far back in time do we tend to go to cite papers? How has that changed over time, and what factors correlate with this citational attention/amnesia?
We show that around 62% of cited papers are from the immediate five years prior to publication, whereas only about 17% are more than ten years old.
We show that the median age and age diversity of cited papers were steadily increasing from 1990 to 2014, but since then, the trend has reversed, and current NLP papers have an all-time low temporal citation diversity.
arXiv Detail & Related papers (2023-05-29T18:30:34Z) - Tag-Aware Document Representation for Research Paper Recommendation [68.8204255655161]
We propose a hybrid approach that leverages deep semantic representation of research papers based on social tags assigned by users.
The proposed model is effective in recommending research papers even when the rating data is very sparse.
arXiv Detail & Related papers (2022-09-08T09:13:07Z) - Deep Graph Learning for Anomalous Citation Detection [55.81334139806342]
We propose a novel deep graph learning model, namely GLAD (Graph Learning for Anomaly Detection), to identify anomalies in citation networks.
Within the GLAD framework, we propose an algorithm called CPU (Citation PUrpose) to discover the purpose of citation based on citation texts.
arXiv Detail & Related papers (2022-02-23T09:05:28Z) - CitationIE: Leveraging the Citation Graph for Scientific Information
Extraction [89.33938657493765]
We use the citation graph of referential links between citing and cited papers.
We observe a sizable improvement in end-to-end information extraction over the state-of-the-art.
arXiv Detail & Related papers (2021-06-03T03:00:12Z) - Enhancing Scientific Papers Summarization with Citation Graph [78.65955304229863]
We redefine the task of scientific papers summarization by utilizing their citation graph.
We construct a novel scientific papers summarization dataset Semantic Scholar Network (SSN) which contains 141K research papers in different domains.
Our model can achieve competitive performance when compared with the pretrained models.
arXiv Detail & Related papers (2021-04-07T11:13:35Z) - Chronological Citation Recommendation with Time Preference [20.179186544131337]
This paper predicts the time preference based on user queries, which is a probability distribution of citing papers published in different time slices.
We use this time preference to re-rank the initial citation list obtained by content-based filtering.
arXiv Detail & Related papers (2021-01-19T13:18:05Z) - Early Indicators of Scientific Impact: Predicting Citations with
Altmetrics [0.0]
We use altmetrics to predict the short-term and long-term citations that a scholarly publication could receive.
We build various classification and regression models and evaluate their performance, finding neural networks and ensemble models to perform best for these tasks.
arXiv Detail & Related papers (2020-12-25T16:25:07Z) - Examining Citations of Natural Language Processing Literature [31.87319293259599]
We show that only about 56% of the papers in AA are cited ten or more times.
CL Journal has the most cited papers, but its citation dominance has lessened in recent years.
papers on sentiment classification, anaphora resolution, and entity recognition have the highest median citations.
arXiv Detail & Related papers (2020-05-02T20:01:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.