ImpactCite: An XLNet-based method for Citation Impact Analysis
- URL: http://arxiv.org/abs/2005.06611v1
- Date: Tue, 5 May 2020 08:31:54 GMT
- Title: ImpactCite: An XLNet-based method for Citation Impact Analysis
- Authors: Dominique Mercier, Syed Tahseen Raza Rizvi, Vikas Rajashekar, Andreas
Dengel, Sheraz Ahmed
- Abstract summary: Impact analysis enables us to quantify the quality of the citations.
XLNet-based solution ImpactCite achieves a new state-of-the-art performance for both citation intent and sentiment classification.
Additional efforts have been performed to come up with CSC-Clean corpus, which is a clean and reliable dataset for citation sentiment classification.
- Score: 4.526582372434088
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Citations play a vital role in understanding the impact of scientific
literature. Generally, citations are analyzed quantitatively whereas
qualitative analysis of citations can reveal deeper insights into the impact of
a scientific artifact in the community. Therefore, citation impact analysis
(which includes sentiment and intent classification) enables us to quantify the
quality of the citations which can eventually assist us in the estimation of
ranking and impact. The contribution of this paper is two-fold. First, we
benchmark the well-known language models like BERT and ALBERT along with
several popular networks for both tasks of sentiment and intent classification.
Second, we provide ImpactCite, which is XLNet-based method for citation impact
analysis. All evaluations are performed on a set of publicly available citation
analysis datasets. Evaluation results reveal that ImpactCite achieves a new
state-of-the-art performance for both citation intent and sentiment
classification by outperforming the existing approaches by 3.44% and 1.33% in
F1-score. Therefore, we emphasize ImpactCite (XLNet-based solution) for both
tasks to better understand the impact of a citation. Additional efforts have
been performed to come up with CSC-Clean corpus, which is a clean and reliable
dataset for citation sentiment classification.
Related papers
- Why do you cite? An investigation on citation intents and decision-making classification processes [1.7812428873698407]
This study emphasizes the importance of trustfully classifying citation intents.
We present a study utilizing advanced Ensemble Strategies for Citation Intent Classification (CIC)
One of our models sets as a new state-of-the-art (SOTA) with an 89.46% Macro-F1 score on the SciCite benchmark.
arXiv Detail & Related papers (2024-07-18T09:29:33Z) - Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics [22.041561519672456]
Large language models (LLMs) often produce unsupported or unverifiable content, known as "hallucinations"
We propose a comparative evaluation framework that assesses the metric effectiveness in distinguishing citations between three-category support levels.
Our results show no single metric consistently excels across all evaluations, revealing the complexity of assessing fine-grained support.
arXiv Detail & Related papers (2024-06-21T15:57:24Z) - ALiiCE: Evaluating Positional Fine-grained Citation Generation [54.19617927314975]
We propose ALiiCE, the first automatic evaluation framework for fine-grained citation generation.
Our framework first parses the sentence claim into atomic claims via dependency analysis and then calculates citation quality at the atomic claim level.
We evaluate the positional fine-grained citation generation performance of several Large Language Models on two long-form QA datasets.
arXiv Detail & Related papers (2024-06-19T09:16:14Z) - CausalCite: A Causal Formulation of Paper Citations [80.82622421055734]
CausalCite is a new way to measure the significance of a paper by assessing the causal impact of the paper on its follow-up papers.
It is based on a novel causal inference method, TextMatch, which adapts the traditional matching framework to high-dimensional text embeddings.
We demonstrate the effectiveness of CausalCite on various criteria, such as high correlation with paper impact as reported by scientific experts.
arXiv Detail & Related papers (2023-11-05T23:09:39Z) - Predicting Long-Term Citations from Short-Term Linguistic Influence [20.78217545537925]
A standard measure of the influence of a research paper is the number of times it is cited.
We propose a novel method to quantify linguistic influence in timestamped document collections.
arXiv Detail & Related papers (2022-10-24T22:03:26Z) - Deep Graph Learning for Anomalous Citation Detection [55.81334139806342]
We propose a novel deep graph learning model, namely GLAD (Graph Learning for Anomaly Detection), to identify anomalies in citation networks.
Within the GLAD framework, we propose an algorithm called CPU (Citation PUrpose) to discover the purpose of citation based on citation texts.
arXiv Detail & Related papers (2022-02-23T09:05:28Z) - Semantic Analysis for Automated Evaluation of the Potential Impact of
Research Articles [62.997667081978825]
This paper presents a novel method for vector representation of text meaning based on information theory.
We show how this informational semantics is used for text classification on the basis of the Leicester Scientific Corpus.
We show that an informational approach to representing the meaning of a text has offered a way to effectively predict the scientific impact of research papers.
arXiv Detail & Related papers (2021-04-26T20:37:13Z) - Enhancing Scientific Papers Summarization with Citation Graph [78.65955304229863]
We redefine the task of scientific papers summarization by utilizing their citation graph.
We construct a novel scientific papers summarization dataset Semantic Scholar Network (SSN) which contains 141K research papers in different domains.
Our model can achieve competitive performance when compared with the pretrained models.
arXiv Detail & Related papers (2021-04-07T11:13:35Z) - Weakly-Supervised Aspect-Based Sentiment Analysis via Joint
Aspect-Sentiment Topic Embedding [71.2260967797055]
We propose a weakly-supervised approach for aspect-based sentiment analysis.
We learn sentiment, aspect> joint topic embeddings in the word embedding space.
We then use neural models to generalize the word-level discriminative information.
arXiv Detail & Related papers (2020-10-13T21:33:24Z) - Fine-tuning Pre-trained Contextual Embeddings for Citation Content
Analysis in Scholarly Publication [0.3997680012976965]
We propose to fine-tune pre-trained contextual embeddings ULMFiT, BERT, and XLNet for the task.
For citation function identification, the XLNet model achieves 87.2%, 86.90%, and 81.6% on DFKI, UMICH, and TKDE 2019 datasets respectively.
Our method can be used to enhance the influence analysis of scholars and scholarly publications.
arXiv Detail & Related papers (2020-09-12T17:46:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.