ILCiteR: Evidence-grounded Interpretable Local Citation Recommendation
- URL: http://arxiv.org/abs/2403.08737v1
- Date: Wed, 13 Mar 2024 17:38:05 GMT
- Title: ILCiteR: Evidence-grounded Interpretable Local Citation Recommendation
- Authors: Sayar Ghosh Roy, Jiawei Han
- Abstract summary: We introduce the evidence-grounded local citation recommendation task, where the target latent space comprises evidence spans for recommending specific papers.
Unlike past formulations that simply output recommendations, ILCiteR retrieves ranked lists of evidence span and recommended paper pairs.
We contribute a novel dataset for the evidence-grounded local citation recommendation task and demonstrate the efficacy of our proposed conditional neural rank-ensembling approach for re-ranking evidence spans.
- Score: 31.259805200946175
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Existing Machine Learning approaches for local citation recommendation
directly map or translate a query, which is typically a claim or an entity
mention, to citation-worthy research papers. Within such a formulation, it is
challenging to pinpoint why one should cite a specific research paper for a
particular query, leading to limited recommendation interpretability. To
alleviate this, we introduce the evidence-grounded local citation
recommendation task, where the target latent space comprises evidence spans for
recommending specific papers. Using a distantly-supervised evidence retrieval
and multi-step re-ranking framework, our proposed system, ILCiteR, recommends
papers to cite for a query grounded on similar evidence spans extracted from
the existing research literature. Unlike past formulations that simply output
recommendations, ILCiteR retrieves ranked lists of evidence span and
recommended paper pairs. Secondly, previously proposed neural models for
citation recommendation require expensive training on massive labeled data,
ideally after every significant update to the pool of candidate papers. In
contrast, ILCiteR relies solely on distant supervision from a dynamic evidence
database and pre-trained Transformer-based Language Models without any model
training. We contribute a novel dataset for the evidence-grounded local
citation recommendation task and demonstrate the efficacy of our proposed
conditional neural rank-ensembling approach for re-ranking evidence spans.
Related papers
- Contrastive Learning to Improve Retrieval for Real-world Fact Checking [84.57583869042791]
We present Contrastive Fact-Checking Reranker (CFR), an improved retriever for fact-checking complex claims.
We leverage the AVeriTeC dataset, which annotates subquestions for claims with human written answers from evidence documents.
We find a 6% improvement in veracity classification accuracy on the dataset.
arXiv Detail & Related papers (2024-10-07T00:09:50Z) - Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation [51.8188846284153]
RAG has been widely adopted to enhance Large Language Models (LLMs)
Attributed Text Generation (ATG) has attracted growing attention, which provides citations to support the model's responses in RAG.
This paper proposes a fine-grained ATG method called ReClaim(Refer & Claim), which alternates the generation of references and answers step by step.
arXiv Detail & Related papers (2024-07-01T20:47:47Z) - ALiiCE: Evaluating Positional Fine-grained Citation Generation [54.19617927314975]
We propose ALiiCE, the first automatic evaluation framework for fine-grained citation generation.
Our framework first parses the sentence claim into atomic claims via dependency analysis and then calculates citation quality at the atomic claim level.
We evaluate the positional fine-grained citation generation performance of several Large Language Models on two long-form QA datasets.
arXiv Detail & Related papers (2024-06-19T09:16:14Z) - Tag-Aware Document Representation for Research Paper Recommendation [68.8204255655161]
We propose a hybrid approach that leverages deep semantic representation of research papers based on social tags assigned by users.
The proposed model is effective in recommending research papers even when the rating data is very sparse.
arXiv Detail & Related papers (2022-09-08T09:13:07Z) - Towards generating citation sentences for multiple references with
intent control [86.53829532976303]
We build a novel generation model with the Fusion-in-Decoder approach to cope with multiple long inputs.
Experiments demonstrate that the proposed approaches provide much more comprehensive features for generating citation sentences.
arXiv Detail & Related papers (2021-12-02T15:32:24Z) - Local Citation Recommendation with Hierarchical-Attention Text Encoder
and SciBERT-based Reranking [6.456347800676685]
BM25 has been found to be a tough-to-beat approach to prefetching.
In this paper, we explore prefetching with nearest neighbor search among text embeddings constructed by a hierarchical attention network.
When coupled with a SciBERT reranker fine-tuned on local citation recommendation tasks, our hierarchical Attention encoder (HAtten) achieves high prefetch recall for a given number of candidates to be reranked.
arXiv Detail & Related papers (2021-12-02T13:20:26Z) - Recommending Multiple Positive Citations for Manuscript via
Content-Dependent Modeling and Multi-Positive Triplet [6.7854900381386845]
We propose a novel scientific paper modeling for citation recommendations, namely Multi-Positive BERT Model for Citation Recommendation (MP-BERT4CR)
The proposed approach has the following advantages: First, the proposed multi-positive objectives are effective to recommend multiple positive candidates.
MP-BERT4CR are also effective in retrieving the full list of co-citations, and historically low-frequent co-citation pairs compared with the prior works.
arXiv Detail & Related papers (2021-11-25T04:09:31Z) - Learning Neural Textual Representations for Citation Recommendation [7.227232362460348]
We propose a novel approach to citation recommendation using a deep sequential representation of the documents (Sentence-BERT) cascaded with Siamese and triplet networks in a submodular scoring function.
To the best of our knowledge, this is the first approach to combine deep representations and submodular selection for a task of citation recommendation.
arXiv Detail & Related papers (2020-07-08T12:38:50Z) - Context-Based Quotation Recommendation [60.93257124507105]
We propose a novel context-aware quote recommendation system.
It generates a ranked list of quotable paragraphs and spans of tokens from a given source document.
We conduct experiments on a collection of speech transcripts and associated news articles.
arXiv Detail & Related papers (2020-05-17T17:49:53Z) - Citation Recommendation: Approaches and Datasets [20.47628019708079]
Citation recommendation describes the task of recommending citations for a given text.
In recent years, several approaches and evaluation data sets have been presented.
No literature survey has been conducted explicitly on citation recommendation.
arXiv Detail & Related papers (2020-02-17T13:59:50Z) - HybridCite: A Hybrid Model for Context-Aware Citation Recommendation [0.0]
We develop citation recommendation approaches based on embeddings, topic modeling, and information retrieval techniques.
We combine, for the first time to the best of our knowledge, the best-performing algorithms into a semi-genetic hybrid recommender system.
Our evaluation results show that a hybrid model containing embedding and information retrieval-based components outperforms its individual components and further algorithms by a large margin.
arXiv Detail & Related papers (2020-02-15T16:19:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.