Semi-Siamese Bi-encoder Neural Ranking Model Using Lightweight
Fine-Tuning
- URL: http://arxiv.org/abs/2110.14943v1
- Date: Thu, 28 Oct 2021 08:26:46 GMT
- Title: Semi-Siamese Bi-encoder Neural Ranking Model Using Lightweight
Fine-Tuning
- Authors: Euna Jung, Jaekeol Choi, Wonjong Rhee
- Abstract summary: We show two approaches for improving the performance of BERT-based bi-encoders.
The first approach is to replace the full fine-tuning step with a lightweight fine-tuning.
The second approach is to develop semi-Siamese models where queries and documents are handled with a limited amount of difference.
- Score: 4.38301148531795
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A BERT-based Neural Ranking Model (NRM) can be either a cross-encoder or a
bi-encoder. Between the two, bi-encoder is highly efficient because all the
documents can be pre-processed before the actual query time. Although query and
document are independently encoded, the existing bi-encoder NRMs are Siamese
models where a single language model is used for consistently encoding both of
query and document. In this work, we show two approaches for improving the
performance of BERT-based bi-encoders. The first approach is to replace the
full fine-tuning step with a lightweight fine-tuning. We examine lightweight
fine-tuning methods that are adapter-based, prompt-based, and hybrid of the
two. The second approach is to develop semi-Siamese models where queries and
documents are handled with a limited amount of difference. The limited
difference is realized by learning two lightweight fine-tuning modules, where
the main language model of BERT is kept common for both query and document. We
provide extensive experiment results for monoBERT, TwinBERT, and ColBERT where
three performance metrics are evaluated over Robust04, ClueWeb09b, and MS-MARCO
datasets. The results confirm that both lightweight fine-tuning and
semi-Siamese are considerably helpful for improving BERT-based bi-encoders. In
fact, lightweight fine-tuning is helpful for cross-encoder, too.
Related papers
- Improving Code Search with Hard Negative Sampling Based on Fine-tuning [15.341959871682981]
We introduce a cross-encoder architecture for code search that jointly encodes the concatenation of query and code.
We also introduce a Retriever-Ranker (RR) framework that cascades the dual-encoder and cross-encoder to promote the efficiency of evaluation and online serving.
arXiv Detail & Related papers (2023-05-08T07:04:28Z) - Revisiting Code Search in a Two-Stage Paradigm [67.02322603435628]
TOSS is a two-stage fusion code search framework.
It first uses IR-based and bi-encoder models to efficiently recall a small number of top-k code candidates.
It then uses fine-grained cross-encoders for finer ranking.
arXiv Detail & Related papers (2022-08-24T02:34:27Z) - Learning Diverse Document Representations with Deep Query Interactions
for Dense Retrieval [79.37614949970013]
We propose a new dense retrieval model which learns diverse document representations with deep query interactions.
Our model encodes each document with a set of generated pseudo-queries to get query-informed, multi-view document representations.
arXiv Detail & Related papers (2022-08-08T16:00:55Z) - ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking
Inference [70.36083572306839]
This paper proposes a new training and inference paradigm for re-ranking.
We finetune a pretrained encoder-decoder model using in the form of document to query generation.
We show that this encoder-decoder architecture can be decomposed into a decoder-only language model during inference.
arXiv Detail & Related papers (2022-04-25T06:26:29Z) - UniXcoder: Unified Cross-Modal Pre-training for Code Representation [65.6846553962117]
We present UniXcoder, a unified cross-modal pre-trained model for programming language.
We propose a one-to-one mapping method to transform AST in a sequence structure that retains all structural information from the tree.
We evaluate UniXcoder on five code-related tasks over nine datasets.
arXiv Detail & Related papers (2022-03-08T04:48:07Z) - Hierarchical Neural Network Approaches for Long Document Classification [3.6700088931938835]
We employ pre-trained Universal Sentence (USE) and Bidirectional Representations from Transformers (BERT) in a hierarchical setup to capture better representations efficiently.
Our proposed models are conceptually simple where we divide the input data into chunks and then pass this through base models of BERT and USE.
We show that USE + CNN/LSTM performs better than its stand-alone baseline. Whereas the BERT + CNN/LSTM performs on par with its stand-alone counterpart.
arXiv Detail & Related papers (2022-01-18T07:17:40Z) - BinaryBERT: Pushing the Limit of BERT Quantization [74.65543496761553]
We propose BinaryBERT, which pushes BERT quantization to the limit with weight binarization.
We find that a binary BERT is hard to be trained directly than a ternary counterpart due to its complex and irregular loss landscapes.
Empirical results show that BinaryBERT has negligible performance drop compared to the full-precision BERT-base.
arXiv Detail & Related papers (2020-12-31T16:34:54Z) - Incorporating BERT into Parallel Sequence Decoding with Adapters [82.65608966202396]
We propose to take two different BERT models as the encoder and decoder respectively, and fine-tune them by introducing simple and lightweight adapter modules.
We obtain a flexible and efficient model which is able to jointly leverage the information contained in the source-side and target-side BERT models.
Our framework is based on a parallel sequence decoding algorithm named Mask-Predict considering the bi-directional and conditional independent nature of BERT.
arXiv Detail & Related papers (2020-10-13T03:25:15Z) - TwinBERT: Distilling Knowledge to Twin-Structured BERT Models for
Efficient Retrieval [11.923682816611716]
We present TwinBERT model for effective and efficient retrieval.
It has twin-structured BERT-like encoders to represent query and document respectively.
It allows document embeddings to be pre-computed offline and cached in memory.
arXiv Detail & Related papers (2020-02-14T22:44:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.