Fugu-MT 論文翻訳(概要): Hypencoder Revisited: Reproducibility and Analysis of Non-Linear Scoring for First-Stage Retrieval

論文の概要: Hypencoder Revisited: Reproducibility and Analysis of Non-Linear Scoring for First-Stage Retrieval

arxiv url: http://arxiv.org/abs/2604.27037v1
Date: Wed, 29 Apr 2026 17:05:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-01 16:31:53.731981
Title: Hypencoder Revisited: Reproducibility and Analysis of Non-Linear Scoring for First-Stage Retrieval
Title（参考訳）: ハイペンコーダの再検討:1段階検索のための非線形スコーリングの再現性と解析
Authors: Arne Eichholtz, Yongkang Li, Jutte Vijverberg, Tobias Groot, Mohammad Aliannejadi,
Abstract要約: Hypencoderは、標準的なバイエンコーダで使用される固定内積スコアリング機能を、クエリ固有のニューラルネットワークに置き換える検索フレームワークである。我々は、ハイペンコーダの研究を行い、元の解析を3方向に拡張する。
参考スコア（独自算出の注目度）: 12.49873774352119
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The Hypencoder, proposed by Killingback et al., is a retrieval framework that replaces the fixed inner-product scoring function used in standard bi-encoders with a query-specific neural network (the $q$-net), whose weights are generated by a hypernetwork from the contextualized query embeddings. This design enables more expressive relevance estimation while preserving independent query and document encoding. In this work, we conduct a reproducibility study of the Hypencoder and extend the original analysis in three directions. Our reproduction confirms that the Hypencoder outperforms a similarly trained bi-encoder baseline on in-domain and out-of-domain benchmarks, and that the proposed efficient search algorithm substantially reduces query latency with minimal performance loss. On hard retrieval tasks, we find partial support: the Hypencoder outperforms the baseline on DL-Hard and FollowIR, but not on TREC TOT, where checkpoint incompatibility and fine-tuning sensitivity complicate full verification. Beyond reproduction, we investigate three extensions: (i)~integrating alternative pre-trained encoders into the Hypencoder framework, where we find that performance gains depend on the encoder and fine-tuning strategy; (ii)~comparing query latency against a Faiss-based bi-encoder pipeline, revealing that standard bi-encoder retrieval remains faster under both exhaustive and efficient search settings; and (iii)~evaluating adversarial robustness, where we find that the $q$-net's non-linear scoring does not provide a consistent robustness disadvantage over inner-product scoring. Our code is publicly available at https://github.com/arneeichholtz/Hypencoder-reprod.
Abstract（参考訳）: Killingbackらによって提案されたHypencoderは、標準的なバイエンコーダで使用される固定内積スコアリング関数をクエリ固有のニューラルネットワーク($q$-net)に置き換える検索フレームワークである。この設計は、独立したクエリと文書エンコーディングを保持しながら、より表現力のある関連性推定を可能にする。本研究では,ハイペンコーダの再現性について検討し,元の解析を3方向に拡張する。我々の再現では、Hypencoderは、ドメイン内およびドメイン外ベンチマークで同様に訓練されたバイエンコーダベースラインより優れており、提案アルゴリズムは、性能損失を最小限に抑えて、クエリレイテンシを大幅に低減することを確認した。ハード検索タスクでは、HypencoderはDL-HardとFollowIRのベースラインより優れているが、TREC TOTでは性能が良くない。再生以外の3つの拡張について調べる。 i) 代替のトレーニング済みエンコーダをHypencoderフレームワークに統合すると、パフォーマンス向上はエンコーダと微調整戦略に依存していることがわかった。 (ii)– Faiss ベースのバイエンコーダパイプラインに対してクエリレイテンシを比較した結果,標準的なバイエンコーダ検索は,排他的かつ効率的な検索設定下においても高速であることが明らかとなった。例えば、$q$-netの非線形スコアは、内積スコアよりも一貫した頑健さを損なわない。私たちのコードはhttps://github.com/arneeichholtz/Hypencoder-reprod.comで公開されています。

論文の概要: Hypencoder Revisited: Reproducibility and Analysis of Non-Linear Scoring for First-Stage Retrieval

関連論文リスト