Fugu-MT 論文翻訳(概要): Learning Retrieval Models with Sparse Autoencoders

論文の概要: Learning Retrieval Models with Sparse Autoencoders

arxiv url: http://arxiv.org/abs/2603.13277v1
Date: Fri, 27 Feb 2026 10:50:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 08:17:42.257358
Title: Learning Retrieval Models with Sparse Autoencoders
Title（参考訳）: スパースオートエンコーダを用いた検索モデル学習
Authors: Thibault Formal, Maxime Louis, Hervé Dejean, Stéphane Clinchant,
Abstract要約: SPLAREは、SAEベースのLSRモデルをトレーニングする方法である。 SPLARE は語彙ベースの LSR を多言語およびドメイン外設定で一貫して上回ることを示す。
参考スコア（独自算出の注目度）: 9.69714089726174
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Sparse autoencoders (SAEs) provide a powerful mechanism for decomposing the dense representations produced by Large Language Models (LLMs) into interpretable latent features. We posit that SAEs constitute a natural foundation for Learned Sparse Retrieval (LSR), whose objective is to encode queries and documents into high-dimensional sparse representations optimized for efficient retrieval. In contrast to existing LSR approaches that project input sequences into the vocabulary space, SAE-based representations offer the potential to produce more semantically structured, expressive, and language-agnostic features. Building on this insight, we introduce SPLARE, a method to train SAE-based LSR models. Our experiments, relying on recently released open-source SAEs, demonstrate that this technique consistently outperforms vocabulary-based LSR in multilingual and out-of-domain settings. SPLARE-7B, a multilingual retrieval model capable of producing generalizable sparse latent embeddings for a wide range of languages and domains, achieves top results on MMTEB's multilingual and English retrieval tasks. We also developed a 2B-parameter variant with a significantly lighter footprint.
Abstract（参考訳）: スパースオートエンコーダ (SAE) は、Large Language Models (LLM) が生み出す高密度表現を解釈可能な潜在機能に分解する強力なメカニズムを提供する。我々は,SAEがLSR(Learned Sparse Retrieval)の自然な基盤であり,その目的が,効率的な検索に最適化された高次元スパース表現にクエリやドキュメントをエンコードすることにあることを示唆する。入力シーケンスを語彙空間に投影する既存のLSRアプローチとは対照的に、SAEベースの表現は、より意味的に構造化され、表現され、言語に依存しない特徴を生み出す可能性がある。本稿では,この知見に基づいて,SAEに基づくLSRモデルの学習手法であるSPLAREを紹介する。我々の実験は、最近リリースされたオープンソースのSAEを頼りにしており、この手法が多言語およびドメイン外設定において語彙ベースのLSRを一貫して上回っていることを実証している。 SPLARE-7Bは、多言語および英語検索タスクにおいて、多言語およびドメインに対して一般化可能なスパース潜伏埋め込みを生成することができる多言語検索モデルである。また,フットプリントが大幅に軽量な2Bパラメータも開発した。

論文の概要: Learning Retrieval Models with Sparse Autoencoders

関連論文リスト