Fugu-MT 論文翻訳(概要): DIVE: Embedding Compression via Self-Limiting Gradient Updates

論文の概要: DIVE: Embedding Compression via Self-Limiting Gradient Updates

arxiv url: http://arxiv.org/abs/2605.20689v1
Date: Wed, 20 May 2026 04:35:28 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 19:19:56.475982
Title: DIVE: Embedding Compression via Self-Limiting Gradient Updates
Title（参考訳）: DIVE: 自己制限のグラディエントアップデートによる圧縮の埋め込み
Authors: Dongfang Zhao,
Abstract要約: 大規模言語モデルからの高次元埋め込みは、ベクトル探索システムにかなりのストレージと計算コストを課す。最近の組込み圧縮法であるMatryoshka-Adaptor (EMNLP 2024) は, 軽量残像アダプタによる寸法の低減を実現するが, ラベル付きデータが少ない場合, トレーニング目的が過度に過度なオーバーフィッティングを引き起こす。 textbfImplicit textbfView textbfEnsembles を用いたtextscDIVE (textbf dimensionity reduction with textbfImplicit textbfView textbfEnsembles) を提案する。
参考スコア（独自算出の注目度）: 0.9179857807576733
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: High-dimensional embeddings from large language models impose significant storage and computational costs on vector search systems. Recent embedding compression methods, including Matryoshka-Adaptor (EMNLP 2024), Search-Adaptor (ACL 2024), and SMEC (EMNLP 2025), enable dimensionality reduction through lightweight residual adapters, but their training objectives cause severe overfitting when labeled data is scarce, degrading retrieval performance below the frozen baseline. We propose \textsc{DIVE} (\textbf{D}imensionality reduction with \textbf{I}mplicit \textbf{V}iew \textbf{E}nsembles), a compression adapter that addresses this failure through two mechanisms. First, a self-limiting hinge-based triplet loss produces zero gradient once a triplet satisfies the margin constraint, bounding the total perturbation applied to the pretrained embedding space. Second, a head-wise NT-Xent contrastive loss treats multiple learned projections of each embedding as implicit views, providing dense self-supervised gradients that compensate for the sparsity of the triplet signal on small datasets. Across six BEIR datasets, \textsc{DIVE} outperforms all three baseline adapters on every dataset and at every evaluated compression ratio, with a 14M-parameter open-source implementation.
Abstract（参考訳）: 大規模言語モデルからの高次元埋め込みは、ベクトル探索システムにかなりのストレージと計算コストを課す。近年,Mtryoshka-Adaptor (EMNLP 2024), Search-Adaptor (ACL 2024), SMEC (EMNLP 2025), などの埋め込み圧縮法では, 軽量残差アダプタによる寸法の低減が可能となっているが, ラベル付きデータが不足している場合, 学習目標が大幅に過適合し, 冷凍ベースラインより下方に検索性能が低下する。本稿では,この障害に対処する圧縮アダプタであるtextbf{DIVE} (\textbf{D}imensionality reduction with \textbf{I}mplicit \textbf{V}iew \textbf{E}nsembles)を提案する。第一に、自己制限ヒンジベースの三重項損失は、三重項がマージン制約を満たすとゼロ勾配を生じ、予め訓練された埋め込み空間に適用される総摂動を束縛する。第二に、頭回りのNT-Xentコントラスト損失は、各埋め込みの複数の学習されたプロジェクションを暗黙の視点として扱い、小さなデータセット上の三重項信号の間隔を補う、密集した自己教師付き勾配を提供する。 6つのBEIRデータセット全体で、 \textsc{DIVE}は、すべてのデータセットと評価された圧縮比の3つのベースラインアダプタを、14Mパラメータのオープンソース実装で上回っている。

論文の概要: DIVE: Embedding Compression via Self-Limiting Gradient Updates

関連論文リスト