Fugu-MT 論文翻訳(概要): EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory

論文の概要: EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory

arxiv url: http://arxiv.org/abs/2606.21649v1
Date: Fri, 19 Jun 2026 17:59:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-26 04:18:34.150266
Title: EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory
Title（参考訳）: EvoEmbedding: 長期検索とエージェント記憶のための進化可能な表現
Authors: Chang Nie, Chaoyou Fu, Junlan Feng, Caifeng Shan,
Abstract要約: EvoEmbeddingは、検索のための進化可能な表現を生成する新しい埋め込みモデルである。情報は動的でシーケンシャルで、継続的な状態追跡を必要とする、長期コンテキストシナリオ用に調整されている。 EvoEmbeddingは、パフォーマンスを高めるためにエージェントにシームレスに統合される。
参考スコア（独自算出の注目度）: 46.411965241105804
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Existing embedding models are inherently static: they encode text segments in isolation, ignoring their surrounding context and temporal order. This paper introduces EvoEmbedding, a novel embedding model that generates evolvable representations for retrieval. It is tailored for long-context scenarios, where information is dynamic, sequential, and requires continuous state tracking. Our design is simple: EvoEmbedding maintains a continuously updated latent memory as it sequentially processes inputs, and uses it alongside the raw content to jointly generate evolvable embeddings. Consequently, for the same query, our model adapts its representation to retrieve distinct targets based on the evolving context, going beyond static semantic search. To equip the model with this capability, we construct EvoTrain-180K, a diverse dataset for the joint optimization of latent memory and retrieval. Furthermore, we introduce a memory queue to prevent representation collapse during recurrent encoding, alongside segment-batching techniques that tackle significant length variance and accelerate training by 3.8$\times$. Extensive experiments show that our model not only outperforms larger-scale specialists (e.g., Qwen3-Embedding-8B and KaLM-Embedding-Gemma3-12B) across a range of long-context retrieval benchmarks, but also generalizes well to downstream tasks (e.g., personalization) with contexts 10$\times$ longer than its training window. Notably, EvoEmbedding seamlessly integrates into agentic workflows to boost performance. For instance, a naive RAG pipeline equipped with our model surpasses dedicated agentic memory systems. Project Page: https://clare-nie.github.io/EvoEmbedding.
Abstract（参考訳）: 既存の埋め込みモデルは本質的に静的で、テキストセグメントを分離してエンコードし、周囲のコンテキストや時間順を無視します。本稿では,進化可能な表現を生成する新しい埋め込みモデルであるEvoEmbeddingを紹介する。情報は動的でシーケンシャルで、継続的な状態追跡を必要とする、長期コンテキストシナリオ用に調整されている。 EvoEmbeddingは、連続的に更新された潜時メモリを維持し、入力を逐次処理し、それを生のコンテンツと一緒に使用して、進化可能な埋め込みを生成する。したがって、同じクエリに対して、我々のモデルは、静的なセマンティックサーチを超えて、進化するコンテキストに基づいて異なるターゲットを検索するために、その表現に適応する。この能力をモデルとして,潜在記憶と検索の協調最適化のための多様なデータセットであるEvoTrain-180Kを構築した。さらに,逐次符号化における表現の崩壊を防止するためのメモリキューを導入し,セグメントバッチ技術により,大幅な長さの分散に対処し,3.8$\times$のトレーニングを高速化する。広範囲な実験により,我々のモデルは,大規模専門家(例えば Qwen3-Embedding-8B や KaLM-Embedding-Gemma3-12B など)よりも長いコンテキストの検索ベンチマークに優れるだけでなく,トレーニングウィンドウよりも10$\times$長いコンテキストで,下流タスク(例えばパーソナライズ)に最適化できることがわかった。特に、EvoEmbeddingは、パフォーマンスを高めるためにエージェントワークフローにシームレスに統合される。例えば、当社のモデルを備えたナイーブなRAGパイプラインは、専用のエージェントメモリシステムを超えています。 Project Page: https://clare-nie.github.io/EvoEmbedding.com

論文の概要: EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory

関連論文リスト