Fugu-MT 論文翻訳(概要): MonaVec: A Training-Free Embedded Vector Search Kernel for Edge and Offline AI Systems

論文の概要: MonaVec: A Training-Free Embedded Vector Search Kernel for Edge and Offline AI Systems

arxiv url: http://arxiv.org/abs/2606.19458v1
Date: Wed, 17 Jun 2026 18:00:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-19 18:23:39.465925
Title: MonaVec: A Training-Free Embedded Vector Search Kernel for Edge and Offline AI Systems
Title（参考訳）: MonaVec:エッジとオフラインAIシステムのためのトレーニング不要な埋め込みベクトル検索カーネル
Authors: Oğuzhan Yenen,
Abstract要約: MonaVecはエッジとオフラインAIのための決定論的、組み込みベクター検索カーネルである。デバイス上のRAG,オフラインの組込み検索 -- リレーショナルデータのニッチ – をターゲットとしています。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We present MonaVec, a deterministic, embedded vector-search kernel for edge and offline AI -- settings where server infrastructure, network connectivity, and training data are all unavailable. Existing vector-search systems assume a persistent server, gigabytes of RAM, or a training pass over the corpus; MonaVec instead targets the deployment profile of SQLite: one file, one function call, runs anywhere. Its quantization core is training-free by default and data-oblivious: a Randomized Hadamard Transform (RHDH) conditions any input distribution toward N(0,1), so precomputed Lloyd-Max tables quantize to 4 bits (8x smaller) with no learned codebook and no data pass. The index persists as a single .mvec file whose embedded ChaCha20 rotation seed makes results reproducible across architectures and byte-identical within a build -- a determinism guarantee that parallel-build graph libraries cannot offer. On semantic embeddings (AG News, 45K x 1024-dim BGE-M3, cosine), MonaVec 4-bit BruteForce reaches 0.960 Recall@10 in 27 MB -- leading float32 FAISS-IVF and 8-bit usearch on recall -- while trading peak throughput for byte-identical determinism. A single-pass global standardization (fit()) extends the same data-oblivious pipeline to magnitude-sensitive L2 data, and optional IvfFlat and HNSW backends carry it to million-vector corpora. MonaVec is implemented in pure Rust with Python bindings and runtime SIMD dispatch (AVX-512/AVX2/NEON/scalar). It targets on-device RAG, offline agents, and embedded retrieval -- the niche SQLite occupies for relational data: one file, one call, runs anywhere.
Abstract（参考訳）: 私たちは、エッジとオフラインAIのための決定論的で組み込みベクタ検索カーネルであるMonaVecを紹介します。既存のベクター検索システムは、永続サーバ、RAM、またはコーパス上のトレーニングパスを前提としています。 RHDH (Randomized Hadamard Transform) は N(0,1) に対して任意の入力分布を条件付けているので、事前計算された Lloyd-Max テーブルは、学習コードブックが無く、データパスも無い4ビット (8 倍小さい) まで量子化する。インデックスは 1 つの . として持続する。組み込みのChaCha20ローテーションシードを持つmvecファイルは、ビルド内のアーキテクチャやバイト単位の成果を再現する -- 並列ビルドグラフライブラリが提供できないことを決定論的に保証する。セマンティック埋め込み(AG News, 45K x 1024-dim BGE-M3, cosine)では、MonaVec 4-bit BruteForceが0.960 Recall@10 in 27 MBに達した。シングルパスグローバル標準化(fit())は、同じデータ公開パイプラインをグレードに敏感なL2データに拡張し、オプションのIvfFlatとHNSWバックエンドはそれを100万ベクタコーパスに転送する。 MonaVecは、PythonバインディングとランタイムSIMDディスパッチ(AVX-512/AVX2/NEON/scalar)を備えた、純粋なRustで実装されている。デバイス上のRAG、オフラインエージェント、組み込み検索をターゲットとしています -- ニッチなSQLiteはリレーショナルデータを占有しています。

論文の概要: MonaVec: A Training-Free Embedded Vector Search Kernel for Edge and Offline AI Systems

関連論文リスト