Fugu-MT 論文翻訳(概要): $π$-RAG: Oblivious Retrieval via Semantic Quantization and Transcendental Addressing for Large Language Models

論文の概要: $π$-RAG: Oblivious Retrieval via Semantic Quantization and Transcendental Addressing for Large Language Models

arxiv url: http://arxiv.org/abs/2606.22153v1
Date: Sat, 20 Jun 2026 17:22:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-25 22:30:47.270742
Title: $π$-RAG: Oblivious Retrieval via Semantic Quantization and Transcendental Addressing for Large Language Models
Title（参考訳）: $π$-RAG:大規模言語モデルに対する意味的量子化と超越的アドレス化による未知の検索
Authors: Aniket Wattamwar, Mrunal Kakirwar,
Abstract要約: $-RAGは、意味理解を犠牲にすることなく、機密データストレージから大規模言語モデル(LLM)を分離する。我々は超越エントロピーの源として$$の桁を使い、LLMとプライベートレコードの間に不変な間接層を作る。このアーキテクチャは決定論的ランダム性、監査可能性、および差分プライバシーを統一し、金融や医療などの高コンプライアンス分野に高い効果を示す。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: This paper introduces $π$-RAG, a novel architecture for oblivious retrieval that decouples Large Language Models (LLMs) from sensitive data storage without sacrificing semantic understanding. Traditional Retrieval-Augmented Generation (RAG) architectures expose raw vector embeddings to potential inversion attacks and nondeterministic retrieval failures. To address this, we utilize the digits of $π$ as a source of transcendental entropy, creating an immutable indirection layer between the LLM and private records. The value $π$ provides immutability, is uneditable and math governs it. The architecture also introduces a Semantic Quantization Layer. This layer projects user inputs onto a pre-computed manifold of Canonical Intent Centroids. RAG performs vector cosine similarity but here it maps the centroids to deterministic offsets via cryptographic salt. The resulting $π$-key is a pointer to standardized payload from the actual datastore. By replacing direct access to the datastore via LLM with this transcendental layer, $π$-RAG mathematically guarantees that the inference remains oblivious to the data. This architecture unifies deterministic randomness, auditability, and differential privacy, demonstrating high efficacy for high-compliance sectors such as finance and healthcare.
Abstract（参考訳）: 本稿では,Large Language Models (LLM) を意味的理解を犠牲にすることなく機密データストレージから切り離す,難解な検索のための新しいアーキテクチャである$π$-RAGを紹介する。従来のRetrieval-Augmented Generation (RAG)アーキテクチャは、潜在的な反転攻撃や非決定論的検索障害に生ベクトル埋め込みを公開している。この問題に対処するために、超越エントロピーの源として$π$の桁を使い、LLMとプライベートレコードの間に不変な間接層を作る。 π$の値は不変性を提供し、計算不可能であり、数学がそれを支配している。アーキテクチャにはSemantic Quantization Layerも導入されている。このレイヤは、ユーザが予め計算されたCanonical Intent Centroidの多様体に入力する。 RAGはベクトルコサイン類似性を実行するが、ここでは、セントロイドを暗号塩を介して決定論的オフセットにマッピングする。その結果得られる$π$-keyは、実際のデータストアから標準化されたペイロードへのポインタである。 LLMによるデータストアへの直接アクセスをこの超越層に置き換えることにより、$π$-RAGは、推論がデータに不利なままであることを数学的に保証する。このアーキテクチャは決定論的ランダム性、監査可能性、および差分プライバシーを統一し、金融や医療などの高コンプライアンス分野に高い効果を示す。

論文の概要: $π$-RAG: Oblivious Retrieval via Semantic Quantization and Transcendental Addressing for Large Language Models

関連論文リスト