Fugu-MT 論文翻訳(概要): REWA: Witness-Overlap Theory -- Foundations for Composable Binary Similarity Systems

論文の概要: REWA: Witness-Overlap Theory -- Foundations for Composable Binary Similarity Systems

arxiv url: http://arxiv.org/abs/2511.19998v1
Date: Tue, 25 Nov 2025 07:04:44 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-26 17:37:04.324135
Title: REWA: Witness-Overlap Theory -- Foundations for Composable Binary Similarity Systems
Title（参考訳）: REWA: Witness-Overlap Theory -- Composable Binary similarity Systemsの基礎
Authors: Nikit Phadke,
Abstract要約: REWAは、目撃者のオーバーラップ構造に基づく類似性に関する一般的な理論を導入する。概念間の類似性を単調な証人オーバーラップとして表すことができれば、コンパクトな符号化に還元できることを示す。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: REWA introduces a general theory of similarity based on witness-overlap structures. We show that whenever similarity between concepts can be expressed as monotone witness overlap -- whether arising from graph neighborhoods, causal relations, temporal structure, topological features, symbolic patterns, or embedding-based neighborhoods -- it admits a reduction to compact encodings with provable ranking preservation guarantees. REWA systems consist of: (1) finite witness sets $W(v)$, (2) semi-random bit assignments generated from each witness, and (3) monotonicity of expected similarity in the overlap $Δ(u, v) = |W(u) \cap W(v)|$. We prove that under an overlap-gap condition on the final witness sets -- independent of how they were constructed -- top-$k$ rankings are preserved using $m = O(\log(|V|/δ))$ bits. The witness-set formulation is compositional: any sequence of structural, temporal, causal, topological, information-theoretic, or learned transformations can be combined into pipelines that terminate in discrete witness sets. The theory applies to the final witness overlap, enabling modular construction of similarity systems from reusable primitives. This yields a vast design space: millions of composable similarity definitions inherit logarithmic encoding complexity. REWA subsumes and unifies Bloom filters, minhash, LSH bitmaps, random projections, sketches, and hierarchical filters as special cases. It provides a principled foundation for similarity systems whose behavior is governed by witness overlap rather than hash-function engineering. This manuscript presents the axioms, the main reducibility theorem, complete proofs with explicit constants, and a detailed discussion of compositional design, limitations, and future extensions including multi-bit encodings, weighted witnesses, and non-set representations.
Abstract（参考訳）: REWAは、目撃者のオーバーラップ構造に基づく類似性に関する一般的な理論を導入する。グラフ近傍,因果関係,時間的構造,トポロジ的特徴,象徴的パターン,あるいは埋め込み型近傍から生じる概念間の類似性は,証明可能なランク維持保証を備えたコンパクトエンコーディングへの縮小を認める。 REWAシステム:(1)有限証人集合$W (v)$, (2) それぞれの証人から生成される半ランダムビット割り当てと(3) 重複する$Δ(u,)における期待類似性の単調性 v) = |W (u) キャップW (v)|$。最終的な目撃者集合の重複ギャップ条件の下では、構築方法とは無関係に、トップ$k$ランキングは$m = O(\log(|V|/δ))$ bits で保存される。構造的、時間的、因果的、位相的、情報理論的、あるいは学習的な変換の任意の列は、個別の証人集合で終了するパイプラインに結合することができる。この理論は最後の目撃者の重複に当てはまり、再利用可能なプリミティブから類似システムのモジュラー構成を可能にする。数百万の構成可能な類似性の定義が対数エンコーディングの複雑さを継承する。 REWAは特別なケースとしてブルームフィルタ、minhash、LSHビットマップ、ランダムプロジェクション、スケッチ、階層フィルタを仮定して統一する。それは、ハッシュ関数工学ではなく、目撃者の重複によって支配される、類似性システムのための原則化された基盤を提供する。この写本は公理、主再現性定理、明示的な定数を持つ完全証明、および多ビット符号化、重み付けされた証人、非集合表現を含む構成設計、制限、将来の拡張に関する詳細な議論を提示する。

論文の概要: REWA: Witness-Overlap Theory -- Foundations for Composable Binary Similarity Systems

関連論文リスト