Fugu-MT 論文翻訳(概要): Sherpa.ai Privacy-Preserving Multi-Party Entity Alignment without Intersection Disclosure for Noisy Identifiers

論文の概要: Sherpa.ai Privacy-Preserving Multi-Party Entity Alignment without Intersection Disclosure for Noisy Identifiers

arxiv url: http://arxiv.org/abs/2604.19219v1
Date: Tue, 21 Apr 2026 08:24:07 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-22 22:41:49.687302
Title: Sherpa.ai Privacy-Preserving Multi-Party Entity Alignment without Intersection Disclosure for Noisy Identifiers
Title（参考訳）: Sherpa.aiプライバシ保護によるノイズ検出のためのインターセクション開示のない多人数エンティティアライメント
Authors: Daniel M. Jimenez-Gutierrez, Enrique Zuazua, Georgios Kellaris, Joaquin Del Rio, Oleksii Sliusarenko, Xabi Uribe-Etxebarria,
Abstract要約: フェデレートラーニング(FL)は、生データを集中することなく、複数のパーティ間で協調的なモデルトレーニングを可能にする。 VFLトレーニングの前提条件はプライバシ保護エンティティアライメント(PPEA)である。 PPEAは、どのサンプルを共有するかを明らかにすることなく、パーティー全体(アライメント)に共通するサンプルのインデックスを確立する。 VFLのためのSherpa.aiマルチパーティPSUプロトコルを導入する。これはPPEA方式で、交差メンバシップを隠蔽し、正確かつノイズの多いマッチングを可能にする。
参考スコア（独自算出の注目度）: 0.44333156307871574
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Federated Learning (FL) enables collaborative model training among multiple parties without centralizing raw data. There are two main paradigms in FL: Horizontal FL (HFL), where all participants share the same feature space but hold different samples, and Vertical FL (VFL), where parties possess complementary features for the same set of samples. A prerequisite for VFL training is privacy-preserving entity alignment (PPEA), which establishes a common index of samples across parties (alignment) without revealing which samples are shared between them. Conventional private set intersection (PSI) achieves alignment but leaks intersection membership, exposing sensitive relationships between datasets. The standard private set union (PSU) mitigates this risk by aligning on the union of identifiers rather than the intersection. However, existing approaches are often limited to two parties or lack support for typo-tolerant matching. In this paper, we introduce the Sherpa.ai multi-party PSU protocol for VFL, a PPEA method that hides intersection membership and enables both exact and noisy matching. The protocol generalizes two-party approaches to multiple parties with low communication overhead and offers two variants: an order-preserving version for exact alignment and an unordered version tolerant to typographical and formatting discrepancies. We prove correctness and privacy, analyze communication and computational (exponentiation) complexity, and formalize a universal index mapping from local records to a shared index space. This multi-party PSU offers a scalable, mathematically grounded protocol for PPEA in real-world VFL deployments, such as multi-institutional healthcare disease detection, collaborative risk modeling between banks and insurers, and cross-domain fraud detection between telecommunications and financial institutions, while preserving intersection privacy.
Abstract（参考訳）: フェデレートラーニング(FL)は、生データを集中することなく、複数のパーティ間で協調的なモデルトレーニングを可能にする。 FLには2つの主要なパラダイムがある: 水平FL(Horizontal FL)、すべての参加者が同じ特徴空間を共有して異なるサンプルを保持する、垂直FL(Vertical FL)。 VFLトレーニングの前提条件は、プライバシ保護エンティティアライメント(PPEA)であり、どのサンプルを共有するかを明らかにすることなく、パーティ全体(アライメント)のサンプルの共通インデックスを確立する。従来のプライベートセット交差点(PSI)は、アライメントを実現するが、アライメントメンバシップをリークし、データセット間のセンシティブな関係を露呈する。標準プライベート・セット・ユニオン(PSU)は、このリスクを、交差点ではなく識別子の結合に合わせることで軽減する。しかし、既存のアプローチは2つに制限される場合もあれば、タイポトトレラントマッチングのサポートが欠如している場合も少なくない。本稿では,VFLのためのSherpa.aiマルチパーティPSUプロトコルについて紹介する。このプロトコルは、通信オーバーヘッドの少ない複数のパーティに対する2つのアプローチを一般化し、正確なアライメントのための順序保存バージョンと、タイポグラフィとフォーマットの相違に耐性のない非順序バージョンという2つのバリエーションを提供する。我々は、正当性とプライバシを証明し、通信と計算の複雑さを分析し、ローカルレコードから共有インデックス空間への普遍的なインデックスマッピングを定式化する。このマルチパーティのPSUは、実世界のVFL展開におけるPPEAのスケーラブルで数学的基盤となるプロトコルを提供する。例えば、多施設の医療疾患の検出、銀行と保険会社間の協調リスクモデリング、通信と金融機関間のクロスドメイン詐欺検出、そして交差プライバシーの保護である。

論文の概要: Sherpa.ai Privacy-Preserving Multi-Party Entity Alignment without Intersection Disclosure for Noisy Identifiers

関連論文リスト