Fugu-MT 論文翻訳(概要): From Recency Bias to Stable Convergence Block Kaczmarz Methods for Online Preference Learning in Matchmaking Applications

論文の概要: From Recency Bias to Stable Convergence Block Kaczmarz Methods for Online Preference Learning in Matchmaking Applications

arxiv url: http://arxiv.org/abs/2604.09964v1
Date: Sat, 11 Apr 2026 00:18:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:15.76916
Title: From Recency Bias to Stable Convergence Block Kaczmarz Methods for Online Preference Learning in Matchmaking Applications
Title（参考訳）: 適合性バイアスから安定収束ブロック Kaczmarz 法によるマッチング学習への応用
Authors: James Nguyen,
Abstract要約: 本稿では,コメンテータシステムにおけるリアルタイムの個人化マッチングのための,Kaczmarzに基づく選好学習アルゴリズムのファミリーを提案する。 BlockNKは、バッチGram解決とセッション後L2正規化を組み合わせることで、最も優先度の高いアライメントを実現する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: We present a family of Kaczmarz-based preference learning algorithms for real-time personalized matchmaking in reciprocal recommender systems. Post-step L2 normalization, common in Kaczmarz-inspired online learners, induces exponential recency bias: the influence of the t-th interaction decays as eta^(n - t), reaching approximately 1e-6 after just 20 swipes at eta = 0.5. We resolve this by replacing the normalization step with a Tikhonov-regularized projection denominator that bounds step size analytically without erasing interaction history. When candidate tag vectors are not pre-normalized, as in realistic deployments where candidates vary in tag density, the Tikhonov denominator ||a||^2 + alpha produces genuinely per-candidate adaptive step sizes, making it structurally distinct from online gradient descent with any fixed learning rate. We further derive a block variant that processes full swipe sessions as a single Gram matrix solve. Population-scale simulation over 6,400 swipes reveals that Block Normalized Kaczmarz (BlockNK), which combines the batch Gram solve with post-session L2 normalization, achieves the highest preference alignment (Align@20 = 0.698), the strongest inter-session direction stability (delta = 0.994), and the flattest degradation profile under label noise across flip ratios p_flip in [0.10, 0.35]. Experiments under cosine similarity subsampling further show that adaptively filtering the candidate pool toward the current preference direction substantially improves asymptotic alignment, at the cost of introducing a feedback loop that may slow recovery from miscalibration. The sequential Tikhonov-Kaczmarz method performs comparably to K-NoNorm under our simulation conditions, suggesting the dominant practical gain over normalized Kaczmarz is the removal of per-step normalization rather than the Tikhonov constant alpha itself.
Abstract（参考訳）: 本稿では, 相互推薦システムにおいて, リアルタイムにパーソナライズされたマッチングのための選好学習アルゴリズムのファミリーを提示する。 Kaczmarzにインスパイアされたオンライン学習者に共通するステップ後L2正規化は指数的回帰バイアスを誘導する: t-th相互作用の影響はeta^(n - t)として崩壊し、eta = 0.5で20回のスワイプで約1e-6に達する。我々は、通常の化ステップを、相互作用履歴を消去することなくステップサイズを解析的に束縛するTikhonov-regularized projection denominatorに置き換えることで、これを解決する。タグ密度の異なる現実的な配置のように、候補タグベクトルが事前正規化されない場合、Tikhonov denominator ||a||^2 + α は真に候補ごとの適応的なステップサイズを生成し、一定の学習速度でオンライン勾配勾配から構造的に区別する。さらに、1つのGram行列が解けるように、完全なスワイプセッションを処理するブロックバリアントを導出します。 6,400回以上の集団規模のシュミレーションにより、Block Normalized Kaczmarz (BlockNK) は、Gram のバッチ解と後次 L2 の正規化を組み合わせた最高優先度アライメント(Align@20 = 0.698)、最強のセッション間方向安定性(delta = 0.994)、および[0.10, 0.35] のフリップ比 p_flip のラベルノイズ下での最も平坦な劣化プロファイルが得られることが明らかになった。さらに、コサイン類似性のサブサンプリングによる実験では、誤校正から回復を遅らせるフィードバックループを導入するコストで、候補プールを現在の選好方向に向けて適応的にフィルタリングすることで、漸近的なアライメントが大幅に向上することが示された。逐次的Tikhonov-Kaczmarz法は、我々のシミュレーション条件下でK-Normと相補的に実行し、正規化Kaczmarzよりも支配的な実践的利得は、Tikhonov定数α自身よりもステップごとの正規化を除去することであることを示唆している。

論文の概要: From Recency Bias to Stable Convergence Block Kaczmarz Methods for Online Preference Learning in Matchmaking Applications

関連論文リスト