Fugu-MT 論文翻訳(概要): When Symbol Names Should Not Matter: A Logistic Theory of Fresh-Symbol Classification

論文の概要: When Symbol Names Should Not Matter: A Logistic Theory of Fresh-Symbol Classification

arxiv url: http://arxiv.org/abs/2605.07120v1
Date: Fri, 08 May 2026 01:50:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:38.731141
Title: When Symbol Names Should Not Matter: A Logistic Theory of Fresh-Symbol Classification
Title（参考訳）: シンボル名が重要でないとき--新生シンボリック分類のロジスティック理論
Authors: Wenjie Guan, Jelena Bradic,
Abstract要約: この問題の固定ラベル分類バージョンについて検討し、列車と試験の例は遅延テンプレートを共有するが、不随意語彙を用いることがある。次世代の予測とは異なり、モデルは目に見えないシンボルを出力する必要はない。色付き衝突グラフによる偶発的トークン重複を符号化し、新しいシンボル分類のための高確率マージン・トランスファー保証を証明した。
参考スコア（独自算出の注目度）: 0.08594140167290097
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Template tasks have emerged as a clean testbed for asking whether transformers reason with abstract symbols rather than concrete token names. We study the fixed-label classification version of this problem, where train and test examples share latent templates but may use disjoint vocabularies. Unlike next-token prediction, the model need not emit unseen symbols; it must learn a decision rule invariant to symbol renaming. We analyze regularized kernel logistic classification in the transformer-kernel regime. Our main result decomposes the learned predictor into an ideal template-level classifier and a finite-sample perturbation caused by accidental token overlaps in the training data. We encode these overlaps by a colored collision graph and prove high-probability margin-transfer guarantees for fresh-symbol classification. This perspective extends template-based analyses to logistic classification and refines scalar diversity conditions: vocabulary size controls the average rate of collisions, but collision geometry controls whether the ideal classification margin is preserved. More broadly, the same perturbation framework applies to abstraction-augmented inputs, yielding a general margin-versus-collision criterion for identifying when prompting strategies improve fresh-symbol generalization. Synthetic template experiments illustrate the predicted roles of regularization, sample size, and transformer-kernel structure.
Abstract（参考訳）: テンプレートタスクは、具体的なトークン名ではなく抽象シンボルをトランスフォーマーが推論するかどうかを問う、クリーンなテストベッドとして登場した。この問題の固定ラベル分類バージョンについて検討し、列車と試験の例は遅延テンプレートを共有するが、不随意語彙を用いることがある。次世代の予測とは異なり、モデルは目に見えないシンボルを出力する必要はない。トランスカーネルシステムにおけるカーネルロジスティックな正規化分類を解析する。本研究の主な成果は,学習した予測器を理想的なテンプレートレベル分類器に分解し,学習データに誤ってトークンが重なることに起因する有限サンプル摂動を推定する。我々はこれらのオーバーラップを色付き衝突グラフでエンコードし、新しいシンボル分類のための高い確率のマージン・トランスファー保証を証明した。この視点は、テンプレートに基づく分析をロジスティックな分類に拡張し、スカラーな多様性条件を洗練させる:語彙サイズは衝突の平均速度を制御するが、衝突幾何学は理想的な分類限界が保存されているかどうかを制御する。より広義には、同じ摂動フレームワークが抽象的拡張された入力に適用され、戦略の推進により新記号の一般化が改善されたときの識別のための一般的なマージン対衝突基準が得られる。合成テンプレート実験は、正規化、サンプルサイズ、トランスフォーマー・カーネル構造の予測された役割を示す。

論文の概要: When Symbol Names Should Not Matter: A Logistic Theory of Fresh-Symbol Classification

関連論文リスト