Fugu-MT 論文翻訳(概要): From Generalist to Specialist Representation

論文の概要: From Generalist to Specialist Representation

arxiv url: http://arxiv.org/abs/2605.12733v1
Date: Tue, 12 May 2026 20:34:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-14 23:30:27.679817
Title: From Generalist to Specialist Representation
Title（参考訳）: ジェネラリストからスペシャリストへ
Authors: Yujia Zheng, Fan Feng, Yuke Li, Shaoan Xie, Kevin Murphy, Kun Zhang,
Abstract要約: 一般論モデルを考えると、ダウンストリームアプリケーションにはタスク関連の専門家表現の学習が不可欠である。我々は、時間ステップとタスクの間の構造が、完全に教師なしの方法で識別可能であることを証明した。次に、各時間ステップにおいて、タスク関連潜在表現が無関係部分から切り離される可能性があることを証明します。
参考スコア（独自算出の注目度）: 35.95654356089404
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Given a generalist model, learning a task-relevant specialist representation is fundamental for downstream applications. Identifiability, the asymptotic guarantee of recovering the ground-truth representation, is critical because it sets the ultimate limit of any model, even with infinite data and computation. We study this problem in a completely nonparametric setting, without relying on interventions, parametric forms, or structural constraints. We first prove that the structure between time steps and tasks is identifiable in a fully unsupervised manner, even when sequences lack strict temporal dependence and may exhibit disconnections, and task assignments can follow arbitrarily complex and interleaving structures. We then prove that, within each time step, the task-relevant latent representation can be disentangled from the irrelevant part under a simple sparsity regularization, without any additional information or parametric constraints. Together, these results establish a hierarchical foundation: task structure is identifiable across time steps, and task-relevant latent representations are identifiable within each step. To our knowledge, each result provides a first general nonparametric identifiability guarantee, and together they mark a step toward provably moving from generalist to specialist models.
Abstract（参考訳）: 一般論モデルを考えると、ダウンストリームアプリケーションにはタスク関連の専門家表現の学習が不可欠である。 Identifiability, asymptotic guarantee to recovery the ground-truth representation, because it set the Ultimate limit of any model, with infinite data and compute。我々はこの問題を、介入、パラメトリック形式、構造的制約に頼ることなく、完全に非パラメトリックな環境で研究する。まず、時間ステップとタスク間の構造が、厳密な時間的依存を欠き、切断を示す可能性がある場合でも、完全に教師のない方法で識別可能であることを証明し、タスク割り当ては任意に複雑でインターリーブな構造に従うことができる。次に、各時間ステップにおいて、タスク関連潜在表現が、余分な情報やパラメトリック制約を伴わずに、単純な疎正則化の下で非関連部分から切り離され得ることを証明した。タスク構造は時間ステップで識別可能であり、タスク関連潜在表現は各ステップで識別可能である。我々の知る限り、各結果は最初の一般の非パラメトリック識別可能性保証を提供し、それらと共に、ジェネラリストからスペシャリストモデルへ確実に移行するためのステップを示す。

論文の概要: From Generalist to Specialist Representation

関連論文リスト