Fugu-MT 論文翻訳(概要): Subspace Control: Turning Constrained Model Steering into Controllable Spectral Optimization

論文の概要: Subspace Control: Turning Constrained Model Steering into Controllable Spectral Optimization

arxiv url: http://arxiv.org/abs/2604.04231v1
Date: Sun, 05 Apr 2026 19:27:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:18.996786
Title: Subspace Control: Turning Constrained Model Steering into Controllable Spectral Optimization
Title（参考訳）: 部分空間制御:制約付きモデルステアリングを制御可能なスペクトル最適化に変換する
Authors: Yancheng Huang, Changsheng Wang, Chongyu Fan, Yicheng Lang, Bingqi Shang, Yang Zhang, Mingyi Hong, Qing Qu, Alvaro Velasquez, Sijia Liu,
Abstract要約: 制約付きモデルトレーニングのためのサブスペース制御フレームワークを提案する。 SIFT (spectral interference-free training) は、すべてのタスクにおいて、実質的で堅牢なパフォーマンス改善を一貫して達成する。
参考スコア（独自算出の注目度）: 27.280175507074322
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Foundation models, such as large language models (LLMs), are powerful but often require customization before deployment to satisfy practical constraints such as safety, privacy, and task-specific requirements, leading to "constrained" optimization problems for model steering and adaptation. However, solving such problems remains largely underexplored and is particularly challenging due to interference between the primary objective and constraint objectives during optimization. In this paper, we propose a subspace control framework for constrained model training. Specifically, (i) we first analyze, from a model merging perspective, how spectral cross-task interference arises and show that it can be resolved via a one-shot solution that orthogonalizes the merged subspace; (ii) we establish a connection between this solution and gradient orthogonalization in the spectral optimizer Muon; and (iii) building on these insights, we introduce SIFT (spectral interference-free training), which leverages a localization scheme to selectively intervene during optimization, enabling controllable updates that mitigate objective-constraint conflicts. We evaluate SIFT across four representative applications: (a) machine unlearning, (b) safety alignment, (c) text-to-speech adaptation, and (d) hallucination mitigation. Compared to both control-based and control-free baselines, SIFT consistently achieves substantial and robust performance improvements across all tasks. Code is available at https://github.com/OPTML-Group/SIFT.
Abstract（参考訳）: 大規模言語モデル(LLM)のような基礎モデルは強力だが、安全性やプライバシー、タスク固有の要件といった現実的な制約を満たすために、デプロイ前にカスタマイズを必要とすることが多く、モデルステアリングと適応のための"制約付き"最適化問題に繋がる。しかし、そのような問題の解法は未解明のままであり、最適化時の主目的と制約目的との干渉により特に困難である。本稿では,制約付きモデルトレーニングのためのサブスペース制御フレームワークを提案する。具体的には i) モデルマージの観点から、まず、どのようにスペクトルクロスタスクの干渉が起こるかを分析し、それが統合された部分空間を直交するワンショット解によって解決可能であることを示す。 (II)スペクトルオプティマイザMuonにおけるこの解と勾配直交化の接続を確立する。 3)これらの知見に基づいて、最適化中に選択的に介入するための局所化スキームを活用するSIFT(spectral interference-free training)を導入し、客観的制約の衝突を緩和する制御可能な更新を可能にする。 SIFTを4つの代表的なアプリケーションで評価する。 a) 機械学習; 機械学習; 機械学習; 機械学習; 機械学習; 機械学習; 機械学習; 機械学習; 機械学習; 機械学習; 機械学習; 機械学習; 機械学習; 機械学習; 機械学習; 機械学習; 機械教育; 機械 (b)安全アライメント (c)テキストから音声への適応、及び (d)幻覚の緩和。コントロールベースベースラインとコントロールフリーベースラインの両方と比較して、SIFTは、すべてのタスクにおいて、実質的で堅牢なパフォーマンス改善を一貫して達成している。コードはhttps://github.com/OPTML-Group/SIFTで入手できる。

論文の概要: Subspace Control: Turning Constrained Model Steering into Controllable Spectral Optimization

関連論文リスト