Fugu-MT 論文翻訳(概要): Exploring Diverse Generation Paths via Inference-time Stiefel Activation Steering

論文の概要: Exploring Diverse Generation Paths via Inference-time Stiefel Activation Steering

arxiv url: http://arxiv.org/abs/2601.22010v1
Date: Thu, 29 Jan 2026 17:17:04 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-30 16:22:50.035939
Title: Exploring Diverse Generation Paths via Inference-time Stiefel Activation Steering
Title（参考訳）: 推論時スティフェル活性化ステアリングによる逆生成経路の探索
Authors: Dongxuan Zhu, Ly Tran Ho Khanh, Andy Yat-Ming Cheung, Man-Chung Yue, Viet Anh Nguyen,
Abstract要約: 言語モデルは、しばしば、高確率出力の狭いセットにデフォルトされ、生成パスは均質でモード崩壊しがちである。本研究では,STARSを導入した。STARSは,アクティベーションステアリングを探索エンジンに変換する,トレーニング不要な推論時間介入手法である。テストケース生成と科学的発見ベンチマークでは、STARSは標準サンプリング法より一貫して優れている。
参考スコア（独自算出の注目度）: 15.160204778653302
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Language models often default to a narrow set of high-probability outputs, leaving their generation paths homogeneous and prone to mode collapse. Sampling-based strategies inject randomness but still struggle to guarantee diversity across multiple concurrent generation runs. We address this limitation by introducing STARS ($\textbf{St}$iefel-based $\textbf{A}$ctivation Steering for Diverse $\textbf{R}$ea$\textbf{S}$oning), a training-free, inference-time intervention method that transforms activation steering into an exploration engine. At each token, STARS collects the hidden activations of concurrent generation runs and optimizes multiple additive steering directions jointly on the Stiefel manifold. STARS maximizes the geometric volume of the steered activations, while the Stiefel manifold induces orthogonality of the steering interventions. This formulation explicitly promotes divergent activation vectors of concurrent generation runs, and implicitly promotes divergent generation trajectories. This manifold optimization formulation can be solved using a Riemannian gradient descent algorithm with convergence guarantees, but this algorithm is too time-consuming for real-time inference. To guarantee low latency, we further design a lightweight one-step update with an aggressive, closed-form stepsize. For test case generation and scientific discovery benchmarks, STARS consistently outperforms standard sampling methods, achieving greater diversity without sacrificing qualitative performance.
Abstract（参考訳）: 言語モデルは、しばしば、高確率出力の狭いセットにデフォルトされ、生成パスは均質でモード崩壊しがちである。サンプリングベースの戦略は、ランダム性を注入するが、複数の同時生成実行における多様性を保証するのに苦慮している。 STARS$\textbf{St}$iefel-based $\textbf{A}$ctivation Steering for Diverse $\textbf{R}$ea$\textbf{S}$oning, この制限に対処する。各トークンにおいて、STARSは同時生成の実行の隠れ活性化を収集し、スティーフェル多様体上で複数の加算ステアリング方向を最適化する。 STARSはステアリング活性化の幾何体積を最大化し、スティフェル多様体はステアリング介入の直交性を誘導する。この定式化は、並列生成実行の発散活性化ベクトルを明示的に促進し、暗黙的に発散生成軌跡を促進させる。この多様体最適化の定式化は収束保証付きリーマン勾配降下アルゴリズムを用いて解くことができるが、このアルゴリズムはリアルタイム推論には時間がかかりすぎる。低レイテンシを保証するため、攻撃的でクローズドなステップサイズで軽量なワンステップアップデートを設計する。テストケースの生成と科学的発見のベンチマークでは、STARSは定性的な性能を犠牲にすることなく、標準サンプリング法を一貫して上回り、さらなる多様性を達成する。

論文の概要: Exploring Diverse Generation Paths via Inference-time Stiefel Activation Steering

関連論文リスト