Fugu-MT 論文翻訳(概要): Thoughtbubbles: an Unsupervised Method for Parallel Thinking in Latent Space

論文の概要: Thoughtbubbles: an Unsupervised Method for Parallel Thinking in Latent Space

arxiv url: http://arxiv.org/abs/2510.00219v1
Date: Tue, 30 Sep 2025 19:49:15 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:20.228739
Title: Thoughtbubbles: an Unsupervised Method for Parallel Thinking in Latent Space
Title（参考訳）: Thoughtbubbles: 潜在空間における並列思考の教師なし手法
Authors: Houjun Liu, Shikhar Murty, Christopher D. Manning, Róbert Csordás,
Abstract要約: 計算変換器における推論時間の計算をスケールするための現在のアプローチは、答えを生成する前に明示的な連鎖トークンを出力するようにトレーニングすることに依存している。 Thoughtbubblesは、残留ストリームのフォークや削除を学ぶことで、潜在空間における並列適応計算を実行するトランスフォーマーである。 Thoughtbubbles は標準デコーダ LM と OpenWebText と peS2o のパープレキシティおよびHellaSwag や LAMBADA などのゼロショット評価において、非適応並列計算手法の両方に優れる。
参考スコア（独自算出の注目度）: 38.50132130644233
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Current approaches for scaling inference-time compute in transformers rely on training them to emit explicit chain-of-thought tokens before producing an answer. While these methods are powerful, they are limited because they cannot be applied during pretraining and are limited to only serially-generated, natural-language verbalization to scale inference-time compute. In this work, we propose Thoughtbubbles, a transformer variant that natively performs parallel adaptive computation in latent space by learning to fork or delete residual streams. Thus, tokens that require a large amount of computation can form a "bubble" of cloned residuals in the middle of the network for additional thinking. Crucially, this behavior is learned during pretraining with only language modeling loss. Thoughtbubbles outperforms both standard decoder LMs as well as non-adaptive parallel computation approaches on OpenWebText and peS2o perplexity and in zero-shot evaluations such as HellaSwag and LAMBADA after pretraining across 150M to 772M parameter scales. The implicit nature of our method enables adaptive computation to be learned starting at pretraining time, paving the way to unify train and test-time behavior for reasoning models.
Abstract（参考訳）: トランスフォーマーにおける推論時間の計算をスケールするための現在のアプローチは、回答を生成する前に明示的な連鎖トークンを発行するようにトレーニングすることに依存している。これらの手法は強力であるが、事前訓練では適用できないため制限されており、推論時間計算をスケールするために連続生成された自然言語の言語化に限られる。本研究では,残余ストリームのフォークや削除を学習することで,遅延空間における並列適応計算をネイティブに行うトランスフォーマーであるThoughtbubblesを提案する。したがって、大量の計算を必要とするトークンは、追加の思考のためにネットワークの中央でクローン化された残余の「バブル」を形成することができる。重要なことは、この振る舞いは言語モデリングの損失だけで事前訓練中に学習される。 Thoughtbubblesは、標準デコーダLMとOpenWebTextとpeS2oパープレキシティの非適応並列計算アプローチ、および1億5000万から772万のパラメータスケールで事前トレーニングした後のHellaSwagやLAMBADAなどのゼロショット評価の両方に優れる。提案手法の暗黙的な性質は,事前学習時間から適応計算を学習することを可能にし,推論モデルに対する列車とテスト時の挙動を統一する方法を編み出す。

論文の概要: Thoughtbubbles: an Unsupervised Method for Parallel Thinking in Latent Space

関連論文リスト