Fugu-MT 論文翻訳(概要): Diffuse Thinking: Exploring Diffusion Language Models as Efficient Thought Proposers for Reasoning

論文の概要: Diffuse Thinking: Exploring Diffusion Language Models as Efficient Thought Proposers for Reasoning

arxiv url: http://arxiv.org/abs/2510.27469v1
Date: Fri, 31 Oct 2025 13:41:30 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-03 17:52:16.111983
Title: Diffuse Thinking: Exploring Diffusion Language Models as Efficient Thought Proposers for Reasoning
Title（参考訳）: Diffuse Thinking: 推論のための効果的な思考プロポーラとしての拡散言語モデルを探る
Authors: Chenyang Shao, Sijian Ren, Fengli Xu, Yong Li,
Abstract要約: そこで我々は,DLMを有効活用した効率的な協調推論フレームワークを提案し,その品質を評価するために,候補思考とLLMを生成する。我々のフレームワークは複雑な推論タスクにおいて高いパフォーマンスを達成し、将来の研究に有望な方向性を提供する。
参考スコア（独自算出の注目度）: 11.437063355666593
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In recent years, large language models (LLMs) have witnessed remarkable advancements, with the test-time scaling law consistently enhancing the reasoning capabilities. Through systematic evaluation and exploration of a diverse spectrum of intermediate thoughts, LLMs demonstrate the potential to generate deliberate reasoning steps, thereby substantially enhancing reasoning accuracy. However, LLMs' autoregressive generation paradigm results in reasoning performance scaling sub-optimally with test-time computation, often requiring excessive computational overhead to propose thoughts while yielding only marginal performance gains. In contrast, diffusion language models (DLMs) can efficiently produce diverse samples through parallel denoising in a single forward pass, inspiring us to leverage them for proposing intermediate thoughts, thereby alleviating the computational burden associated with autoregressive generation while maintaining quality. In this work, we propose an efficient collaborative reasoning framework, leveraging DLMs to generate candidate thoughts and LLMs to evaluate their quality. Experiments across diverse benchmarks demonstrate that our framework achieves strong performance in complex reasoning tasks, offering a promising direction for future research. Our code is open-source at https://anonymous.4open.science/r/Diffuse-Thinking-EC60.
Abstract（参考訳）: 近年、大規模言語モデル (LLM) は目覚ましい進歩を見せており、テストタイムスケーリング法則は推論能力を継続的に強化している。様々な中間的思考のスペクトルの体系的な評価と探索を通じて、LSMは意図的な推論ステップを生成する可能性を示し、それによって推論精度を大幅に向上させる。しかし、LSMの自己回帰生成パラダイムは、性能スケーリングをテスト時間計算と準最適に推し進め、しばしば計算の過大なオーバーヘッドを伴って、限界的な性能向上しか得られない思想を提案する。対照的に、拡散言語モデル(DLM)は、1つの前方通過を並列に認知することで多種多様なサンプルを効率よく生成し、中間思考の提案にそれらを活用することで、品質を維持しながら自己回帰生成に伴う計算負担を軽減することができる。そこで本研究では, DLM を利用した効率的な協調推論手法を提案する。様々なベンチマーク実験により、我々のフレームワークは複雑な推論タスクにおいて高いパフォーマンスを達成し、将来の研究に有望な方向性を提供することが示された。私たちのコードはhttps://anonymous.4open.science/r/Diffuse-Thinking-EC60.comで公開されている。

論文の概要: Diffuse Thinking: Exploring Diffusion Language Models as Efficient Thought Proposers for Reasoning

関連論文リスト