Fugu-MT 論文翻訳(概要): Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner

論文の概要: Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner

arxiv url: http://arxiv.org/abs/2510.03206v1
Date: Fri, 03 Oct 2025 17:44:41 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-06 16:35:52.518759
Title: Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner
Title（参考訳）: Coevolutionary Continuous Discrete Diffusion: 拡散言語モデルを遅延推論器にする
Authors: Cai Zhou, Chenxiao Yang, Yi Hu, Chenyu Wang, Chubin Zhang, Muhan Zhang, Lester Mackey, Tommi Jaakkola, Stephen Bates, Dinghuai Zhang,
Abstract要約: 拡散言語モデルは必ずしも離散空間にある必要はないと主張する。特に、連続拡散モデルが離散拡散やループ変換器よりも強い表現性を持つことを示す。本稿では,連続表現空間と離散トークン空間の結合に関する共同多モード拡散過程を定義する共進化連続拡散法(CCDD)を提案する。
参考スコア（独自算出の注目度）: 66.86440230599656
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Diffusion language models, especially masked discrete diffusion models, have achieved great success recently. While there are some theoretical and primary empirical results showing the advantages of latent reasoning with looped transformers or continuous chain-of-thoughts, continuous diffusion models typically underperform their discrete counterparts. In this paper, we argue that diffusion language models do not necessarily need to be in the discrete space. In particular, we prove that continuous diffusion models have stronger expressivity than discrete diffusions and looped transformers. We attribute the contradiction between the theoretical expressiveness and empirical performance to their practical trainability: while continuous diffusion provides intermediate supervision that looped transformers lack, they introduce additional difficulty decoding tokens into the discrete token space from the continuous representation space. We therefore propose Coevolutionary Continuous Discrete Diffusion (CCDD), which defines a joint multimodal diffusion process on the union of a continuous representation space and a discrete token space, leveraging a single model to simultaneously denoise in the joint space. By combining two modalities, CCDD is expressive with rich semantics in the latent space, as well as good trainability and sample quality with the help of explicit discrete tokens. We also propose effective architectures and advanced training/sampling techniques for CCDD, which reveals strong empirical performance in extensive language modeling experiments on real-world tasks.
Abstract（参考訳）: 拡散言語モデル、特にマスク付き離散拡散モデルは近年大きな成功を収めている。ループ変換器や連続的なチェーン・オブ・シントによる潜在推論の利点を示す理論的および一次的な実験的な結果もあるが、連続拡散モデルは典型的にはそれらの離散的推論よりも優れる。本稿では,拡散言語モデルが必ずしも離散空間に存在する必要はないことを論じる。特に、連続拡散モデルが離散拡散やループ変換器よりも強い表現性を持つことを示す。連続拡散はループ変換器に欠ける中間的な監督を提供するが、連続表現空間から離散トークン空間にトークンを復号化することがさらに困難である。そこで我々は,連続表現空間と離散トークン空間の結合に関する連立多モード拡散過程を定義する共進化連続離散拡散法(CCDD)を提案する。 2つのモダリティを組み合わせることで、CCDDは潜在空間におけるリッチなセマンティクスと表現され、明確な離散トークンの助けを借りて、優れたトレーニング容易性とサンプル品質を持つ。また,CCDDのための効果的なアーキテクチャと高度なトレーニング/サンプリング手法を提案し,実世界のタスクにおける広範囲な言語モデリング実験において,強力な経験的性能を示す。

論文の概要: Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner

関連論文リスト