Fugu-MT 論文翻訳(概要): The Expressivity Boundary of Probabilistic Circuits: A Comparison with Large Language Models

論文の概要: The Expressivity Boundary of Probabilistic Circuits: A Comparison with Large Language Models

arxiv url: http://arxiv.org/abs/2605.12940v1
Date: Wed, 13 May 2026 03:22:10 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-14 23:30:27.783016
Title: The Expressivity Boundary of Probabilistic Circuits: A Comparison with Large Language Models
Title（参考訳）: 確率回路の表現性境界:大規模言語モデルとの比較
Authors: Zhiyu Zhao, Xuejie Liu, Muhan Zhang, Anji Liu,
Abstract要約: 自己回帰型言語モデリングにおいて、確率回路(PC)はTransformer-based large language model(LLM)より遅れている構造化分解可能なPCは, 変圧器分離ランクをvtree整列パーティションに合わせることができるが, この容量は固定されたルーティング構造に整列したパーティションに限られる。さらに、分解可能なPCは構造化された分解可能なPCよりも厳密に表現可能であることを証明していますが、効果的に最適化することはオープンな課題です。
参考スコア（独自算出の注目度）: 49.67598869075125
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Probabilistic Circuits (PCs) are deep generative models that support exact and efficient probabilistic inference. Yet in autoregressive language modeling, PCs still lag behind Transformer-based large language models (LLMs), suggesting an important expressivity gap. In this work, we compare PCs and LLMs under a unified autoregressive formulation. First, an output bottleneck: PCs parameterize predictions as convex combinations in probability space, which struggles to represent the sharp distributions typical of language; adopting a logit-space parameterization substantially narrows this gap. Second, a context-encoding bottleneck: we prove that structured-decomposable PCs can match Transformer separation rank on vtree-aligned partitions, but show, both theoretically and empirically, that this capacity is limited to partitions aligned with the fixed routing structure, leading to severe degradation when the data exhibits heterogeneous dependency topologies. We further prove that decomposable PCs are strictly more expressive than structured-decomposable ones, though effectively optimizing them remains an open challenge.
Abstract（参考訳）: 確率回路(PC)は、正確かつ効率的な確率的推論をサポートする深層生成モデルである。しかし、自動回帰言語モデリングでは、PCはトランスフォーマーベースの大規模言語モデル(LLM)より遅れており、重要な表現力のギャップを示唆している。本研究では,PC と LLM を統合自己回帰式で比較する。第一に、出力ボトルネック:PCは確率空間における凸結合として予測をパラメータ化し、言語に典型的な鋭い分布を表現するのに苦労する。第2に、構造化分解可能なPCは、vtree-alignedパーティション上のトランスフォーマー分離ランクと一致することを証明しますが、理論的にも経験的にも、この容量は固定されたルーティング構造に沿ったパーティションに限られており、データが不均一な依存トポロジーを示す場合、大幅に劣化することを示す。さらに、分解可能なPCは構造化された分解可能なPCよりも厳密に表現可能であることを証明していますが、効果的に最適化することはオープンな課題です。

論文の概要: The Expressivity Boundary of Probabilistic Circuits: A Comparison with Large Language Models

関連論文リスト