Fugu-MT 論文翻訳(概要): Constraints-of-Thought: A Framework for Constrained Reasoning in Language-Model-Guided Search

論文の概要: Constraints-of-Thought: A Framework for Constrained Reasoning in Language-Model-Guided Search

arxiv url: http://arxiv.org/abs/2510.08992v1
Date: Fri, 10 Oct 2025 04:21:18 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 00:38:48.109339
Title: Constraints-of-Thought: A Framework for Constrained Reasoning in Language-Model-Guided Search
Title（参考訳）: Constraints-of-ought:Language-Model-Guided Searchにおける制約推論のためのフレームワーク
Authors: Kamel Alrashedy, Vriksha Srihari, Zulfiqar Zaidi, Ridam Srivastava, Pradyumna Tambwekar, Matthew Gombolay,
Abstract要約: Constraints-of-Thought (Const-o-T) はモンテカルロ木探索(MCTS)が意味論的に意味のある経路を探索できるようにするフレームワークである。我々は、Const-o-Tが制約誘導推論の一般化可能な基盤を提供し、より効率的、制約整合性、ドメイン適応型プランニングを可能にすることを実証した。
参考スコア（独自算出の注目度）: 3.0130126601831235
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While researchers have made significant progress in enabling large language models (LLMs) to perform multi-step planning, LLMs struggle to ensure that those plans align with high-level user intent and satisfy symbolic constraints, especially in complex, multi-step domains. Existing reasoning approaches such as Chain-of-Thought (CoT), Tree-of-Thought (ToT), and verifier-augmented methods, expand the search space but often yield infeasible actions or hallucinated steps. To overcome these limitations, we propose Constraints-of-Thought (Const-o-T), a framework that provides a structured prior that enables Monte Carlo Tree Search (MCTS) focus search on semantically meaningful paths. Each reasoning step is represented as an (intent, constraint) pair, which serves both to compress the search space and enforce validity. Unlike prior methods that merely generate reasoning traces or validate outputs post hoc, Const-o-T uses (intent, constraint)pairs to actively focus the search toward feasible and meaningful plans. We integrate Const-o-T into MCTS using a structured representation of intent-constraint pairs constraints prune infeasible branches and guide exploration toward semantically valid actions, improving planning efficiency and verifiable decision-making. We demonstrate across three domains Risk game, CAD code generation, and arithmetic reasoning that our approach outperforms baselines, yielding higher accuracy and stronger structural alignment. Our contribution is to demonstrate that Const-of-T offers a generalizable foundation for constraint-guided reasoning, enabling more efficient, constraint-aligned, and domain-adaptable planning with LLMs.
Abstract（参考訳）: 研究者は大規模言語モデル(LLM)を多段階計画で実行可能にするために大きな進歩を遂げてきたが、LSMはこれらの計画が高レベルのユーザ意図と整合し、特に複雑で多段階のドメインにおいて象徴的な制約を満たすことを保証するのに苦労している。 Chain-of-Thought (CoT) や Tree-of-Thought (ToT) といった既存の推論手法や検証対象拡張手法は、探索空間を拡大するが、しばしば実現不可能な行動や幻覚的なステップをもたらす。この制限を克服するために,モンテカルロ木探索(MCTS)が意味論的に意味のある経路を探索できる構造化された事前情報を提供するフレームワークであるConstraints-of-Thought(Const-o-T)を提案する。各推論ステップは(意図的、制約的な)ペアとして表現され、探索空間を圧縮し、妥当性を強制する。単に推論トレースを生成したり、アウトプットを評価したりする従来の方法とは異なり、Const-o-Tは(意図的、制約的)ペアを使用して、実行可能で有意義な計画への探索に積極的に焦点を合わせている。本研究では,MCTS に Const-o-T を組み込むため,意図制約対の制約を構造化した表現を用いて,意味論的に有効な行動への探索,計画効率の向上,意思決定の検証を行う。我々は,リスクゲーム,CADコード生成,算術的推論の3分野にまたがって,我々のアプローチがベースラインを上回り,より高い精度とより強い構造アライメントをもたらすことを実証する。私たちの貢献は、Const-of-Tが制約誘導推論の一般化可能な基盤を提供し、LLMによるより効率的、制約整合性、ドメイン適応型プランニングを可能にしていることを示すことです。

論文の概要: Constraints-of-Thought: A Framework for Constrained Reasoning in Language-Model-Guided Search

関連論文リスト