Fugu-MT 論文翻訳(概要): Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction

論文の概要: Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction

arxiv url: http://arxiv.org/abs/2511.07943v1
Date: Wed, 12 Nov 2025 01:29:44 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-12 20:17:03.561173
Title: Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction
Title（参考訳）: 思考:多軸相互作用による深層探索のための階層的思考におけるLLMの訓練
Authors: Jun Xu, Xinkai Du, Yu Ao, Peilong Zhao, Yang Li, Ling Zhong, Lin Yuan, Zhongpu Bo, Xiaorui Wang, Mengshu Sun, Zhengke Gui, Dalong Zhang, Zhaoyang Wang, Qiwei Wang, Yangyang Hou, Zhiying Yin, Haofen Wang, Huajun Chen, Lei Liang, Jun Zhou,
Abstract要約: Thinkerはマルチターンインタラクションによるディープ検索のための階層的思考モデルである。複素問題を独立に解ける部分確率に分解する。サブプロブレム間の依存関係は、これらの論理関数を介してパラメータとして渡される。
参考スコア（独自算出の注目度）: 57.67217258741752
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Efficient retrieval of external knowledge bases and web pages is crucial for enhancing the reasoning abilities of LLMs. Previous works on training LLMs to leverage external retrievers for solving complex problems have predominantly employed end-to-end reinforcement learning. However, these approaches neglect supervision over the reasoning process, making it difficult to guarantee logical coherence and rigor. To address these limitations, we propose Thinker, a hierarchical thinking model for deep search through multi-turn interaction, making the reasoning process supervisable and verifiable. It decomposes complex problems into independently solvable sub-problems, each dually represented in both natural language and an equivalent logical function to support knowledge base and web searches. Concurrently, dependencies between sub-problems are passed as parameters via these logical functions, enhancing the logical coherence of the problem-solving process. To avoid unnecessary external searches, we perform knowledge boundary determination to check if a sub-problem is within the LLM's intrinsic knowledge, allowing it to answer directly. Experimental results indicate that with as few as several hundred training samples, the performance of Thinker is competitive with established baselines. Furthermore, when scaled to the full training set, Thinker significantly outperforms these methods across various datasets and model sizes. The source code is available at https://github.com/OpenSPG/KAG-Thinker.
Abstract（参考訳）: 外部知識ベースと Web ページの効率的な検索は LLM の推論能力の向上に不可欠である。複雑な問題を解決するために外部レトリバーを活用するLLMのトレーニング作業は、主にエンドツーエンドの強化学習を採用してきた。しかし、これらのアプローチは推論過程の監督を無視しており、論理的一貫性と厳密性を保証することは困難である。これらの制約に対処するため,多ターンインタラクションによる深層探索のための階層的思考モデルであるThinkerを提案する。複雑な問題を独立に解けるサブプロブレムに分解し、それぞれが自然言語と等価論理関数の両方で表現され、知識ベースとWeb検索をサポートする。同時に、サブプロブレム間の依存関係はこれらの論理関数を介してパラメータとして渡され、問題解決プロセスの論理的一貫性が向上する。不要な外部探索を避けるために,サブプロブレムがLLMの内在的知識内にあるかどうかを確認する知識境界決定を行い,直接答えることを可能にする。実験結果から,数百のトレーニングサンプルで,Thinkerの性能は確立したベースラインと競合することがわかった。さらに、完全なトレーニングセットにスケールすると、Thinkerはさまざまなデータセットやモデルサイズでこれらのメソッドよりも大幅にパフォーマンスが向上します。ソースコードはhttps://github.com/OpenSPG/KAG-Thinkerで入手できる。

論文の概要: Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction

関連論文リスト