Fugu-MT 論文翻訳(概要): CAP-CoT: Cycle Adversarial Prompt for Improving Chain of Thoughts in LLM Reasoning

論文の概要: CAP-CoT: Cycle Adversarial Prompt for Improving Chain of Thoughts in LLM Reasoning

arxiv url: http://arxiv.org/abs/2604.23270v1
Date: Sat, 25 Apr 2026 12:24:04 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:07.238734
Title: CAP-CoT: Cycle Adversarial Prompt for Improving Chain of Thoughts in LLM Reasoning
Title（参考訳）: CAP-CoT:LCM推論における思考の連鎖改善のためのサイクル逆転プロンプト
Authors: Shuxu Chen, Yitian Zhou, Jiaquan Zhang, Haoyu Bian, Aming Wu, Sungyoung Lee, Chaoning Zhang, Hyundong Shin,
Abstract要約: CoT(Chain-of-Thought)プロンプトは,大規模言語モデル(LLM)からステップバイステップのソリューションを引き出すための,シンプルかつ効果的な方法として登場した。本研究では,CoTの推理精度と1つのデプロイされたソルバの安定性を両立させるため,Cycle Adversarial Prompt最適化フレームワークであるCAP-CoTを提案する。 CAP-CoTは,摂動を誘導するための推論精度とロバスト性を改善しつつ,ランの変動性を一貫して低減することを示す。
参考スコア（独自算出の注目度）: 32.954077252169995
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Chain-of-Thought (CoT) prompting has emerged as a simple and effective way to elicit step-by-step solutions from large language models (LLMs). However, CoT reasoning can be unstable across runs on long, multi-step problems, leading to inconsistent answers for unchanged task. Most prior work focuses on improving the forward reasoning chain within a single pass, with less attention to iterative and contrastive correction. To address this gap, we propose CAP-CoT, a Cycle Adversarial Prompt optimization framework designed to improve both CoT reasoning accuracy and stability of a single deployed solver. In each cycle, a forward solver generates candidate reasoning chains, an adversarial challenger constructs plausible but deliberately flawed chains using targeted error strategies, and a feedback agent contrasts the two chains and produces step-aligned structured feedback. This feedback closes the optimization loop in two directions, including updating the solver prompt based on errors exposed by the challenger, and updating the challenger prompt to generate increasingly targeted errors in subsequent cycles. Unlike safety-oriented adversarial prompting such as jailbreak or prompt-injection attacks, our adversarial component is task-semantic and aims to expose logical vulnerabilities in reasoning chains. Experiments across six benchmarks and four LLM backbones demonstrate that within two to three adversarial prompt optimization cycles, CAP-CoT consistently reduces variability across runs while improving reasoning accuracy and robustness to prompt perturbations.
Abstract（参考訳）: CoT(Chain-of-Thought)プロンプトは,大規模言語モデル(LLM)からステップバイステップのソリューションを引き出すための,シンプルかつ効果的な方法として登場した。しかし、CoT推論は、長い複数のステップの問題で実行中に不安定になり、不連続なタスクに対する不整合な答えにつながる。これまでのほとんどの研究は、1回のパスで前方の推論連鎖を改善することに重点を置いており、反復的かつコントラスト的な修正にはあまり注意を払わない。このギャップに対処するために、単一デプロイソルバのCoT推論精度と安定性の両方を改善するために設計されたCycle Adversarial Prompt最適化フレームワークであるCAP-CoTを提案する。各サイクルにおいて、フォワードソルバは、候補推論チェーンを生成し、敵対的チャレンジャーは、目標とするエラー戦略を用いて、プラプティブルで意図的に欠陥のあるチェーンを構築し、フィードバックエージェントは、2つのチェーンを対比し、ステップ整列された構造化されたフィードバックを生成する。このフィードバックは、2つの方向に最適化ループをクローズする。例えば、チャレンジャーが露出したエラーに基づいてソルバプロンプトを更新し、チャレンジャープロンプトを更新することで、次のサイクルでますますターゲットとなるエラーを発生させる。ジェイルブレイクやインジェクション攻撃のような安全指向の敵攻撃とは異なり、我々の敵コンポーネントはタスクセマンティックであり、推論チェーンにおける論理的脆弱性を明らかにすることを目的としている。 6つのベンチマークと4つのLCMバックボーンによる実験により、CAP-CoTは2～3つの逆のプロンプト最適化サイクルにおいて、連続的にランの変動を低減し、推論精度とロバスト性を改善して摂動を誘導することを示した。

論文の概要: CAP-CoT: Cycle Adversarial Prompt for Improving Chain of Thoughts in LLM Reasoning

関連論文リスト