Fugu-MT 論文翻訳(概要): CCQA: Generating Question from Solution Can Improve Inference-Time Reasoning in SLMs

論文の概要: CCQA: Generating Question from Solution Can Improve Inference-Time Reasoning in SLMs

arxiv url: http://arxiv.org/abs/2509.18536v1
Date: Tue, 23 Sep 2025 02:01:03 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-24 20:41:27.644173
Title: CCQA: Generating Question from Solution Can Improve Inference-Time Reasoning in SLMs
Title（参考訳）: CCQA: SLMにおける推論時間推論を改善するソリューションから質問を生成する
Authors: Jin Young Kim, Ji Won Yoon,
Abstract要約: textbfQuestion textbfAnswering (CCQA)におけるtextbfCycle-textbf一貫性を提案する。 CCQAは、サイクル一貫性に着想を得て、各推論経路から質問を生成し、それぞれが元の質問と類似度で評価し、次に、最も類似度の高い候補解を最終応答として選択する。 CCQAは数学および常識推論ベンチマークにおいて8つのモデルで既存の最先端(SOTA)手法を一貫して上回っていることが確認された。
参考スコア（独自算出の注目度）: 14.97707719362011
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recently, inference-time reasoning strategies have further improved the accuracy of large language models (LLMs), but their effectiveness on smaller models remains unclear. Based on the observation that conventional approaches often fail to improve performance in this context, we propose \textbf{C}ycle-\textbf{C}onsistency in \textbf{Q}uestion \textbf{A}nswering (CCQA), a novel reasoning method that can be effectively applied to SLMs. Inspired by cycle consistency, CCQA generates a question from each reasoning path and answer, evaluates each by its similarity to the original question, and then selects the candidate solution with the highest similarity score as the final response. Since conventional SLMs struggle to generate accurate questions from their own reasoning paths and answers, we employ a lightweight Flan-T5 model specialized for question generation to support this process efficiently. From the experimental results, it is verified that CCQA consistently outperforms existing state-of-the-art (SOTA) methods across eight models on mathematical and commonsense reasoning benchmarks. Furthermore, our method establishes a new practical baseline for efficient reasoning in SLMs. Source code can be found at https://github.com/scai-research/ccqa_official.
Abstract（参考訳）: 近年,大規模言語モデル (LLM) の精度は,推論時推論手法により向上している。そこで本研究では,従来の手法が性能向上に失敗するケースが多いことを踏まえ,SLMに効果的に適用可能な新しい推論手法である,<textbf{C}ycle-\textbf{C}onsistency in \textbf{Q}uestion \textbf{A}nswering (CCQA)を提案する。 CCQAは、サイクル一貫性に着想を得て、各推論経路から質問を生成し、それぞれが元の質問と類似度で評価し、次に、最も類似度の高い候補解を最終応答として選択する。従来のSLMでは,独自の推論経路や解答から正確な質問を生成するのに苦労しているため,このプロセスを効率的に支援するために,質問生成に特化した軽量なFlan-T5モデルを採用している。実験結果から,CCQAは数学および常識推論ベンチマークにおいて8つのモデルにわたる既存の最先端(SOTA)手法を一貫して上回っていることを確認した。さらに,本手法は,SLMにおける効率的な推論のための新しい実践的ベースラインを確立する。ソースコードはhttps://github.com/scai-research/ccqa_officialにある。

論文の概要: CCQA: Generating Question from Solution Can Improve Inference-Time Reasoning in SLMs

関連論文リスト