Fugu-MT 論文翻訳(概要): Executable Counterfactuals: Improving LLMs' Causal Reasoning Through Code

論文の概要: Executable Counterfactuals: Improving LLMs' Causal Reasoning Through Code

arxiv url: http://arxiv.org/abs/2510.01539v1
Date: Thu, 02 Oct 2025 00:26:35 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:20.917867
Title: Executable Counterfactuals: Improving LLMs' Causal Reasoning Through Code
Title（参考訳）: Executable Counterfactuals: コードによるLLMの因果推論の改善
Authors: Aniket Vashishtha, Qirun Dai, Hongyuan Mei, Amit Sharma, Chenhao Tan, Hao Peng,
Abstract要約: コードや数学の問題を通した因果推論を運用するフレームワークである実行可能逆ファクトアルを導入する。その結果,o4-mini や Claude-4-Sonnet などの SOTA モデルでは,介入による精度 (25-40%) の低下が認められた。また、コードで訓練されたモデルが、反実数ワード問題に一般化するかどうかを検証した。
参考スコア（独自算出の注目度）: 29.382261465478248
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Counterfactual reasoning, a hallmark of intelligence, consists of three steps: inferring latent variables from observations (abduction), constructing alternatives (interventions), and predicting their outcomes (prediction). This skill is essential for advancing LLMs' causal understanding and expanding their applications in high-stakes domains such as scientific research. However, existing efforts in assessing LLM's counterfactual reasoning capabilities tend to skip the abduction step, effectively reducing to interventional reasoning and leading to overestimation of LLM performance. To address this, we introduce executable counterfactuals, a novel framework that operationalizes causal reasoning through code and math problems. Our framework explicitly requires all three steps of counterfactual reasoning and enables scalable synthetic data creation with varying difficulty, creating a frontier for evaluating and improving LLM's reasoning. Our results reveal substantial drop in accuracy (25-40%) from interventional to counterfactual reasoning for SOTA models like o4-mini and Claude-4-Sonnet. To address this gap, we construct a training set comprising counterfactual code problems having if-else condition and test on out-of-domain code structures (e.g. having while-loop); we also test whether a model trained on code would generalize to counterfactual math word problems. While supervised finetuning on stronger models' reasoning traces improves in-domain performance of Qwen models, it leads to a decrease in accuracy on OOD tasks such as counterfactual math problems. In contrast, reinforcement learning induces the core cognitive behaviors and generalizes to new domains, yielding gains over the base model on both code (improvement of 1.5x-2x) and math problems. Analysis of the reasoning traces reinforces these findings and highlights the promise of RL for improving LLMs' counterfactual reasoning.
Abstract（参考訳）: カウンターファクト推論(英: Counterfactual reasoning)とは、観測(吸収)から潜伏変数を推論し、代替案(干渉)を構築し、結果(予測)を予測する3つのステップである。この技術は、LLMの因果理解を推進し、科学研究のような高度な分野への応用を拡大するために欠かせない。しかし, LLMの非現実的推論能力を評価する既存の取り組みは, 退行ステップを省略し, 介入推論を効果的に減らし, LLM性能を過大評価する傾向にある。これを解決するために、コードや数学の問題を通じて因果推論を運用する新しいフレームワークである実行可能対実法を導入する。筆者らのフレームワークは, 対実的推論の3段階全てを明示的に要求し, 様々な難易度でスケーラブルな合成データ作成を可能にし, LLMの推論を評価し改善するためのフロンティアを創出する。その結果,o4-mini や Claude-4-Sonnet などの SOTA モデルでは,介入による精度 (25-40%) の低下が認められた。このギャップに対処するため、我々は、if-else条件の反実コード問題を含むトレーニングセットを構築し、ドメイン外のコード構造(例えば、 while-loop)でテストし、コードでトレーニングされたモデルが反実数ワード問題に一般化するかどうかを検証した。より強いモデルの推論トレースの教師付き微調整はQwenモデルのドメイン内性能を向上させるが、反実数問題などのOODタスクの精度は低下する。対照的に、強化学習はコア認知の振る舞いを誘導し、新しい領域に一般化し、コード(1.5x-2xの改良)と数学の問題の両方のベースモデルよりも利益をもたらす。推論の痕跡の分析はこれらの知見を補強し、LLMの非現実的推論を改善するためのRLの約束を強調している。

論文の概要: Executable Counterfactuals: Improving LLMs' Causal Reasoning Through Code

関連論文リスト