Fugu-MT 論文翻訳(概要): Iterative LLM-Based Generation and Refinement of Distracting Conditions in Math Word Problems

論文の概要: Iterative LLM-Based Generation and Refinement of Distracting Conditions in Math Word Problems

arxiv url: http://arxiv.org/abs/2510.08615v2
Date: Wed, 15 Oct 2025 16:08:14 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-16 13:11:49.448476
Title: Iterative LLM-Based Generation and Refinement of Distracting Conditions in Math Word Problems
Title（参考訳）: 数学語問題における繰り返しLLMに基づく抽出条件の生成と再定義
Authors: Kaiqi Yang, Hang Li, Yucheng Chu, Zitao Liu, Mi Tian, Hui Liu,
Abstract要約: 数学語問題(MWP)は最も広く使われている形式の一つである。既存のMWPデータセットの多くは必要な情報のみを含む。我々は,複数の視点と認知レベルからMWPを改訂する一連のプロンプトを開発する。私たちのフレームワークの重要な利点は、オリジナルの問題と修正された問題の間の共有ソリューションの保存です。
参考スコア（独自算出の注目度）: 10.872987322381894
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Mathematical reasoning serves as a crucial testbed for evaluating the intelligence of large language models (LLMs), and math word problems (MWPs) represent one of the most widely used formats. Most existing MWP datasets contain only the necessary information, while problems with distracting or excessive conditions are often overlooked. Prior studies have shown that popular LLMs experience a dramatic performance drop when such distracting conditions are introduced. However, available datasets of MWPs with distracting conditions remain limited, and most exhibit low difficulty and out-of-context expressions. These shortcomings make the distracting conditions easy to detect and disregard, thereby reducing the credibility of benchmarking on these datasets. Moreover, when distracting conditions are added, the reasoning process and answers may change, requiring intensive manual effort to check and rewrite solutions. To address these issues, we design an iterative framework that leverages LLMs to generate distracting conditions automatically. We develop a set of prompts to revise MWPs from multiple perspectives and cognitive levels, encouraging the creation of meaningful distracting conditions as well as suggestions for further refinement. A key advantage of our framework is the preservation of shared solutions between the original and revised problems: the LLMs are explicitly guided to generate distractions that do not alter the original solution, thus eliminating the need to produce new answers. This framework is efficient and easy to deploy, substantially reducing the effort required to generate MWPs with distracting conditions while maintaining high data quality.
Abstract（参考訳）: 数学的推論は、大規模言語モデル(LLM)の知性を評価する上で重要なテストベッドとして機能し、数学用語問題(MWP)は最も広く使われているフォーマットの1つである。既存のMWPデータセットの多くは必要な情報しか含まないが、乱れや過剰な条件の問題は見過ごされがちである。以前の研究では、人気のあるLLMはそのような気を散らす条件が導入されたときに劇的な性能低下を経験していることが示されている。しかし、不注意な条件で利用可能なMWPのデータセットは限定的であり、ほとんどの場合、難易度が低く、文脈外表現である。これらの欠点は、気を散らす条件を検出および無視しやすくすることで、これらのデータセット上でのベンチマークの信頼性を低下させる。さらに、気を散らす条件が加えられると、推論プロセスと答えが変わり、ソリューションのチェックと書き直しに集中的な手作業が必要になる。これらの問題に対処するために, LLM を利用した反復的フレームワークを設計し, 自動的に気を散らす条件を生成する。我々は,複数の視点と認知レベルからMWPを改訂する一連のプロンプトを開発し,有意義な気晴らし条件の創出を奨励するとともに,さらなる改良を提案する。我々のフレームワークの主な利点は、元の問題と修正された問題の共有ソリューションの保存である。このフレームワークは効率的でデプロイが容易で、高いデータ品質を維持しながら、邪魔な条件でMWPを生成するのに必要な労力を大幅に削減する。

論文の概要: Iterative LLM-Based Generation and Refinement of Distracting Conditions in Math Word Problems

関連論文リスト