Fugu-MT 論文翻訳(概要): MARCO: Meta-Reflection with Cross-Referencing for Code Reasoning

論文の概要: MARCO: Meta-Reflection with Cross-Referencing for Code Reasoning

arxiv url: http://arxiv.org/abs/2505.17481v1
Date: Fri, 23 May 2025 05:21:11 GMT
ステータス: 翻訳完了
システム内更新日: 2025-05-26 18:08:33.831261
Title: MARCO: Meta-Reflection with Cross-Referencing for Code Reasoning
Title（参考訳）: MARCO: コード推論のためのクロスリファレンスを備えたメタリフレクション
Authors: Yusheng Zhao, Xiao Luo, Weizhi Zhang, Wei Ju, Zhiping Xiao, Philip S. Yu, Ming Zhang,
Abstract要約: 我々は、LLMエージェントが提案するソリューションごとにコード推論において徐々に賢くなる方法を探る。認知進化的視点を取り入れたメタリフレクション(Meta-Reflection with Cross-Reference, MARCO)という新しいフレームワークを提案する。コード推論において,様々なデータセットに対して実験を行い,MARCOの有効性を実証した。
参考スコア（独自算出の注目度）: 37.0077348617437
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: The ability to reason is one of the most fundamental capabilities of large language models (LLMs), enabling a wide range of downstream tasks through sophisticated problem-solving. A critical aspect of this is code reasoning, which involves logical reasoning with formal languages (i.e., programming code). In this paper, we enhance this capability of LLMs by exploring the following question: how can an LLM agent become progressively smarter in code reasoning with each solution it proposes, thereby achieving substantial cumulative improvement? Most existing research takes a static perspective, focusing on isolated problem-solving using frozen LLMs. In contrast, we adopt a cognitive-evolving perspective and propose a novel framework named Meta-Reflection with Cross-Referencing (MARCO) that enables the LLM to evolve dynamically during inference through self-improvement. From the perspective of human cognitive development, we leverage both knowledge accumulation and lesson sharing. In particular, to accumulate knowledge during problem-solving, we propose meta-reflection that reflects on the reasoning paths of the current problem to obtain knowledge and experience for future consideration. Moreover, to effectively utilize the lessons from other agents, we propose cross-referencing that incorporates the solution and feedback from other agents into the current problem-solving process. We conduct experiments across various datasets in code reasoning, and the results demonstrate the effectiveness of MARCO.
Abstract（参考訳）: この能力は大規模言語モデル(LLM)の最も基本的な機能の一つであり、洗練された問題解決を通じて幅広いダウンストリームタスクを可能にする。コード推論は形式言語(プログラミングコード)による論理的推論を含む。本稿では,LLMエージェントが提案する各ソリューションでコード推論を段階的に賢くし,累積的な改善を実現するにはどうすればよいのか,という疑問を提起することによって,LLMのこの能力を向上させる。既存のほとんどの研究は、凍結LDMを用いた独立した問題解決に焦点を当て、静的な視点を採っている。対照的に、認知進化的な視点を採用し、自己改善を通じてLLMが動的に進化することを可能にするMeta-Reflection with Cross-Reference (MARCO) という新しいフレームワークを提案する。人間の認知発達の観点からは、知識蓄積と授業共有の両方を活用する。特に,問題解決の過程で知識を蓄積するために,現在の課題の推論経路を反映したメタリフレクションを提案し,今後の考察のために知識と経験を得る。さらに、他のエージェントからの学習を効果的に活用するために、他のエージェントからのソリューションとフィードバックを現在の問題解決プロセスに組み込んだ相互参照を提案する。コード推論において,様々なデータセットに対して実験を行い,MARCOの有効性を実証した。

論文の概要: MARCO: Meta-Reflection with Cross-Referencing for Code Reasoning

関連論文リスト