Fugu-MT 論文翻訳(概要): Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

論文の概要: Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

arxiv url: http://arxiv.org/abs/2404.02575v1
Date: Wed, 3 Apr 2024 08:49:11 GMT
ステータス: 翻訳完了
システム内更新日: 2024-04-04 18:00:28.420400
Title: Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models
Title（参考訳）: コンパイラとしての言語モデル: 擬似コード実行のシミュレーションは言語モデルにおけるアルゴリズム推論を改善する
Authors: Hyungjoo Chae, Yeonghyeon Kim, Seungone Kim, Kai Tzu-iunn Ong, Beong-woo Kwak, Moohyeon Kim, Seonghwan Kim, Taeyoon Kwon, Jiwan Chung, Youngjae Yu, Jinyoung Yeo,
Abstract要約: 本稿では,言語モデルの推論過程を2段階に分解するフレームワークであるThink-and-Executeについて述べる。 7つのアルゴリズム的推論タスクについて広範な実験を行い、思考と実行の有効性を実証する。
参考スコア（独自算出の注目度）: 17.76252625790628
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Algorithmic reasoning refers to the ability to understand the complex patterns behind the problem and decompose them into a sequence of reasoning steps towards the solution. Such nature of algorithmic reasoning makes it a challenge for large language models (LLMs), even though they have demonstrated promising performance in other reasoning tasks. Within this context, some recent studies use programming languages (e.g., Python) to express the necessary logic for solving a given instance/question (e.g., Program-of-Thought) as inspired by their strict and precise syntaxes. However, it is non-trivial to write an executable code that expresses the correct logic on the fly within a single inference call. Also, the code generated specifically for an instance cannot be reused for others, even if they are from the same task and might require identical logic to solve. This paper presents Think-and-Execute, a novel framework that decomposes the reasoning process of language models into two steps. (1) In Think, we discover a task-level logic that is shared across all instances for solving a given task and then express the logic with pseudocode; (2) In Execute, we further tailor the generated pseudocode to each instance and simulate the execution of the code. With extensive experiments on seven algorithmic reasoning tasks, we demonstrate the effectiveness of Think-and-Execute. Our approach better improves LMs' reasoning compared to several strong baselines performing instance-specific reasoning (e.g., CoT and PoT), suggesting the helpfulness of discovering task-level logic. Also, we show that compared to natural language, pseudocode can better guide the reasoning of LMs, even though they are trained to follow natural language instructions.
Abstract（参考訳）: アルゴリズム推論(英: Algorithmic reasoning)とは、問題の裏にある複雑なパターンを理解し、それらを一連の推論ステップに分解する能力である。このようなアルゴリズム推論の性質は、他の推論タスクにおいて有望な性能を示したにもかかわらず、大きな言語モデル(LLM)にとって困難である。この文脈の中では、いくつかの最近の研究では、厳密で正確な構文にインスパイアされた、与えられたインスタンス/問い合わせ(例えば、Program-of-Thought)を解決するために必要なロジックを表現するために、プログラミング言語(例えば、Python)を使用している。しかし、単一の推論呼び出しの中で、オンザフライで正しいロジックを表現する実行可能なコードを書くのは簡単ではない。また、インスタンス用に特別に生成されたコードは、同じタスク出身で、解決するために同じロジックを必要とする場合であっても、他の人のために再利用することはできない。本稿では,言語モデルの推論過程を2段階に分解する新しいフレームワークであるThink-and-Executeを提案する。 1) 与えられたタスクを解決するために,すべてのインスタンス間で共有されるタスクレベルのロジックを発見し,そのロジックを擬似コードで表現する; (2) 実行時には,生成された擬似コードを各インスタンスにさらに調整し,コードの実行をシミュレートする。 7つのアルゴリズム的推論タスクについて広範な実験を行い、思考と実行の有効性を実証する。提案手法は,インスタンス固有の推論(例えば,CoT,PoT)を実行するいくつかの強いベースラインと比較して,LMの推論を改良し,タスクレベルの論理の発見に有用であることを示す。また、自然言語と比較して、擬似コードは、自然言語の指示に従うように訓練されているにもかかわらず、LMの推論をより良く導くことができることを示す。

論文の概要: Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

関連論文リスト