Fugu-MT 論文翻訳(概要): VerifyLLM: LLM-Based Pre-Execution Task Plan Verification for Robots

論文の概要: VerifyLLM: LLM-Based Pre-Execution Task Plan Verification for Robots

arxiv url: http://arxiv.org/abs/2507.05118v1
Date: Mon, 07 Jul 2025 15:31:36 GMT
ステータス: 翻訳完了
システム内更新日: 2025-07-08 15:46:35.483494
Title: VerifyLLM: LLM-Based Pre-Execution Task Plan Verification for Robots
Title（参考訳）: VerifyLLM:LLMによるロボットの事前実行タスク計画検証
Authors: Danil S. Grigorev, Alexey K. Kovalev, Aleksandr I. Panov,
Abstract要約: 本研究では,シミュレータや実環境で実行する前に,タスクプランを自動的に検証するアーキテクチャを提案する。このモジュールは、Large Language Modelsの推論機能を使用して、論理的一貫性を評価し、計画の潜在的なギャップを特定する。我々は,タスク計画の信頼性と効率の向上に寄与し,自律システムにおける堅牢な事前実行検証の必要性に対処する。
参考スコア（独自算出の注目度）: 44.99833362998488
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In the field of robotics, researchers face a critical challenge in ensuring reliable and efficient task planning. Verifying high-level task plans before execution significantly reduces errors and enhance the overall performance of these systems. In this paper, we propose an architecture for automatically verifying high-level task plans before their execution in simulator or real-world environments. Leveraging Large Language Models (LLMs), our approach consists of two key steps: first, the conversion of natural language instructions into Linear Temporal Logic (LTL), followed by a comprehensive analysis of action sequences. The module uses the reasoning capabilities of the LLM to evaluate logical coherence and identify potential gaps in the plan. Rigorous testing on datasets of varying complexity demonstrates the broad applicability of the module to household tasks. We contribute to improving the reliability and efficiency of task planning and addresses the critical need for robust pre-execution verification in autonomous systems. The code is available at https://verifyllm.github.io.
Abstract（参考訳）: ロボット工学の分野では、研究者は信頼性と効率的なタスクプランニングを確実にする上で重要な課題に直面している。実行前に高いレベルのタスク計画を検証するとエラーが大幅に減少し、システム全体のパフォーマンスが向上する。本稿では,シミュレータや実環境環境での実行前に,ハイレベルなタスク計画を自動的に検証するアーキテクチャを提案する。まず,LTL(Linear Temporal Logic)に自然言語命令を変換し,次にアクションシーケンスの包括的解析を行う。このモジュールはLSMの推論機能を使用して、論理的一貫性を評価し、計画の潜在的なギャップを特定する。様々な複雑さのデータセットに対する厳密なテストは、モジュールが家庭用タスクに広く適用可能であることを示す。我々は,タスク計画の信頼性と効率の向上に寄与し,自律システムにおける堅牢な事前実行検証の必要性に対処する。コードはhttps://verifyllm.github.ioで公開されている。

論文の概要: VerifyLLM: LLM-Based Pre-Execution Task Plan Verification for Robots

関連論文リスト