Fugu-MT 論文翻訳(概要): Interactive LLM-assisted Curriculum Learning for Multi-Task Evolutionary Policy Search

論文の概要: Interactive LLM-assisted Curriculum Learning for Multi-Task Evolutionary Policy Search

arxiv url: http://arxiv.org/abs/2602.10891v1
Date: Wed, 11 Feb 2026 14:21:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-12 21:44:01.993259
Title: Interactive LLM-assisted Curriculum Learning for Multi-Task Evolutionary Policy Search
Title（参考訳）: マルチタスク進化的ポリシー探索のための対話型LCM支援カリキュラム学習
Authors: Berfin Sakallioglu, Giorgia Nadizar, Eric Medvet,
Abstract要約: 進化的最適化プロセスからリアルタイムにフィードバックを得られるオンラインカリキュラム生成のための対話型LCM支援フレームワークを提案する。対話型カリキュラム生成は,プログレッシブプロットとビヘイビアビジュアライゼーションの両方のマルチモーダルフィードバックによって,専門家が設計したキュリキュラと競合する性能を実現するため,静的なアプローチよりも優れていることを示す。
参考スコア（独自算出の注目度）: 0.20734358984482407
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Multi-task policy search is a challenging problem because policies are required to generalize beyond training cases. Curriculum learning has proven to be effective in this setting, as it introduces complexity progressively. However, designing effective curricula is labor-intensive and requires extensive domain expertise. LLM-based curriculum generation has only recently emerged as a potential solution, but was limited to operate in static, offline modes without leveraging real-time feedback from the optimizer. Here we propose an interactive LLM-assisted framework for online curriculum generation, where the LLM adaptively designs training cases based on real-time feedback from the evolutionary optimization process. We investigate how different feedback modalities, ranging from numeric metrics alone to combinations with plots and behavior visualizations, influence the LLM ability to generate meaningful curricula. Through a 2D robot navigation case study, tackled with genetic programming as optimizer, we evaluate our approach against static LLM-generated curricula and expert-designed baselines. We show that interactive curriculum generation outperforms static approaches, with multimodal feedback incorporating both progression plots and behavior visualizations yielding performance competitive with expert-designed curricula. This work contributes to understanding how LLMs can serve as interactive curriculum designers for embodied AI systems, with potential extensions to broader evolutionary robotics applications.
Abstract（参考訳）: マルチタスクポリシーサーチは、トレーニングケースを超えて、ポリシーを一般化する必要があるため、難しい問題である。カリキュラム学習は、複雑さを徐々に導入するので、この設定で効果的であることが証明されている。しかし、効果的なカリキュラムを設計するには労働集約的であり、幅広い分野の専門知識が必要である。 LLMベースのカリキュラム生成は、潜在的な解決策として最近登場したばかりであるが、オプティマイザからのリアルタイムフィードバックを活用せずに、静的なオフラインモードで動作するように制限されていた。本稿では、オンラインカリキュラム生成のための対話型LLM支援フレームワークを提案し、LLMは進化最適化プロセスからのリアルタイムフィードバックに基づいて学習事例を適応的に設計する。本研究では,数値メトリクスのみからプロットとビヘイビアビジュアライゼーションの組み合わせに至るまで,異なるフィードバックのモダリティが,有意義なキュリキュラを生成するLLM能力にどのように影響するかを検討する。遺伝的プログラミングを最適化として取り組んだ2次元ロボットナビゲーションケーススタディを通じて,静的LLM生成カリキュラムとエキスパート設計ベースラインに対するアプローチを評価した。対話型カリキュラム生成は,プログレッシブプロットとビヘイビアビジュアライゼーションの両方を取り入れたマルチモーダルフィードバックによって,専門家が設計したカリキュラムと競合する性能を実現することで,静的なアプローチよりも優れることを示す。この研究は、LLMがAIシステムのためのインタラクティブなカリキュラムデザイナとして機能し、より広範な進化的ロボティクスアプリケーションの拡張に寄与する。

論文の概要: Interactive LLM-assisted Curriculum Learning for Multi-Task Evolutionary Policy Search

関連論文リスト