Fugu-MT 論文翻訳(概要): Continual Quadruped Robots Coordination via Semantic Skill Discovery

論文の概要: Continual Quadruped Robots Coordination via Semantic Skill Discovery

arxiv url: http://arxiv.org/abs/2606.08102v2
Date: Wed, 10 Jun 2026 11:06:40 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-11 14:23:44.347666
Title: Continual Quadruped Robots Coordination via Semantic Skill Discovery
Title（参考訳）: セマンティックスキル発見による4足歩行ロボットの協調
Authors: Daoqing Wang, Yuchen Xiao, Weixuan Huang, Zhilong Zhang, Shenghua Wan, Meng Li, Lei Yuan, Yang Yu,
Abstract要約: マルチクワッドループ調整は、ペイロード容量の増強、コンタクト範囲の拡大、課題への適応性の向上などにより、注目を集めている。既存のマルチクワッドループ操作の方法は、通常、事前に定義されたタスクファミリやクローズドタスクファミリにフォーカスする。本稿では,検索・適応・更新プロセスとして連続的な多重四分法協調を定式化するセマンティック・スキル・ライブラリー・フレームワークであるConquerを提案する。
参考スコア（独自算出の注目度）: 17.82598716904032
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Multi-quadruped coordination has attracted increasing attention due to its enhanced payload capacity, broader contact coverage, and improved adaptability to challenging tasks. Existing methods for multi-quadruped manipulation typically focus on predefined or closed task families, often relying on multi-agent reinforcement learning (MARL) to train task-specific coordination policies. However, such methods struggle in open-ended continual learning settings, where tasks arrive sequentially and robots are expected to acquire new coordination skills while reusing previously learned ones without catastrophic forgetting. To address this challenge, we propose Conquer, a semantic skill-library framework that formulates continual multi-quadruped coordination as a retrieve-adapt-update process. First, to accommodate varying team sizes across tasks, we design a team-structured Self-Allies-Goal (SAG) backbone that supports variable-cardinality robot teams by explicitly modeling each robot's own state, teammate context, and task goal. For each incoming task, Conquer constructs a task-level semantic descriptor from pre-execution information and retrieves a relevant skill from the library for adaptation. After successful execution, Conquer updates the skill library by extracting trajectory-level semantic descriptors and organizing them according to semantic distance, thereby enabling continual skill accumulation and cross-task knowledge transfer. Simulation experiments show that Conquer achieves a final average success rate of 95.6%, demonstrating strong forward transfer and negligible catastrophic forgetting. Real-world rollouts on Unitree Go2 teams further validate the deployment feasibility of Conquer for practical multi-quadruped coordination. Simulation and real-robot demonstration videos are available at: https://conquer-project.pages.dev/.
Abstract（参考訳）: マルチクワッドループ調整は、ペイロード容量の増強、コンタクト範囲の拡大、課題への適応性の向上などにより、注目を集めている。既存のマルチクワッドループ操作の方法は、通常、事前に定義されたまたはクローズドなタスクファミリに焦点を合わせ、多くの場合、タスク固有の調整ポリシーを訓練するためにマルチエージェント強化学習(MARL)に依存している。しかし、これらの手法は、タスクが順次到着し、ロボットは、破滅的な忘れをすることなく、以前学習したものを再利用しながら、新しい調整スキルを身につけることが期待される、オープンエンドな継続的学習環境に苦慮している。この課題に対処するために,検索・適応・更新プロセスとして連続的な多重四分法協調を定式化するセマンティック・スキル・ライブラリー・フレームワークであるConquerを提案する。まず、タスク全体にわたるさまざまなチームサイズに対応するために、各ロボットの状態、チームメイトコンテキスト、タスク目標を明示的にモデル化することで、可変カード型ロボットチームをサポートする、チーム構成のセルフアライアンスゴール(SAG)バックボーンを設計します。それぞれのタスクに対して、Conquerは事前実行情報からタスクレベルのセマンティック記述子を構築し、ライブラリから関連するスキルを取得して適応する。実行が成功した後、Conquerは、トラジェクトリレベルのセマンティック記述子を抽出し、セマンティック距離に応じてそれらを整理することで、スキルライブラリを更新し、継続的なスキル蓄積とクロスタスクの知識伝達を可能にする。シミュレーション実験により、コンカーは95.6%の最終的な平均成功率を達成し、強い前方移動と無視可能な破滅的な忘れを証明した。 Unitree Go2チームの実際のロールアウトは、実用的なマルチクワッドラップ調整のためのConquerのデプロイメント実現可能性をさらに検証する。シミュレーションと実ロボットのデモビデオは、https://conquer-project.pages.dev/.comで公開されている。

論文の概要: Continual Quadruped Robots Coordination via Semantic Skill Discovery

関連論文リスト