Fugu-MT 論文翻訳(概要): MemER: Scaling Up Memory for Robot Control via Experience Retrieval

論文の概要: MemER: Scaling Up Memory for Robot Control via Experience Retrieval

arxiv url: http://arxiv.org/abs/2510.20328v1
Date: Thu, 23 Oct 2025 08:26:17 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:17.542399
Title: MemER: Scaling Up Memory for Robot Control via Experience Retrieval
Title（参考訳）: MemER: 経験検索によるロボット制御のためのメモリスケールアップ
Authors: Ajay Sridhar, Jennifer Pan, Satvik Sharma, Chelsea Finn,
Abstract要約: 人間は日常的にメモリをタスクに頼っているが、ほとんどのロボットポリシーはこの機能を欠いている。本稿では,その経験から過去の関連事項を選択し,追跡するために,ハイレベルな政策を訓練する階層的な政策枠組みを提案する。我々のアプローチであるMemERは、数分のメモリを必要とする3つの現実世界の長距離ロボット操作タスクにおいて、従来の手法よりも優れています。
参考スコア（独自算出の注目度）: 46.5398413633767
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Humans routinely rely on memory to perform tasks, yet most robot policies lack this capability; our goal is to endow robot policies with the same ability. Naively conditioning on long observation histories is computationally expensive and brittle under covariate shift, while indiscriminate subsampling of history leads to irrelevant or redundant information. We propose a hierarchical policy framework, where the high-level policy is trained to select and track previous relevant keyframes from its experience. The high-level policy uses selected keyframes and the most recent frames when generating text instructions for a low-level policy to execute. This design is compatible with existing vision-language-action (VLA) models and enables the system to efficiently reason over long-horizon dependencies. In our experiments, we finetune Qwen2.5-VL-7B-Instruct and $\pi_{0.5}$ as the high-level and low-level policies respectively, using demonstrations supplemented with minimal language annotations. Our approach, MemER, outperforms prior methods on three real-world long-horizon robotic manipulation tasks that require minutes of memory. Videos and code can be found at https://jen-pan.github.io/memer/.
Abstract（参考訳）: 人間は日常的にメモリを頼りにタスクを実行するが、ほとんどのロボットポリシーはこの機能を欠いている。長期の観測履歴に内在的な条件付けは計算的に高価であり、共変量シフトの下では不安定であるが、歴史の非差別的なサブサンプリングは無関係または冗長な情報をもたらす。本稿では,その経験から関連するキーフレームを選択し,追跡するために,ハイレベルなポリシをトレーニングする階層型ポリシフレームワークを提案する。高レベルポリシーは、選択されたキーフレームと最新のフレームを使用して、低レベルポリシーを実行するためのテキスト命令を生成する。この設計は、既存の視覚言語アクション(VLA)モデルと互換性があり、システムは長期の依存関係を効率的に推論することができる。実験では,Qwen2.5-VL-7B-インストラクトと$\pi_{0.5}$を,最小限の言語アノテーションで補足されたデモを用いて,それぞれ高レベルかつ低レベルなポリシーとして精査した。我々のアプローチであるMemERは、数分のメモリを必要とする3つの現実世界の長距離ロボット操作タスクにおいて、従来の手法よりも優れています。ビデオとコードはhttps://jen-pan.github.io/memer/.com/で見ることができる。

論文の概要: MemER: Scaling Up Memory for Robot Control via Experience Retrieval

関連論文リスト