Fugu-MT 論文翻訳(概要): Diversity You Can Actually Measure: A Fast, Model-Free Diversity Metric for Robotics Datasets

論文の概要: Diversity You Can Actually Measure: A Fast, Model-Free Diversity Metric for Robotics Datasets

arxiv url: http://arxiv.org/abs/2603.11634v1
Date: Thu, 12 Mar 2026 07:54:43 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-21 18:33:56.715271
Title: Diversity You Can Actually Measure: A Fast, Model-Free Diversity Metric for Robotics Datasets
Title（参考訳）: ロボットデータセットのための高速でモデルなしの多様性メトリクス
Authors: Sreevardhan Sirigiri, Nathan Samuel de Lara, Christopher Agia, Florian Shkurti, Fabio Ramos,
Abstract要約: ロボット模倣学習におけるデータセットの多様性が一般化性能に与える影響について検討する。本稿では、エントロピーを最大化するデモのサブセットを選択するデータキュレーションアルゴリズムであるFAKTUALを紹介する。 FAKTUALは,ロボット模倣学習において,データセットの多様性を理解し,改善するための実用的なツールであることが示唆された。
参考スコア（独自算出の注目度）: 12.45101201787042
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Robotics datasets for imitation learning typically consist of long-horizon trajectories of different lengths over states, actions, and high-dimensional observations (e.g., RGB video), making it non-trivial to quantify diversity in a way that respects the underlying trajectory structure and geometry. We extend Shannon and von Neumann entropy to this setting by defining signature transform-based entropy on the Gram matrix of a signature kernel over demonstrations, yielding entropy and diversity metrics that operate directly on the demonstration dataset. Building on these metrics, we study how dataset diversity affects generalization performance in robot imitation learning and propose a simple, model-free way to curate diverse demonstrations. We introduce FAKTUAL (FAst trajectory Kernel enTropy cUration for imitation Learning), a data curation algorithm that selects a subset of demonstrations maximizing entropy given a subset-size budget. FAKTUAL is fully model-free, requires no access to the imitation policy or rollouts, and adds negligible overhead relative to policy training. We evaluate our approach on image and state-based RoboMimic and MetaWorld benchmarks, as well as four real-world manipulation tasks. Across tasks and architectures, diversity-aware curation with FAKTUAL consistently improves downstream success rates over random selection, while being substantially more computationally efficient compared to recent robot data curation methods. Our results suggest that the entropy of demonstration datasets is a practical tool for understanding and improving dataset diversity in robot imitation learning.
Abstract（参考訳）: 模倣学習のためのロボットデータセットは、通常、状態、行動、高次元の観察(例えばRGBビデオ)に対して異なる長さの長い水平軌道から成り、基礎となる軌道構造と幾何学を尊重する方法で多様性を定量化することは簡単ではない。我々は、シグネチャカーネルのグラム行列上で、シグネチャ変換に基づくエントロピーを定義することで、シグネチャカーネルのエントロピーとフォン・ノイマンエントロピーをこの設定に拡張し、デモデータセット上で直接動作するエントロピーと多様性のメトリクスを得る。これらの指標に基づいて、ロボット模倣学習におけるデータセットの多様性が一般化性能にどのように影響するかを考察し、多様な実演をカリキュラム化するための単純でモデルなしの方法を提案する。本稿では,データキュレーションアルゴリズムであるFAKTUAL(FAst trajectory Kernel enTropy cUration for mimicion Learning)を紹介する。 FAKTUALは完全にモデルフリーで、模倣ポリシーやロールアウトへのアクセスを必要としない。画像および状態に基づくRoboMimicおよびMetaWorldベンチマークおよび4つの実世界の操作タスクに対するアプローチを評価した。タスクやアーキテクチャ全体にわたって、FAKTUALによる多様性を意識したキュレーションは、最近のロボットデータキュレーション法に比べてはるかに計算効率が良く、ランダム選択よりも下流の成功率を一貫して改善する。この結果から,ロボット模倣学習におけるデータセットの多様性の理解と改善のための実践的ツールとして,実証データセットのエントロピーが示唆された。

論文の概要: Diversity You Can Actually Measure: A Fast, Model-Free Diversity Metric for Robotics Datasets

関連論文リスト