Fugu-MT 論文翻訳(概要): WINFlowNets: Warm-up Integrated Networks Training of Generative Flow Networks for Robotics and Machine Fault Adaptation

論文の概要: WINFlowNets: Warm-up Integrated Networks Training of Generative Flow Networks for Robotics and Machine Fault Adaptation

arxiv url: http://arxiv.org/abs/2603.17301v1
Date: Wed, 18 Mar 2026 02:55:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-19 18:32:57.482841
Title: WINFlowNets: Warm-up Integrated Networks Training of Generative Flow Networks for Robotics and Machine Fault Adaptation
Title（参考訳）: WINFlowNets: ロボットと機械故障適応のための生成フローネットワークのトレーニング
Authors: Zahin Sufiyan, Shadan Golestan, Yoshihiro Mitsuka, Shotaro Miwa, Osmar Zaiane,
Abstract要約: WINFlowNetsは、フローと検索ネットワークの協調トレーニングを可能にする新しいフレームワークである。 WINFlowNetsは、平均報酬とトレーニング安定性の観点から、CFlowNetsや最先端のRLアルゴリズムを上回っている。これらの知見は、WINFlowNetsが動的および機能不全を引き起こすロボットシステムに展開する可能性を示している。
参考スコア（独自算出の注目度）: 1.7094085866965287
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Generative Flow Networks for continuous scenarios (CFlowNets) have shown promise in solving sequential decision-making tasks by learning stochastic policies using a flow and a retrieval network. Despite their demonstrated efficiency compared to state-of-the-art Reinforcement Learning (RL) algorithms, their practical application in robotic control tasks is constrained by the reliance on pre-training the retrieval network. This dependency poses challenges in dynamic robotic environments, where pre-training data may not be readily available or representative of the current environment. This paper introduces WINFlowNets, a novel CFlowNets framework that enables the co-training of flow and retrieval networks. WINFlowNets begins with a warm-up phase for the retrieval network to bootstrap its policy, followed by a shared training architecture and a shared replay buffer for co-training both networks. Experiments in simulated robotic environments demonstrate that WINFlowNets surpasses CFlowNets and state-of-the-art RL algorithms in terms of average reward and training stability. Furthermore, WINFlowNets exhibits strong adaptive capability in fault environments, making it suitable for tasks that demand quick adaptation with limited sample data. These findings highlight WINFlowNets' potential for deployment in dynamic and malfunction-prone robotic systems, where traditional pre-training or sample inefficient data collection may be impractical.
Abstract（参考訳）: 連続シナリオのためのジェネレーティブフローネットワーク(CFlowNets)は、フローと検索ネットワークを用いて確率的ポリシーを学習することで、シーケンシャルな意思決定タスクを解くことを約束している。最新のReinforcement Learning (RL)アルゴリズムと比較して,その効率性が実証されているにもかかわらず,ロボット制御タスクの実践的応用は,検索ネットワークの事前学習に依存している。この依存関係は、トレーニング済みデータが簡単に利用できないり、現在の環境を代表したりできないような、動的ロボット環境において課題となる。本稿では,フローと検索ネットワークの協調学習を可能にする新しいCFlowNetsフレームワークであるWINFlowNetsを紹介する。 WINFlowNetsは、検索ネットワークがポリシーをブートストラップするウォームアップフェーズから始まり、その後、共有トレーニングアーキテクチャと、両方のネットワークを共同トレーニングするための共有リプレイバッファが続く。シミュレーションロボット環境での実験では、WINFlowNetsが平均報酬とトレーニング安定性の点でCFlowNetsや最先端のRLアルゴリズムを上回ることが示されている。さらに、WINFlowNetsはフォールト環境において強い適応性を示し、限られたサンプルデータによる迅速な適応を必要とするタスクに適している。これらの知見は、WINFlowNetsが、従来の事前学習やサンプル非効率なデータ収集が現実的でない、動的および機能不良のロボットシステムに展開する可能性を強調している。

論文の概要: WINFlowNets: Warm-up Integrated Networks Training of Generative Flow Networks for Robotics and Machine Fault Adaptation

関連論文リスト