Fugu-MT 論文翻訳(概要): Federated Reinforcement Learning for Efficient Mobile Crowdsensing under Incomplete Information

論文の概要: Federated Reinforcement Learning for Efficient Mobile Crowdsensing under Incomplete Information

arxiv url: http://arxiv.org/abs/2605.02705v1
Date: Mon, 04 May 2026 15:13:16 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:50.361582
Title: Federated Reinforcement Learning for Efficient Mobile Crowdsensing under Incomplete Information
Title（参考訳）: 不完全情報に基づく効率的なモバイルクラウドセンシングのためのフェデレーション強化学習
Authors: Sumedh J. Dongare, Patrick Weber, Andrea Ortiz, Walid Saad, Oliver Hinz, Anja Klein,
Abstract要約: 移動群集センシング(Mobile crowdsensing, MCS)は, 移動体(MU)上の既存のセンサを用いて, センサタスクを実行する分散センシングアーキテクチャである。モバイルクラウドセンシングプラットフォーム(MCSP)は、センシングタスクを公開し、MUが金銭と引き換えに参加するかを決定する。 MUは、MCSPが完了したタスク数の最大化に重点を置いている間、彼らの収入を最大化するための効率的なタスク参加戦略を見つけることを目的としている。 FDRL-PPOという完全分散化した深層強化学習アルゴリズムが提案されている。
参考スコア（独自算出の注目度）: 37.05598500605439
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Mobile crowdsensing (MCS) is a distributed sensing architecture that utilizes existing sensors on mobile units (MUs) to perform sensing tasks. A mobile crowdsensing platform (MCSP) publishes the sensing tasks and the MUs decide whether to participate in exchange for money. The MCS system is dynamic: the task requirements, the MUs' availability, and their available resources change over time. The MUs aim to find an efficient task participation strategy to maximize their income while the MCSP focuses on maximizing the number of completed tasks. As optimal strategies require perfect non-causal information about the MCS system, which is unavailable in realistic scenarios, the main challenge is to find an efficient task participation strategy for the MUs under incomplete information. To this end, a novel fully decentralized federated deep reinforcement learning algorithm, FDRL-PPO, is proposed. FDRL-PPO enables every MU to learn its own task participation strategy based on its experiences, available resources, and preferences, without relying on perfect non-causal information about the MCS system. To replenish their batteries, the MUs rely on energy harvesting. As a result, their available energy varies over time, leading to varying availability and fragmented learning experiences. To mitigate these challenges, the proposed approach leverages federated learning, enabling MUs to collaboratively improve their models without sharing private raw data like their own experiences. By exchanging only learned models, MUs collectively compensate for individual limitations, and find more scalable, robust, and efficient task participation strategies. Comprehensive evaluations on both synthetic and real-world datasets show that FDRL-PPO consistently outperforms benchmark algorithms in terms of task completion ratio, fairness in task completion, energy consumption, and number of conflicting proposals.
Abstract（参考訳）: 移動群集センシング(Mobile crowdsensing, MCS)は, 移動体(MU)上の既存のセンサを用いて, センサタスクを実行する分散センシングアーキテクチャである。モバイルクラウドセンシングプラットフォーム(MCSP)は、センシングタスクを公開し、MUが金銭と引き換えに参加するかを決定する。 MCSシステムは動的で、タスク要求、MUの可用性、利用可能なリソースは時間とともに変化する。 MUは、MCSPが完了したタスク数の最大化に重点を置いている間、彼らの収入を最大化するための効率的なタスク参加戦略を見つけることを目的としている。最適戦略は、現実的なシナリオでは利用できないMCSシステムに関する完全な非因果的情報を必要とするため、主要な課題は、不完全な情報の下でMUの効率的なタスク参加戦略を見つけることである。そこで本研究では,FDRL-PPOという完全分散型深層強化学習アルゴリズムを提案する。 FDRL-PPOは、すべてのMUが、MCSシステムに関する完全な非因果的情報に頼ることなく、その経験、利用可能なリソース、好みに基づいて、独自のタスク参加戦略を学ぶことができる。電池を補充するため、MUはエネルギー収穫に依存している。その結果、利用可能なエネルギーは時間とともに変化し、様々な可用性と断片化された学習体験をもたらす。これらの課題を軽減するため、提案されたアプローチでは、フェデレートされた学習を活用し、MUが自身の経験のようにプライベートな生データを共有することなく、モデルを改善することができる。学習したモデルのみを交換することで、MUは個々の制限をまとめて補償し、よりスケーラブルで堅牢で効率的なタスク参加戦略を見つける。 FDRL-PPOは, タスク完了率, タスク完了の公平性, エネルギー消費, 競合する提案数の観点から, ベンチマークアルゴリズムを一貫して上回っている。

論文の概要: Federated Reinforcement Learning for Efficient Mobile Crowdsensing under Incomplete Information

関連論文リスト