Fugu-MT 論文翻訳(概要): Learning Shared Representations for Multi-Task Linear Bandits

論文の概要: Learning Shared Representations for Multi-Task Linear Bandits

arxiv url: http://arxiv.org/abs/2604.00531v1
Date: Wed, 01 Apr 2026 06:18:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-02 16:44:31.86749
Title: Learning Shared Representations for Multi-Task Linear Bandits
Title（参考訳）: マルチタスク線形帯域における共有表現の学習
Authors: Jiabin Lin, Shana Moothedath,
Abstract要約: マルチタスク表現学習は、関連するタスク間で共有潜在表現を学習するアプローチである。特徴次元 d を持つ T の並列線型バンドイットタスクは、次元 r ll Mind,T$ の共通潜在表現を共有する。我々は、共有低ランク表現を利用して意思決定を強化する、顔の不確かさリニア(OFUL)アルゴリズムの新たな最適化を提案する。
参考スコア（独自算出の注目度）: 7.978226788634307
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multi-task representation learning is an approach that learns shared latent representations across related tasks, facilitating knowledge transfer and improving sample efficiency. This paper introduces a novel approach to multi-task representation learning in linear bandits. We consider a setting with T concurrent linear bandit tasks, each with feature dimension d, that share a common latent representation of dimension r \ll min{d,T}$, capturing their underlying relatedness. We propose a new Optimism in the Face of Uncertainty Linear (OFUL) algorithm that leverages shared low-rank representations to enhance decision-making in a sample-efficient manner. Our algorithm first collects data through an exploration phase, estimates the shared model via spectral initialization, and then conducts OFUL based learning over a newly constructed confidence set. We provide theoretical guarantees for the confidence set and prove that the unknown reward vectors lie within the confidence set with high probability. We derive cumulative regret bounds and show that the proposed approach achieves \tilde{O}(\sqrt{drNT}), a significant improvement over solving the T tasks independently, resulting in a regret of \tilde{O}(dT\sqrt{N}). We performed numerical simulations to validate the performance of our algorithm for different problem sizes.
Abstract（参考訳）: マルチタスク表現学習は、関連するタスク間で共有潜在表現を学習し、知識伝達を容易にし、サンプル効率を向上させるアプローチである。本稿では,線形帯域におけるマルチタスク表現学習に対する新しいアプローチを提案する。特徴次元 d を持つ T の並列線型バンドイットタスクは、その基礎となる関係性を捉えながら、次元 r \ll min{d,T}$ の共通の潜在表現を共有する。我々は,共有低ランク表現を利用した顔不確かさ線形(OFUL)アルゴリズムを提案する。提案アルゴリズムは,まず探索段階からデータを収集し,スペクトル初期化により共有モデルを推定し,新たに構築された信頼度集合上でOFULに基づく学習を行う。我々は、信頼集合の理論的な保証を提供し、未知の報酬ベクトルが高い確率で信頼集合の中に存在することを証明する。累積的後悔境界を導出し、提案手法が T のタスクを独立に解くよりも顕著な改善である \tilde{O}(\sqrt{drNT}) を達成し、結果として \tilde{O}(dT\sqrt{N}) の後悔をもたらすことを示す。我々は,異なる問題サイズに対するアルゴリズムの性能評価のために,数値シミュレーションを行った。

論文の概要: Learning Shared Representations for Multi-Task Linear Bandits

関連論文リスト