Fugu-MT 論文翻訳(概要): On the ERM Principle in Meta-Learning

論文の概要: On the ERM Principle in Meta-Learning

arxiv url: http://arxiv.org/abs/2411.17898v1
Date: Tue, 26 Nov 2024 21:27:14 GMT
ステータス: 翻訳完了
システム内更新日: 2024-12-01 15:52:53.459198
Title: On the ERM Principle in Meta-Learning
Title（参考訳）: メタラーニングにおけるEMM原理について
Authors: Yannay Alon, Steve Hanneke, Shay Moran, Uri Shalit,
Abstract要約: 1タスクあたりのサンプル数が少ないことは、学習を成功させるのに十分であることを示す。また、各$varepsilon$に対して、$varepsilon$のエラーを達成するためにタスク毎の例がいくつ必要かを特定します。この設定は、コンテキスト内学習、ハイパーネットワーク、学習から学習への学習など、現代の多くの問題に適用できる。
参考スコア（独自算出の注目度）: 35.32637037177801
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Classic supervised learning involves algorithms trained on $n$ labeled examples to produce a hypothesis $h \in \mathcal{H}$ aimed at performing well on unseen examples. Meta-learning extends this by training across $n$ tasks, with $m$ examples per task, producing a hypothesis class $\mathcal{H}$ within some meta-class $\mathbb{H}$. This setting applies to many modern problems such as in-context learning, hypernetworks, and learning-to-learn. A common method for evaluating the performance of supervised learning algorithms is through their learning curve, which depicts the expected error as a function of the number of training examples. In meta-learning, the learning curve becomes a two-dimensional learning surface, which evaluates the expected error on unseen domains for varying values of $n$ (number of tasks) and $m$ (number of training examples). Our findings characterize the distribution-free learning surfaces of meta-Empirical Risk Minimizers when either $m$ or $n$ tend to infinity: we show that the number of tasks must increase inversely with the desired error. In contrast, we show that the number of examples exhibits very different behavior: it satisfies a dichotomy where every meta-class conforms to one of the following conditions: (i) either $m$ must grow inversely with the error, or (ii) a \emph{finite} number of examples per task suffices for the error to vanish as $n$ goes to infinity. This finding illustrates and characterizes cases in which a small number of examples per task is sufficient for successful learning. We further refine this for positive values of $\varepsilon$ and identify for each $\varepsilon$ how many examples per task are needed to achieve an error of $\varepsilon$ in the limit as the number of tasks $n$ goes to infinity. We achieve this by developing a necessary and sufficient condition for meta-learnability using a bounded number of examples per domain.
Abstract（参考訳）: 古典的な教師付き学習には、$n$のラベル付き例で訓練されたアルゴリズムが、見知らぬ例でうまく実行することを目的とした仮説$h \in \mathcal{H}$を生成する。メタラーニングは、タスク毎に$m$の例を持ち、いくつかのメタクラス$\mathbb{H}$内で仮説クラス$\mathcal{H}$を生成する。この設定は、コンテキスト内学習、ハイパーネットワーク、学習から学習への学習など、現代の多くの問題に適用できる。教師付き学習アルゴリズムの性能を評価するための一般的な方法は、学習曲線を通し、予測誤差を学習例の数の関数として表現する。メタラーニングでは、学習曲線は2次元の学習曲面となり、未確認領域の予測誤差を$n$(タスク数)と$m$(トレーニング例数)で評価する。メタ経験的リスク最小化器の分布自由学習面を$m$か$n$のどちらかが無限大の傾向にある場合に特徴付ける。対照的に、サンプルの数は非常に異なる振る舞いを示しており、全てのメタクラスが以下の条件のいずれかに適合する二分法を満たす。 (i)$m$は、エラーとともに逆向きに成長しなければならないか、または (ii)タスク毎の例のemph{finite}数は、$n$が無限に進むとエラーが消えるのに十分である。この発見は、タスク毎の少数の例が学習を成功させるのに十分であるケースを図示し、特徴付けする。さらに、これを$\varepsilon$の正の値に対して洗練し、各$\varepsilon$に対して$\varepsilon$の誤差を達成するためにタスク毎の例がいくつ必要かを特定する。ドメインごとの有界な例数を用いてメタ学習性のための必要かつ十分な条件を開発することでこれを実現できる。

関連論文リスト

Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning [33.790048240113165]
本研究では,専門家の行動に限定的あるいは全くアクセスできない低データ体制について考察する。我々は、Generative Affordances$(textbf$textttCoGA$)$として$textbfCodeを提案する。エージェントが考慮しなければならないアクションの数を大幅に削減することで、MiniWob++ベンチマークで幅広いタスクを実演する。
論文参考訳（メタデータ） (2025-04-24T06:20:08Z)
Metalearning with Very Few Samples Per Task [19.78398372660794]
タスクが共有表現によって関連づけられるバイナリ分類について検討する。ここでは、データ量は、見る必要のあるタスク数$t$と、タスク当たりのサンプル数$n$で測定されます。我々の研究は、分布のないマルチタスク学習の特性とメタとマルチタスク学習の削減をもたらす。
論文参考訳（メタデータ） (2023-12-21T16:06:44Z)
Adversarial Online Multi-Task Reinforcement Learning [12.421997449847153]
対戦型オンラインマルチタスク強化学習環境について考察する。 K$の各エピソードにおいて、学習者は未知のタスクをM$未知有限ホライゾン MDP モデルの有限集合から与えられる。学習者の目的は,各課題に対する最適方針に関して,その後悔を一般化することである。
論文参考訳（メタデータ） (2023-01-11T02:18:26Z)
Multi-Task Imitation Learning for Linear Dynamical Systems [50.124394757116605]
線形システム上での効率的な模倣学習のための表現学習について検討する。学習対象ポリシーによって生成された軌道上の模倣ギャップは、$tildeOleft(frack n_xHN_mathrmshared + frack n_uN_mathrmtargetright)$で制限されている。
論文参考訳（メタデータ） (2022-12-01T00:14:35Z)
A Characterization of Semi-Supervised Adversarially-Robust PAC Learnability [57.502573663108535]
本研究では、半教師付きPACモデルにおいて、時間攻撃をテストするために、逆向きに頑健な予測器を学習する問題について検討する。最悪の分布自由モデルにおいても,半教師付き頑健な学習には大きなメリットがあることが示されている。
論文参考訳（メタデータ） (2022-02-11T03:01:45Z)
On the Power of Multitask Representation Learning in Linear MDP [61.58929164172968]
本稿では,線形マルコフ決定過程(MDP)におけるマルチタスク表現学習の統計的メリットについて分析する。簡単な最小二乗アルゴリズムが $tildeO(H2sqrtfrackappa MathcalC(Phi)2 kappa dNT+frackappa dn) というポリシーを学ぶことを証明した。
論文参考訳（メタデータ） (2021-06-15T11:21:06Z)
Learning to extrapolate using continued fractions: Predicting the critical temperature of superconductor materials [5.905364646955811]
人工知能(AI)と機械学習(ML)の分野では、未知のターゲット関数 $y=f(mathbfx)$ の近似が共通の目的である。トレーニングセットとして$S$を参照し、新しいインスタンス$mathbfx$に対して、このターゲット関数を効果的に近似できる低複雑さの数学的モデルを特定することを目的としている。
論文参考訳（メタデータ） (2020-11-27T04:57:40Z)
Improving Robustness and Generality of NLP Models Using Disentangled Representations [62.08794500431367]
スーパービジョンニューラルネットワークはまず入力$x$を単一の表現$z$にマップし、次に出力ラベル$y$にマッピングする。本研究では,非交叉表現学習の観点から,NLPモデルの堅牢性と汎用性を改善する手法を提案する。提案した基準でトレーニングしたモデルは、広範囲の教師付き学習タスクにおいて、より堅牢性とドメイン適応性を向上することを示す。
論文参考訳（メタデータ） (2020-09-21T02:48:46Z)
On the Theory of Transfer Learning: The Importance of Task Diversity [114.656572506859]
一般的な関数クラス$mathcalF circ MathcalH$において、$f_j circ h$という形の関数によってパラメータ化される$t+1$タスクを考える。多様なトレーニングタスクに対して、最初の$t$のトレーニングタスク間で共有表現を学ぶのに必要なサンプルの複雑さが、$C(mathcalH) + t C(mathcalF)$であることを示す。
論文参考訳（メタデータ） (2020-06-20T20:33:59Z)
Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation [30.137884459159107]
連続状態と行動空間を用いた強化学習において,Q$関数を効率よく学習する方法を考える。我々は、$epsilon$-Schmidt $Q$-functionと$widetildeO(frac1epsilonmax(d1, d_2)+2)$のサンプル複雑性を求める単純な反復学習アルゴリズムを開発する。
論文参考訳（メタデータ） (2020-06-11T00:55:35Z)
On the Modularity of Hypernetworks [103.1147622394852]
構造化対象関数の場合、ハイパーネットワークにおけるトレーニング可能なパラメータの総数は、標準ニューラルネットワークのトレーニング可能なパラメータの数や埋め込み法よりも桁違いに小さいことを示す。
論文参考訳（メタデータ） (2020-02-23T22:51:52Z)
Task-Robust Model-Agnostic Meta-Learning [42.27488241647739]
本稿では,AML(Model Agnostic Meta-Learning)の目標を改訂することで,「タスク・ロバストネス(task-robustness)」の概念を導入する。この新しい定式化の解決策は、最も難しいタスクや稀なタスクにも等しく重要となるという意味で、タスクロバストである。
論文参考訳（メタデータ） (2020-02-12T02:20:51Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。