Fugu-MT 論文翻訳(概要): Are Greedy Task Orderings Better Than Random in Continual Linear Regression?

論文の概要: Are Greedy Task Orderings Better Than Random in Continual Linear Regression?

arxiv url: http://arxiv.org/abs/2510.19941v1
Date: Wed, 22 Oct 2025 18:09:59 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:16.627153
Title: Are Greedy Task Orderings Better Than Random in Continual Linear Regression?
Title（参考訳）: 連続線形回帰におけるグレディタスク命令はランダムよりも優れているか?
Authors: Matan Tsipory, Ran Levinstein, Itay Evron, Mark Kong, Deanna Needell, Daniel Soudry,
Abstract要約: 線形回帰のための連続学習におけるタスク順序の解析を行う。我々は,連続タスク間の相違性を極度に最大化する順序付けに着目する。我々は,タスク間の平均損失の点で,欲求順序はランダムな順序よりも早く収束することを示した。
参考スコア（独自算出の注目度）: 23.706463629642155
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We analyze task orderings in continual learning for linear regression, assuming joint realizability of training data. We focus on orderings that greedily maximize dissimilarity between consecutive tasks, a concept briefly explored in prior work but still surrounded by open questions. Using tools from the Kaczmarz method literature, we formalize such orderings and develop geometric and algebraic intuitions around them. Empirically, we demonstrate that greedy orderings converge faster than random ones in terms of the average loss across tasks, both for linear regression with random data and for linear probing on CIFAR-100 classification tasks. Analytically, in a high-rank regression setting, we prove a loss bound for greedy orderings analogous to that of random ones. However, under general rank, we establish a repetition-dependent separation. Specifically, while prior work showed that for random orderings, with or without replacement, the average loss after $k$ iterations is bounded by $\mathcal{O}(1/\sqrt{k})$, we prove that single-pass greedy orderings may fail catastrophically, whereas those allowing repetition converge at rate $\mathcal{O}(1/\sqrt[3]{k})$. Overall, we reveal nuances within and between greedy and random orderings.
Abstract（参考訳）: 学習データの連成実現可能性を考慮した連続学習におけるタスク順序の分析を行った。我々は、連続するタスク間の相違性を鮮やかに最大化する順序付けに重点を置いている。カッツマルツ法文学の道具を用いて、そのような順序付けを形式化し、それらを取り巻く幾何学的および代数的直観を発達させる。実験により,CIFAR-100分類タスクにおける線形回帰と線形回帰の両方において,不規則な順序付けがタスク間の平均損失の点で,ランダムな順序付けよりも高速に収束することが実証された。解析的に、高階回帰環境では、無作為な順序に対する損失境界がランダムな順序と類似していることを証明する。しかし、一般の階級では、繰り返し依存的な分離を確立する。特に、事前の研究では、置換の有無にかかわらず、$k$ の反復後の平均損失は $\mathcal{O}(1/\sqrt{k})$ で有界である一方で、単一パスの欲求順序が破滅的に失敗することを証明する一方、反復を許すものは $\mathcal{O}(1/\sqrt[3]{k})$ で収束することを示した。全体として、欲求と無作為な秩序の間のニュアンスを明らかにする。

論文の概要: Are Greedy Task Orderings Better Than Random in Continual Linear Regression?

関連論文リスト