Fugu-MT 論文翻訳(概要): Revisit the Imbalance Optimization in Multi-task Learning: An Experimental Analysis

論文の概要: Revisit the Imbalance Optimization in Multi-task Learning: An Experimental Analysis

arxiv url: http://arxiv.org/abs/2509.23915v1
Date: Sun, 28 Sep 2025 14:40:06 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:19.530055
Title: Revisit the Imbalance Optimization in Multi-task Learning: An Experimental Analysis
Title（参考訳）: マルチタスク学習における不均衡最適化の再検討:実験的検討
Authors: Yihang Guo, Tianyuan Yu, Liang Bai, Yanming Guo, Yirun Ruan, William Li, Weishi Zheng,
Abstract要約: マルチタスク学習(MTL)は、1つのネットワークをトレーニングして複数のタスクを共同実行することで汎用的な視覚システムを構築することを目的としている。期待されているが、そのポテンシャルは「不均衡な最適化」によって妨げられることが多い。本稿では,この持続的問題に寄与する要因を解明するための系統的な実験的検討を行った。
参考スコア（独自算出の注目度）: 44.410446932443
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multi-task learning (MTL) aims to build general-purpose vision systems by training a single network to perform multiple tasks jointly. While promising, its potential is often hindered by "unbalanced optimization", where task interference leads to subpar performance compared to single-task models. To facilitate research in MTL, this paper presents a systematic experimental analysis to dissect the factors contributing to this persistent problem. Our investigation confirms that the performance of existing optimization methods varies inconsistently across datasets, and advanced architectures still rely on costly grid-searched loss weights. Furthermore, we show that while powerful Vision Foundation Models (VFMs) provide strong initialization, they do not inherently resolve the optimization imbalance, and merely increasing data quantity offers limited benefits. A crucial finding emerges from our analysis: a strong correlation exists between the optimization imbalance and the norm of task-specific gradients. We demonstrate that this insight is directly applicable, showing that a straightforward strategy of scaling task losses according to their gradient norms can achieve performance comparable to that of an extensive and computationally expensive grid search. Our comprehensive analysis suggests that understanding and controlling gradient dynamics is a more direct path to stable MTL than developing increasingly complex methods.
Abstract（参考訳）: マルチタスク学習(MTL)は、1つのネットワークをトレーニングして複数のタスクを共同実行することで汎用的な視覚システムを構築することを目的としている。期待されているものの、タスク干渉がシングルタスクモデルと比較してサブパーパフォーマンスにつながる「アンバランスな最適化」によってそのポテンシャルが妨げられることが多い。 MTLの研究を容易にするために,本論文では,この永続的問題に寄与する要因を解明するための系統的な実験的検討を行った。本研究は,既存の最適化手法の性能がデータセット間で不整合に変化していることを確認し,高度なアーキテクチャは依然としてグリッド探索による損失重みに頼っている。さらに、強力なビジョン基礎モデル(VFM)が強力な初期化を提供する一方で、最適化の不均衡を本質的に解決するわけではなく、単にデータ量を増やすだけでは利点が限られていることを示す。最適化の不均衡とタスク固有の勾配のノルムの間には強い相関関係が存在する。この知見は直接適用可能であることを示し、その勾配基準に従ってタスク損失をスケーリングする簡単な戦略が、広範で計算コストのかかるグリッドサーチに匹敵する性能を達成できることを示した。我々の包括的分析は、勾配力学の理解と制御が、より複雑な手法の開発よりも、安定なMLLへの直接的な経路であることを示唆している。

論文の概要: Revisit the Imbalance Optimization in Multi-task Learning: An Experimental Analysis

関連論文リスト