Fugu-MT 論文翻訳(概要): Delve into the Applicability of Advanced Optimizers for Multi-Task Learning

論文の概要: Delve into the Applicability of Advanced Optimizers for Multi-Task Learning

arxiv url: http://arxiv.org/abs/2604.08939v1
Date: Fri, 10 Apr 2026 04:15:31 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-13 17:57:53.677755
Title: Delve into the Applicability of Advanced Optimizers for Multi-Task Learning
Title（参考訳）: マルチタスク学習における高度な最適化手法の適用性についての一考察
Authors: Zhipeng Zhou, Linxiao Cao, Pengcheng Wu, Peilin Zhao, Chunyan Miao,
Abstract要約: MTL(Multi-Task Learning)は、機械学習の問題であり、過去10年間に広範囲に開発されてきた。 APT(Advanced oPizers)は,高度とMTLの強度のバランスをとるために設計された,シンプルな適応運動量機構を備えたフレームワークである。
参考スコア（独自算出の注目度）: 70.80216936064654
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Multi-Task Learning (MTL) is a foundational machine learning problem that has seen extensive development over the past decade. Recently, various optimization-based MTL approaches have been proposed to learn multiple tasks simultaneously by altering the optimization trajectory. Although these methods strive to de-conflict and re-balance tasks, we empirically identify that their effectiveness is often undermined by an overlooked factor when employing advanced optimizers: the instant-derived gradients play only a marginal role in the actual parameter updates. This discrepancy prevents MTL frameworks from fully releasing its power on learning dynamics. Furthermore, we observe that Muon-a recently emerged advanced optimizer-inherently functions as a multi-task learner, which underscores the critical importance of the gradients used for its orthogonalization. To address these issues, we propose APT (Applicability of advanced oPTimizers), a framework featuring a simple adaptive momentum mechanism designed to balance the strengths between advanced optimizers and MTL. Additionally, we introduce a light direction preservation method to facilitate Muon's orthogonalization. Extensive experiments across four mainstream MTL datasets demonstrate that APT consistently augments existing MTL approaches, yielding substantial performance improvements.
Abstract（参考訳）: MTL(Multi-Task Learning)は、過去10年間に幅広く開発されてきた基礎的な機械学習問題である。近年,最適化軌道を変更して複数のタスクを同時に学習するために,最適化に基づく様々なMTL手法が提案されている。これらの手法はタスクの非競合化と再バランス化を図っているが、先進的なオプティマイザを用いる場合、その効果が見落とされがちな要因によって損なわれることが実証的に確認されている。この不一致は、MTLフレームワークが動的学習のパワーを完全に解放するのを防ぐ。さらに,最近登場した多タスク学習者としての高度最適化機能であるMuon-aが,その直交化に使用される勾配の重要さを浮き彫りにした。これらの問題に対処するため,先進オプティマイザとMPLの強度のバランスをとるためのシンプルな適応運動量機構を備えたフレームワークであるAPT(Advanced oPTimizers)を提案する。さらに,ミュオンの直交化を促進するために光方向保存法を導入する。 4つのメインストリームMTLデータセットにわたる大規模な実験により、APTは既存のMTLアプローチを一貫して強化し、大幅なパフォーマンス改善を実現している。

論文の概要: Delve into the Applicability of Advanced Optimizers for Multi-Task Learning

関連論文リスト