Fugu-MT 論文翻訳(概要): Accelerated Dynamic Importance Weighting with Versatile Divergence-Minimizing Estimators

論文の概要: Accelerated Dynamic Importance Weighting with Versatile Divergence-Minimizing Estimators

arxiv url: http://arxiv.org/abs/2605.25499v1
Date: Mon, 25 May 2026 06:58:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:19.424919
Title: Accelerated Dynamic Importance Weighting with Versatile Divergence-Minimizing Estimators
Title（参考訳）: 可変ダイバージェンス最小化推定器による動的重み付けの高速化
Authors: Tongtong Fang, Nan Lu, Gang Niu, Kenji Fukumizu, Masashi Sugiyama,
Abstract要約: 共同分散シフト下での深層学習のための統合的で効率的なIWフレームワークである加速DIW(ADIW)を提案する。 ADIWはDIWを多種多様な重み推定手法をサポートする統一分散最小化フレームワークに一般化する。 ADIWは最先端のIW性能を実現し、より効率的である。
参考スコア（独自算出の注目度）: 62.20244820580353
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Importance weighting (IW) is a golden solver for joint distribution shift, where the joint distributions differ between the training and test data. To solve this problem, IW estimates test-to-training density ratios as importance weights and reweights the training losses accordingly. Recent advances in dynamic IW (DIW) integrate weight estimation into model training, enabling scalable IW for deep models and achieving strong performance on large modern datasets. Despite its promise, DIW remains limited in two aspects. First, it incurs substantial computational overhead by solving a kernel mean matching (KMM)-induced optimization problem to convergence in every mini-batch. Second, it relies solely on KMM for weight estimation, whereas the IW literature contains diverse estimation methods based on different divergence measures. In this paper, we propose accelerated DIW (ADIW), a unified and efficient IW framework for deep learning under joint distribution shift. ADIW performs a few lightweight projected gradient descent updates that warm-start from previously updated weights, substantially improving efficiency. Moreover, ADIW generalizes DIW into a unified divergence-minimization framework that supports diverse weight-estimation methods in a plug-and-play manner, including those based on the Kullback-Leibler divergence, squared distance, and Wasserstein-1 distance. We establish convergence guarantees for ADIW under mild conditions, and empirical results demonstrate that ADIW achieves state-of-the-art IW performance while being substantially more efficient.
Abstract（参考訳）: 重要度重み付け (IW) は, トレーニングデータと試験データとで共同分布が異なる, 共同分布シフトの黄金解法である。この問題を解決するために、IWはテスト対トレーニング密度比を重みとして推定し、それに応じてトレーニング損失を再重み付けする。動的IW(DIW)の最近の進歩は、重み推定をモデルトレーニングに統合し、ディープモデルにスケーラブルなIWを可能にし、大規模なモダンデータセット上で強力なパフォーマンスを実現する。約束にもかかわらず、DIWは2つの面で制限されている。第一に、カーネル平均マッチング(KMM)による最適化問題を各ミニバッチに収束させることで、かなりの計算オーバーヘッドを発生させる。第二に、重量推定はKMMのみに頼っているのに対し、IWの文献は異なる発散度尺度に基づく様々な推定方法を含んでいる。本稿では,共同分散シフト下での深層学習のための統合的で効率的なIWフレームワークであるアクセラレーションDIW(ADIW)を提案する。 ADIWは、以前更新された重みからウォームスタートする軽量な勾配降下アップデートをいくつか実行し、効率を大幅に改善した。さらに、ADIWは、Kullback-Leiblerの発散、二乗距離、Wasserstein-1距離などを含む様々な重み推定手法をプラグ・アンド・プレイ方式でサポートする統一的な発散最小化フレームワークにDIWを一般化する。我々は,ADIWの収束保証を軽度条件下で確立し,ADIWが最先端IW性能を著しく向上しつつ達成できることを実証した。

論文の概要: Accelerated Dynamic Importance Weighting with Versatile Divergence-Minimizing Estimators

関連論文リスト