Fugu-MT 論文翻訳(概要): Free on the Fly: Enhancing Flexibility in Test-Time Adaptation with Online EM

論文の概要: Free on the Fly: Enhancing Flexibility in Test-Time Adaptation with Online EM

arxiv url: http://arxiv.org/abs/2507.06973v1
Date: Wed, 09 Jul 2025 16:03:07 GMT
ステータス: 翻訳完了
システム内更新日: 2025-07-10 17:37:43.657398
Title: Free on the Fly: Enhancing Flexibility in Test-Time Adaptation with Online EM
Title（参考訳）: フリー・オン・ザ・フライ:オンラインEMによるテスト時間適応の柔軟性向上
Authors: Qiyuan Dai, Sibei Yang,
Abstract要約: FreeTTAはトレーニング不要で普遍的に利用できるメソッドであり、仮定はしない。本研究は,想定しない学習自由かつ普遍的な手法であるFreeTTAを提案する。
参考スコア（独自算出の注目度）: 13.924553294859315
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Vision-Language Models (VLMs) have become prominent in open-world image recognition for their strong generalization abilities. Yet, their effectiveness in practical applications is compromised by domain shifts and distributional changes, especially when test data distributions diverge from training data. Therefore, the paradigm of test-time adaptation (TTA) has emerged, enabling the use of online off-the-shelf data at test time, supporting independent sample predictions, and eliminating reliance on test annotations. Traditional TTA methods, however, often rely on costly training or optimization processes, or make unrealistic assumptions about accessing or storing historical training and test data. Instead, this study proposes FreeTTA, a training-free and universally available method that makes no assumptions, to enhance the flexibility of TTA. More importantly, FreeTTA is the first to explicitly model the test data distribution, enabling the use of intrinsic relationships among test samples to enhance predictions of individual samples without simultaneous access--a direction not previously explored. FreeTTA achieves these advantages by introducing an online EM algorithm that utilizes zero-shot predictions from VLMs as priors to iteratively compute the posterior probabilities of each online test sample and update parameters. Experiments demonstrate that FreeTTA achieves stable and significant improvements compared to state-of-the-art methods across 15 datasets in both cross-domain and out-of-distribution settings.
Abstract（参考訳）: VLM(Vision-Language Models)はその強力な一般化能力のために、オープンワールドイメージ認識において顕著になっている。しかし、実際の応用におけるそれらの効果は、特にテストデータ分布がトレーニングデータから分岐する場合に、ドメインシフトや分布の変化によって損なわれる。そのため、テスト時間適応(TTA)パラダイムが出現し、オンラインオフザシェルフデータをテスト時に使用でき、独立したサンプル予測をサポートし、テストアノテーションへの依存を排除できる。しかし、従来のTTA手法は、しばしばコストのかかるトレーニングや最適化プロセスに依存し、過去のトレーニングやテストデータへのアクセスや保存に関して非現実的な仮定をする。代わりに、本研究では、TTAの柔軟性を高めるために、仮定をしないトレーニングフリーで普遍的に利用可能なFreeTTAを提案する。さらに重要なのは、FreeTTAが最初にテストデータ分布を明示的にモデル化し、テストサンプル間の固有の関係を利用して、同時アクセスなしで個々のサンプルの予測を強化することである。 FreeTTAは、オンラインテストサンプルの後方確率を反復的に計算し、更新パラメータを更新するために、VLMからゼロショット予測を利用するオンラインEMアルゴリズムを導入することで、これらの利点を享受している。実験によると、FreeTTAは、クロスドメインとアウト・オブ・ディストリビューション設定の両方で15のデータセットにわたる最先端のメソッドと比較して、安定的で重要な改善を実現している。

論文の概要: Free on the Fly: Enhancing Flexibility in Test-Time Adaptation with Online EM

関連論文リスト