Fugu-MT 論文翻訳(概要): The Geometry of Robustness: Optimizing Loss Landscape Curvature and Feature Manifold Alignment for Robust Finetuning of Vision-Language Models

論文の概要: The Geometry of Robustness: Optimizing Loss Landscape Curvature and Feature Manifold Alignment for Robust Finetuning of Vision-Language Models

arxiv url: http://arxiv.org/abs/2603.27139v1
Date: Sat, 28 Mar 2026 05:22:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-31 23:18:44.806886
Title: The Geometry of Robustness: Optimizing Loss Landscape Curvature and Feature Manifold Alignment for Robust Finetuning of Vision-Language Models
Title（参考訳）: ロバストネスの幾何学:視覚・言語モデルのロバスト微視化のためのランドスケープ曲線の最適化と特徴マニフォールドアライメント
Authors: Shivang Chopra, Shaunak Halbe, Chengyue Huan, Brisa Maneechotesuwan, Zsolt Kira,
Abstract要約: 一般化保存法はID/OOD性能を維持するが、敵攻撃に弱いモデルを残す。我々の重要な洞察は、ロバスト性トレードオフは、パラメータ空間における鋭く異方性のある最小値と、摂動下で変形する不安定な特徴表現の2つの幾何学的失敗に由来するということである。本稿では,パラメータ空間の曲率と特徴空間の不変性を協調的に正規化する,統一的な微調整フレームワークGRACEを提案する。
参考スコア（独自算出の注目度）: 29.489099268602544
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Fine-tuning approaches for Vision-Language Models (VLMs) face a critical three-way trade-off between In-Distribution (ID) accuracy, Out-of-Distribution (OOD) generalization, and adversarial robustness. Existing robust fine-tuning strategies resolve at most two axes of this trade-off. Generalization-preserving methods retain ID/OOD performance but leave models vulnerable to adversarial attacks, while adversarial training improves robustness to targeted attacks but degrades ID/OOD accuracy. Our key insight is that the robustness trade-off stems from two geometric failures: sharp, anisotropic minima in parameter space and unstable feature representations that deform under perturbation. To address this, we propose GRACE (Gram-aligned Robustness via Adaptive Curvature Estimation), a unified fine-tuning framework that jointly regularizes the parameter-space curvature and feature-space invariance for VLMs. Grounded in Robust PAC-Bayes theory, GRACE employs adaptive weight perturbations scaled by local curvature to promote flatter minima, combined with a feature alignment loss that maintains representation consistency across clean, adversarial, and OOD inputs. On ImageNet fine-tuning of CLIP models, GRACE simultaneously improves ID accuracy by 10.8%, and adversarial accuracy by 13.5% while maintaining 57.0% OOD accuracy (vs. 57.4% zero-shot baseline). Geometric analysis confirms that GRACE converges to flatter minima without feature distortion across distribution shifts, providing a principled step toward generalized robustness in foundation VLMs.
Abstract（参考訳）: VLM(Vision-Language Models)の微調整アプローチは、In-Distribution (ID)の精度、Out-of-Distribution (OOD)の一般化、および敵のロバスト性の間の重要な3方向のトレードオフに直面している。既存の堅牢な微調整戦略は、このトレードオフの少なくとも2つの軸で解決する。一般化保存法はID/OOD性能を維持するが、敵の攻撃に弱いモデルを残し、敵の訓練は攻撃に対する堅牢性を向上するが、ID/OODの精度は低下する。我々の重要な洞察は、ロバスト性トレードオフは、パラメータ空間における鋭く異方性のある最小値と、摂動下で変形する不安定な特徴表現の2つの幾何学的失敗に由来するということである。そこで本研究では,パラメータ空間の曲率と特徴空間の不変性を協調的に正規化する統一的な微調整フレームワークGRACEを提案する。 Robust PAC-Bayes理論に基づくGRACEは、局所曲率によってスケールされた適応的な重みの摂動を用いて、平坦なミニマを推進し、クリーン、対向、OOD入力間の表現整合性を維持する特徴的アライメント損失と組み合わせる。 ImageNetによるCLIPモデルの微調整では、GRACEは同時にIDの精度を10.8%改善し、敵の精度を13.5%向上し、57.0% OODの精度を維持した(57.4%ゼロショットベースライン)。幾何解析により、GRACEは分布シフト間の特徴歪みを伴わずに平坦なミニマに収束し、基礎VLMにおける一般化ロバスト性への原則的なステップを提供する。

論文の概要: The Geometry of Robustness: Optimizing Loss Landscape Curvature and Feature Manifold Alignment for Robust Finetuning of Vision-Language Models

関連論文リスト