Fugu-MT 論文翻訳(概要): CHIPS: Efficient CLIP Adaptation via Curvature-aware Hybrid Influence-based Data Selection

論文の概要: CHIPS: Efficient CLIP Adaptation via Curvature-aware Hybrid Influence-based Data Selection

arxiv url: http://arxiv.org/abs/2511.18519v1
Date: Sun, 23 Nov 2025 16:25:42 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-25 18:34:24.880086
Title: CHIPS: Efficient CLIP Adaptation via Curvature-aware Hybrid Influence-based Data Selection
Title（参考訳）: CHIPS: Curvature-aware Hybrid Influence-based data SelectionによるCLIP適応の効率化
Authors: Xinlin Zhuang, Yichen Li, Xiwei Liu, Haolin Yang, Yifan Lu, Ziyun Zou, Yulong Li, Huifa Li, Dongliang Chen, Qinglei Wang, Weiyang Liu, Ying Qian, Jiangming Shi, Imran Razzak,
Abstract要約: CLIPを垂直領域に適用することは、通常、新しい微調整戦略や、大規模なドメイン固有のデータセット上での継続事前トレーニング(CPT)によってアプローチされる。我々は、このタスクをデータ中心の観点から再考する: CPTの大規模データセットの代わりに、効果的なデータ選択は可能か? そこで,CHIPS(Curvature-aware Hybrid Influence in Projection Subspace)を導入し,各画像テキスト対に3つの相補的要素を3つの目標に整合させるユーティリティスコアを割り当てる。
参考スコア（独自算出の注目度）: 41.61500990573312
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Adapting CLIP to vertical domains is typically approached by novel fine-tuning strategies or by continual pre-training (CPT) on large domain-specific datasets. Yet, data itself remains an underexplored factor in this process. We revisit this task from a data-centric perspective: Can effective data selection substitute for large-scale datasets in CPT? We introduce CHIPS (Curvature-aware Hybrid Influence in Projection Subspace), which assigns each image-text pair a utility score that integrates three complementary factors aligned with three goals: faithfulness via a curvature-aware, Newton-style alignment computed in CLIP's end-point subspace; scalability via an InfoNCE-aware curvature estimator with Johnson-Lindenstrauss (JL) sketching; and retention via a selection-aware relevance weight combined with learnability to balance target adaptation against general-domain preservation. We justify this design theoretically by proving a lower-bound guarantee on the proxy's correlation with full-parameter alignment and by characterizing the bias-variance trade-offs introduced by curvature mixing and JL sketching. We evaluate CHIPS empirically across various settings: 1) CHIPS attains state-of-the-art performance among selection baselines on 17 medical benchmarks, matches full-dataset CPT with 30% of the data, and outperforms half-dataset CPT using only 10%; 2) on 31 general-domain benchmarks, CHIPS yields the smallest performance drop under 10-30% data-retention budgets. Code, data, and checkpoints will be released.
Abstract（参考訳）: CLIPを垂直領域に適用することは、通常、新しい微調整戦略や、大規模なドメイン固有のデータセット上での継続事前トレーニング(CPT)によってアプローチされる。しかし、このプロセスではデータ自体が未探索の要素である。我々は、このタスクをデータ中心の観点から再考する: CPTの大規模データセットの代わりに、効果的なデータ選択は可能か? 提案手法では,CLIPの終点部分空間で計算されたニュートン型アライメントの忠実度,Johnson-Lindenstrauss (JL) スケッチによるInfoNCE対応曲率推定器によるスケーラビリティ,および一般領域保存に対する目標適応のバランスをとるための学習性を組み合わせた選択認識関連度重みによる保持という,3つの目標に整合した3つの相補的因子を統合するユーティリティスコアを,各画像テキストペアに割り当てるCHIPS(Curvature-aware Hybrid Influence in Projection Subspace)を導入する。我々は、この設計を理論的に正当化するために、プロキシとフルパラメータアライメントとの相関に対する低いバウンド保証を証明し、曲率混合とJLスケッチによって導入されたバイアス分散トレードオフを特徴付ける。様々な設定でCHIPSを実証的に評価する。 1)CHIPSは17の医用ベンチマークにおける選抜ベースライン間の最先端性能を達成し、全データセットCPTと30%のデータとを一致させ、半データセットCPTを10%で上回ります。 2)31の一般ドメインベンチマークにおいて、CHIPSは10～30%のデータ保持予算以下で最小のパフォーマンス低下をもたらす。コード、データ、チェックポイントがリリースされる。

論文の概要: CHIPS: Efficient CLIP Adaptation via Curvature-aware Hybrid Influence-based Data Selection

関連論文リスト