Fugu-MT 論文翻訳(概要): MC-RFM: Geometry-Aware Few-Shot Adaptation via Mixed-Curvature Riemannian Flow Matching

論文の概要: MC-RFM: Geometry-Aware Few-Shot Adaptation via Mixed-Curvature Riemannian Flow Matching

arxiv url: http://arxiv.org/abs/2605.08557v1
Date: Fri, 08 May 2026 23:36:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:49.73391
Title: MC-RFM: Geometry-Aware Few-Shot Adaptation via Mixed-Curvature Riemannian Flow Matching
Title（参考訳）: MC-RFM:混合曲率リーマン流マッチングによる幾何認識Few-Shot適応
Authors: Salim Khazem, Ibrahim Mohamed Serouis, Zakaria Ezzahed,
Abstract要約: textscMC-RFMは、凍結した視覚バックボーンの少数ショット適応のための混合曲率フローマッチングフレームワークである。適応は、凍結した特徴からサポートセットプロトタイプへのタスク条件付き連続輸送として定式化されている。その結果, 混合曲率ヘッド, タスク条件付け, 適応分岐ゲーティング, プロトタイプ縮小, 識別的監督がそれぞれ性能に寄与していることが示唆された。
参考スコア（独自算出の注目度）: 0.764671395172401
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Parameter-efficient adaptation of pretrained vision models is commonly performed through linear probes, prompts, low-rank updates, or lightweight residual modules. While effective, these methods usually treat adaptation as a discrete Euclidean perturbation of frozen representations, without explicitly modeling the geometry of the task-induced feature displacement. We propose \textsc{MC-RFM}, a mixed-curvature Riemannian flow-matching framework for few-shot adaptation of frozen visual backbones. The key idea is to represent adapted features on a product manifold combining a hyperbolic factor, which captures hierarchy-sensitive semantic structure, and a Euclidean factor, which preserves locally discriminative visual variation. Adaptation is formulated as a task-conditioned continuous transport from frozen features to support-set prototypes, trained with a flow-matching objective and coupled to a hybrid prototype-linear classifier. The method is lightweight, backbone-agnostic, and operates entirely on cached frozen features. Across seven visual recognition benchmarks, five frozen backbones, and 1/4/16-shot regimes, \textsc{MC-RFM} is the best-performing method in a majority of evaluated settings, with the strongest gains on Transformer backbones and fine-grained datasets. Ablations show that the mixed-curvature head, task conditioning, adaptive branch gating, prototype shrinkage, and discriminative supervision each contribute to performance. These results suggest that few-shot adaptation benefits not only from deciding which parameters to update, but also from modeling how representations should move through a geometry matched to the structure of the downstream task.
Abstract（参考訳）: 事前学習された視覚モデルのパラメータ効率の適応は、線形プローブ、プロンプト、低ランク更新、軽量残余モジュールによって一般的に行われる。有効ではあるが、これらの手法は通常、タスクによって引き起こされる特徴変位の幾何学を明示的にモデル化することなく、凍結表現の離散ユークリッド摂動として適応を扱う。凍結した視覚バックボーンの少数ショット適応のための混合曲率リーマン流マッチングフレームワークである \textsc{MC-RFM} を提案する。鍵となる考え方は、階層性に敏感な意味構造を捉えた双曲的因子と、局所的な識別的視覚的変動を保存するユークリッド因子を組み合わせた積多様体上の適応的特徴を表現することである。適応は、凍結した特徴からサポートセットのプロトタイプへのタスク条件付き連続輸送として定式化され、フローマッチングの目的で訓練され、ハイブリッドプロトタイプ-線形分類器に結合される。このメソッドは軽量でバックボーンに依存しず、完全にキャッシュされた凍結機能で動作する。 7つの視覚的認識ベンチマーク、5つの凍結したバックボーン、1/4/16ショットのレシエーション、 \textsc{MC-RFM} は、ほとんどの評価済み設定において最高のパフォーマンスの方法であり、Transformerのバックボーンときめ細かいデータセットで最大の利益を得ている。その結果, 混合曲率ヘッド, タスク条件付け, 適応分岐ゲーティング, プロトタイプ縮小, 識別的監督がそれぞれ性能に寄与していることが示唆された。これらの結果は,どのパラメータを更新するかを判断するだけでなく,下流タスクの構造と一致した形状を表現がどのように移動すべきかをモデル化することによる利点を示唆している。

論文の概要: MC-RFM: Geometry-Aware Few-Shot Adaptation via Mixed-Curvature Riemannian Flow Matching

関連論文リスト