Fugu-MT 論文翻訳(概要): Sparsity Hurts: Simple Linear Adapter Can Boost Generalized Category Discovery

論文の概要: Sparsity Hurts: Simple Linear Adapter Can Boost Generalized Category Discovery

arxiv url: http://arxiv.org/abs/2605.08183v1
Date: Tue, 05 May 2026 10:14:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:49.437125
Title: Sparsity Hurts: Simple Linear Adapter Can Boost Generalized Category Discovery
Title（参考訳）: Sparsity Hurts:単純な線形アダプタは一般化されたカテゴリー発見を促進する
Authors: Bo Ye, Kai Gan, Tong Wei, Min-Ling Zhang,
Abstract要約: Generalized Category Discovery (GCD)は、ラベルのないデータから新しいカテゴリを識別する。以前のGCD法は、通常、事前訓練されたモデルからの転送可能な表現を利用する。本稿では,各 ViT ブロックに線形アダプタを組み込む,単純かつ効果的な GCD 手法である LAGCD を提案する。
参考スコア（独自算出の注目度）: 54.695641187741536
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Generalized Category Discovery (GCD) seeks to identify novel categories from unlabeled data while retaining the classification ability of seen categories. Prior GCD methods commonly leverage transferable representations from pre-trained models, adapting to downstream datasets via partial fine-tuning (updating only the final ViT block) and visual prompt tuning (appending learnable vectors to inputs). However, conventional partial fine-tuning offers limited flexibility, as it fails to adapt the entire model; meanwhile, visual prompt tuning is prone to overfitting, due to its sensitivity to initialization and inherently constrained capacity. To address these limitations, we propose LAGCD, a simple yet effective GCD approach that embeds a residual linear adapter into each ViT block. From the perspective of feature sparsity, we systematically show that non-linearity in conventional adapters impairs performance, whereas our linear adapter enhances it by enabling more flexible model capacity. We further introduce an auxiliary distribution alignment loss to mitigate the negative impact of biased predictions between seen and novel categories. Extensive experiments on both generic and fine-grained datasets confirm that LAGCD consistently improves performance over many sophisticated baselines. The source code is available at https://github.com/yebo0216best/LAGCD
Abstract（参考訳）: Generalized Category Discovery (GCD) は、未ラベルのデータから新しいカテゴリを識別し、そのカテゴリの分類能力を維持する。従来のGCD手法では、事前訓練されたモデルからの転送可能な表現を利用し、部分的な微調整(最後のViTブロックのみの更新)と視覚的プロンプトチューニング(入力への学習可能なベクトルの付加)によって下流のデータセットに適応する。しかし、従来の部分的な微調整は、モデル全体の適応に失敗するため、柔軟性が制限されるが、一方、初期化に対する感受性と本質的に制約された能力のため、視覚的なプロンプトチューニングは過度に適合する傾向がある。これらの制約に対処するために,各 ViT ブロックに線形アダプタを埋め込む,単純かつ効果的な GCD アプローチである LAGCD を提案する。特徴空間の観点からは,従来のアダプタの非線形性は性能を損なうが,線形アダプタはより柔軟なモデルキャパシティを実現することで性能を向上させる。さらに,視線と新カテゴリー間の偏差予測の負の影響を軽減するために,補助分布アライメント損失を導入する。汎用データセットと細粒度データセットの両方に対する大規模な実験により、RAGCDが多くの洗練されたベースラインよりも一貫してパフォーマンスを改善していることが確認された。ソースコードはhttps://github.com/yebo0216best/LAGCDで入手できる。

論文の概要: Sparsity Hurts: Simple Linear Adapter Can Boost Generalized Category Discovery

関連論文リスト