Fugu-MT 論文翻訳(概要): How to trust unlabeled data? Instance Credibility Inference for Few-Shot Learning

論文の概要: How to trust unlabeled data? Instance Credibility Inference for Few-Shot Learning

arxiv url: http://arxiv.org/abs/2007.08461v4
Date: Tue, 11 May 2021 03:21:15 GMT
ステータス: 翻訳完了
システム内更新日: 2022-11-10 06:11:31.011052
Title: How to trust unlabeled data? Instance Credibility Inference for Few-Shot Learning
Title（参考訳）: ラベルのないデータを信用する方法? 単発学習におけるインスタンス信頼度推定
Authors: Yikai Wang, Li Zhang, Yuan Yao, Yanwei Fu
Abstract要約: 本稿では,未ラベルのインスタンスを数発の視覚認識に利用するために,ICI (Instance Credibility Inference) と呼ばれる統計的アプローチを提案する。擬似ラベル付きインスタンスの信頼性は, それらの付随パラメータの正規化経路に沿ってランク付けし, 最も信頼性の高い擬似ラベル付きインスタンスを拡張ラベル付きインスタンスとして保存する。
参考スコア（独自算出の注目度）: 47.21354101796544
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Deep learning based models have excelled in many computer vision tasks and appear to surpass humans' performance. However, these models require an avalanche of expensive human labeled training data and many iterations to train their large number of parameters. This severely limits their scalability to the real-world long-tail distributed categories, some of which are with a large number of instances, but with only a few manually annotated. Learning from such extremely limited labeled examples is known as Few-shot learning (FSL). Different to prior arts that leverage meta-learning or data augmentation strategies to alleviate this extremely data-scarce problem, this paper presents a statistical approach, dubbed Instance Credibility Inference (ICI) to exploit the support of unlabeled instances for few-shot visual recognition. Typically, we repurpose the self-taught learning paradigm to predict pseudo-labels of unlabeled instances with an initial classifier trained from the few shot and then select the most confident ones to augment the training set to re-train the classifier. This is achieved by constructing a (Generalized) Linear Model (LM/GLM) with incidental parameters to model the mapping from (un-)labeled features to their (pseudo-)labels, in which the sparsity of the incidental parameters indicates the credibility of the corresponding pseudo-labeled instance. We rank the credibility of pseudo-labeled instances along the regularization path of their corresponding incidental parameters, and the most trustworthy pseudo-labeled examples are preserved as the augmented labeled instances. Theoretically, under mild conditions of restricted eigenvalue, irrepresentability, and large error, our approach is guaranteed to collect all the correctly-predicted instances from the noisy pseudo-labeled set.
Abstract（参考訳）: ディープラーニングベースのモデルは、多くのコンピュータビジョンタスクに優れ、人間のパフォーマンスを上回っているように見える。しかし、これらのモデルには、高価なラベル付きトレーニングデータの雪崩と、多数のパラメータをトレーニングするための多くのイテレーションが必要である。これにより、実世界のロングテール分散カテゴリへのスケーラビリティが大幅に制限される。このようなラベル付き例から学ぶことはFew-shot Learning(FSL)として知られている。メタラーニングやデータ拡張戦略を駆使して、この極端なデータ収集問題を軽減する先行技術とは異なり、本稿では、少数の視覚認識のためにラベルなしインスタンスのサポートを利用するためのインスタンス信頼度推論(ici)と呼ばれる統計的アプローチを提案する。典型的には、自己学習パラダイムを用いて、未ラベルのインスタンスの擬似ラベルを、いくつかのショットから訓練された初期分類器で予測し、最も自信のある分類器を選択して、トレーニングセットを増強し、分類器を再訓練する。これは(一般化された)線形モデル(lm/glm)と、(未)ラベル付き特徴からそれらのラベル(pseudo-)へのマッピングをモデル化するために、付随的パラメータを持つ(一般化された)線形モデル(lm/glm)を構築して実現される。擬似ラベル付きインスタンスの信頼性は, それらの付随パラメータの正規化経路に沿ってランク付けし, 最も信頼性の高い擬似ラベル付きインスタンスを拡張ラベル付きインスタンスとして保存する。理論的には、制限された固有値、表現可能性、大きな誤差の条件下では、ノイズの多い擬似ラベル集合から正しい予測された全てのインスタンスを収集することが保証される。

論文の概要: How to trust unlabeled data? Instance Credibility Inference for Few-Shot Learning

関連論文リスト