Fugu-MT 論文翻訳(概要): A Systematic Performance Analysis of Deep Perceptual Loss Networks: Breaking Transfer Learning Conventions

論文の概要: A Systematic Performance Analysis of Deep Perceptual Loss Networks: Breaking Transfer Learning Conventions

arxiv url: http://arxiv.org/abs/2302.04032v2
Date: Tue, 24 Oct 2023 07:35:41 GMT
ステータス: 翻訳完了
システム内更新日: 2023-10-26 01:32:29.082183
Title: A Systematic Performance Analysis of Deep Perceptual Loss Networks: Breaking Transfer Learning Conventions
Title（参考訳）: ディープ・パーセプチュアル・ロス・ネットワークの系統的性能解析--Breaking Transfer Learning Conventions-
Authors: Gustav Grund Pihlgren, Konstantina Nikolaidou, Prakash Chandra Chhipa, Nosheen Abid, Rajkumar Saini, Fredrik Sandin, Marcus Liwicki
Abstract要約: ディープ・パーセプチュアル・ロス(英: Deep Perceptual Los)は、ニューラルネットワークから抽出されたディープ・フィーチャを用いて人間の知覚を模倣することを目的としたコンピュータビジョンにおけるロス関数の一種である。この手法の多くのアプリケーションは事前訓練されたネットワーク(しばしば畳み込みネットワーク)を損失計算に利用する。この研究は、多くの異なる特徴抽出ポイントに対して、広く使われ、容易に利用できる事前訓練されたネットワークのホストを体系的に評価する。
参考スコア（独自算出の注目度）: 5.675846065608753
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Deep perceptual loss is a type of loss function in computer vision that aims to mimic human perception by using the deep features extracted from neural networks. In recent years, the method has been applied to great effect on a host of interesting computer vision tasks, especially for tasks with image or image-like outputs, such as image synthesis, segmentation, depth prediction, and more. Many applications of the method use pretrained networks, often convolutional networks, for loss calculation. Despite the increased interest and broader use, more effort is needed toward exploring which networks to use for calculating deep perceptual loss and from which layers to extract the features. This work aims to rectify this by systematically evaluating a host of commonly used and readily available, pretrained networks for a number of different feature extraction points on four existing use cases of deep perceptual loss. The use cases of perceptual similarity, super-resolution, image segmentation, and dimensionality reduction, are evaluated through benchmarks. The benchmarks are implementations of previous works where the selected networks and extraction points are evaluated. The performance on the benchmarks, and attributes of the networks and extraction points are then used as a basis for an in-depth analysis. This analysis uncovers insight regarding which architectures provide superior performance for deep perceptual loss and how to choose an appropriate extraction point for a particular task and dataset. Furthermore, the work discusses the implications of the results for deep perceptual loss and the broader field of transfer learning. The results show that deep perceptual loss deviates from two commonly held conventions in transfer learning, which suggests that those conventions are in need of deeper analysis.
Abstract（参考訳）: ディープ・パーセプチュアル・ロス(deep perceptual loss)は、ニューラルネットワークから抽出された深い特徴を用いて人間の知覚を模倣することを目的としたコンピュータビジョンにおける損失関数の一種である。近年,画像合成やセグメンテーション,奥行き予測など,画像や画像ライクなアウトプットを持つタスクにおいて,興味深いコンピュータビジョンタスクのホストに対して大きな効果が与えられている。この手法の多くのアプリケーションは事前訓練されたネットワーク(しばしば畳み込みネットワーク)を損失計算に利用する。関心が高まり、広く使われるようになったにも拘わらず、深い知覚的損失を計算するためにどのネットワークを使うか、どの層から特徴を抽出するかを探索するにはより多くの努力が必要である。本研究の目的は,既存の4つの重度知覚喪失例において,多種多様な特徴抽出点に対して,広く利用され,容易に利用できる事前学習ネットワークのホストを体系的に評価することである。知覚的類似性,超解像,画像分割,次元化のユースケースをベンチマークにより評価した。ベンチマークは、選択したネットワークと抽出ポイントを評価する以前の作業の実装である。ベンチマークのパフォーマンスとネットワークの属性と抽出ポイントは、詳細な分析の基盤として使用される。この分析は、どのアーキテクチャが深い知覚損失に対して優れたパフォーマンスを提供するのか、特定のタスクやデータセットの適切な抽出ポイントをどのように選択するかに関する洞察を明らかにする。さらに本研究は, 深い知覚喪失に対する結果の意義と, 転校学習の幅広い分野について論じる。その結果, 転校学習における2つの慣例から深い知覚損失が逸脱し, それらの規則がより深い分析を必要とすることが示唆された。

論文の概要: A Systematic Performance Analysis of Deep Perceptual Loss Networks: Breaking Transfer Learning Conventions

関連論文リスト