Fugu-MT 論文翻訳(概要): End-to-end One-shot Human Parsing

論文の概要: End-to-end One-shot Human Parsing

arxiv url: http://arxiv.org/abs/2105.01241v3
Date: Sun, 24 Sep 2023 05:03:21 GMT
ステータス: 翻訳完了
システム内更新日: 2023-09-27 05:21:22.818077
Title: End-to-end One-shot Human Parsing
Title（参考訳）: エンドツーエンドのワンショットパーシング
Authors: Haoyu He, Bohan Zhuang, Jing Zhang, Jianfei Cai, Dacheng Tao
Abstract要約: One-shot Human Parsing (OSHP) タスクでは、テスト例によって定義されたオープンなクラスに人間を解析する必要がある。 EOP-Net (End-to-end One-shot Human Parsing Network) が提案されている。 EOP-Netは、一発セグメンテーションモデルを大きなマージンで上回る。
参考スコア（独自算出の注目度）: 91.5113227694443
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Previous human parsing methods are limited to parsing humans into pre-defined classes, which is inflexible for practical fashion applications that often have new fashion item classes. In this paper, we define a novel one-shot human parsing (OSHP) task that requires parsing humans into an open set of classes defined by any test example. During training, only base classes are exposed, which only overlap with part of the test-time classes. To address three main challenges in OSHP, i.e., small sizes, testing bias, and similar parts, we devise an End-to-end One-shot human Parsing Network (EOP-Net). Firstly, an end-to-end human parsing framework is proposed to parse the query image into both coarse-grained and fine-grained human classes, which embeds rich semantic information that is shared across different granularities to identify the small-sized human classes. Then, we gradually smooth the training-time static prototypes to get robust class representations. Moreover, we employ a dynamic objective to encourage the network's enhancing features' representational capability in the early training phase while improving features' transferability in the late training phase. Therefore, our method can quickly adapt to the novel classes and mitigate the testing bias issue. In addition, we add a contrastive loss at the prototype level to enforce inter-class distances, thereby discriminating the similar parts. For comprehensive evaluations on the new task, we tailor three existing popular human parsing benchmarks to the OSHP task. Experiments demonstrate that EOP-Net outperforms representative one-shot segmentation models by large margins and serves as a strong baseline for further research. The source code is available at https://github.com/Charleshhy/One-shot-Human-Parsing.
Abstract（参考訳）: 従来の人間の構文解析法は、人間が定義済みのクラスにパースすることに限定されており、新しいファッションアイテムクラスを持つ実用的なファッションアプリケーションには柔軟性がない。本稿では,人間を任意のテスト例で定義されたクラスのオープンセットにパースする必要がある,新しいone-shot human parse (oshp)タスクを定義する。トレーニング中は、ベースクラスのみが公開され、テスト時間クラスの一部にのみ重複する。 oshpの3つの主な課題、すなわち小さなサイズ、テストバイアス、および類似部分に対処するために、エンドツーエンドのワンショット人間解析ネットワーク(eop-net)を考案する。まず、エンド・ツー・エンドのヒューマン・パース・フレームワークが、クエリ・イメージを粗粒度と細粒度の両方のヒューマン・クラスに解析するために提案されている。そして、トレーニング時の静的プロトタイプを徐々にスムーズにして、堅牢なクラス表現を得る。さらに,初期訓練段階ではネットワークの表現能力の向上を奨励し,後期訓練段階では特徴の伝達性を向上させるために動的目標を用いる。そこで本手法は,新しいクラスに迅速に適応し,テストバイアスを軽減できる。さらに,クラス間距離を強制するために,プロトタイプレベルでの対比的損失を追加し,類似部分の識別を行う。新しいタスクに関する総合的な評価のために、既存の3つのヒトパースベンチマークをOSHPタスクに調整する。実験により、eop-netは代表的なワンショットセグメンテーションモデルを大きなマージンで上回り、さらなる研究の基盤となることが示されている。ソースコードはhttps://github.com/Charleshhy/One-shot-Human-Parsingで入手できる。

論文の概要: End-to-end One-shot Human Parsing

関連論文リスト