Fugu-MT 論文翻訳(概要): Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing

論文の概要: Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing

arxiv url: http://arxiv.org/abs/2203.14448v1
Date: Mon, 28 Mar 2022 02:12:30 GMT
ステータス: 翻訳完了
システム内更新日: 2022-03-29 15:36:28.414838
Title: Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
Title（参考訳）: 顔解析のための周期的自己制御型マルチタスク学習
Authors: Qingping Zheng, Jiankang Deng, Zheng Zhu, Ying Li, Stefanos Zafeiriou
Abstract要約: 顔解析のための周期的自己統制型デカップリング型マルチタスク学習を提案する。具体的には、DML-CSRは、顔解析、バイナリエッジ、カテゴリエッジ検出を含むマルチタスクモデルを設計する。提案手法は,Helen,CelebA-HQ,LapaMaskのデータセット上での最先端性能を実現する。
参考スコア（独自算出の注目度）: 71.19528222206088
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: This paper probes intrinsic factors behind typical failure cases (e.g. spatial inconsistency and boundary confusion) produced by the existing state-of-the-art method in face parsing. To tackle these problems, we propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation (DML-CSR) for face parsing. Specifically, DML-CSR designs a multi-task model which comprises face parsing, binary edge, and category edge detection. These tasks only share low-level encoder weights without high-level interactions between each other, enabling to decouple auxiliary modules from the whole network at the inference stage. To address spatial inconsistency, we develop a dynamic dual graph convolutional network to capture global contextual information without using any extra pooling operation. To handle boundary confusion in both single and multiple face scenarios, we exploit binary and category edge detection to jointly obtain generic geometric structure and fine-grained semantic clues of human faces. Besides, to prevent noisy labels from degrading model generalization during training, cyclical self-regulation is proposed to self-ensemble several model instances to get a new model and the resulting model then is used to self-distill subsequent models, through alternating iterations. Experiments show that our method achieves the new state-of-the-art performance on the Helen, CelebAMask-HQ, and Lapa datasets. The source code is available at https://github.com/deepinsight/insightface/tree/master/parsing/dml_csr.
Abstract（参考訳）: 本稿では,既存の顔解析における最先端手法が生み出す典型的な障害(例えば,空間的不整合や境界混乱)の背後にある本質的な要因について検討する。これらの問題に対処するために,顔解析のためのDML-CSR(Decoupled Multi-task Learning with Cyclical Self-Regulation)を提案する。具体的には、DML-CSRは、顔解析、バイナリエッジ、カテゴリエッジ検出を含むマルチタスクモデルを設計する。これらのタスクは、互いに高レベルな相互作用なしに低レベルエンコーダ重みを共有するだけで、推論段階で補助モジュールをネットワーク全体から切り離すことができる。空間的不整合に対処するため,余分なプール操作を使わずにグローバルな文脈情報をキャプチャする動的二重グラフ畳み込みネットワークを開発した。単一面と複数面の両方における境界の混乱に対処するために,二項エッジ検出とカテゴリエッジ検出を併用して,人間の顔の汎用幾何学的構造と詳細な意味的手がかりを得る。さらに、学習中にノイズラベルがモデル一般化を損なうのを防ぐために、複数のモデルインスタンスを自己アンサンブルして新しいモデルを得るための循環自己規制を提案し、その結果、繰り返しを交互に繰り返して後続モデルに自己拡散させる。実験により,本手法がHelen,CelebAMask-HQ,Lapaデータセット上での最先端性能を実現することを示す。ソースコードはhttps://github.com/deepinsight/insightface/tree/master/parsing/dml_csrで入手できる。

論文の概要: Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing

関連論文リスト