Fugu-MT 論文翻訳(概要): Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

論文の概要: Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

arxiv url: http://arxiv.org/abs/2107.14428v1
Date: Fri, 30 Jul 2021 04:50:56 GMT
ステータス: 翻訳完了
システム内更新日: 2021-08-02 13:02:10.056476
Title: Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation
Title（参考訳）: 高分解能セマンティックセグメンテーションのための動的ニューラルネットワーク表現デコーダ
Authors: Bowen Zhang, Yifan Liu, Zhi Tian, Chunhua Shen
Abstract要約: 動的ニューラル表現デコーダ(NRD)と呼ばれる新しいデコーダを提案する。エンコーダの出力上の各位置がセマンティックラベルの局所的なパッチに対応するので、この研究では、これらの局所的なパッチをコンパクトなニューラルネットワークで表現する。このニューラル表現により、意味ラベル空間に先行する滑らかさを活用することができ、デコーダをより効率的にすることができる。
参考スコア（独自算出の注目度）: 98.05643473345474
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Semantic segmentation requires per-pixel prediction for a given image. Typically, the output resolution of a segmentation network is severely reduced due to the downsampling operations in the CNN backbone. Most previous methods employ upsampling decoders to recover the spatial resolution. Various decoders were designed in the literature. Here, we propose a novel decoder, termed dynamic neural representational decoder (NRD), which is simple yet significantly more efficient. As each location on the encoder's output corresponds to a local patch of the semantic labels, in this work, we represent these local patches of labels with compact neural networks. This neural representation enables our decoder to leverage the smoothness prior in the semantic label space, and thus makes our decoder more efficient. Furthermore, these neural representations are dynamically generated and conditioned on the outputs of the encoder networks. The desired semantic labels can be efficiently decoded from the neural representations, resulting in high-resolution semantic segmentation predictions. We empirically show that our proposed decoder can outperform the decoder in DeeplabV3+ with only 30% computational complexity, and achieve competitive performance with the methods using dilated encoders with only 15% computation. Experiments on the Cityscapes, ADE20K, and PASCAL Context datasets demonstrate the effectiveness and efficiency of our proposed method.
Abstract（参考訳）: セマンティックセグメンテーションは、与えられた画像に対してピクセル単位の予測を必要とする。通常、セグメンテーションネットワークの出力解像度はCNNバックボーンのダウンサンプリング操作により大幅に低下する。以前の手法では、空間分解能を回復するためにデコーダのアップサンプリングを用いる。様々なデコーダが文学で設計された。本稿では,動的ニューラルネットワーク表現デコーダ(dynamic neural representational decoder, nrd)と呼ばれる新しいデコーダを提案する。本研究では、エンコーダ出力上の各位置が意味ラベルの局所パッチに対応するため、これらのラベルの局所パッチをコンパクトニューラルネットワークで表現する。このニューラル表現により、デコーダは意味ラベル空間の前の滑らかさを活用できるため、デコーダをより効率的にします。さらに、これらの神経表現は動的に生成され、エンコーダネットワークの出力に条件付けされる。所望のセマンティクスラベルを効率的に神経表現から復号することができ、その結果、高分解能セマンティクスセグメンテーションが予測される。提案するデコーダは,DeeplabV3+のデコーダを30%の計算複雑性で上回り,15%しか計算できない拡張エンコーダを用いた手法と競合する性能が得られることを示す。都市景観,ADE20K,PASCALコンテキストデータセットの実験により,提案手法の有効性と有効性を示した。

論文の概要: Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

関連論文リスト