Fugu-MT 論文翻訳(概要): Self-Supervised Scene De-occlusion

論文の概要: Self-Supervised Scene De-occlusion

arxiv url: http://arxiv.org/abs/2004.02788v1
Date: Mon, 6 Apr 2020 16:31:11 GMT
ステータス: 翻訳完了
システム内更新日: 2022-12-16 07:02:34.532643
Title: Self-Supervised Scene De-occlusion
Title（参考訳）: 自己監督シーンの閉鎖
Authors: Xiaohang Zhan, Xingang Pan, Bo Dai, Ziwei Liu, Dahua Lin, Chen Change Loy
Abstract要約: 本稿では,隠蔽対象の隠蔽順序を復元し,隠蔽対象の見えない部分を完成させることを目的としたシーン非隠蔽問題について検討する。そこで本研究では,隠されたシーン構造を監視対象として指示やアモーダルアノテーションを使わずに復元する,新規で統一的なフレームワークを用いて,この問題に対処する試みを行う。そこで,PCNet-M と PCNet-C をベースとして,プログレッシブ・オーダリング・リカバリ,アモーダル・コンプリーメント,コンテント・コンプリートを通じてシーン・デオクルージョンを実現する新しい推論手法を考案した。
参考スコア（独自算出の注目度）: 186.89979151728636
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Natural scene understanding is a challenging task, particularly when encountering images of multiple objects that are partially occluded. This obstacle is given rise by varying object ordering and positioning. Existing scene understanding paradigms are able to parse only the visible parts, resulting in incomplete and unstructured scene interpretation. In this paper, we investigate the problem of scene de-occlusion, which aims to recover the underlying occlusion ordering and complete the invisible parts of occluded objects. We make the first attempt to address the problem through a novel and unified framework that recovers hidden scene structures without ordering and amodal annotations as supervisions. This is achieved via Partial Completion Network (PCNet)-mask (M) and -content (C), that learn to recover fractions of object masks and contents, respectively, in a self-supervised manner. Based on PCNet-M and PCNet-C, we devise a novel inference scheme to accomplish scene de-occlusion, via progressive ordering recovery, amodal completion and content completion. Extensive experiments on real-world scenes demonstrate the superior performance of our approach to other alternatives. Remarkably, our approach that is trained in a self-supervised manner achieves comparable results to fully-supervised methods. The proposed scene de-occlusion framework benefits many applications, including high-quality and controllable image manipulation and scene recomposition (see Fig. 1), as well as the conversion of existing modal mask annotations to amodal mask annotations.
Abstract（参考訳）: 自然の風景理解は、特に部分的に遮蔽された複数の物体の画像に遭遇する場合、難しい課題である。この障害は、オブジェクトの順序や位置を変えることで生じる。既存のシーン理解パラダイムは、可視部分のみを解析することができ、不完全で非構造的なシーン解釈をもたらす。そこで本研究では, 咬合順序を回復し, 咬合対象の目に見えない部分を完備することを目的とした, 閉鎖シーンの課題について検討する。オーダリングやアモーダルアノテーションをスーパーバイザとして使わずに隠れたシーン構造を復元する、新しく統一されたフレームワークを通じて、この問題に対処する最初の試みを行ないます。これはPCNet (Partial Completion Network)-mask (M) と-content (C) によって実現され、オブジェクトマスクとコンテンツの分画を自己管理的に復元する。そこで,PCNet-M と PCNet-C をベースとして,プログレッシブ・オーダリング・リカバリ,アモーダル・コンプリーメント,コンテント・コンプリートを通じてシーン・デクルージョンを実現する新しい推論手法を提案する。実世界のシーンでの広範囲な実験は、他の選択肢に対する我々のアプローチの優れたパフォーマンスを示しています。驚くべきことに、自己監督的な方法で訓練された我々のアプローチは、完全に監督された方法と同等の結果を得る。提案したシーン除去フレームワークは,高品質で制御可能な画像操作やシーン再構成など,多数のアプリケーションに有効である(図1参照)。

論文の概要: Self-Supervised Scene De-occlusion

関連論文リスト