Fugu-MT 論文翻訳(概要): You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene Understanding

論文の概要: You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene Understanding

arxiv url: http://arxiv.org/abs/2303.14727v2
Date: Sat, 9 Sep 2023 23:11:09 GMT
ステータス: 翻訳完了
システム内更新日: 2023-09-13 17:18:43.231366
Title: You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene Understanding
Title（参考訳）: ワンクリックで3dシーン理解の弱さを学習できるセルフトレーニング(動画あり)
Authors: Zhengzhe Liu, Xiaojuan Qi, Chi-Wing Fu
Abstract要約: 私たちはOne Thing One Click''を提案する。つまり、アノテーションはオブジェクトごとに1つのポイントをラベル付けするだけです。グラフ伝搬モジュールによって促進されるトレーニングとラベル伝搬を反復的に行う。我々のモデルは、ポイントクラスタリング戦略を備えた3Dインスタンスセグメンテーションと互換性がある。
参考スコア（独自算出の注目度）: 107.06117227661204
License: http://creativecommons.org/licenses/by/4.0/
Abstract: 3D scene understanding, e.g., point cloud semantic and instance segmentation, often requires large-scale annotated training data, but clearly, point-wise labels are too tedious to prepare. While some recent methods propose to train a 3D network with small percentages of point labels, we take the approach to an extreme and propose ``One Thing One Click,'' meaning that the annotator only needs to label one point per object. To leverage these extremely sparse labels in network training, we design a novel self-training approach, in which we iteratively conduct the training and label propagation, facilitated by a graph propagation module. Also, we adopt a relation network to generate the per-category prototype to enhance the pseudo label quality and guide the iterative training. Besides, our model can be compatible to 3D instance segmentation equipped with a point-clustering strategy. Experimental results on both ScanNet-v2 and S3DIS show that our self-training approach, with extremely-sparse annotations, outperforms all existing weakly supervised methods for 3D semantic and instance segmentation by a large margin, and our results are also comparable to those of the fully supervised counterparts. Codes and models are available at https://github.com/liuzhengzhe/One-Thing-One-Click.
Abstract（参考訳）: ポイントクラウドセマンティクスやインスタンスセグメンテーションといった3dシーンの理解には、多くの場合、大規模な注釈付きトレーニングデータが必要であるが、ポイントワイズラベルを準備するには面倒すぎることは明らかである。近年,ポイントラベルの少ない3dネットワークを訓練する手法が提案されているが,そのアプローチを極端に取り入れて,アノテータがオブジェクト毎に1つのポイントをラベル付けるだけでよい‘one thing one click,’を提案する。ネットワークトレーニングにおいて,これらの極端にスパースなラベルを活用するために,グラフ伝搬モジュールによって促進されるトレーニングとラベル伝搬を反復的に行う,新たな自己学習アプローチを設計する。また,カテゴリごとのプロトタイプを生成するために関係ネットワークを採用し,擬似ラベル品質を高め,反復学習を指導する。さらに,本モデルは,ポイントクラスタリング戦略を備えた3次元インスタンスセグメンテーションと互換性がある。 scannet-v2 と s3dis の両方の実験結果は、非常に疎結合なアノテーションを持つ我々の自己学習アプローチが、既存の3dセマンティクスとインスタンスセグメンテーションのための弱い教師付きメソッドを大きなマージンで上回っていることを示している。コードとモデルはhttps://github.com/liuzhengzhe/One-Thing-One-Click.comで公開されている。

論文の概要: You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene Understanding

関連論文リスト