Fugu-MT 論文翻訳(概要): Share With Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency

論文の概要: Share With Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency

arxiv url: http://arxiv.org/abs/2204.10310v1
Date: Thu, 21 Apr 2022 17:47:35 GMT
ステータス: 翻訳完了
システム内更新日: 2022-04-22 14:21:43.010598
Title: Share With Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency
Title（参考訳）: Thy Neighbors氏との共有: クロスインスタンス一貫性による単一ビュー再構築
Authors: Tom Monnier, Matthew Fisher, Alexei A. Efros, Mathieu Aubry
Abstract要約: 単一ビューの再構築は通常、視点アノテーション、シルエット、背景の欠如、同じインスタンスの複数のビュー、テンプレートの形状、対称性に依存する。異なるオブジェクトインスタンスのイメージ間の一貫性を明確に活用することで、これらの監督と仮説をすべて回避します。 i)プログレッシブ・コンディショニング(プログレッシブ・コンディショニング)、(ii)類似の形状やテクスチャを持つインスタンス間の一貫性の喪失、(ii)モデルのカテゴリからインスタンスへと徐々に専門化するためのトレーニング戦略。
参考スコア（独自算出の注目度）: 59.427074701985795
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Approaches to single-view reconstruction typically rely on viewpoint annotations, silhouettes, the absence of background, multiple views of the same instance, a template shape, or symmetry. We avoid all of these supervisions and hypotheses by leveraging explicitly the consistency between images of different object instances. As a result, our method can learn from large collections of unlabelled images depicting the same object category. Our main contributions are two approaches to leverage cross-instance consistency: (i) progressive conditioning, a training strategy to gradually specialize the model from category to instances in a curriculum learning fashion; (ii) swap reconstruction, a loss enforcing consistency between instances having similar shape or texture. Critical to the success of our method are also: our structured autoencoding architecture decomposing an image into explicit shape, texture, pose, and background; an adapted formulation of differential rendering, and; a new optimization scheme alternating between 3D and pose learning. We compare our approach, UNICORN, both on the diverse synthetic ShapeNet dataset - the classical benchmark for methods requiring multiple views as supervision - and on standard real-image benchmarks (Pascal3D+ Car, CUB-200) for which most methods require known templates and silhouette annotations. We also showcase applicability to more challenging real-world collections (CompCars, LSUN), where silhouettes are not available and images are not cropped around the object.
Abstract（参考訳）: 単一ビュー再構築へのアプローチは通常、視点アノテーション、シルエット、背景の欠如、同じインスタンスの複数のビュー、テンプレートの形状、対称性に依存する。異なるオブジェクトインスタンスのイメージ間の一貫性を明確に活用することで、これらの監督と仮説をすべて回避します。その結果,同じ対象カテゴリを表現したラベルなし画像の膨大なコレクションから学習することができる。私たちの主な貢献は、クロスインスタンス一貫性を活用する2つのアプローチです。一プログレッシブ・コンディショニング、カリキュラム学習の方法によるカテゴリーから事例までモデルを徐々に専門化する訓練戦略 (ii)形状又はテクスチャの類似したインスタンス間の一貫性を強制する損失。提案手法の成功には, イメージを明示的な形状, テクスチャ, ポーズ, 背景に分解する構造化オートエンコーディングアーキテクチャ, 微分レンダリングの適合した定式化, 3dとポーズ学習を交互に交互に行う新しい最適化スキームなども重要である。当社のアプローチであるUNICORNは,さまざまな合成ShapeNetデータセット – 監視対象として複数のビューを必要とするメソッドの古典的なベンチマーク – と,既知のテンプレートやシルエットアノテーションを必要とする標準的なリアルタイムベンチマーク(Pascal3D+ Car, CUB-200)を比較しています。また、シルエットが利用できず、画像がオブジェクトの周りにトリミングされない、より挑戦的な実世界のコレクション(compcars、lsun)に適用する可能性も示しています。

論文の概要: Share With Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency

関連論文リスト