Denoise and Contrast for Category Agnostic Shape Completion
- URL: http://arxiv.org/abs/2103.16671v1
- Date: Tue, 30 Mar 2021 20:33:24 GMT
- Title: Denoise and Contrast for Category Agnostic Shape Completion
- Authors: Antonio Alliegro, Diego Valsesia, Giulia Fracastoro, Enrico Magli,
Tatiana Tommasi
- Abstract summary: We present a deep learning model that exploits the power of self-supervision to perform 3D point cloud completion.
A denoising pretext task provides the network with the needed local cues, decoupled from the high-level semantics.
contrastive learning maximizes the agreement between variants of the same shape with different missing portions.
- Score: 48.66519783934386
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we present a deep learning model that exploits the power of
self-supervision to perform 3D point cloud completion, estimating the missing
part and a context region around it. Local and global information are encoded
in a combined embedding. A denoising pretext task provides the network with the
needed local cues, decoupled from the high-level semantics and naturally shared
over multiple classes. On the other hand, contrastive learning maximizes the
agreement between variants of the same shape with different missing portions,
thus producing a representation which captures the global appearance of the
shape. The combined embedding inherits category-agnostic properties from the
chosen pretext tasks. Differently from existing approaches, this allows to
better generalize the completion properties to new categories unseen at
training time. Moreover, while decoding the obtained joint representation, we
better blend the reconstructed missing part with the partial shape by paying
attention to its known surrounding region and reconstructing this frame as
auxiliary objective. Our extensive experiments and detailed ablation on the
ShapeNet dataset show the effectiveness of each part of the method with new
state of the art results. Our quantitative and qualitative analysis confirms
how our approach is able to work on novel categories without relying neither on
classification and shape symmetry priors, nor on adversarial training
procedures.
Related papers
- Unsupervised Non-Rigid Point Cloud Matching through Large Vision Models [1.3030624795284795]
We propose a learning-based framework for non-rigid point cloud matching.
Key insight is to incorporate semantic features derived from large vision models (LVMs)
Our framework effectively leverages the structural information contained in the semantic features to address ambiguities arise from self-similarities among local geometries.
arXiv Detail & Related papers (2024-08-16T07:02:19Z) - Understanding Imbalanced Semantic Segmentation Through Neural Collapse [81.89121711426951]
We show that semantic segmentation naturally brings contextual correlation and imbalanced distribution among classes.
We introduce a regularizer on feature centers to encourage the network to learn features closer to the appealing structure.
Our method ranks 1st and sets a new record on the ScanNet200 test leaderboard.
arXiv Detail & Related papers (2023-01-03T13:51:51Z) - Global and Local Features through Gaussian Mixture Models on Image
Semantic Segmentation [0.38073142980732994]
We propose an internal structure for the feature representations while extracting a global representation that supports the former.
During training, we predict a Gaussian Mixture Model from the data, which, merged with the skip connections and the decoding stage, helps avoid wrong inductive biases.
Our results show that we can improve semantic segmentation by providing both learning representations (global and local) with a clustering behavior and combining them.
arXiv Detail & Related papers (2022-07-19T10:10:49Z) - Bending Graphs: Hierarchical Shape Matching using Gated Optimal
Transport [80.64516377977183]
Shape matching has been a long-studied problem for the computer graphics and vision community.
We investigate a hierarchical learning design, to which we incorporate local patch-level information and global shape-level structures.
We propose a novel optimal transport solver by recurrently updating features on non-confident nodes to learn globally consistent correspondences between the shapes.
arXiv Detail & Related papers (2022-02-03T11:41:46Z) - Point Discriminative Learning for Unsupervised Representation Learning
on 3D Point Clouds [54.31515001741987]
We propose a point discriminative learning method for unsupervised representation learning on 3D point clouds.
We achieve this by imposing a novel point discrimination loss on the middle level and global level point features.
Our method learns powerful representations and achieves new state-of-the-art performance.
arXiv Detail & Related papers (2021-08-04T15:11:48Z) - Learning to Segment Human Body Parts with Synthetically Trained Deep
Convolutional Networks [58.0240970093372]
This paper presents a new framework for human body part segmentation based on Deep Convolutional Neural Networks trained using only synthetic data.
The proposed approach achieves cutting-edge results without the need of training the models with real annotated data of human body parts.
arXiv Detail & Related papers (2021-02-02T12:26:50Z) - Adaptive Prototypical Networks with Label Words and Joint Representation
Learning for Few-Shot Relation Classification [17.237331828747006]
This work focuses on few-shot relation classification (FSRC)
We propose an adaptive mixture mechanism to add label words to the representation of the class prototype.
Experiments have been conducted on FewRel under different few-shot (FS) settings.
arXiv Detail & Related papers (2021-01-10T11:25:42Z) - Commonality-Parsing Network across Shape and Appearance for Partially
Supervised Instance Segmentation [71.59275788106622]
We propose to learn the underlying class-agnostic commonalities that can be generalized from mask-annotated categories to novel categories.
Our model significantly outperforms the state-of-the-art methods on both partially supervised setting and few-shot setting for instance segmentation on COCO dataset.
arXiv Detail & Related papers (2020-07-24T07:23:44Z) - Focus on Semantic Consistency for Cross-domain Crowd Understanding [34.560447389853614]
Some domain adaptation algorithms try to liberate it by training models with synthetic data.
We found that a mass of estimation errors in the background areas impede the performance of the existing methods.
In this paper, we propose a domain adaptation method to eliminate it.
arXiv Detail & Related papers (2020-02-20T08:51:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.