MATE: Masked Autoencoders are Online 3D Test-Time Learners
- URL: http://arxiv.org/abs/2211.11432v3
- Date: Mon, 20 Mar 2023 09:44:58 GMT
- Title: MATE: Masked Autoencoders are Online 3D Test-Time Learners
- Authors: M. Jehanzeb Mirza, Inkyu Shin, Wei Lin, Andreas Schriebl, Kunyang Sun,
Jaesung Choe, Horst Possegger, Mateusz Kozinski, In So Kweon, Kun-Jin Yoon,
Horst Bischof
- Abstract summary: MATE is the first Test-Time-Training (TTT) method designed for 3D data.
It makes deep networks trained for point cloud classification robust to distribution shifts occurring in test data.
- Score: 63.3907730920114
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Our MATE is the first Test-Time-Training (TTT) method designed for 3D data,
which makes deep networks trained for point cloud classification robust to
distribution shifts occurring in test data. Like existing TTT methods from the
2D image domain, MATE also leverages test data for adaptation. Its test-time
objective is that of a Masked Autoencoder: a large portion of each test point
cloud is removed before it is fed to the network, tasked with reconstructing
the full point cloud. Once the network is updated, it is used to classify the
point cloud. We test MATE on several 3D object classification datasets and show
that it significantly improves robustness of deep networks to several types of
corruptions commonly occurring in 3D point clouds. We show that MATE is very
efficient in terms of the fraction of points it needs for the adaptation. It
can effectively adapt given as few as 5% of tokens of each test sample, making
it extremely lightweight. Our experiments show that MATE also achieves
competitive performance by adapting sparsely on the test data, which further
reduces its computational overhead, making it ideal for real-time applications.
Related papers
- Test-Time Adaptation of 3D Point Clouds via Denoising Diffusion Models [19.795578581043745]
Test-time adaptation of 3D point clouds is crucial for mitigating discrepancies between training and testing samples in real-world scenarios.
We introduce a novel 3D test-time adaptation method, termed 3DD-TTA, which stands for 3D Denoising Diffusion Test-Time Adaptation.
arXiv Detail & Related papers (2024-11-21T00:04:38Z) - Test-Time Augmentation for 3D Point Cloud Classification and
Segmentation [40.62640761825697]
Data augmentation is a powerful technique to enhance the performance of a deep learning task.
This work explores test-time augmentation (TTA) for 3D point clouds.
arXiv Detail & Related papers (2023-11-22T04:31:09Z) - AutoSynth: Learning to Generate 3D Training Data for Object Point Cloud
Registration [69.21282992341007]
Auto Synth automatically generates 3D training data for point cloud registration.
We replace the point cloud registration network with a much smaller surrogate network, leading to a $4056.43$ speedup.
Our results on TUD-L, LINEMOD and Occluded-LINEMOD evidence that a neural network trained on our searched dataset yields consistently better performance than the same one trained on the widely used ModelNet40 dataset.
arXiv Detail & Related papers (2023-09-20T09:29:44Z) - Clustering based Point Cloud Representation Learning for 3D Analysis [80.88995099442374]
We propose a clustering based supervised learning scheme for point cloud analysis.
Unlike current de-facto, scene-wise training paradigm, our algorithm conducts within-class clustering on the point embedding space.
Our algorithm shows notable improvements on famous point cloud segmentation datasets.
arXiv Detail & Related papers (2023-07-27T03:42:12Z) - Focal Sparse Convolutional Networks for 3D Object Detection [121.45950754511021]
We introduce two new modules to enhance the capability of Sparse CNNs.
They are focal sparse convolution (Focals Conv) and its multi-modal variant of focal sparse convolution with fusion.
For the first time, we show that spatially learnable sparsity in sparse convolution is essential for sophisticated 3D object detection.
arXiv Detail & Related papers (2022-04-26T17:34:10Z) - TANet++: Triple Attention Network with Filtered Pointcloud on 3D
Detection [7.64943832687184]
TANet is one of state-of-the-art 3D object detection method on KITTI and JRDB benchmark.
In this paper, we propose TANet++ to improve the performance on 3D Detection.
In order to reduce the negative impact by the weak samples, the training strategy previously filtered the training data, and then the TANet++ is trained by the rest of data.
arXiv Detail & Related papers (2021-06-26T16:56:35Z) - MT3: Meta Test-Time Training for Self-Supervised Test-Time Adaption [69.76837484008033]
An unresolved problem in Deep Learning is the ability of neural networks to cope with domain shifts during test-time.
We combine meta-learning, self-supervision and test-time training to learn to adapt to unseen test distributions.
Our approach significantly improves the state-of-the-art results on the CIFAR-10-Corrupted image classification benchmark.
arXiv Detail & Related papers (2021-03-30T09:33:38Z) - D3Feat: Joint Learning of Dense Detection and Description of 3D Local
Features [51.04841465193678]
We leverage a 3D fully convolutional network for 3D point clouds.
We propose a novel and practical learning mechanism that densely predicts both a detection score and a description feature for each 3D point.
Our method achieves state-of-the-art results in both indoor and outdoor scenarios.
arXiv Detail & Related papers (2020-03-06T12:51:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.