Joint prototype and coefficient prediction for 3D instance segmentation
- URL: http://arxiv.org/abs/2407.06958v1
- Date: Tue, 9 Jul 2024 15:36:13 GMT
- Title: Joint prototype and coefficient prediction for 3D instance segmentation
- Authors: Remco Royen, Leon Denis, Adrian Munteanu,
- Abstract summary: 3D instance segmentation is crucial for applications demanding comprehensive 3D scene understanding.
In this paper, we introduce a novel method that simultaneously learns coefficients and prototypes.
Our method demonstrates superior performance on S3DIS-blocks, consistently outperforming existing methods in mRec and mPrec.
With only 0.8% of the total inference time, our method exhibits an over 20-fold reduction in the variance of inference time compared to existing methods.
- Score: 6.632158868486343
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: 3D instance segmentation is crucial for applications demanding comprehensive 3D scene understanding. In this paper, we introduce a novel method that simultaneously learns coefficients and prototypes. Employing an overcomplete sampling strategy, our method produces an overcomplete set of instance predictions, from which the optimal ones are selected through a Non-Maximum Suppression (NMS) algorithm during inference. The obtained prototypes are visualizable and interpretable. Our method demonstrates superior performance on S3DIS-blocks, consistently outperforming existing methods in mRec and mPrec. Moreover, it operates 32.9% faster than the state-of-the-art. Notably, with only 0.8% of the total inference time, our method exhibits an over 20-fold reduction in the variance of inference time compared to existing methods. These attributes render our method well-suited for practical applications requiring both rapid inference and high reliability.
Related papers
- ProtoSeg: A Prototype-Based Point Cloud Instance Segmentation Method [6.632158868486343]
This paper presents a novel neural network architecture for performing instance segmentation on 3D point clouds.
We propose to jointly learn coefficients and prototypes in parallel which can be combined to obtain the instance predictions.
The proposed method is not only 28% faster than the state-of-the-art, it also exhibits the lowest standard deviation.
arXiv Detail & Related papers (2024-10-03T10:05:27Z) - Efficient Fairness-Performance Pareto Front Computation [51.558848491038916]
We show that optimal fair representations possess several useful structural properties.
We then show that these approxing problems can be solved efficiently via concave programming methods.
arXiv Detail & Related papers (2024-09-26T08:46:48Z) - Importance of Aligning Training Strategy with Evaluation for Diffusion
Models in 3D Multiclass Segmentation [8.649603931882227]
We studied the DDPM-based segmentation model for 3D multiclass segmentation on two large multiclass data sets.
We observed that the difference between training and test methods led to inferior performance for existing DDPM methods.
To mitigate the inconsistency, we proposed a recycling method which generated corrupted masks based on the model's prediction at a previous time step.
arXiv Detail & Related papers (2023-03-10T16:30:09Z) - PointInst3D: Segmenting 3D Instances by Points [136.7261709896713]
We propose a fully-convolutional 3D point cloud instance segmentation method that works in a per-point prediction fashion.
We find the key to its success is assigning a suitable target to each sampled point.
Our approach achieves promising results on both ScanNet and S3DIS benchmarks.
arXiv Detail & Related papers (2022-04-25T02:41:46Z) - Distributed Dynamic Safe Screening Algorithms for Sparse Regularization [73.85961005970222]
We propose a new distributed dynamic safe screening (DDSS) method for sparsity regularized models and apply it on shared-memory and distributed-memory architecture respectively.
We prove that the proposed method achieves the linear convergence rate with lower overall complexity and can eliminate almost all the inactive features in a finite number of iterations almost surely.
arXiv Detail & Related papers (2022-04-23T02:45:55Z) - Revitalizing Optimization for 3D Human Pose and Shape Estimation: A
Sparse Constrained Formulation [21.710205047008916]
We propose a novel sparse constrained formulation and from it derive a real-time optimization method for 3D human pose and shape estimation.
We show that this computation scales linearly with the number of joints of a complex 3D human model, in contrast to prior work where it scales cubically due to their dense unconstrained formulation.
We present a real-time motion capture framework that estimates 3D human poses and shapes from a single image at over 30 FPS.
arXiv Detail & Related papers (2021-05-28T16:44:56Z) - Sample and Computation Redistribution for Efficient Face Detection [137.19388513633484]
Training data sampling and computation distribution strategies are the keys to efficient and accurate face detection.
scrfdf34 outperforms the best competitor, TinaFace, by $3.86%$ (AP at hard set) while being more than emph3$times$ faster on GPUs with VGA-resolution images.
arXiv Detail & Related papers (2021-05-10T23:51:14Z) - Learning Expectation of Label Distribution for Facial Age and
Attractiveness Estimation [65.5880700862751]
We analyze the essential relationship between two state-of-the-art methods (Ranking-CNN and DLDL) and show that the Ranking method is in fact learning label distribution implicitly.
We propose a lightweight network architecture and propose a unified framework which can jointly learn facial attribute distribution and regress attribute value.
Our method achieves new state-of-the-art results using the single model with 36$times$ fewer parameters and 3$times$ faster inference speed on facial age/attractiveness estimation.
arXiv Detail & Related papers (2020-07-03T15:46:53Z) - 3DSSD: Point-based 3D Single Stage Object Detector [61.67928229961813]
We present a point-based 3D single stage object detector, named 3DSSD, achieving a good balance between accuracy and efficiency.
Our method outperforms all state-of-the-art voxel-based single stage methods by a large margin, and has comparable performance to two stage point-based methods as well.
arXiv Detail & Related papers (2020-02-24T12:01:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.