GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality
Assessment
- URL: http://arxiv.org/abs/2306.05658v2
- Date: Wed, 31 Jan 2024 09:30:09 GMT
- Title: GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality
Assessment
- Authors: Zicheng Zhang, Wei Sun, Houning Wu, Yingjie Zhou, Chunyi Li, Xiongkuo
Min, Guangtao Zhai, Weisi Lin
- Abstract summary: Previous projection-based 3DQA methods directly extract features from multi-projections to ensure quality prediction accuracy.
We propose a no-reference (NR) projection-based textitunderlineGrid underlineMini-patch underlineSampling underline3D Model underlineQuality underlineAssessment (GMS-3DQA) method.
The proposed GMS-3DQA requires far less computational resources and inference time than other 3D
- Score: 82.93561866101604
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Nowadays, most 3D model quality assessment (3DQA) methods have been aimed at
improving performance. However, little attention has been paid to the
computational cost and inference time required for practical applications.
Model-based 3DQA methods extract features directly from the 3D models, which
are characterized by their high degree of complexity. As a result, many
researchers are inclined towards utilizing projection-based 3DQA methods.
Nevertheless, previous projection-based 3DQA methods directly extract features
from multi-projections to ensure quality prediction accuracy, which calls for
more resource consumption and inevitably leads to inefficiency. Thus in this
paper, we address this challenge by proposing a no-reference (NR)
projection-based \textit{\underline{G}rid \underline{M}ini-patch
\underline{S}ampling \underline{3D} Model \underline{Q}uality
\underline{A}ssessment (GMS-3DQA)} method. The projection images are rendered
from six perpendicular viewpoints of the 3D model to cover sufficient quality
information. To reduce redundancy and inference resources, we propose a
multi-projection grid mini-patch sampling strategy (MP-GMS), which samples grid
mini-patches from the multi-projections and forms the sampled grid mini-patches
into one quality mini-patch map (QMM). The Swin-Transformer tiny backbone is
then used to extract quality-aware features from the QMMs. The experimental
results show that the proposed GMS-3DQA outperforms existing state-of-the-art
NR-3DQA methods on the point cloud quality assessment databases. The efficiency
analysis reveals that the proposed GMS-3DQA requires far less computational
resources and inference time than other 3DQA competitors. The code will be
available at https://github.com/zzc-1998/GMS-3DQA.
Related papers
- Activating Frequency and ViT for 3D Point Cloud Quality Assessment
without Reference [0.49157446832511503]
We propose no-reference quality metric of a given 3D-PC.
To map the input attributes to quality score, we use a light-weight hybrid deep model; combined of Deformable Convolutional Network (DCN) and Vision Transformers (ViT)
The results show that our approach outperforms state-of-the-art NR-PCQA measures and even some FR-PCQA on PointXR.
arXiv Detail & Related papers (2023-12-10T19:13:34Z) - EEP-3DQA: Efficient and Effective Projection-based 3D Model Quality
Assessment [58.16279881415622]
It is difficult to perform an efficient module to extract quality-aware features of 3D models.
We develop a no-reference (NR) underlineEfficient and underlineEffective underlineProjection-based underline3D Model underlineQuality underlineAssessment (textbfEEP-3DQA) method.
The proposed EEP-3DQA and EEP-3DQA-t (tiny version) achieve
arXiv Detail & Related papers (2023-02-17T06:14:37Z) - MM-PCQA: Multi-Modal Learning for No-reference Point Cloud Quality
Assessment [32.495387943305204]
We propose a novel no-reference point cloud quality assessment (NR-PCQA) metric in a multi-modal fashion.
In specific, we split the point clouds into sub-models to represent local geometry distortions such as point shift and down-sampling.
To achieve the goals, the sub-models and projected images are encoded with point-based and image-based neural networks.
arXiv Detail & Related papers (2022-09-01T06:11:12Z) - From 2D to 3D: Re-thinking Benchmarking of Monocular Depth Prediction [80.67873933010783]
We argue that MDP is currently witnessing benchmark over-fitting and relying on metrics that are only partially helpful to gauge the usefulness of the predictions for 3D applications.
This limits the design and development of novel methods that are truly aware of - and improving towards estimating - the 3D structure of the scene rather than optimizing 2D-based distances.
We propose a set of metrics well suited to evaluate the 3D geometry of MDP approaches and a novel indoor benchmark, RIO-D3D, crucial for the proposed evaluation methodology.
arXiv Detail & Related papers (2022-03-15T17:50:54Z) - No-Reference Quality Assessment for Colored Point Cloud and Mesh Based
on Natural Scene Statistics [36.017914479449864]
We propose an NSS-based no-reference quality assessment metric for colored 3D models.
Our method is mainly validated on the colored point cloud quality assessment database (SJTU-PCQA) and the colored mesh quality assessment database (CMDM)
arXiv Detail & Related papers (2021-07-05T14:03:15Z) - Soft Expectation and Deep Maximization for Image Feature Detection [68.8204255655161]
We propose SEDM, an iterative semi-supervised learning process that flips the question and first looks for repeatable 3D points, then trains a detector to localize them in image space.
Our results show that this new model trained using SEDM is able to better localize the underlying 3D points in a scene.
arXiv Detail & Related papers (2021-04-21T00:35:32Z) - PLUME: Efficient 3D Object Detection from Stereo Images [95.31278688164646]
Existing methods tackle the problem in two steps: first depth estimation is performed, a pseudo LiDAR point cloud representation is computed from the depth estimates, and then object detection is performed in 3D space.
We propose a model that unifies these two tasks in the same metric space.
Our approach achieves state-of-the-art performance on the challenging KITTI benchmark, with significantly reduced inference time compared with existing methods.
arXiv Detail & Related papers (2021-01-17T05:11:38Z) - Reinforced Axial Refinement Network for Monocular 3D Object Detection [160.34246529816085]
Monocular 3D object detection aims to extract the 3D position and properties of objects from a 2D input image.
Conventional approaches sample 3D bounding boxes from the space and infer the relationship between the target object and each of them, however, the probability of effective samples is relatively small in the 3D space.
We propose to start with an initial prediction and refine it gradually towards the ground truth, with only one 3d parameter changed in each step.
This requires designing a policy which gets a reward after several steps, and thus we adopt reinforcement learning to optimize it.
arXiv Detail & Related papers (2020-08-31T17:10:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.