TSMD: A Database for Static Color Mesh Quality Assessment Study
- URL: http://arxiv.org/abs/2308.01940v2
- Date: Mon, 30 Oct 2023 09:15:52 GMT
- Title: TSMD: A Database for Static Color Mesh Quality Assessment Study
- Authors: Qi Yang, Joel Jung, Haiqiang Wang, Xiaozhong Xu, and Shan Liu
- Abstract summary: Static meshes with texture map are widely used in modern industrial and manufacturing sectors.
To facilitate the study of static mesh compression algorithm and objective quality metric, we create the Tencent - Static Mesh dataset.
210 distorted samples are generated by the lossy compression scheme developed for the Call for Proposals on polygonal static mesh coding.
- Score: 24.702740013624613
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Static meshes with texture map are widely used in modern industrial and
manufacturing sectors, attracting considerable attention in the mesh
compression community due to its huge amount of data. To facilitate the study
of static mesh compression algorithm and objective quality metric, we create
the Tencent - Static Mesh Dataset (TSMD) containing 42 reference meshes with
rich visual characteristics. 210 distorted samples are generated by the lossy
compression scheme developed for the Call for Proposals on polygonal static
mesh coding, released on June 23 by the Alliance for Open Media Volumetric
Visual Media group. Using processed video sequences, a large-scale,
crowdsourcing-based, subjective experiment was conducted to collect subjective
scores from 74 viewers. The dataset undergoes analysis to validate its sample
diversity and Mean Opinion Scores (MOS) accuracy, establishing its
heterogeneous nature and reliability. State-of-the-art objective metrics are
evaluated on the new dataset. Pearson and Spearman correlations around 0.75 are
reported, deviating from results typically observed on less heterogeneous
datasets, demonstrating the need for further development of more robust
metrics. The TSMD, including meshes, PVSs, bitstreams, and MOS, is made
publicly available at the following location:
https://multimedia.tencent.com/resources/tsmd.
Related papers
- SJTU-TMQA: A quality assessment database for static mesh with texture
map [28.821971310570436]
We create a large-scale textured mesh quality assessment database, namely SJTU-TMQA, which includes 21 reference meshes and 945 distorted samples.
13 state-of-the-art objective metrics are evaluated on SJTU-TMQA. The results report the highest correlation of around 0.6, indicating the need for more effective objective metrics.
arXiv Detail & Related papers (2023-09-27T14:18:04Z) - ArSDM: Colonoscopy Images Synthesis with Adaptive Refinement Semantic
Diffusion Models [69.9178140563928]
Colonoscopy analysis is essential for assisting clinical diagnosis and treatment.
The scarcity of annotated data limits the effectiveness and generalization of existing methods.
We propose an Adaptive Refinement Semantic Diffusion Model (ArSDM) to generate colonoscopy images that benefit the downstream tasks.
arXiv Detail & Related papers (2023-09-03T07:55:46Z) - Robustness Analysis on Foundational Segmentation Models [28.01242494123917]
In this work, we perform a robustness analysis of Visual Foundation Models (VFMs) for segmentation tasks.
We benchmark seven state-of-the-art segmentation architectures using 2 different datasets.
Our findings reveal several key insights: VFMs exhibit vulnerabilities to compression-induced corruptions, despite not outpacing all of unimodal models in robustness, multimodal models show competitive resilience in zero-shot scenarios, and VFMs demonstrate enhanced robustness for certain object categories.
arXiv Detail & Related papers (2023-06-15T16:59:42Z) - Revisiting the Evaluation of Image Synthesis with GANs [55.72247435112475]
This study presents an empirical investigation into the evaluation of synthesis performance, with generative adversarial networks (GANs) as a representative of generative models.
In particular, we make in-depth analyses of various factors, including how to represent a data point in the representation space, how to calculate a fair distance using selected samples, and how many instances to use from each set.
arXiv Detail & Related papers (2023-04-04T17:54:32Z) - A Study on the Generality of Neural Network Structures for Monocular
Depth Estimation [14.09373215954704]
We deeply investigate the various backbone networks toward the generalization of monocular depth estimation.
We evaluate state-of-the-art models on both in-distribution and out-of-distribution datasets.
We observe that the Transformers exhibit a strong shape-bias rather than CNNs, which have a strong texture-bias.
arXiv Detail & Related papers (2023-01-09T04:58:12Z) - TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual
Environments [84.6017003787244]
This work proposes a synthetic data generation pipeline to address the difficulties and domain-gaps present in simulated datasets.
We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation.
arXiv Detail & Related papers (2022-08-16T20:46:08Z) - Negative Data Augmentation [127.28042046152954]
We show that negative data augmentation samples provide information on the support of the data distribution.
We introduce a new GAN training objective where we use NDA as an additional source of synthetic data for the discriminator.
Empirically, models trained with our method achieve improved conditional/unconditional image generation along with improved anomaly detection capabilities.
arXiv Detail & Related papers (2021-02-09T20:28:35Z) - TraDE: Transformers for Density Estimation [101.20137732920718]
TraDE is a self-attention-based architecture for auto-regressive density estimation.
We present a suite of tasks such as regression using generated samples, out-of-distribution detection, and robustness to noise in the training data.
arXiv Detail & Related papers (2020-04-06T07:32:51Z) - Diversity inducing Information Bottleneck in Model Ensembles [73.80615604822435]
In this paper, we target the problem of generating effective ensembles of neural networks by encouraging diversity in prediction.
We explicitly optimize a diversity inducing adversarial loss for learning latent variables and thereby obtain diversity in the output predictions necessary for modeling multi-modal data.
Compared to the most competitive baselines, we show significant improvements in classification accuracy, under a shift in the data distribution.
arXiv Detail & Related papers (2020-03-10T03:10:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.