Generalized Two-Dimensional Quaternion Principal Component Analysis with
Weighting for Color Image Recognition
- URL: http://arxiv.org/abs/2010.01477v2
- Date: Mon, 12 Jun 2023 03:28:15 GMT
- Title: Generalized Two-Dimensional Quaternion Principal Component Analysis with
Weighting for Color Image Recognition
- Authors: Zhi-Gang Jia, Zi-Jin Qiu, Qian-Yu Wang, Mei-Xiang Zhao, and Dan-Dan
Zhu
- Abstract summary: One of the most powerful methods of color image recognition is the two-dimensional principle component analysis (2DQPCA) approach.
In this paper, a generalized 2DQPCA approach with weighting is presented with imposing $L_p$ norms on both constraint and objective functions.
The numerical results based on the real face databases validate that the newly proposed method is robust to noise and performs better than the state-of-the-art 2DQPCA-based algorithms and four prominent deep learning methods.
- Score: 1.516937009186805
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: One of the most powerful methods of color image recognition is the
two-dimensional principle component analysis (2DQPCA) approach, which is based
on quaternion representation and preserves color information very well.
However, the current versions of 2DQPCA are still not feasible to extract
different geometric properties of color images according to practical data
analysis requirements and they are vulnerable to strong noise. In this paper, a
generalized 2DQPCA approach with weighting is presented with imposing $L_{p}$
norms on both constraint and objective functions. As a unit 2DQPCA framework,
this new version makes it possible to choose adaptive regularizations and
constraints according to actual applications and can extract both geometric
properties and color information of color images. The projection vectors
generated by the deflating scheme are required to be orthogonal to each other.
A weighting matrix is defined to magnify the effect of main features. This
overcomes the shortcomings of traditional 2DQPCA that the recognition rate
decreases as the number of principal components increases. The numerical
results based on the real face databases validate that the newly proposed
method is robust to noise and performs better than the state-of-the-art
2DQPCA-based algorithms and four prominent deep learning methods.
Related papers
- Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference [62.99706119370521]
Humans can easily deduce the relative pose of an unseen object, without label/training, given only a single query-reference image pair.
We propose a novel 3D generalizable relative pose estimation method by elaborating (i) with a 2.5D shape from an RGB-D reference, (ii) with an off-the-shelf differentiable, and (iii) with semantic cues from a pretrained model like DINOv2.
arXiv Detail & Related papers (2024-06-26T16:01:10Z) - RGB-based Category-level Object Pose Estimation via Decoupled Metric
Scale Recovery [72.13154206106259]
We propose a novel pipeline that decouples the 6D pose and size estimation to mitigate the influence of imperfect scales on rigid transformations.
Specifically, we leverage a pre-trained monocular estimator to extract local geometric information.
A separate branch is designed to directly recover the metric scale of the object based on category-level statistics.
arXiv Detail & Related papers (2023-09-19T02:20:26Z) - Data-Driven Bilateral Generalized Two-Dimensional Quaternion Principal
Component Analysis with Application to Color Face Recognition [4.471812624045322]
New data-driven bilateral generalized two-dimensional quaternion principal component analysis (BiG2DQPCA) is presented.
BiG2DQPCA directly works on the 2D color images without vectorizing and well preserves the spatial and color information.
New approach based on BiG2DQPCA is presented for color face recognition and image reconstruction with a new data-driven weighting technique.
arXiv Detail & Related papers (2023-06-12T11:45:59Z) - $PC^2$: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D
Reconstruction [97.06927852165464]
Reconstructing the 3D shape of an object from a single RGB image is a long-standing and highly challenging problem in computer vision.
We propose a novel method for single-image 3D reconstruction which generates a sparse point cloud via a conditional denoising diffusion process.
arXiv Detail & Related papers (2023-02-21T13:37:07Z) - Quasi Non-Negative Quaternion Matrix Factorization with Application to
Color Face Recognition [0.0]
A novel quasi-negative quaternion matrix factorization (QNQMF) is presented for color image processing.
The accuracy of the rate of face recognition on the quaternion model is better than on the red, green and blue channels of color image.
arXiv Detail & Related papers (2022-11-30T04:51:09Z) - Quaternion Optimized Model with Sparse Regularization for Color Image
Recovery [10.137095668835439]
This paper is inspired by an appreciation of the fact that different signal types, including audio formats and images, possess structures that are inherently sparse in respect of their respective bases.
Since color images can be processed as a whole in the quaternion domain, we depicted the sparsity of the color image in the quaternion discrete cosine transform (QDCT) domain.
To achieve a more superior low-rank approximation, the quatenrion-based truncated nuclear norm (QTNN) is employed in the proposed model.
arXiv Detail & Related papers (2022-04-19T03:07:12Z) - A DCT-based Tensor Completion Approach for Recovering Color Images and
Videos from Highly Undersampled Data [0.8399688944263843]
We propose a novel tensor completion approach to recover color images and videos from undersampled data.
A series of numerical experiments including color image inpainting and video data recovery demonstrate that our proposed approach performs better than many existing state-of-theart tensor completion methods.
arXiv Detail & Related papers (2021-10-18T13:41:27Z) - Spatial-Separated Curve Rendering Network for Efficient and
High-Resolution Image Harmonization [59.19214040221055]
We propose a novel spatial-separated curve rendering network (S$2$CRNet) for efficient and high-resolution image harmonization.
The proposed method reduces more than 90% parameters compared with previous methods.
Our method can work smoothly on higher resolution images in real-time which is more than 10$times$ faster than the existing methods.
arXiv Detail & Related papers (2021-09-13T07:20:16Z) - Self-supervised Geometric Perception [96.89966337518854]
Self-supervised geometric perception is a framework to learn a feature descriptor for correspondence matching without any ground-truth geometric model labels.
We show that SGP achieves state-of-the-art performance that is on-par or superior to the supervised oracles trained using ground-truth labels.
arXiv Detail & Related papers (2021-03-04T15:34:43Z) - Perceptual Robust Hashing for Color Images with Canonical Correlation
Analysis [21.22196411212803]
We propose a novel perceptual image hashing scheme for color images based on ring-ribbon quadtree and color vector angle.
Our scheme has satisfactory performances with respect to robustness, discrimination and security, which can be effectively used in copy detection and content authentication.
arXiv Detail & Related papers (2020-12-08T09:35:21Z) - Bi-directional Cross-Modality Feature Propagation with
Separation-and-Aggregation Gate for RGB-D Semantic Segmentation [59.94819184452694]
Depth information has proven to be a useful cue in the semantic segmentation of RGBD images for providing a geometric counterpart to the RGB representation.
Most existing works simply assume that depth measurements are accurate and well-aligned with the RGB pixels and models the problem as a cross-modal feature fusion.
In this paper, we propose a unified and efficient Crossmodality Guided to not only effectively recalibrate RGB feature responses, but also to distill accurate depth information via multiple stages and aggregate the two recalibrated representations alternatively.
arXiv Detail & Related papers (2020-07-17T18:35:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.