Quaternion-based dynamic mode decomposition for background modeling in
color videos
- URL: http://arxiv.org/abs/2112.13982v1
- Date: Tue, 28 Dec 2021 03:35:39 GMT
- Title: Quaternion-based dynamic mode decomposition for background modeling in
color videos
- Authors: Juan Han, Kit Ian Kou, Jifei Miao
- Abstract summary: Dynamic mode decomposition (DMD) is a recently proposed method to robustly decompose a video sequence into the background model and the corresponding foreground part.
We propose a quaternion-based DMD (Q-DMD), which extends the DMD by quaternion matrix analysis.
We exploit the standard eigenvalues of the quaternion matrix to compute its spectral decomposition and calculate the corresponding Q-DMD modes and eigenvalues.
- Score: 1.0098114696565863
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Scene Background Initialization (SBI) is one of the challenging problems in
computer vision. Dynamic mode decomposition (DMD) is a recently proposed method
to robustly decompose a video sequence into the background model and the
corresponding foreground part. However, this method needs to convert the color
image into the grayscale image for processing, which leads to the neglect of
the coupling information between the three channels of the color image. In this
study, we propose a quaternion-based DMD (Q-DMD), which extends the DMD by
quaternion matrix analysis, so as to completely preserve the inherent color
structure of the color image and the color video. We exploit the standard
eigenvalues of the quaternion matrix to compute its spectral decomposition and
calculate the corresponding Q-DMD modes and eigenvalues. The results on the
publicly available benchmark datasets prove that our Q-DMD outperforms the
exact DMD method, and experiment results also demonstrate that the performance
of our approach is comparable to that of the state-of-the-art ones.
Related papers
- M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising [63.39134873744748]
Existing industrial anomaly detection methods primarily concentrate on unsupervised learning with pristine RGB images.
This paper proposes a novel noise-resistant M3DM-NR framework to leverage strong multi-modal discriminative capabilities of CLIP.
Extensive experiments show that M3DM-NR outperforms state-of-the-art methods in 3D-RGB multi-modal noisy anomaly detection.
arXiv Detail & Related papers (2024-06-04T12:33:02Z) - Real-Time Motion Detection Using Dynamic Mode Decomposition [0.40964539027092906]
We propose a simple and interpretable motion detection algorithm for streaming video data rooted in Dynamic Mode Decomposition (DMD)
Our method leverages the fact that there exists a correspondence between the evolution of important video features, such as foreground motion, and the eigenvalues of the matrix which results from applying DMD to segments of video.
arXiv Detail & Related papers (2024-05-08T13:52:14Z) - MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation [54.27399121779011]
We present MVD-Fusion: a method for single-view 3D inference via generative modeling of multi-view-consistent RGB-D images.
We show that our approach can yield more accurate synthesis compared to recent state-of-the-art, including distillation-based 3D inference and prior multi-view generation methods.
arXiv Detail & Related papers (2024-04-04T17:59:57Z) - Diffusion Priors for Dynamic View Synthesis from Monocular Videos [59.42406064983643]
Dynamic novel view synthesis aims to capture the temporal evolution of visual content within videos.
We first finetune a pretrained RGB-D diffusion model on the video frames using a customization technique.
We distill the knowledge from the finetuned model to a 4D representations encompassing both dynamic and static Neural Radiance Fields.
arXiv Detail & Related papers (2024-01-10T23:26:41Z) - $PC^2$: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D
Reconstruction [97.06927852165464]
Reconstructing the 3D shape of an object from a single RGB image is a long-standing and highly challenging problem in computer vision.
We propose a novel method for single-image 3D reconstruction which generates a sparse point cloud via a conditional denoising diffusion process.
arXiv Detail & Related papers (2023-02-21T13:37:07Z) - Multi-channel Nuclear Norm Minus Frobenius Norm Minimization for Color
Image Denoising [9.20787253404652]
One traditional strategy is to convert the RGB image to a less correlated color space and denoise each channel of the new space separately.
This paper proposes a new multi-channel optimization model for color image denoising under the nuclear norm minus Frobenius norm minimization framework.
Experimental results on both synthetic and real noise datasets demonstrate the proposed model outperforms state-of-the-art models.
arXiv Detail & Related papers (2022-09-16T04:10:29Z) - Quaternion Optimized Model with Sparse Regularization for Color Image
Recovery [10.137095668835439]
This paper is inspired by an appreciation of the fact that different signal types, including audio formats and images, possess structures that are inherently sparse in respect of their respective bases.
Since color images can be processed as a whole in the quaternion domain, we depicted the sparsity of the color image in the quaternion discrete cosine transform (QDCT) domain.
To achieve a more superior low-rank approximation, the quatenrion-based truncated nuclear norm (QTNN) is employed in the proposed model.
arXiv Detail & Related papers (2022-04-19T03:07:12Z) - Dynamic Mode Decomposition in Adaptive Mesh Refinement and Coarsening
Simulations [58.720142291102135]
Dynamic Mode Decomposition (DMD) is a powerful data-driven method used to extract coherent schemes.
This paper proposes a strategy to enable DMD to extract from observations with different mesh topologies and dimensions.
arXiv Detail & Related papers (2021-04-28T22:14:25Z) - Depth-conditioned Dynamic Message Propagation for Monocular 3D Object
Detection [86.25022248968908]
We learn context- and depth-aware feature representation to solve the problem of monocular 3D object detection.
We show state-of-the-art results among the monocular-based approaches on the KITTI benchmark dataset.
arXiv Detail & Related papers (2021-03-30T16:20:24Z) - Generalized Two-Dimensional Quaternion Principal Component Analysis with
Weighting for Color Image Recognition [1.516937009186805]
One of the most powerful methods of color image recognition is the two-dimensional principle component analysis (2DQPCA) approach.
In this paper, a generalized 2DQPCA approach with weighting is presented with imposing $L_p$ norms on both constraint and objective functions.
The numerical results based on the real face databases validate that the newly proposed method is robust to noise and performs better than the state-of-the-art 2DQPCA-based algorithms and four prominent deep learning methods.
arXiv Detail & Related papers (2020-10-04T03:37:23Z) - Full Quaternion Representation of Color images: A Case Study on
QSVD-based Color Image Compression [0.38073142980732994]
We propose an approach for representing color images with full quaternion numbers.
An autoencoder neural network is used to generate a global model for transforming a color image into a full quaternion matrix.
arXiv Detail & Related papers (2020-07-19T19:13:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.