Cross-Modal and Multimodal Data Analysis Based on Functional Mapping of
Spectral Descriptors and Manifold Regularization
- URL: http://arxiv.org/abs/2105.05631v1
- Date: Wed, 12 May 2021 13:00:33 GMT
- Title: Cross-Modal and Multimodal Data Analysis Based on Functional Mapping of
Spectral Descriptors and Manifold Regularization
- Authors: Maysam Behmanesh, Peyman Adibi, Jocelyn Chanussot, Sayyed Mohammad
Saeed Ehsani
- Abstract summary: This work proposes two new multimodal modeling methods.
The first method establishes a general analyzing framework to deal with the multimodal information problem for heterogeneous data.
The second method is a manifold regularized multimodal classification based on pointwise correspondences.
- Score: 21.06669693699965
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Multimodal manifold modeling methods extend the spectral geometry-aware data
analysis to learning from several related and complementary modalities. Most of
these methods work based on two major assumptions: 1) there are the same number
of homogeneous data samples in each modality, and 2) at least partial
correspondences between modalities are given in advance as prior knowledge.
This work proposes two new multimodal modeling methods. The first method
establishes a general analyzing framework to deal with the multimodal
information problem for heterogeneous data without any specific prior
knowledge. For this purpose, first, we identify the localities of each manifold
by extracting local descriptors via spectral graph wavelet signatures (SGWS).
Then, we propose a manifold regularization framework based on the functional
mapping between SGWS descriptors (FMBSD) for finding the pointwise
correspondences. The second method is a manifold regularized multimodal
classification based on pointwise correspondences (M$^2$CPC) used for the
problem of multiclass classification of multimodal heterogeneous, which the
correspondences between modalities are determined based on the FMBSD method.
The experimental results of evaluating the FMBSD method on three common
cross-modal retrieval datasets and evaluating the (M$^2$CPC) method on three
benchmark multimodal multiclass classification datasets indicate their
effectiveness and superiority over state-of-the-art methods.
Related papers
- Rethinking Few-shot 3D Point Cloud Semantic Segmentation [62.80639841429669]
This paper revisits few-shot 3D point cloud semantic segmentation (FS-PCS)
We focus on two significant issues in the state-of-the-art: foreground leakage and sparse point distribution.
To address these issues, we introduce a standardized FS-PCS setting, upon which a new benchmark is built.
arXiv Detail & Related papers (2024-03-01T15:14:47Z) - Mixture of multilayer stochastic block models for multiview clustering [0.0]
We propose an original method for aggregating multiple clustering coming from different sources of information.
The identifiability of the model parameters is established and a variational Bayesian EM algorithm is proposed for the estimation of these parameters.
The method is utilized to analyze global food trading networks, leading to structures of interest.
arXiv Detail & Related papers (2024-01-09T17:15:47Z) - Convolutional autoencoder-based multimodal one-class classification [80.52334952912808]
One-class classification refers to approaches of learning using data from a single class only.
We propose a deep learning one-class classification method suitable for multimodal data.
arXiv Detail & Related papers (2023-09-25T12:31:18Z) - Generalized Product-of-Experts for Learning Multimodal Representations
in Noisy Environments [18.14974353615421]
We propose a novel method for multimodal representation learning in a noisy environment via the generalized product of experts technique.
In the proposed method, we train a separate network for each modality to assess the credibility of information coming from that modality.
We attain state-of-the-art performance on two challenging benchmarks: multimodal 3D hand-pose estimation and multimodal surgical video segmentation.
arXiv Detail & Related papers (2022-11-07T14:27:38Z) - Unified Multi-View Orthonormal Non-Negative Graph Based Clustering
Framework [74.25493157757943]
We formulate a novel clustering model, which exploits the non-negative feature property and incorporates the multi-view information into a unified joint learning framework.
We also explore, for the first time, the multi-model non-negative graph-based approach to clustering data based on deep features.
arXiv Detail & Related papers (2022-11-03T08:18:27Z) - Consistency and Diversity induced Human Motion Segmentation [231.36289425663702]
We propose a novel Consistency and Diversity induced human Motion (CDMS) algorithm.
Our model factorizes the source and target data into distinct multi-layer feature spaces.
A multi-mutual learning strategy is carried out to reduce the domain gap between the source and target data.
arXiv Detail & Related papers (2022-02-10T06:23:56Z) - Geometric Multimodal Deep Learning with Multi-Scaled Graph Wavelet
Convolutional Network [21.06669693699965]
Multimodal data provide information of a natural phenomenon by integrating data from various domains with very different statistical properties.
Capturing the intra-modality and cross-modality information of multimodal data is the essential capability of multimodal learning methods.
Generalizing deep learning methods to the non-Euclidean domains is an emerging research field.
arXiv Detail & Related papers (2021-11-26T08:41:51Z) - Multivariate feature ranking of gene expression data [62.997667081978825]
We propose two new multivariate feature ranking methods based on pairwise correlation and pairwise consistency.
We statistically prove that the proposed methods outperform the state of the art feature ranking methods Clustering Variation, Chi Squared, Correlation, Information Gain, ReliefF and Significance.
arXiv Detail & Related papers (2021-11-03T17:19:53Z) - Quantile-based fuzzy C-means clustering of multivariate time series:
Robust techniques [2.3226893628361682]
Robustness to the presence of outliers is achieved by using the so-called metric, noise and trimmed approaches.
Results from a broad simulation study indicate that the algorithms are substantially effective in coping with the presence of outlying series.
arXiv Detail & Related papers (2021-09-22T20:26:12Z) - Quantile-based fuzzy clustering of multivariate time series in the
frequency domain [2.610470075814367]
fuzzy C-means and fuzzy C-medoids algorithms are proposed.
The performance of the proposed approach is evaluated in a broad simulation study.
Two specific applications involving air quality and financial databases illustrate the usefulness of our approach.
arXiv Detail & Related papers (2021-09-08T15:38:33Z) - Manifold Topology Divergence: a Framework for Comparing Data Manifolds [109.0784952256104]
We develop a framework for comparing data manifold, aimed at the evaluation of deep generative models.
Based on the Cross-Barcode, we introduce the Manifold Topology Divergence score (MTop-Divergence)
We demonstrate that the MTop-Divergence accurately detects various degrees of mode-dropping, intra-mode collapse, mode invention, and image disturbance.
arXiv Detail & Related papers (2021-06-08T00:30:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.