A generalized kernel machine approach to identify higher-order composite
effects in multi-view datasets
- URL: http://arxiv.org/abs/2004.14031v1
- Date: Wed, 29 Apr 2020 08:56:02 GMT
- Title: A generalized kernel machine approach to identify higher-order composite
effects in multi-view datasets
- Authors: Md Ashad Alam, Chuan Qiu, Hui Shen, Yu-Ping Wang, and Hong-Wen Deng
- Abstract summary: We propose a novel kernel machine approach to identify higher-order composite effects in multi-view biomedical datasets.
The proposed method can effectively identify higher-order composite effects and suggest that corresponding features function in a concerted effort.
- Score: 4.579719459619913
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In recent years, a comprehensive study of multi-view datasets (e.g.,
multi-omics and imaging scans) has been a focus and forefront in biomedical
research. State-of-the-art biomedical technologies are enabling us to collect
multi-view biomedical datasets for the study of complex diseases. While all the
views of data tend to explore complementary information of a disease,
multi-view data analysis with complex interactions is challenging for a deeper
and holistic understanding of biological systems. In this paper, we propose a
novel generalized kernel machine approach to identify higher-order composite
effects in multi-view biomedical datasets. This generalized semi-parametric (a
mixed-effect linear model) approach includes the marginal and joint Hadamard
product of features from different views of data. The proposed kernel machine
approach considers multi-view data as predictor variables to allow more
thorough and comprehensive modeling of a complex trait. The proposed method can
be applied to the study of any disease model, where multi-view datasets are
available. We applied our approach to both synthesized datasets and real
multi-view datasets from adolescence brain development and osteoporosis study,
including an imaging scan dataset and five omics datasets. Our experiments
demonstrate that the proposed method can effectively identify higher-order
composite effects and suggest that corresponding features (genes, region of
interests, and chemical taxonomies) function in a concerted effort. We show
that the proposed method is more generalizable than existing ones.
Related papers
- Multiplex Imaging Analysis in Pathology: a Comprehensive Review on Analytical Approaches and Digital Toolkits [0.7968706282619793]
Multi multiplexed imaging allows for simultaneous visualization of multiple biomarkers in a single section.
Data from multiplexed imaging requires sophisticated computational methods for preprocessing, segmentation, feature extraction, and spatial analysis.
PathML is an AI-powered platform that streamlines image analysis, making complex interpretation accessible for clinical and research settings.
arXiv Detail & Related papers (2024-11-01T18:02:41Z) - Supervised Multiple Kernel Learning approaches for multi-omics data integration [1.3032276477872158]
Multiple kernel learning (MKL) has shown to be a flexible and valid approach to consider the diverse nature of multi-omics inputs.
We provide novel MKL approaches based on different kernel fusion strategies.
Results show that MKL-based models can compete with more complex, state-of-the-art, supervised multi-omics integrative approaches.
arXiv Detail & Related papers (2024-03-27T08:48:16Z) - Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning [65.54680361074882]
Eye-gaze Guided Multi-modal Alignment (EGMA) framework harnesses eye-gaze data for better alignment of medical visual and textual features.
We conduct downstream tasks of image classification and image-text retrieval on four medical datasets.
arXiv Detail & Related papers (2024-03-19T03:59:14Z) - Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - HEALNet: Multimodal Fusion for Heterogeneous Biomedical Data [10.774128925670183]
This paper presents the Hybrid Early-fusion Attention Learning Network (HEALNet), a flexible multimodal fusion architecture.
We conduct multimodal survival analysis on Whole Slide Images and Multi-omic data on four cancer datasets from The Cancer Genome Atlas (TCGA)
HEALNet achieves state-of-the-art performance compared to other end-to-end trained fusion models.
arXiv Detail & Related papers (2023-11-15T17:06:26Z) - Building Flexible, Scalable, and Machine Learning-ready Multimodal
Oncology Datasets [17.774341783844026]
This work proposes Multimodal Integration of Oncology Data System (MINDS)
MINDS is a flexible, scalable, and cost-effective metadata framework for efficiently fusing disparate data from public sources.
By harmonizing multimodal data, MINDS aims to potentially empower researchers with greater analytical ability.
arXiv Detail & Related papers (2023-09-30T15:44:39Z) - Incomplete Multimodal Learning for Complex Brain Disorders Prediction [65.95783479249745]
We propose a new incomplete multimodal data integration approach that employs transformers and generative adversarial networks.
We apply our new method to predict cognitive degeneration and disease outcomes using the multimodal imaging genetic data from Alzheimer's Disease Neuroimaging Initiative cohort.
arXiv Detail & Related papers (2023-05-25T16:29:16Z) - CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self
Attention for multi-omics integration with incomplete multi-omics data [47.2764293508916]
Integration of heterogeneous and high-dimensional multi-omics data is becoming increasingly important in understanding genetic data.
One obstacle faced when performing multi-omics data integration is the existence of unpaired multi-omics data due to instrument sensitivity and cost.
We propose a deep learning method for multi-omics integration with incomplete data by Cross-omics Linked unified embedding with Contrastive Learning and Self Attention.
arXiv Detail & Related papers (2023-04-12T00:22:18Z) - A Variational Information Bottleneck Approach to Multi-Omics Data
Integration [98.6475134630792]
We propose a deep variational information bottleneck (IB) approach for incomplete multi-view observations.
Our method applies the IB framework on marginal and joint representations of the observed views to focus on intra-view and inter-view interactions that are relevant for the target.
Experiments on real-world datasets show that our method consistently achieves gain from data integration and outperforms state-of-the-art benchmarks.
arXiv Detail & Related papers (2021-02-05T06:05:39Z) - Modeling Shared Responses in Neuroimaging Studies through MultiView ICA [94.31804763196116]
Group studies involving large cohorts of subjects are important to draw general conclusions about brain functional organization.
We propose a novel MultiView Independent Component Analysis model for group studies, where data from each subject are modeled as a linear combination of shared independent sources plus noise.
We demonstrate the usefulness of our approach first on fMRI data, where our model demonstrates improved sensitivity in identifying common sources among subjects.
arXiv Detail & Related papers (2020-06-11T17:29:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.