R-Mixup: Riemannian Mixup for Biological Networks
- URL: http://arxiv.org/abs/2306.02532v1
- Date: Mon, 5 Jun 2023 01:41:23 GMT
- Title: R-Mixup: Riemannian Mixup for Biological Networks
- Authors: Xuan Kan, Zimu Li, Hejie Cui, Yue Yu, Ran Xu, Shaojun Yu, Zilong
Zhang, Ying Guo, Carl Yang
- Abstract summary: We propose R-mixUP, a Mixup-based data augmentation technique that suits the symmetric positive definite (SPD) property of adjacency matrices from biological networks.
We demonstrate the effectiveness of R-mixUP with five real-world biological network datasets on both regression and classification tasks.
- Score: 15.48899766304136
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Biological networks are commonly used in biomedical and healthcare domains to
effectively model the structure of complex biological systems with interactions
linking biological entities. However, due to their characteristics of high
dimensionality and low sample size, directly applying deep learning models on
biological networks usually faces severe overfitting. In this work, we propose
R-MIXUP, a Mixup-based data augmentation technique that suits the symmetric
positive definite (SPD) property of adjacency matrices from biological networks
with optimized training efficiency. The interpolation process in R-MIXUP
leverages the log-Euclidean distance metrics from the Riemannian manifold,
effectively addressing the swelling effect and arbitrarily incorrect label
issues of vanilla Mixup. We demonstrate the effectiveness of R-MIXUP with five
real-world biological network datasets on both regression and classification
tasks. Besides, we derive a commonly ignored necessary condition for
identifying the SPD matrices of biological networks and empirically study its
influence on the model performance. The code implementation can be found in
Appendix E.
Related papers
- A Review of Link Prediction Applications in Network Biology [2.7624021966289605]
Link prediction (LP) methodologies are instrumental in inferring missing or prospective associations within biological networks.
We systematically dissect the attributes of local, centrality, and embedding-based LP approaches, applied to static and dynamic biological networks.
We conclude the review with an exploration of the essential characteristics expected from future LP models, poised to advance our comprehension of the intricate interactions governing biological systems.
arXiv Detail & Related papers (2023-12-03T04:23:51Z) - Heterogenous Memory Augmented Neural Networks [84.29338268789684]
We introduce a novel heterogeneous memory augmentation approach for neural networks.
By introducing learnable memory tokens with attention mechanism, we can effectively boost performance without huge computational overhead.
We show our approach on various image and graph-based tasks under both in-distribution (ID) and out-of-distribution (OOD) conditions.
arXiv Detail & Related papers (2023-10-17T01:05:28Z) - PhyloGFN: Phylogenetic inference with generative flow networks [57.104166650526416]
We introduce the framework of generative flow networks (GFlowNets) to tackle two core problems in phylogenetics: parsimony-based and phylogenetic inference.
Because GFlowNets are well-suited for sampling complex structures, they are a natural choice for exploring and sampling from the multimodal posterior distribution over tree topologies.
We demonstrate that our amortized posterior sampler, PhyloGFN, produces diverse and high-quality evolutionary hypotheses on real benchmark datasets.
arXiv Detail & Related papers (2023-10-12T23:46:08Z) - SBMLtoODEjax: Efficient Simulation and Optimization of Biological
Network Models in JAX [19.55237447763145]
This paper introduces SBMLtoODEjax, a lightweight library designed to seamlessly integrate SBML models with ML-supported pipelines, powered by JAX.
It harnesses JAX's capabilities for efficient parallel simulations and optimization, with the aim to accelerate research in biological network analysis.
arXiv Detail & Related papers (2023-07-17T12:47:33Z) - Differentiable Agent-based Epidemiology [71.81552021144589]
We introduce GradABM: a scalable, differentiable design for agent-based modeling that is amenable to gradient-based learning with automatic differentiation.
GradABM can quickly simulate million-size populations in few seconds on commodity hardware, integrate with deep neural networks and ingest heterogeneous data sources.
arXiv Detail & Related papers (2022-07-20T07:32:02Z) - A deep learning driven pseudospectral PCE based FFT homogenization
algorithm for complex microstructures [68.8204255655161]
It is shown that the proposed method is able to predict central moments of interest while being magnitudes faster to evaluate than traditional approaches.
It is shown, that the proposed method is able to predict central moments of interest while being magnitudes faster to evaluate than traditional approaches.
arXiv Detail & Related papers (2021-10-26T07:02:14Z) - CNN-based Approaches For Cross-Subject Classification in Motor Imagery:
From The State-of-The-Art to DynamicNet [0.2936007114555107]
Motor imagery (MI)-based brain-computer interface (BCI) systems are being increasingly employed to provide alternative means of communication and control.
accurately classifying MI from brain signals is essential to obtain reliable BCI systems.
Deep learning approaches have started to emerge as valid alternatives to standard machine learning techniques.
arXiv Detail & Related papers (2021-05-17T14:57:13Z) - Comparisons among different stochastic selection of activation layers
for convolutional neural networks for healthcare [77.99636165307996]
We classify biomedical images using ensembles of neural networks.
We select our activations among the following ones: ReLU, leaky ReLU, Parametric ReLU, ELU, Adaptive Piecewice Linear Unit, S-Shaped ReLU, Swish, Mish, Mexican Linear Unit, Parametric Deformable Linear Unit, Soft Root Sign.
arXiv Detail & Related papers (2020-11-24T01:53:39Z) - Mycorrhiza: Genotype Assignment usingPhylogenetic Networks [2.286041284499166]
We introduce Mycorrhiza, a machine learning approach for the genotype assignment problem.
Our algorithm makes use of phylogenetic networks to engineer features that encode the evolutionary relationships among samples.
Mycorrhiza yields particularly significant gains on datasets with a large average fixation index (FST) or deviation from the Hardy-Weinberg equilibrium.
arXiv Detail & Related papers (2020-10-14T02:36:27Z) - TaBooN -- Boolean Network Synthesis Based on Tabu Search [0.0]
Omics-technologies revolutionized the investigation of biology by producing molecular data in multiple dimensions and scale.
Biological network is composed of nodes referring to the components such as genes or proteins, and the edges/arcs formalizing interactions between them.
arXiv Detail & Related papers (2020-09-08T08:56:14Z) - Repulsive Mixture Models of Exponential Family PCA for Clustering [127.90219303669006]
The mixture extension of exponential family principal component analysis ( EPCA) was designed to encode much more structural information about data distribution than the traditional EPCA.
The traditional mixture of local EPCAs has the problem of model redundancy, i.e., overlaps among mixing components, which may cause ambiguity for data clustering.
In this paper, a repulsiveness-encouraging prior is introduced among mixing components and a diversified EPCA mixture (DEPCAM) model is developed in the Bayesian framework.
arXiv Detail & Related papers (2020-04-07T04:07:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.