DeepProphet2 -- A Deep Learning Gene Recommendation Engine
- URL: http://arxiv.org/abs/2208.01918v4
- Date: Wed, 22 Mar 2023 11:15:58 GMT
- Title: DeepProphet2 -- A Deep Learning Gene Recommendation Engine
- Authors: Daniele Brambilla (1), Davide Maria Giacomini (1), Luca Muscarnera,
Andrea Mazzoleni (1) ((1) TheProphetAI)
- Abstract summary: The paper discusses the potential advantages of gene recommendation performed by artificial intelligence (AI)
A transformer-based model has been trained on a well-curated freely available paper corpus, PubMed.
A set of use cases illustrates the algorithm's potential applications in a real word setting.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: New powerful tools for tackling life science problems have been created by
recent advances in machine learning. The purpose of the paper is to discuss the
potential advantages of gene recommendation performed by artificial
intelligence (AI). Indeed, gene recommendation engines try to solve this
problem: if the user is interested in a set of genes, which other genes are
likely to be related to the starting set and should be investigated? This task
was solved with a custom deep learning recommendation engine, DeepProphet2
(DP2), which is freely available to researchers worldwide via
https://www.generecommender.com?utm_source=DeepProphet2_paper&utm_medium=pdf.
Hereafter, insights behind the algorithm and its practical applications are
illustrated.
The gene recommendation problem can be addressed by mapping the genes to a
metric space where a distance can be defined to represent the real semantic
distance between them. To achieve this objective a transformer-based model has
been trained on a well-curated freely available paper corpus, PubMed. The paper
describes multiple optimization procedures that were employed to obtain the
best bias-variance trade-off, focusing on embedding size and network depth. In
this context, the model's ability to discover sets of genes implicated in
diseases and pathways was assessed through cross-validation. A simple
assumption guided the procedure: the network had no direct knowledge of
pathways and diseases but learned genes' similarities and the interactions
among them. Moreover, to further investigate the space where the neural network
represents genes, the dimensionality of the embedding was reduced, and the
results were projected onto a human-comprehensible space. In conclusion, a set
of use cases illustrates the algorithm's potential applications in a real word
setting.
Related papers
- An Evolutional Neural Network Framework for Classification of Microarray Data [0.0]
This research aims to apply a hybrid model of Genetic Algorithm and Neural Network to overcome the problem during subset selection of informative genes.
Experimental results show the proposed method suggested high accuracy and minimum number of selected genes in comparison with other machine learning algorithms.
arXiv Detail & Related papers (2024-11-20T13:48:40Z) - Neural Lineage [56.34149480207817]
We introduce a novel task known as neural lineage detection, aiming at discovering lineage relationships between parent and child models.
For practical convenience, we introduce a learning-free approach, which integrates an approximation of the finetuning process into the neural network representation similarity metrics.
For the pursuit of accuracy, we introduce a learning-based lineage detector comprising encoders and a transformer detector.
arXiv Detail & Related papers (2024-06-17T01:11:53Z) - BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments [112.25067497985447]
We introduce BioDiscoveryAgent, an agent that designs new experiments, reasons about their outcomes, and efficiently navigates the hypothesis space to reach desired solutions.
BioDiscoveryAgent can uniquely design new experiments without the need to train a machine learning model.
It achieves an average of 21% improvement in predicting relevant genetic perturbations across six datasets.
arXiv Detail & Related papers (2024-05-27T19:57:17Z) - GENEVIC: GENetic data Exploration and Visualization via Intelligent interactive Console [6.786793669890866]
GENEVIC is an AI-driven chat framework that bridges the gap between genetic data generation and biomedical knowledge discovery.
It automates the analysis, retrieval, and visualization of customized domain-specific genetic information.
It integrates functionalities to generate protein interaction networks, enrich gene sets, and search scientific literature from PubMed, Google Scholar, and arXiv.
arXiv Detail & Related papers (2024-04-04T20:53:30Z) - Genetic InfoMax: Exploring Mutual Information Maximization in
High-Dimensional Imaging Genetics Studies [50.11449968854487]
Genome-wide association studies (GWAS) are used to identify relationships between genetic variations and specific traits.
Representation learning for imaging genetics is largely under-explored due to the unique challenges posed by GWAS.
We introduce a trans-modal learning framework Genetic InfoMax (GIM) to address the specific challenges of GWAS.
arXiv Detail & Related papers (2023-09-26T03:59:21Z) - A New Deep Learning and XAI-Based Algorithm for Features Selection in
Genomics [5.787117733071415]
The paper proposes a novel algorithm to perform Feature Selection on genomic-scale data.
Results of the application on a Chronic Lymphocytic Leukemia dataset evidence the effectiveness of the algorithm.
arXiv Detail & Related papers (2023-03-29T16:44:13Z) - DDeMON: Ontology-based function prediction by Deep Learning from Dynamic
Multiplex Networks [0.7349727826230864]
The goal of this work is to explore how the fusion of systems' level information with temporal dynamics of gene expression can be used to predict novel gene functions.
We propose DDeMON, an approach for scalable, systems-level inference of function annotation using time-dependent multiscale biological information.
arXiv Detail & Related papers (2023-02-08T06:53:02Z) - Graph Based Link Prediction between Human Phenotypes and Genes [5.1398743023989555]
Recent advances in the field of machine learning is efficient to predict these interactions between abnormal human phenotypes and genes.
In this study, we developed a framework to predict links between human phenotype ontology (HPO) and genes.
Compared to the other 4 methods LightGBM is able to find more accurate interaction/link between human phenotype & gene pairs.
arXiv Detail & Related papers (2021-05-25T14:47:07Z) - Epigenetic evolution of deep convolutional models [81.21462458089142]
We build upon a previously proposed neuroevolution framework to evolve deep convolutional models.
We propose a convolutional layer layout which allows kernels of different shapes and sizes to coexist within the same layer.
The proposed layout enables the size and shape of individual kernels within a convolutional layer to be evolved with a corresponding new mutation operator.
arXiv Detail & Related papers (2021-04-12T12:45:16Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - AutoML-Zero: Evolving Machine Learning Algorithms From Scratch [76.83052807776276]
We show that it is possible to automatically discover complete machine learning algorithms just using basic mathematical operations as building blocks.
We demonstrate this by introducing a novel framework that significantly reduces human bias through a generic search space.
We believe these preliminary successes in discovering machine learning algorithms from scratch indicate a promising new direction in the field.
arXiv Detail & Related papers (2020-03-06T19:00:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.