On The Nature Of The Phenotype In Tree Genetic Programming
- URL: http://arxiv.org/abs/2402.08011v1
- Date: Mon, 12 Feb 2024 19:19:29 GMT
- Title: On The Nature Of The Phenotype In Tree Genetic Programming
- Authors: Wolfgang Banzhaf, Illya Bakurov
- Abstract summary: We discuss the basic concepts of genotypes and phenotypes in tree-based GP (TGP)
We then analyze their behavior using five benchmark datasets.
To generate phenotypes, we provide a unique technique for removing semantically ineffective code from GP trees.
- Score: 3.8642945120580703
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this contribution, we discuss the basic concepts of genotypes and
phenotypes in tree-based GP (TGP), and then analyze their behavior using five
benchmark datasets. We show that TGP exhibits the same behavior that we can
observe in other GP representations: At the genotypic level trees show
frequently unchecked growth with seemingly ineffective code, but on the
phenotypic level, much smaller trees can be observed. To generate phenotypes,
we provide a unique technique for removing semantically ineffective code from
GP trees. The approach extracts considerably simpler phenotypes while not being
limited to local operations in the genotype. We generalize this transformation
based on a problem-independent parameter that enables a further simplification
of the exact phenotype by coarse-graining to produce approximate phenotypes.
The concept of these phenotypes (exact and approximate) allows us to clarify
what evolved solutions truly predict, making GP models considered at the
phenotypic level much better interpretable.
Related papers
- A Non-negative VAE:the Generalized Gamma Belief Network [49.970917207211556]
The gamma belief network (GBN) has demonstrated its potential for uncovering multi-layer interpretable latent representations in text data.
We introduce the generalized gamma belief network (Generalized GBN) in this paper, which extends the original linear generative model to a more expressive non-linear generative model.
We also propose an upward-downward Weibull inference network to approximate the posterior distribution of the latent variables.
arXiv Detail & Related papers (2024-08-06T18:18:37Z) - Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - A Comparative Analysis of Gene Expression Profiling by Statistical and
Machine Learning Approaches [1.8954222800767324]
We discuss the biological and the methodological limitations of machine learning models to classify cancer samples.
Gene rankings are obtained from explainability methods adapted to these models.
We observe that the information learned by black-box neural networks is related to the notion of differential expression.
arXiv Detail & Related papers (2024-02-01T18:17:36Z) - PhyloGFN: Phylogenetic inference with generative flow networks [57.104166650526416]
We introduce the framework of generative flow networks (GFlowNets) to tackle two core problems in phylogenetics: parsimony-based and phylogenetic inference.
Because GFlowNets are well-suited for sampling complex structures, they are a natural choice for exploring and sampling from the multimodal posterior distribution over tree topologies.
We demonstrate that our amortized posterior sampler, PhyloGFN, produces diverse and high-quality evolutionary hypotheses on real benchmark datasets.
arXiv Detail & Related papers (2023-10-12T23:46:08Z) - Unsupervised ensemble-based phenotyping helps enhance the
discoverability of genes related to heart morphology [57.25098075813054]
We propose a new framework for gene discovery entitled Un Phenotype Ensembles.
It builds a redundant yet highly expressive representation by pooling a set of phenotypes learned in an unsupervised manner.
These phenotypes are then analyzed via (GWAS), retaining only highly confident and stable associations.
arXiv Detail & Related papers (2023-01-07T18:36:44Z) - Probabilistic Genotype-Phenotype Maps Reveal Mutational Robustness of RNA Folding, Spin Glasses, and Quantum Circuits [0.0]
We introduce probabilistic genotype-phenotype maps, where each genotype maps to a vector of phenotype probabilities.
We study three model systems to show that PrGP maps offer a generalized framework which can handle uncertainty emerging from various physical sources.
We derive an analytical theory for the behavior of PrGP robustness, and we demonstrate that the theory is highly predictive of empirical robustness.
arXiv Detail & Related papers (2023-01-04T23:09:38Z) - Phenotype Search Trajectory Networks for Linear Genetic Programming [8.079719491562305]
Neutrality is the observation that some mutations do not lead to phenotypic changes.
We study the search trajectories of a genetic programming system as graph-based models.
We measure the characteristics of phenotypes including their genotypic abundance and Kolmogorov complexity.
arXiv Detail & Related papers (2022-11-15T21:20:50Z) - Layers, Folds, and Semi-Neuronal Information Processing [0.0]
We use a type of embodied agent that exhibits layered representational capacity: meta-brain models.
We focus on two candidate structures that potentially explain this capacity: folding and layering.
The paper concludes with a discussion on how the meta-brains method can assist us in the investigation of enactivism, holism, and cognitive processing in the context of biological simulation.
arXiv Detail & Related papers (2022-07-07T21:47:23Z) - rfPhen2Gen: A machine learning based association study of brain imaging
phenotypes to genotypes [71.1144397510333]
We learned machine learning models to predict SNPs using 56 brain imaging QTs.
SNPs within the known Alzheimer disease (AD) risk gene APOE had lowest RMSE for lasso and random forest.
Random forests identified additional SNPs that were not prioritized by the linear models but are known to be associated with brain-related disorders.
arXiv Detail & Related papers (2022-03-31T20:15:22Z) - Graph Based Link Prediction between Human Phenotypes and Genes [5.1398743023989555]
Recent advances in the field of machine learning is efficient to predict these interactions between abnormal human phenotypes and genes.
In this study, we developed a framework to predict links between human phenotype ontology (HPO) and genes.
Compared to the other 4 methods LightGBM is able to find more accurate interaction/link between human phenotype & gene pairs.
arXiv Detail & Related papers (2021-05-25T14:47:07Z) - Complexity-based speciation and genotype representation for
neuroevolution [81.21462458089142]
This paper introduces a speciation principle for neuroevolution where evolving networks are grouped into species based on the number of hidden neurons.
The proposed speciation principle is employed in several techniques designed to promote and preserve diversity within species and in the ecosystem as a whole.
arXiv Detail & Related papers (2020-10-11T06:26:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.